×
img

英伟达:2025年Nemotron Nano 2 AI模型技术报告(英文版)

发布者:wx****6b
2025-08-21
2 MB 43 页
人工智能(AI)
文件列表:
英伟达:2025年Nemotron Nano 2 AI模型技术报告(英文版).pdf
下载文档

We introduce Nemotron-Nano-9B-v2, a hybrid Mamba-Transformer language model designed to increase throughput for reasoning workloads while achieving state-of-the-art accuracy compared to similarly-sized models. Nemotron-Nano-9B-v2 builds on the Nemotron-H architecture, in which the majority of the self-attention layers in the common Transformer architecture are replaced with Mamba-2 layers, to achieve improved inference speed when generating the long thinking traces needed for reasoning. We cr


加载中...

本文档仅能预览20页

继续阅读请下载文档

网友评论>