英伟达：2025年Nemotron Nano 2 AI模型技术报告（英文版）

发布者：wx****6b

2025-08-21

2 MB 43 页

人工智能（AI）

文件列表：

英伟达：2025年Nemotron Nano 2 AI模型技术报告（英文版）.pdf

下载文档

资源简介

We introduce Nemotron-Nano-9B-v2, a hybrid Mamba-Transformer language model designed to increase throughput for reasoning workloads while achieving state-of-the-art accuracy compared to similarly-sized models. Nemotron-Nano-9B-v2 builds on the Nemotron-H architecture, in which the majority of the self-attention layers in the common Transformer architecture are replaced with Mamba-2 layers, to achieve improved inference speed when generating the long thinking traces needed for reasoning. We cr

加载中...

本文档仅能预览20页

继续阅读请下载文档