DeepSeek-V3 深度解析：AI架构的硬件扩展挑战与思考（英文版）

发布者：wx****cd

2025-05-22

2 MB 14 页

文件列表：

DeepSeek-V3 深度解析：AI架构的硬件扩展挑战与思考（英文版）.pdf

资源简介

The rapid scaling of large language models (LLMs) has unveiled critical limitations in current hardware architectures, including constraints in memory capacity, computational efficiency, and interconnection bandwidth. DeepSeek-V3, trained on 2,048 NVIDIA H800 GPUs, demonstrates how hardware-aware model co-design can effectively address these challenges, enabling cost-efficient training and inference at scale. This paper presents an in-depth analysis of the DeepSeek-V3/R1 model architecture an

加载中...

已阅读到文档的结尾了

下载文档