大语言模型的后训练:深入探究推理(英文版)
大语言模型的后训练:深入探究推理(英文版).pdf |
下载文档 |
资源简介
Abstract—Large Language Models (LLMs) have transformed the natural language processing landscape and brought to life diverse applications. Pretraining on vast web-scale data has laid the foundation for these models, yet the research community is now increasingly shifting focus toward post-training techniques to achieve further breakthroughs. While pretraining provides a broad linguistic foundation, post-training methods enable LLMs to refine their knowledge, improve reasoning, enhance factual
本文档仅能预览20页