文件列表:
探究训练数据和评估对中文指示性语言模型的影响【英文版】.pdf |
下载文档 |
资源简介
>
英文标题:Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation中文摘要:本研究旨在使用公开数据集结合自身汉语多轮对话中的数据进行分析,选取各种评估指标来评价各类开源聊天机器人的性能表现,并对 LLaMA 进行词汇扩展及 34 亿汉语单词的二次预训练,以期提升聊天机器人在中文领域的表现与效率,最后将模型、数据、代码进行公开发布。英文摘要:Recently, significant public efforts have been directed towards developinglow-cost models with capabilities akin to ChatGPT, thereby fostering the growthof open-source conversational models. However, there remains a scarcity ofcompreh
加载中...
已阅读到文档的结尾了