大模型如何判决?从生成到判决:大型语言模型作为裁判的机遇与挑战(英文版)
大模型如何判决?从生成到判决:大型语言模型作为裁判的机遇与挑战(英文版).pdf |
下载文档 |
资源简介
Assessment and evaluation have long been criti.cal challenges in artificial intelligence (Al) andnatural language processing (NLP). However,traditional methods, whether matching-basedor embedding-based, often fall short of judg.ing subtle attributes and delivering satisfactoryresults. Recent advancements in Large Lan-guage Models (LLMs) inspire the "LLM-as-a-judge" paradigm, where LLMs are leveragedto perform scoring, ranking, or selection acrossvarious tasks and applications. This
本文档仅能预览20页