大语言模型心理测量学系统综述:评估、验证、增强(英文版)
大语言模型心理测量学系统综述:评估、验证、增强(英文版).pdf |
下载文档 |
资源简介
The rapid advancement of large language models (LLMs) has outpaced traditional evaluation method.ologies. It presents novel challenges, such as measuring human-like psychological constructs.navigating beyond static and task-specific benchmarks, and establishing human-centered evaluation.T'hese challenges intersect with Psychometrics, the science of quantifying the intangible aspects of hu-man psychology, such as personality, values, and intelligence, This survey introduces and synthesizes
本文档仅能预览20页