文件列表:
面向人工智能协同的语言指导强化学习【英文版】.pdf |
下载文档 |
资源简介
>
英文标题:Language Instructed Reinforcement Learning for Human-AI Coordination中文摘要:在缺乏高质量人类行为数据的情况下,使用预训练的大型语言模型生成人类语言指令的先验策略并规范化强化学习目标可以帮助人工智能代理与人类协作,并在多智能体强化学习问题中实现人工智能代理与人类偏好一致的均衡解。案例中验证了该框架的有效性。英文摘要:One of the fundamental quests of AI is to produce agents that coordinate wellwith humans. This problem is challenging, especially in domains that lack highquality human behavioral data, because multi-agent reinforcement learning (RL)often converges to different equilibria from the ones that humans pre
加载中...
已阅读到文档的结尾了