文件列表:
利用奖励塑形模仿学习方法合成生成类似人类数据以解决序列决策问题【英文版】.pdf |
下载文档 |
资源简介
>
英文标题:Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning中文摘要:本研究通过结合奖励塑造和模仿学习算法,提出了一种生成人工智能系统中类似于人类决策数据的新算法,证明使用这种合成的数据可以成功解决具有逐步增加难度的计算机游戏中的决策任务,并且与人类表现几乎无差异。英文摘要:We consider the problem of synthetically generating data that can closelyresemble human decisions made in the context of an interactive human-AI systemlike a computer game. We propose a novel algorithm that can generate synthetic,human-like, decision making data while sta
加载中...
已阅读到文档的结尾了