×
img

STAS: 多智能体强化学习的时空回报分解(英文版)

发布者:wx****aa
2023-04-21
10 MB 10 页
人工智能(AI)
文件列表:
STAS: 多智能体强化学习的时空回报分解【英文版】.pdf
下载文档
英文标题:STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning中文摘要:提出了一种名为 Spatial-Temporal Attention with Shapley(STAS)的新方法,该方法可以在时间和空间维度上学习信用分配,在多智能体强化学习中实现有效的空间 - 时间信用分配,优于所有现有的基线。英文摘要:Centralized Training with Decentralized Execution (CTDE) has been proven tobe an effective paradigm in cooperative multi-agent reinforcement learning(MARL). One of the major challenges is yet credit assignment, which aims tocredit agents by their contributions. Prior studies focus on e

加载中...

已阅读到文档的结尾了

下载文档

网友评论>