STAS: 多智能体强化学习的时空回报分解（英文版）

发布者：wx****aa

2023-04-21

10 MB 10 页

人工智能（AI）

文件列表：

STAS: 多智能体强化学习的时空回报分解【英文版】.pdf

下载文档

资源简介

英文标题：STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning中文摘要：提出了一种名为 Spatial-Temporal Attention with Shapley（STAS）的新方法，该方法可以在时间和空间维度上学习信用分配，在多智能体强化学习中实现有效的空间 - 时间信用分配，优于所有现有的基线。英文摘要：Centralized Training with Decentralized Execution (CTDE) has been proven tobe an effective paradigm in cooperative multi-agent reinforcement learning(MARL). One of the major challenges is yet credit assignment, which aims tocredit agents by their contributions. Prior studies focus on e

加载中...

已阅读到文档的结尾了

下载文档