文件列表:
双重文本图像指示下的多模式程序规划【英文版】.pdf |
下载文档 |
资源简介
>
英文标题:Multimodal Procedural Planning via Dual Text-Image Prompting中文摘要:研究了利用图文混合信息来辅助人类完成任务的方法,提出了基于多模态程序规划的任务,使用基于大型语言模型的有提示和图片描述提示的方法可以生成具有信息性和准确性的图文混合任务规划。英文摘要:Embodied agents have achieved prominent performance in following humaninstructions to complete tasks. However, the potential of providinginstructions informed by texts and images to assist humans in completing tasksremains underexplored. To uncover this capability, we present the multimodalprocedural planning (MPP) task, in which models
加载中...
本文档仅能预览20页