AV-SAM: 模型将任何物体分割与视听定位相结合（英文版）

发布者：wx****41

2023-05-06

1 MB 4 页

人工智能（AI）

文件列表：

AV-SAM: 模型将任何物体分割与视听定位相结合【英文版】.pdf

下载文档

资源简介

英文标题：AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation中文摘要：本文提出了基于 SAM 模型的简单而有效的音频 - 视觉定位和分割框架 AV-SAM，可以生成对应于音频的听觉对象掩模，实现像声音定位和分割等视听任务。英文摘要：Segment Anything Model (SAM) has recently shown its powerful effectiveness invisual segmentation tasks. However, there is less exploration concerning howSAM works on audio-visual tasks, such as visual sound localization andsegmentation. In this work, we propose a simple yet effective audio-visuallocalization and segmentation fr

加载中...

已阅读到文档的结尾了

下载文档