×
img

AV-SAM: 模型将任何物体分割与视听定位相结合(英文版)

发布者:wx****41
2023-05-06
1 MB 4 页
人工智能(AI)
文件列表:
AV-SAM: 模型将任何物体分割与视听定位相结合【英文版】.pdf
下载文档
英文标题:AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation中文摘要:本文提出了基于 SAM 模型的简单而有效的音频 - 视觉定位和分割框架 AV-SAM,可以生成对应于音频的听觉对象掩模,实现像声音定位和分割等视听任务。英文摘要:Segment Anything Model (SAM) has recently shown its powerful effectiveness invisual segmentation tasks. However, there is less exploration concerning howSAM works on audio-visual tasks, such as visual sound localization andsegmentation. In this work, we propose a simple yet effective audio-visuallocalization and segmentation fr

加载中...

已阅读到文档的结尾了

下载文档

网友评论>