文件列表:
Multimodal C4: 亿级图文混合语料库【英文版】.pdf |
下载文档 |
资源简介
>
英文标题:Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text中文摘要:Multimodal C4 is a publicly available dataset that supports in-context vision and language models, including linear assignment algorithm, for complex learning between images and texts.英文摘要:In-context vision and language models like Flamingo support arbitrarilyinterleaved sequences of images and text as input. This format not only enablesfew-shot learning via interleaving independent supervised (image, text)exa
加载中...
已阅读到文档的结尾了