×
img

使用双编码器改进场景文本编辑的扩散模型(英文版)

发布者:wx****c7
2023-04-22
41 MB 22 页
人工智能(AI)
文件列表:
使用双编码器改进场景文本编辑的扩散模型【英文版】.pdf
下载文档
英文标题:Improving Diffusion Models for Scene Text Editing with Dual Encoders中文摘要:DIFFSTE 是一个改善预训练扩散模型性能的双编码器设计框架,通过指令调整训练,实现了场景文本编辑中正确文本渲染和风格控制的任务,使其具有零 - shot 泛化能力。英文摘要:Scene text editing is a challenging task that involves modifying or insertingspecified texts in an image while maintaining its natural and realisticappearance. Most previous approaches to this task rely on style-transfer modelsthat crop out text regions and feed them into image transfer models, such asGANs. However, these methods a

加载中...

本文档仅能预览20页

继续阅读请下载文档

网友评论>