特拉维夫大学 | 使用文本到图像扩散模型定位对象级别形状变化

【推荐理由】文本生成图像的全球性质阻止用户将其探索范围缩小到特定对象。本文提出了一种技术，生成描绘特定对象形状变化的图像集合，实现对象级别的形状探索过程。

Localizing Object-level Shape Variations with Text-to-Image Diffusion Models

Or Patashnik, Daniel Garibi, Idan Azuri, Hadar Averbuch-Elor, Daniel Cohen-Or

[Tel-Aviv University, Independent Researcher]

【论文链接】https://arxiv.org/pdf/2303.11306v1.pdf

【摘要】文本到图像模型产生的工作流程通常从探索步骤开始，用户需要筛选大量生成的图像。由于文本到图像生成过程的全局性质，用户无法将其探索范围限定在图像中的特定对象上。本文介绍一种生成描绘特定对象形状变化的图像集合的技术，从而实现对象级别的形状探索过程。创建可信的变化很具有挑战性，因为需要在尊重其语义的同时控制所生成对象的形状。在生成对象变化时一个特殊的挑战是准确地定位应用于对象形状上的操作。我们介绍了一种通过在去噪过程中在提示之间切换来实现各种形状选择的提示混合技术。为了定位图像空间操作，我们提出了两种使用自注意力层和交叉注意力层的技术。此外，我们展示了这些定位技术在生成对象变化之外的范围内也是通用和有效的。广泛的结果和比较表明了我们的方法在生成对象变化方面的有效性，以及我们定位技术的竞争力。

特拉维夫大学 | 使用文本到图像扩散模型定位对象级别形状变化

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

特拉维夫大学 | 使用文本到图像扩散模型定位对象级别形状变化

AI青年学霸齐聚杭州！这场峰会要选出「未来科学新星」

李飞飞空间智能独角兽开源底层技术！AI生成3D世界在所有设备流畅运行

终于！全球爆火AI视频神器PixVerse发布国内版——拍我AI

双重突破：全球首个零售VLA大模型来了！开源OpenWBT让机器人遥操门槛暴降！

挑战强化学习后训练霸权！全新无监督方法仅需1条数据+10步优化

通义灵码AI IDE上线，深度适配Qwen3，首创自动记忆功能

GPT-4o-Image仅完成28.9%任务！上海AI实验室等发布图像编辑新基准，360道人类专家严选难题

华为攻克AI推理「想太多」问题！新方法让大模型推理提速60%，准确率还高了

最新一期权威大模型榜单：豆包1.5、商汤日日新V6并列国内第一

每2秒吃透一道高数大题！华为终于揭秘准万亿MoE昇腾训练系统全流程