Strictly-ID-Preserved and Controllable Accessory Advertising Image Generation

Kavli Affiliate: Jiansheng Chen

| First 5 Authors: Youze Xue, Binghui Chen, Yifeng Geng, Xuansong Xie, Jiansheng Chen

| Summary:

Customized generative text-to-image models have the ability to produce images
that closely resemble a given subject. However, in the context of generating
advertising images for e-commerce scenarios, it is crucial that the generated
subject’s identity aligns perfectly with the product being advertised. In order
to address the need for strictly-ID preserved advertising image generation, we
have developed a Control-Net based customized image generation pipeline and
have taken earring model advertising as an example. Our approach facilitates a
seamless interaction between the earrings and the model’s face, while ensuring
that the identity of the earrings remains intact. Furthermore, to achieve a
diverse and controllable display, we have proposed a multi-branch
cross-attention architecture, which allows for control over the scale, pose,
and appearance of the model, going beyond the limitations of text prompts. Our
method manages to achieve fine-grained control of the generated model’s face,
resulting in controllable and captivating advertising effects.

| Search Query: ArXiv Query: search_query=au:”Jiansheng Chen”&id_list=&start=0&max_results=3