Presentation | SIGGRAPH 2023

Presentation / Installation

· Contributors · Organizations · Search Program · My Schedule · Maps

CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features for a Disentangled, Interpretable and Controllable Text-Guided Face Manipulation

SessionText-Guided Generation

DescriptionWe found that Contrastive Language-Image Pre-Training (CLIP) embeds texts and images in different regions in the joint space, leading to artifacts in the resulting images when optimizing toward text embeddings. Disentanglement, interpretability, and controllability are hard to guarantee. Therefore, we introduce CLIP projection-augmentation embedding (PAE) as an alternative optimization target.

Authors