Abstract
In this paper, we use a tensor model based on the Higher-Order Singular Value Decomposition (HOSVD) to discover semantic directions in Generative Adversarial Networks. This is achieved by first embedding a structured facial expression database into the latent space using the e4e encoder. Specifically, we discover directions in latent space corresponding to the six prototypical emotions: anger, disgust, fear, happiness, sadness, and surprise, as well as a direction for yaw rotation. These latent space directions are employed to change the expression or yaw rotation of real face images. We compare our found directions to similar directions found by two other methods. The results show that the visual quality of the resultant edits are on par with State-of-the-Art. It can also be concluded that the tensor-based model is well suited for emotion and yaw editing, i.e., that the emotion or yaw rotation of a novel face image can be robustly changed without a significant effect on identity or other attributes in the images.
Originalsprog | Engelsk |
---|---|
Publikationsdato | 13 maj 2022 |
Antal sider | 10 |
Status | Udgivet - 13 maj 2022 |
Begivenhed | AI for Content Creation Workshop @ CVPR 2022 - New Orleans Ernest N. Morial Convention Center, New Orleans, USA Varighed: 19 jun. 2022 → 20 jun. 2022 https://ai4cc.net/ |
Konference
Konference | AI for Content Creation Workshop @ CVPR 2022 |
---|---|
Lokation | New Orleans Ernest N. Morial Convention Center |
Land/Område | USA |
By | New Orleans |
Periode | 19/06/2022 → 20/06/2022 |
Internetadresse |
Emneord
- Computer vision and pattern recognition
- Face Synthesis
- generative adversarial network