- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
- 【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
Natural Text-Driven, Multi-Attribute Editing of Facial Images with Robustness in Sparse Latent Space
Search this article
Description
Due to the development of GAN and the proposal of many excellent models like StyleGAN, text-driven image editing and image generation have made great progress in recent years, but the task of generating diverse images of specific people under the guidance of text is still lacking. This paper combines two pre-training models, CLIP and StyleGAN2, to conduct a preliminary exploration of the above tasks. The latent code of the input portrait is driven to be edited and manipulated in the StyleGAN latent space via a CLIP-based text-driven module. Especially in the sparse region of the generator latent space, and when editing multiple attributes at the same time, some good results have finally been achieved.
Journal
-
- 法政大学大学院紀要. 情報科学研究科編
-
法政大学大学院紀要. 情報科学研究科編 18 1-6, 2023-03-24
法政大学大学院情報科学研究科
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390014868195782016
-
- NII Book ID
- AA12746425
-
- HANDLE
- 10114/00026277
-
- ISSN
- 24321192
-
- Text Lang
- en
-
- Article Type
- departmental bulletin paper
-
- Data Source
-
- JaLC
- IRDB
-
- Abstract License Flag
- Allowed