Semantic Images Editing by Operations on Latent Space of Deep Generative Models

AOSHIMA Takehiro, MATSUBARA Takashi

doi:10.11370/isj.62.579

この論文をさがす

説明

<p>The creation of images and other data is one of the ultimate goals of computer vision research. For this purpose, various deep learning methods have been proposed, such as variational autoencoders, adversarial networks, and diffusion models. These methods learn the distributions of photographs and illustrations and reproduce them. The generated image is determined using the coordinates provided in the latent space. Therefore, several studies have been conducted to manipulate these coordinates to edit the generated images. However, existing methods frequently provide unintended or low-quality editing results because the coordinate system in the latent space is not properly learned, among other reasons. In this study, we focus on the coordinate system in the representation space and introduce deep curvilinear editing. In particular, we propose a method for the representation vectors using representation space with a curvilinear coordinate system. The method was also combined with generative adversarial networks, whose results demonstrated that the proposed method enables the high-quality editing of generated images.</p>

収録刊行物

日本画像学会誌

日本画像学会誌 62 (6), 579-587, 2023-12-10

一般社団法人日本画像学会

キーワード

詳細情報詳細情報について

CRID: 1390298433281727616

DOI: 10.11370/isj.62.579

ISSN: 18804675; 13444425

NDL書誌ID: 033225815

Web Site: http://id.ndl.go.jp/bib/033225815; https://ndlsearch.ndl.go.jp/books/R000000004-I033225815

本文言語コード: en

データソース種別

JaLC
NDLサーチ

抄録ライセンスフラグ: 使用不可

書き出し

問題の指摘