超解像のための画像及び言語の統合特徴を利用したPerceptual Lossの改善

大谷 豪, 片岡 裕雄, 青木 義満

doi:10.2493/jjspe.90.217

書誌事項

タイトル別名

Improving Perceptual Loss with CLIP for Super-Resolution

抄録

<p>Perceptual loss, calculated by VGG network pre-trained on ImageNet, has been widely employed in the past for super-resolution tasks, enabling the generation of photo-realistic images. However, it has been reported that grid-like artifacts frequently appear in the generated images. To address this problem, we consider that large-scale pre-trained models can make significant contributions to super-resolution across different scenes. In particular, by combining language, those models can exhibit a strong capability to comprehend complex scenes, potentially enhancing super-resolution performance. Therefore, this paper proposes new perceptual loss with Contrastive Language-Image Pre-training (CLIP) based on Vision Transformer (ViT) instead of VGG network. The results demonstrate our proposed perceptual loss can generate photo-realistic images without grid-like artifacts.</p>

収録刊行物

精密工学会誌

精密工学会誌 90 (2), 217-223, 2024-02-05

公益社団法人精密工学会

キーワード

詳細情報詳細情報について

CRID: 1390299086443889664

DOI: 10.2493/jjspe.90.217

ISSN: 1882675X; 09120289

Web Site: https://www.jstage.jst.go.jp/article/jjspe/90/2/90_217/_pdf

本文言語コード: ja

データソース種別

JaLC
Crossref

抄録ライセンスフラグ: 使用不可

超解像のための画像及び言語の統合特徴を利用したPerceptual Lossの改善

書誌事項

抄録

収録刊行物

参考文献 (13)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

超解像のための画像及び言語の統合特徴を利用したPerceptual Lossの改善

書誌事項

抄録

収録刊行物

参考文献 (13)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について