Building a Manga Dataset “Manga109” With Annotations for Multimedia Applications

Kiyoharu Aizawa, Azuma Fujimoto, Atsushi Otsubo, Toru Ogawa, Yusuke Matsui, Koki Tsubota, Hikaru Ikuta

doi:10.48550/arxiv.2005.04425

Building a Manga Dataset “Manga109” With Annotations for Multimedia Applications

DOI DOI PDF 被引用文献6件オープンアクセス

Kiyoharu Aizawa

The University of Tokyo
Azuma Fujimoto

The University of Tokyo
Atsushi Otsubo

The University of Tokyo
Toru Ogawa

The University of Tokyo
Yusuke Matsui

The University of Tokyo
Koki Tsubota

The University of Tokyo
Hikaru Ikuta

The University of Tokyo

この論文をさがす

CiNii Books

説明

Manga, or comics, which are a type of multimodal artwork, have been left behind in the recent trend of deep learning applications because of the lack of a proper dataset. Hence, we built Manga109, a dataset consisting of a variety of 109 Japanese comic books (94 authors and 21,142 pages) and made it publicly available by obtaining author permissions for academic use. We carefully annotated the frames, speech texts, character faces, and character bodies; the total number of annotations exceeds 500k. This dataset provides numerous manga images and annotations, which will be beneficial for use in machine learning algorithms and their evaluation. In addition to academic use, we obtained further permission for a subset of the dataset for industrial use. In this article, we describe the details of the dataset and present a few examples of multimedia processing applications (detection, retrieval, and generation) that apply existing deep learning methods and are made possible by the dataset.

10 pages, 8 figures

収録刊行物

IEEE MultiMedia

IEEE MultiMedia 27 (2), 8-18, 2020-04-01

Institute of Electrical and Electronics Engineers (IEEE)

被引用文献 (6)*注記

詳細情報詳細情報について

CRID

1361137045657200256
DOI

10.1109/mmul.2020.2987895

10.48550/arxiv.2005.04425
ISSN

19410166

1070986X
Web Site

http://xplorestaging.ieee.org/ielx7/93/9115798/09069265.pdf?arnumber=9069265
データソース種別
- Crossref
- OpenAIRE

書き出し

問題の指摘

ページトップへ

Building a Manga Dataset “Manga109” With Annotations for Multimedia Applications

この論文をさがす

説明

収録刊行物

被引用文献 (6)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について