Ruby Removal Filters by Genetic Programming using the classification of printing type data for Early-Modern Japanese Printed Books
Bibliographic Information
- Other Title
-
- 活字データの分類を用いた進化計算による近代書籍からのルビ除去
Search this article
Description
In National Diet Library, books which are possessed in library as "the digital library from meiji era" are open to the public on Web. Since these are shown as image data and cannot search using document contents, an automatic text conversion is needed. There is a major obstacle to text conversion. It is ruby. Ruby can not be removed in the histogram method. Therefore, we have proposed a ruby removal method for early-modern Japanese printed books. However, since the proposed method is based on the external information added to the books, the feasibility is low. In this paper, we propose a method to remove the ruby automatically from early-modern Japanese printed books by generating ruby removal formula in Genetic Programming using the training data was based on the data of book image.
Journal
-
- IPSJ SIG Notes
-
IPSJ SIG Notes 2014 (20), 1-6, 2014-06-18
Information Processing Society of Japan (IPSJ)
- Tweet
Details 詳細情報について
-
- CRID
- 1570572702972833024
-
- NII Article ID
- 110009795498
-
- NII Book ID
- AN10505667
-
- ISSN
- 09196072
-
- Text Lang
- ja
-
- Data Source
-
- CiNii Articles