Ruby Removal Filters by Genetic Programming using the classification of printing type data for Early-Modern Japanese Printed Books

Bibliographic Information

Other Title
  • 活字データの分類を用いた進化計算による近代書籍からのルビ除去

Search this article

Description

In National Diet Library, books which are possessed in library as "the digital library from meiji era" are open to the public on Web. Since these are shown as image data and cannot search using document contents, an automatic text conversion is needed. There is a major obstacle to text conversion. It is ruby. Ruby can not be removed in the histogram method. Therefore, we have proposed a ruby removal method for early-modern Japanese printed books. However, since the proposed method is based on the external information added to the books, the feasibility is low. In this paper, we propose a method to remove the ruby automatically from early-modern Japanese printed books by generating ruby removal formula in Genetic Programming using the training data was based on the data of book image.

Journal

  • IPSJ SIG Notes

    IPSJ SIG Notes 2014 (20), 1-6, 2014-06-18

    Information Processing Society of Japan (IPSJ)

Details 詳細情報について

  • CRID
    1570572702972833024
  • NII Article ID
    110009795498
  • NII Book ID
    AN10505667
  • ISSN
    09196072
  • Text Lang
    ja
  • Data Source
    • CiNii Articles

Report a problem

Back to top