Region segmentation for table image with unknown complex structure
説明
In this paper, we describe a system of region segmentation and conversion into an HTML file for an unknown machine-printed table image. Ruled lines delimit some cells of the table, and omitted ruled lines also delimit other cells. We consider a table analysis system for both types of table cell. First, our system segments a table by means of the ruled lines into some regions. Secondly, these segmented regions are further segmented into cells by the omitted ruled lines that are indicators (such as numerals and characters). The cells include several character lines, and our system can convert a table of unknown complex structure into an HTML file. Also, we confirm the effectiveness of our region segmentation method for various kinds of tables with omitted ruled lines by computer experiments.
収録刊行物
-
- Proceedings of Sixth International Conference on Document Analysis and Recognition
-
Proceedings of Sixth International Conference on Document Analysis and Recognition 709-713, 2002-11-13
IEEE Comput. Soc