Document categorization for document image understanding
説明
In the knowledge-based document image understanding, it is important to distinguish the layout structures of individual documents exactly with a view to making use of adaptable document model. At least, the document models which are characterized heuristically by the application-specific layout structures are not always applicable to every document. In this paper, we propose a categorization method of various kinds of documents. Our categorization method on the basis of the classification and verification paradigm divides various kinds of documents into appropriate document types stepwisely. First, the classification procedure divides the given documents using rough features about documents, and then the verification procedure is applied to the globally categorized document sets, using the detail features.