Consideration of New Word Extraction from Large-scale Directory Service

Takeshita Kazutoshi, Takama Yasufumi

doi:10.14864/fss.21.0.22.0

Bibliographic Information

Other Title

大規模ディレクトリサービスからの新出語抽出に関する考察

Abstract

Recently documents of various specific fields exist on the Web, which are updated frequently. As a result, there exist many fields-specific new words that are not listed in dictionaries. When a document including fields-specific new words is processed with computers for the purposes of indexing and information extraction, the treatment of new words becomes a problem. This paper proposes a method for extracting new words from category names in a large-scale Web dictionary service. The method is based on several characteristics of a word, such as the number of hits in Google search, the number of categories containing the word in the directory service, and the part-of-speech pattern. The experimental results show their effectiveness for extracting new words.

Journal

Proceedings of the Fuzzy System Symposium

Proceedings of the Fuzzy System Symposium 21 (0), 22-22, 2005

Japan Society for Fuzzy Theory and Intelligent Informatics

Keywords

Details 詳細情報について

CRID: 1390282680644898688

NII Article ID: 130005035028

DOI: 10.14864/fss.21.0.22.0

Data Source

JaLC
CiNii Articles

Abstract License Flag: Disallowed

Export