Consideration of New Word Extraction from Large-scale Directory Service
-
- Takeshita Kazutoshi
- Tokyo Metropolitan Institute of Technology
-
- Takama Yasufumi
- Tokyo Metropolitan University
Bibliographic Information
- Other Title
-
- 大規模ディレクトリサービスからの新出語抽出に関する考察
Abstract
Recently documents of various specific fields exist on the Web, which are updated frequently. As a result, there exist many fields-specific new words that are not listed in dictionaries. When a document including fields-specific new words is processed with computers for the purposes of indexing and information extraction, the treatment of new words becomes a problem. This paper proposes a method for extracting new words from category names in a large-scale Web dictionary service. The method is based on several characteristics of a word, such as the number of hits in Google search, the number of categories containing the word in the directory service, and the part-of-speech pattern. The experimental results show their effectiveness for extracting new words.
Journal
-
- Proceedings of the Fuzzy System Symposium
-
Proceedings of the Fuzzy System Symposium 21 (0), 22-22, 2005
Japan Society for Fuzzy Theory and Intelligent Informatics
- Tweet
Details 詳細情報について
-
- CRID
- 1390282680644898688
-
- NII Article ID
- 130005035028
-
- Data Source
-
- JaLC
- CiNii Articles
-
- Abstract License Flag
- Disallowed