Word-Length Distribution of the Keywords using KANA Characters

Bibliographic Information

Other Title
  • 新聞記事情報に与えるカナ表記キーワードの語長分布
  • シンブン キジ ジョウホウ ニ アタエル カナ ヒョウキ キーワード ノ ゴチ

Search this article

Abstract

This is a report of the study on the word-length distribution of keywords using KANA characters. 5,564 keywords are assigned to about 10,000 articles of two newspapers (ASAHI and NIPPON KEIZAI) for 87 months (over 1969-1976). For these keywords, the word-length distribution patterns are shown for each category, such as natural science, industry, economy and sociology, general culture and so on. And the feature of patterns, the construction of words and the maximum/mean values of word-lengths, are described. It is revealed that (1) the mean value of word-lengths in all fields, including the proper noun, is 8.4 characters, (2) in modern developing fields of categories, the mean value of word-lengths is long compared to other fields, (3) and its pattern of distribution is not simple, (4) the maximum value of word-lengths for over all is 37 characters, and 99.9% of words are within the length of 30 characters and also 99% are within 20 characters.

Journal

  • Dokumenteshon kenkyu

    Dokumenteshon kenkyu 27 (11), 532-538, 1977

    Information Science and Technology Association, Japan

Details 詳細情報について

Report a problem

Back to top