Author identification of Japanese texts based on notations and syllabic writings usage patterns

  • MAESHIRO Tetsuya
    School of Library and Information Science, University of Tsukuba Research Center for Knowledge Communities, University of Tsukuba
  • JOHO Hideo
    School of Library and Information Science, University of Tsukuba Research Center for Knowledge Communities, University of Tsukuba
  • NAKAYAMA Shin-ichi
    School of Library and Information Science, University of Tsukuba Research Center for Knowledge Communities, University of Tsukuba
  • HAYAKURA Mai
    College of Informatics, University of Tsukuba

Bibliographic Information

Other Title
  • 表記と送り仮名の使用パターンを用いた日本語文章の著者判別
  • ヒョウキ ト オクリガナ ノ シヨウ パターン オ モチイタ ニホンゴ ブンショウ ノ チョシャ ハンベツ

Search this article

Abstract

This paper presents a novel feature to discriminate authors from Japanese texts, based on notations and syllabic writings usage patterns. These two properties are particular to Japanese texts. The proposed indicator's performance was compared to part-of-speech 3-gram and comma usage patterns, both considered as best methods for author identification tasks. Texts written by college students and published novels were analyzed. The proposed indicator provides a novel viewpoint to capture authors' characteristics, and its precision is equivalent to conventional best indicators.

Journal

References(1)*help

See more

Details 詳細情報について

Report a problem

Back to top