Author identification of Japanese texts based on notations and syllabic writings usage patterns
-
- MAESHIRO Tetsuya
- School of Library and Information Science, University of Tsukuba Research Center for Knowledge Communities, University of Tsukuba
-
- JOHO Hideo
- School of Library and Information Science, University of Tsukuba Research Center for Knowledge Communities, University of Tsukuba
-
- NAKAYAMA Shin-ichi
- School of Library and Information Science, University of Tsukuba Research Center for Knowledge Communities, University of Tsukuba
-
- HAYAKURA Mai
- College of Informatics, University of Tsukuba
Bibliographic Information
- Other Title
-
- 表記と送り仮名の使用パターンを用いた日本語文章の著者判別
- ヒョウキ ト オクリガナ ノ シヨウ パターン オ モチイタ ニホンゴ ブンショウ ノ チョシャ ハンベツ
Search this article
Abstract
This paper presents a novel feature to discriminate authors from Japanese texts, based on notations and syllabic writings usage patterns. These two properties are particular to Japanese texts. The proposed indicator's performance was compared to part-of-speech 3-gram and comma usage patterns, both considered as best methods for author identification tasks. Texts written by college students and published novels were analyzed. The proposed indicator provides a novel viewpoint to capture authors' characteristics, and its precision is equivalent to conventional best indicators.
Journal
-
- Joho Chishiki Gakkaishi
-
Joho Chishiki Gakkaishi 24 (3), 342-364, 2014
Japan Society of Information and Knowledge
- Tweet
Details 詳細情報について
-
- CRID
- 1390282679401579008
-
- NII Article ID
- 130004843610
-
- NII Book ID
- AN10459774
-
- ISSN
- 18817661
- 09171436
-
- NDL BIB ID
- 025870952
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed