Today's Japanese corpora(<Special feature>Japanese language in the digital era)

Bibliographic Information

Other Title
  • 日本語コーパスの今(<特集>デジタル時代の日本語)
  • 日本語コーパスの今
  • ニホンゴ コーパス ノ イマ

Search this article

Abstract

In 2011, 100 million scale "Balanced Corpus of Contemporary Written Japanese" was released, and it is widely used in many fields such as Japanese linguistics, Japanese language education, and natural language processing. In this paper, history of the Japanese corpora is described first. Next, the language resources constructed at the National Institute for Japanese Language and Linguistics, including the "Corpus of Spontaneous Japanese", "Taiyo Corpus", "Balanced Corpus of Contemporary Written Japanese", "Corpus of Historical Japanese" and "UniDic" are introduced. Furthermore, roles which Japanese corpora carry out for today's digital age, and desirable way of Japanese corpora in future are discussed.

Journal

Details 詳細情報について

Report a problem

Back to top