日本語節境界検出プログラムCBAPの開発と評価

書誌事項

タイトル別名
  • Development and Evaluation of Japanese Clause Boundaries Annotation Program
  • ニホンゴセツ キョウカイ ケンシュツ プログラム CBAP ノ カイハツ ト ヒョウカ

この論文をさがす

抄録

Sentences generally tend to be long and complicated in monologues, and they cause problems for parsing and translation. It is desirable to define some short unit to process monologues efficiently. We developed “CBAP (Clause Boundaries Annotation Program), ” which detects and labels every clause boundary in Japanese text. CBAP accepts a series of morphemes with part-of-speech information and detectsthe final boundary of every clause with more than 97% accuracy. It also inserts 147 kinds of labels which represent the types of the boundaries. Since clauses are syntactically and semantically sufficient constituents, we can use the annotated labels for effective and flexible sentence segmentation. In this paper, we show the method for annotating Japanese clause boundaries, and present the result of experiments to examine the performance of CBAP.

収録刊行物

  • 自然言語処理

    自然言語処理 11 (3), 39-68, 2004

    一般社団法人 言語処理学会

被引用文献 (26)*注記

もっと見る

参考文献 (35)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ