A Study for Recognition·Selection of Chemical Substance Name in Documents – Using Patent Documents

  • TANAKA Rumiko
    Graduate School of Library Information and Media Studies, University of Tsukuba
  • NAKAYAMA Shin-ichi
    Faculty of Library Information and Media Science, University of Tsukuba

Bibliographic Information

Other Title
  • 文章からの化学物質名を含む単語の認識法の確立と化学物質名 の選択法の検討-特許公開公報を用いて
  • 文章からの化学物質名を含む単語の認識法の確立と化学物質名の選定法の検討 : 特許公開公報を用いて
  • ブンショウ カラ ノ カガク ブッシツメイ オ フクム タンゴ ノ ニンシキホウ ノ カクリツ ト カガク ブッシツメイ ノ センテイホウ ノ ケントウ : トッキョ コウカイ コウホウ オ モチイテ

Search this article

Abstract

<p> The chemical substance names described have various descriptions and the description of the name depends on the author. Such variation causes hindering information sharing of chemical knowledge. Auto-extraction of chemical substance names is useful for information sharing. In order to find a method for extracting the names of chemical substances in Japanese documents, we created a corpus of patent documents tagged with chemical substance names. We studied cutting out words from sentences and recognized chemical substance names by concatenating cut-out words using the part of speech information. We also studied selecting chemical substance names from concatenated cut-out words and made a selection comparison between chemical substance names and functional group names that are similar to chemical substance names.</p>

Journal

  • Joho Chishiki Gakkaishi

    Joho Chishiki Gakkaishi 29 (3), 238-246, 2019-10-15

    Japan Society of Information and Knowledge

Citations (1)*help

See more

References(5)*help

See more

Details 詳細情報について

Report a problem

Back to top