A Study for Recognition·Selection of Chemical Substance Name in Documents – Using Patent Documents
-
- TANAKA Rumiko
- Graduate School of Library Information and Media Studies, University of Tsukuba
-
- NAKAYAMA Shin-ichi
- Faculty of Library Information and Media Science, University of Tsukuba
Bibliographic Information
- Other Title
-
- 文章からの化学物質名を含む単語の認識法の確立と化学物質名 の選択法の検討-特許公開公報を用いて
- 文章からの化学物質名を含む単語の認識法の確立と化学物質名の選定法の検討 : 特許公開公報を用いて
- ブンショウ カラ ノ カガク ブッシツメイ オ フクム タンゴ ノ ニンシキホウ ノ カクリツ ト カガク ブッシツメイ ノ センテイホウ ノ ケントウ : トッキョ コウカイ コウホウ オ モチイテ
Search this article
Abstract
<p> The chemical substance names described have various descriptions and the description of the name depends on the author. Such variation causes hindering information sharing of chemical knowledge. Auto-extraction of chemical substance names is useful for information sharing. In order to find a method for extracting the names of chemical substances in Japanese documents, we created a corpus of patent documents tagged with chemical substance names. We studied cutting out words from sentences and recognized chemical substance names by concatenating cut-out words using the part of speech information. We also studied selecting chemical substance names from concatenated cut-out words and made a selection comparison between chemical substance names and functional group names that are similar to chemical substance names.</p>
Journal
-
- Joho Chishiki Gakkaishi
-
Joho Chishiki Gakkaishi 29 (3), 238-246, 2019-10-15
Japan Society of Information and Knowledge
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390564227346158208
-
- NII Article ID
- 130007745932
-
- NII Book ID
- AN10459774
-
- ISSN
- 18817661
- 09171436
-
- HANDLE
- 2241/00160075
-
- NDL BIB ID
- 030052862
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- IRDB
- NDL
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed