Automatic Term Recognition Using the Corpora of the Different Academic Areas

KUBO Junko, TSUJI Keita, SUGIMOTO Shigeo

doi:10.2964/jsik.19-320

Bibliographic Information

Other Title

異なる学問分野のコーパスを利用した専門用語抽出手法の提案
コトナルガクモンブンヤノコーパスオリヨウシタセンモンヨウゴチュウシュツシュホウノテイアン

Search this article

Description

In this paper, we propose a method for automatic term recognition (ATR) which is using the statistical differences of relative frequencies of terms in target domain corpus and in others. The target terms more frequently appear in target domain corpus than in other domain corpus. Utilizing such characteristics will lead to the improvement of extraction performance. Most of the ATR methods proposed so far only use the target domain corpus and do not take such characteristics into account. For the extraction experiment, we used the abstracts of the Women's Studies International Forum as a target domain corpus and those of academic journals of 39 domains as non-target domain corpus. The extraction performance was examined and we found that our method outperformed the existing ATR methods. We confirmed that it is possible to decrease the size of the other domain corpus by the experiments which used random journals out of 39 domains. As a result, we found that we used some corpus consists of journals which is similar to target domain is almost as high extraction performance as the corpus consists of 39 journals.

Journal

Joho Chishiki Gakkaishi

Joho Chishiki Gakkaishi 20 (1), 15-31, 2010

Japan Society of Information and Knowledge

Keywords

Details 詳細情報について

CRID: 1390001204423416448

NII Article ID: 10025992205

NII Book ID: AN10459774

DOI: 10.2964/jsik.19-320

ISSN: 18817661; 09171436

HANDLE: 2241/106716

NDL BIB ID: 10633248

Web Site: https://tsukuba.repo.nii.ac.jp/records/20527; http://id.ndl.go.jp/bib/10633248; https://ndlsearch.ndl.go.jp/books/R000000004-I10633248

Text Lang: ja

Article Type: journal article

Data Source

JaLC
IRDB
NDL Search
Crossref
CiNii Articles

Abstract License Flag: Disallowed

Export

Report a problem