Identification and Classification of Research Data Cited in Scholarly Papers

Bibliographic Information

Other Title
  • 論文で引用された研究データの同定と分類
  • ロンブン デ インヨウ サレタ ケンキュウ データ ノ ドウテイ ト ブンルイ

Search this article

Abstract

<p>This paper proposes a method for identifying and classifying the research data cited in scholarly papers, aiming at automatic generation of metadata stored in data repository. This study focuses on URL citations in the scholarly papers. That is, the targets are to identify the URLs referring to the research data and to classify them into tool and data. The method is realized as a multi-class classification (tool/data/others). The method acquires the distributed representations of the URLs from the context around them, and uses them as the input feature. There exists an advantage in that the meanings of URLs can be given based on their surrounding words. This study adopts an approach of computing the meaning of the entire URL from those of the components of the URL. In order to evaluate the performance of the proposed method, experiments on URL classification were conducted. The scholarly papers included in the proceedings of the international conference were used as experimental data. Experimental results have shown the effectiveness of the proposed method for identifying and classifying URLs referring to research data.</p>

Journal

References(3)*help

See more

Details 詳細情報について

Report a problem

Back to top