- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
- 【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
-
- TAKAHASHI KAZUKO
- Faculty of International Studies, Keiai University Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
-
- TAKAMURA HIROYA
- Precision and Intelligence Laboratory, Tokyo Institute of Technology
-
- OKUMURA MANABU
- Precision and Intelligence Laboratory, Tokyo Institute of Technology
Bibliographic Information
- Other Title
-
- 機械学習とルールベースの組み合わせによる自動職業コーディング
Search this article
Description
We apply a machine learning method to occupation coding, which is a task to categorize answers to open-ended questions about respondent's occupation.Specifically, we use Support Vector Machines (SVMs) and their combination with hand-crafted rules.Conducting occupation coding manually is expensive and sometimes leads to inconsistent coding results when coders are not experts in occupation coding. For this reason, a rule-based automatic method was developed and applied.However, its categorization performance was not satisfactory.Therefore, we adopt SVMs, which show high performance in various fields, and compare them with the rule-based method.We also investigate effective combination methods of SVMs and the rulebased method.We empirically show that SVMs outperform the rule-based method in occupation coding and that the combination of the two methods yields even better accuracy, and that the accuracy of each method increases if the part of the new samples is added to the training data.
Journal
-
- Journal of Natural Language Processing
-
Journal of Natural Language Processing 12 (2), 3-23, 2005
The Association for Natural Language Processing
- Tweet
Details 詳細情報について
-
- CRID
- 1390282679451314176
-
- NII Article ID
- 130004291834
-
- ISSN
- 21858314
- 13407619
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed