機械学習とルールベースの組み合わせによる自動職業コーディング

書誌事項

タイトル別名
  • Automatic Occupation Coding with Machine Learning and Hand-Crafted Rules

この論文をさがす

説明

We apply a machine learning method to occupation coding, which is a task to categorize answers to open-ended questions about respondent's occupation.Specifically, we use Support Vector Machines (SVMs) and their combination with hand-crafted rules.Conducting occupation coding manually is expensive and sometimes leads to inconsistent coding results when coders are not experts in occupation coding. For this reason, a rule-based automatic method was developed and applied.However, its categorization performance was not satisfactory.Therefore, we adopt SVMs, which show high performance in various fields, and compare them with the rule-based method.We also investigate effective combination methods of SVMs and the rulebased method.We empirically show that SVMs outperform the rule-based method in occupation coding and that the combination of the two methods yields even better accuracy, and that the accuracy of each method increases if the part of the new samples is added to the training data.

収録刊行物

  • 自然言語処理

    自然言語処理 12 (2), 3-23, 2005

    一般社団法人 言語処理学会

詳細情報 詳細情報について

問題の指摘

ページトップへ