-
- Qian Li
- Beihang University, Haidian district, Beijing, China
-
- Hao Peng
- Beihang University, Haidian district, Beijing, China
-
- Jianxin Li
- Beihang University, Haidian district, Beijing, China
-
- Congying Xia
- University of Illinois at Chicago, Chicago, IL, USA
-
- Renyu Yang
- University of Leeds, Leeds, England, UK
-
- Lichao Sun
- Lehigh University, Bethlehem, PA, USA
-
- Philip S. Yu
- University of Illinois at Chicago, Chicago, IL, USA
-
- Lifang He
- Lehigh University, Bethlehem, PA, USA
説明
<jats:p>Text classification is the most fundamental and essential task in natural language processing. The last decade has seen a surge of research in this area due to the unprecedented success of deep learning. Numerous methods, datasets, and evaluation metrics have been proposed in the literature, raising the need for a comprehensive and updated survey. This paper fills the gap by reviewing the state-of-the-art approaches from 1961 to 2021, focusing on models from traditional models to deep learning. We create a taxonomy for text classification according to the text involved and the models used for feature extraction and classification. We then discuss each of these categories in detail, dealing with both the technical developments and benchmark datasets that support tests of predictions. A comprehensive comparison between different techniques, as well as identifying the pros and cons of various evaluation metrics are also provided in this survey. Finally, we conclude by summarizing key implications, future research directions, and the challenges facing the research area.</jats:p>
収録刊行物
-
- ACM Transactions on Intelligent Systems and Technology
-
ACM Transactions on Intelligent Systems and Technology 13 (2), 1-41, 2022-04-08
Association for Computing Machinery (ACM)
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1360299458377434496
-
- DOI
- 10.1145/3495162
-
- ISSN
- 21576912
- 21576904
-
- データソース種別
-
- Crossref