- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
- 【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
Automated Labeling of Entities in CVE Vulnerability Descriptions with Natural Language Processing
-
- SUMOTO Kensuke
- Waseda University
-
- KANAKOGI Kenta
- Waseda University
-
- WASHIZAKI Hironori
- Waseda University
-
- TSUDA Naohiko
- Waseda University
-
- YOSHIOKA Nobukazu
- Waseda University
-
- FUKAZAWA Yoshiaki
- Waseda University
-
- KANUKA Hideyuki
- Hitachi, Ltd.
Search this article
Description
<p>Security-related issues have become more significant due to the proliferation of IT. Collating security-related information in a database improves security. For example, Common Vulnerabilities and Exposures (CVE) is a security knowledge repository containing descriptions of vulnerabilities about software or source code. Although the descriptions include various entities, there is not a uniform entity structure, making security analysis difficult using individual entities. Developing a consistent entity structure will enhance the security field. Herein we propose a method to automatically label select entities from CVE descriptions by applying the Named Entity Recognition (NER) technique. We manually labeled 3287 CVE descriptions and conducted experiments using a machine learning model called BERT to compare the proposed method to labeling with regular expressions. Machine learning using the proposed method significantly improves the labeling accuracy. It has an f1 score of about 0.93, precision of about 0.91, and recall of about 0.95, demonstrating that our method has potential to automatically label select entities from CVE descriptions.</p>
Journal
-
- IEICE Transactions on Information and Systems
-
IEICE Transactions on Information and Systems E107.D (5), 674-682, 2024-05-01
The Institute of Electronics, Information and Communication Engineers
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390299993933127680
-
- ISSN
- 17451361
- 09168532
-
- Text Lang
- en
-
- Article Type
- journal article
-
- Data Source
-
- JaLC
- Crossref
- KAKEN
- OpenAIRE
-
- Abstract License Flag
- Disallowed