大規模言語モデルを用いた静的解析の支援可能性検証

藤井, 翔太, 山岸, 伶

大規模言語モデル（LLM）が隆盛しており，サイバーセキュリティ分野を含む様々な領域への適用可能性が示されている．ここで，マルウェアの静的解析は，サイバーセキュリティ分野において重要な要素の一つであるものの，高い専門知識や長い時間が必要なことが知られている．そこで我々は，LLMが静的解析の支援に寄与し得るかに着目して実証実験を実施した．まず，LLMでマルウェアの説明を生成し，機能網羅性や解説記事との一致性を用いた評価を実施した．この結果，最大で90.9%の精度で機能を網羅した説明文を生成できることを実証した．また，LLMを実務に活用可能か検証するために，6名の静的解析技術者にLLMの説明を用いての疑似的な静的解析タスクを依頼した．その後の協力者へのアンケートやインタビューを通して，実務の観点からのLLMの活用可能性も示した．以上の評価を通して，LLMの静的解析支援への活用可能性を示した．さらに，アンケートやインタビュー結果を分析し，LLMを静的解析支援に活用する際の課題や求められる機能と共に，今後の研究可能性を提言としてまとめた．

Large Language Models (LLMs) are flourishing and have shown their applicability to various domains, including cybersecurity. A static analysis of malware is one of the important elements, but it is known to require a high level of expertise and a long time. Therefore, we conducted a demonstration experiment focusing on whether LLM can contribute to supporting static analysis. First, we evaluated the explanatory ability for malware functionality. The results demonstrated LLM can generate descriptions that cover functions with an accuracy of up to 90.9%. In addition, six static analysis engineers were asked to perform a pseudo static analysis task using LLM explanations in order to verify whether LLM can be used in practice. Through subsequent questionnaires and interviews with the collaborators, we also demonstrated the applicability of LLM from a practical perspective. Furthermore, we summarized the issues and required functions when utilizing LLM as a static analysis support, and recommendations for future research possibilities.

大規模言語モデルを用いた静的解析の支援可能性検証

書誌事項

抄録

収録刊行物

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

大規模言語モデルを用いた静的解析の支援可能性検証

書誌事項

抄録

収録刊行物

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について