抽象構文木に基づくネスト構造に関する特徴を用いた悪性JavaScript検知手法

佐野, 涼太, 花田, 真樹, 早稲田, 篤志, 村上, 洋一, 布広, 永示, 折田, 彰, 関口, 竜也

ドライブバイダウンロード攻撃による被害は年々深刻化している．その攻撃の被害に遭う主な原因としては，Webサイトに埋め込まれた不正スクリプトによって，訪問者が悪性 Web サイトにアクセスさせられてしまうということが挙げられる．その不正スクリプトの多くは攻撃の検知を回避するための様々な手法が施されているため，このような悪性 Web サイトによる攻撃を高精度で検知する手法が求められている．本稿では，JavaScriptを用いた不正スクリプトを高精度に検知するために，スクリプトの抽象構文木における出現キーワード（出現文字列）とその属性（変数や関数など），プログラムの構造的特徴（ネストの深さとその位置的な分布）を利用して，機械学習アルゴリズム（SVM, NBC, Random Forest）に基づく検知手法を提案し，評価する．

Damage caused by drive-by download has been getting worse year by year. One of the main causes of the attack is that malicious scripts embedded in web sites force visitors to access malicious web sites. Also, many of the scripts use obfuscation to prevent detection. Therefore, it is required to detect such attacks with high accuracy. In this study, in order to detect malicious JavaScript code with high accuracy, we propose a machine learning method using not only statistical features, i.e. the number of keywords in the code and their attributes, such as variables and functions, but also structural features (nest depth) based on abstract syntax tree. In addition, the accuracies of the methods with different combinations of the features and machine learning algorithms, SVM, Native Bayes and Random Forest, are evaluated.

抽象構文木に基づくネスト構造に関する特徴を用いた悪性JavaScript検知手法

Bibliographic Information

Abstract

Journal

Keywords

Details 詳細情報について

Export

Report a problem