Malware Clustering Based on Graph Convolutional Networks
DOI:
Author:
Affiliation:

Clc Number:

TP391.1

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
    Abstract:

    Many new types of malwares are often modified by attackers based on the existing malwares. Therefore, family homology analysis of malwares can help to study of evolutionary trend and traceability of malwares. In this paper, starting from API call graphs of malwares and combined with Graph Convolutional Networks (GCN), we proposed a similarity calculation and family clustering model for malwares. Firstly, we extract API call graphs of malwares with disassembly tools and the attribution of the API functions in the graphs are labeled. Then, we select key API functions by its contribution to the malware families and the API call graphs of malwares are constructed. We use GCN and Convolutional Neural Networks (CNN) as the model of the malware similarity calculation which the inputs are the API call graphs. Finally, we use DBSCAN algorithm to cluster malwares. The experimental results show that the proposed method can achieve 87.3% accuracy and can effectively cluster malware families.

    Reference
    Related
    Cited by
Get Citation

Cite this article as: liukai, fangyong, zhanglei, zuozheng, liuliang. Malware Clustering Based on Graph Convolutional Networks [J]. J Sichuan Univ: Nat Sci Ed, 2019, 56: 654.

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:January 09,2019
  • Revised:March 05,2019
  • Adopted:March 07,2019
  • Online: July 15,2019
  • Published: