Journal of Lanzhou University of Technology ›› 2023, Vol. 49 ›› Issue (6): 100-106.

• Automation Technique and Computer Technology • Previous Articles     Next Articles

A generation method of malicious domain name training data based on generating adversarial network

LIU Wei-shan1,2, MA Xu-qi1,3, WANG Hang1, WU Zi-yan1   

  1. 1. GSCERT, Lanzhou 730000, China;
    2. School of Mechanical and Electrical Engineering, Lanzhou Univ. of Tech., Lanzhou 730050, China;
    3. School of Information Science & Engineering, Lanzhou University, Lanzhou 730000, China
  • Received:2022-01-14 Online:2023-12-28 Published:2024-01-05

Abstract: Domain generation algorithm(DGA) is widely used by cyber attackers to generate a large number of random domain names to evade detection at present. While the existing DGA domain name detection can not effectively detect unknown malicious domains, because these models are all trained and constructed on publicly available datasets. In this paper, an autoencoder(AE) is first trained using real domain names, and then it is combined with the generative adversarial network(GAN) to construct a new DGA domain name generating model. Experiment results show that the sequences generated by this model are similar to the Alexa domain names in terms of length and character distribution, and it also can effectively reduce the performance of the DGA domain name classifier based on a long short-term memory (LSTM) network. These generated sequences enrich the malicious domain name dataset, which can significantly improve the performance of existing DGA domain name detectors with further utilization.

Key words: malicious domains, DGA, AE, GAN

CLC Number: