Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure

Detecting and controlling illegal websites (gambling and pornography sites) through illegal domain names has been an unsolved problem. Therefore, how to mine and discover potential illegal domain names in advance has become a current research hotspot. This paper studies a method of generating illega...

Full description

Bibliographic Details
Main Authors: Yuchen Liang, Yanan Cheng, Zhaoxin Zhang, Tingting Chai, Chao Li
Format: Article
Language:English
Published: MDPI AG 2023-03-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/6/4061
_version_ 1797613584024338432
author Yuchen Liang
Yanan Cheng
Zhaoxin Zhang
Tingting Chai
Chao Li
author_facet Yuchen Liang
Yanan Cheng
Zhaoxin Zhang
Tingting Chai
Chao Li
author_sort Yuchen Liang
collection DOAJ
description Detecting and controlling illegal websites (gambling and pornography sites) through illegal domain names has been an unsolved problem. Therefore, how to mine and discover potential illegal domain names in advance has become a current research hotspot. This paper studies a method of generating illegal domain names based on the character similarity of domain name structure. Firstly, the K-means algorithm classified illegal domain names with similar structures. Then, put the classified clusters into the adversarial generative network for training. Finally, through a specific result verification method, the experiment shows that the average concentration of the generation algorithm is 23.82%, the effective concentration is 63.54%, and the expansion rate is 7.5. By comparing the results with the enumeration algorithm, the generation algorithm has greatly improved in terms of generation efficiency and accuracy.
first_indexed 2024-03-11T06:57:59Z
format Article
id doaj.art-d0210ca141b24fdd93cb55fcbd84e500
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T06:57:59Z
publishDate 2023-03-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-d0210ca141b24fdd93cb55fcbd84e5002023-11-17T09:30:56ZengMDPI AGApplied Sciences2076-34172023-03-01136406110.3390/app13064061Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name StructureYuchen Liang0Yanan Cheng1Zhaoxin Zhang2Tingting Chai3Chao Li4Faculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaDetecting and controlling illegal websites (gambling and pornography sites) through illegal domain names has been an unsolved problem. Therefore, how to mine and discover potential illegal domain names in advance has become a current research hotspot. This paper studies a method of generating illegal domain names based on the character similarity of domain name structure. Firstly, the K-means algorithm classified illegal domain names with similar structures. Then, put the classified clusters into the adversarial generative network for training. Finally, through a specific result verification method, the experiment shows that the average concentration of the generation algorithm is 23.82%, the effective concentration is 63.54%, and the expansion rate is 7.5. By comparing the results with the enumeration algorithm, the generation algorithm has greatly improved in terms of generation efficiency and accuracy.https://www.mdpi.com/2076-3417/13/6/4061illegal domain namesK-meansgeneration algorithmadversarial generative networkenumeration algorithm
spellingShingle Yuchen Liang
Yanan Cheng
Zhaoxin Zhang
Tingting Chai
Chao Li
Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure
Applied Sciences
illegal domain names
K-means
generation algorithm
adversarial generative network
enumeration algorithm
title Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure
title_full Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure
title_fullStr Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure
title_full_unstemmed Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure
title_short Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure
title_sort illegal domain name generation algorithm based on character similarity of domain name structure
topic illegal domain names
K-means
generation algorithm
adversarial generative network
enumeration algorithm
url https://www.mdpi.com/2076-3417/13/6/4061
work_keys_str_mv AT yuchenliang illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure
AT yanancheng illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure
AT zhaoxinzhang illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure
AT tingtingchai illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure
AT chaoli illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure