Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure
Detecting and controlling illegal websites (gambling and pornography sites) through illegal domain names has been an unsolved problem. Therefore, how to mine and discover potential illegal domain names in advance has become a current research hotspot. This paper studies a method of generating illega...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-03-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/13/6/4061 |
_version_ | 1797613584024338432 |
---|---|
author | Yuchen Liang Yanan Cheng Zhaoxin Zhang Tingting Chai Chao Li |
author_facet | Yuchen Liang Yanan Cheng Zhaoxin Zhang Tingting Chai Chao Li |
author_sort | Yuchen Liang |
collection | DOAJ |
description | Detecting and controlling illegal websites (gambling and pornography sites) through illegal domain names has been an unsolved problem. Therefore, how to mine and discover potential illegal domain names in advance has become a current research hotspot. This paper studies a method of generating illegal domain names based on the character similarity of domain name structure. Firstly, the K-means algorithm classified illegal domain names with similar structures. Then, put the classified clusters into the adversarial generative network for training. Finally, through a specific result verification method, the experiment shows that the average concentration of the generation algorithm is 23.82%, the effective concentration is 63.54%, and the expansion rate is 7.5. By comparing the results with the enumeration algorithm, the generation algorithm has greatly improved in terms of generation efficiency and accuracy. |
first_indexed | 2024-03-11T06:57:59Z |
format | Article |
id | doaj.art-d0210ca141b24fdd93cb55fcbd84e500 |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-11T06:57:59Z |
publishDate | 2023-03-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-d0210ca141b24fdd93cb55fcbd84e5002023-11-17T09:30:56ZengMDPI AGApplied Sciences2076-34172023-03-01136406110.3390/app13064061Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name StructureYuchen Liang0Yanan Cheng1Zhaoxin Zhang2Tingting Chai3Chao Li4Faculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150001, ChinaDetecting and controlling illegal websites (gambling and pornography sites) through illegal domain names has been an unsolved problem. Therefore, how to mine and discover potential illegal domain names in advance has become a current research hotspot. This paper studies a method of generating illegal domain names based on the character similarity of domain name structure. Firstly, the K-means algorithm classified illegal domain names with similar structures. Then, put the classified clusters into the adversarial generative network for training. Finally, through a specific result verification method, the experiment shows that the average concentration of the generation algorithm is 23.82%, the effective concentration is 63.54%, and the expansion rate is 7.5. By comparing the results with the enumeration algorithm, the generation algorithm has greatly improved in terms of generation efficiency and accuracy.https://www.mdpi.com/2076-3417/13/6/4061illegal domain namesK-meansgeneration algorithmadversarial generative networkenumeration algorithm |
spellingShingle | Yuchen Liang Yanan Cheng Zhaoxin Zhang Tingting Chai Chao Li Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure Applied Sciences illegal domain names K-means generation algorithm adversarial generative network enumeration algorithm |
title | Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure |
title_full | Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure |
title_fullStr | Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure |
title_full_unstemmed | Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure |
title_short | Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure |
title_sort | illegal domain name generation algorithm based on character similarity of domain name structure |
topic | illegal domain names K-means generation algorithm adversarial generative network enumeration algorithm |
url | https://www.mdpi.com/2076-3417/13/6/4061 |
work_keys_str_mv | AT yuchenliang illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure AT yanancheng illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure AT zhaoxinzhang illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure AT tingtingchai illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure AT chaoli illegaldomainnamegenerationalgorithmbasedoncharactersimilarityofdomainnamestructure |