PassTCN-PPLL: A Password Guessing Model Based on Probability Label Learning and Temporal Convolutional Neural Network.

Advanced Search

Junbin Ye, Min Jin, Guoliang Gong, Rongxuan Shen, Huaxiang Lu

Author Information

Junbin Ye: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.
Min Jin: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.
Guoliang Gong: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.
Rongxuan Shen: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.
Huaxiang Lu: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.

PMID: 36080943 DOI: 10.3390/s22176484

The frequent incidents of password leakage have increased people's attention and research on password security. Password guessing is an essential part of password cracking and password security research. The progression of deep learning technology provides a promising way to improve the efficiency of password guessing. However, the mainstream models proposed for password guessing, such as RNN (or other variants, such as LSTM, GRU), GAN and VAE still face some problems, such as the low efficiency and high repetition rate of the generated passwords. In this paper, we propose a password-guessing model based on the temporal convolutional neural network (PassTCN). To further improve the performance of the generated passwords, we propose a novel password probability label-learning method, which reconstructs labels based on the password probability distribution of the training set and deduplicates the training set when training. Experiments on the RockYou dataset showed that, when generating 108 passwords, the coverage rate of PassTCN with password probability label learning (PassTCN-PPLL) reached 12.6%, which is 87.2%, 72.6% and 42.9% higher than PassGAN (a password-guessing model based on GAN), VAEPass (a password-guessing model based on VAE) and FLA (a password-guessing model based on LSTM), respectively. The repetition rate of our model is 25.9%, which is 45.1%, 31.7% and 17.4% lower than that of PassGAN, VAEPass and FLA, respectively. The results confirm that our approach not only improves the coverage rate but also reduces the repetition rate.

password guessing password label temporal convolutional neural network

U19A2080/National Natural Science Foundation of China
U1936106/National Natural Science Foundation of China
XDA27040303/CAS Strategic Leading Science and Technology Project
XDA18040400/CAS Strategic Leading Science and Technology Project
XDB44000000/CAS Strategic Leading Science and Technology Project
31513070501/High Technology Project
1916312ZD00902201/High Technology Project

Computer Security

Humans

Neural Networks, Computer

Probability

Journal Article

OpenLB
Open Library of Bioscience