PassTCN-PPLL: A Password Guessing Model Based on Probability Label Learning and Temporal Convolutional Neural Network.

Junbin Ye, Min Jin, Guoliang Gong, Rongxuan Shen, Huaxiang Lu
Author Information
  1. Junbin Ye: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.
  2. Min Jin: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.
  3. Guoliang Gong: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.
  4. Rongxuan Shen: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.
  5. Huaxiang Lu: Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China.

Abstract

The frequent incidents of password leakage have increased people's attention and research on password security. Password guessing is an essential part of password cracking and password security research. The progression of deep learning technology provides a promising way to improve the efficiency of password guessing. However, the mainstream models proposed for password guessing, such as RNN (or other variants, such as LSTM, GRU), GAN and VAE still face some problems, such as the low efficiency and high repetition rate of the generated passwords. In this paper, we propose a password-guessing model based on the temporal convolutional neural network (PassTCN). To further improve the performance of the generated passwords, we propose a novel password probability label-learning method, which reconstructs labels based on the password probability distribution of the training set and deduplicates the training set when training. Experiments on the RockYou dataset showed that, when generating 108 passwords, the coverage rate of PassTCN with password probability label learning (PassTCN-PPLL) reached 12.6%, which is 87.2%, 72.6% and 42.9% higher than PassGAN (a password-guessing model based on GAN), VAEPass (a password-guessing model based on VAE) and FLA (a password-guessing model based on LSTM), respectively. The repetition rate of our model is 25.9%, which is 45.1%, 31.7% and 17.4% lower than that of PassGAN, VAEPass and FLA, respectively. The results confirm that our approach not only improves the coverage rate but also reduces the repetition rate.

Keywords

Grants

  1. U19A2080/National Natural Science Foundation of China
  2. U1936106/National Natural Science Foundation of China
  3. XDA27040303/CAS Strategic Leading Science and Technology Project
  4. XDA18040400/CAS Strategic Leading Science and Technology Project
  5. XDB44000000/CAS Strategic Leading Science and Technology Project
  6. 31513070501/High Technology Project
  7. 1916312ZD00902201/High Technology Project

MeSH Term

Computer Security
Humans
Neural Networks, Computer
Probability

Word Cloud

Created with Highcharts 10.0.0passwordratemodelbasedguessingpassword-guessingrepetitionpasswordsprobabilitytrainingresearchsecurityPasswordlearningimproveefficiencyLSTMGANVAEgeneratedproposetemporalconvolutionalneuralnetworkPassTCNsetcoveragelabel6%9%PassGANVAEPassFLArespectivelyfrequentincidentsleakageincreasedpeople'sattentionessentialpartcrackingprogressiondeeptechnologyprovidespromisingwayHowevermainstreammodelsproposedRNNvariantsGRUstillfaceproblemslowhighpaperperformancenovellabel-learningmethodreconstructslabelsdistributiondeduplicatesExperimentsRockYoudatasetshowedgenerating108PassTCN-PPLLreached12872%7242higher25451%317%174%lowerresultsconfirmapproachimprovesalsoreducesPassTCN-PPLL:GuessingModelBasedProbabilityLabelLearningTemporalConvolutionalNeuralNetwork

Similar Articles

Cited By