PaSSw0rdVib3s! AI-assisted password recognition for digital forensic investigations

Open Access
Authors
Publication date 03-2025
Journal Forensic Science International: Digital Investigation
Article number 301870
Volume | Issue number 52 | supplement
Number of pages 8
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In digital forensic investigations, the ability to identify passwords in cleartext within digital evidence is often essential for the acquisition of data from encrypted devices. Passwords may be stored in cleartext, knowingly or accidentally, in various locations within a device, e.g., in text messages, notes, or system log files. Finding those passwords is a challenging task, as devices typically contain a substantial amount and a wide variety of textual data. This paper explores the performance of several different types of machine learning models trained to distinguish passwords from non-passwords, and ranks them according to their likelihood of being a human-generated password. Three deep learning models (PassGPT, CodeBERT and DistilBERT) were fine-tuned, and two traditional machine learning models (a feature-based XGBoost and a TF/IDF-based XGBoost) were trained. These were compared to the existing state-of-the-art technology, a password recognition model based on probabilistic context-free grammars. Our research shows that the fine-tuned PassGPT model outperforms the other models. We show that the combination of multiple different types of training datasets, carefully chosen based on the context, is needed to achieve good results. In particular, it is important to train not only on dictionary words and leaked credentials, but also on data scraped from chats and websites. Our approach was evaluated with realistic hardware that could fit inside an investigator's workstation. The evaluation was conducted on the publicly available RockYou and MyHeritage leaks, but also on a dataset derived from real casework, showing that these innovations can indeed be used in a real forensic context.
Document type Article
Note In special issue: DFRWS EU 2025 - Selected Papers from the 12th Annual Digital Forensics Research Conference Europe
Language English
Published at https://doi.org/10.1016/j.fsidi.2025.301870
Other links https://www.scopus.com/pages/publications/105000751935
Downloads
PaSSw0rdVib3s! (Final published version)
Permalink to this page
Back