Created
January 23, 2019 14:37
-
-
Save ahvonenj/ce9d0d304ceec7ed0682df7b30664522 to your computer and use it in GitHub Desktop.
1.4 billion password breach compilation wordlist
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
wordlist created from original 41G stash via: | |
grep -rohP '(?<=:).*$' | uniq > breachcompilation.txt | |
Then, compressed with: | |
7z a breachcompilation.txt.7z breachcompilation.txt | |
Size: | |
4.1G compressed | |
9.0G uncompressed | |
No personal information included - just a list of passwords. | |
magnet url: | |
magnet:?xt=urn:btih:5a9ba318a5478769ddc7393f1e4ac928d9aa4a71&dn=breachcompilation.txt.7z | |
Welp, looks like this is not sorted, so not a unique list. :(
sort with the number of cores you have available:
LC_ALL=C sort --parallel=8 -u breachcompilation.txt -o breachcompilation.sorted.txt
remove leading spaces:
sed -i -e's/^\s*//' breachcompilation.sorted.txt
grep -rohP '(?<=:).*$' |sort |uniq -c|sort -nr > breachcompilation_index.txt
cat breachcompilation_index.txt |cut -b 9- > breachcompilation.txt
How can i read filtering for email with cat?
Hi does anyone have instructions on how to compile this? My email is in this breach and I want to verify it.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
magnet:?xt=urn:btih:7ffbcd8cee06aba2ce6561688cf68ce2addca0a3&dn=BreachCompilation&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.leechers-paradise.org%3A6969&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Fglotorrents.pw%3A6969&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337