Created
December 18, 2012 16:01
-
-
Save edsu/4329253 to your computer and use it in GitHub Desktop.
example of reading the www-talk 1991-1994 mbox archive with python
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env python | |
# prints the subject lines for the www-talk 1991-1994 email archive | |
import os | |
import mailbox | |
os.system("wget http://dl.dropbox.com/u/2797650/www-talk_1991-1994.tar.gz") | |
os.system("tar xvfz www-talk_1991-1994.tar.gz") | |
for mbox_file in os.listdir("www-talk_1991-1994/data"): | |
for msg in mailbox.mbox("www-talk_1991-1994/data/" + mbox_file): | |
print msg["subject"] |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment