Skip to content

Instantly share code, notes, and snippets.

@ZeroRaven
Last active April 4, 2019 08:43
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save ZeroRaven/95b1a18ec1197ddad5e7103606b54274 to your computer and use it in GitHub Desktop.
Save ZeroRaven/95b1a18ec1197ddad5e7103606b54274 to your computer and use it in GitHub Desktop.
10.2.py
10.2 Write a program to read through the mbox-short.txt and figure out the distribution by hour of the day for each of the messages. You can pull the hour out from the 'From ' line by finding the time and then splitting the string a second time using a colon.
From stephen.marquard@uct.ac.za Sat Jan 5 09:14:16 2008
Once you have accumulated the counts for each hour, print out the counts, sorted by hour as shown below.
name = input("Enter file:")
if len(name) < 1 : name = "mbox-short.txt"
handle = open(name)
counts=dict()
for line in handle:
if not line.startswith('From '): continue
#print(line)
lin = line.rstrip().split()
time = lin[5]
tmesp = time.split(':')
hour= tmesp[0]
#print(hour)
counts[hour]=counts.get(hour,0)+1
x = sorted(counts.items())
for k,v in x:
print(k,v)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment