Skip to content

Instantly share code, notes, and snippets.

@tangotiger
Last active February 7, 2016 21:24
Show Gist options
  • Save tangotiger/4fec9a63b2cb4692ecb9 to your computer and use it in GitHub Desktop.
Save tangotiger/4fec9a63b2cb4692ecb9 to your computer and use it in GitHub Desktop.
Parse Schedule
print("Parse start")
sourcefile = "C:\\Users\\TOM\\PycharmProjects\\downloadNHL\\datafiles\\schedulebyseason.htm"
targetfile = "C:\\Users\\TOM\\PycharmProjects\\downloadNHL\\datafiles\\parsed_schedulebyseason.txt"
searchstr = "recap?id="
sample_recstr = "2015020001"
reclen = len(sample_recstr)
i = 0
with open(sourcefile,'r') as infile, open(targetfile,'w') as outfile:
for line in infile:
line_iterator = str(line).split(searchstr)
if len(line_iterator) > 1:
game_id = line_iterator[1][0:reclen]
outfile.write(game_id)
outfile.write("\n")
i = i + 1
print(str(i) + " : records written")
print("Parse end")
@tangotiger
Copy link
Author

Someone pointed to here:
https://gist.github.com/Ja1meMartin/db1b71ed90921aff24fa
And I made my updates accordingly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment