Skip to content

Instantly share code, notes, and snippets.

@pyk
Created January 22, 2019 09:00
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save pyk/2a57037417f569cee71695e2a3de0fd1 to your computer and use it in GitHub Desktop.
Save pyk/2a57037417f569cee71695e2a3de0fd1 to your computer and use it in GitHub Desktop.
Typod
import json
output = open("kumparan-news-dataset-19062018-16112018.txt", "w")
for line in open("typo-detector-dataset.json"):
data = json.loads(line)
output.write(data["title"].strip() + "\n")
if "content" in data:
output.write(data["content"].strip() + "\n")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment