Skip to content

Instantly share code, notes, and snippets.

@caniszczyk
Created December 28, 2012 20:41
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 1 You must be signed in to fork a gist
  • Save caniszczyk/4401652 to your computer and use it in GitHub Desktop.
Save caniszczyk/4401652 to your computer and use it in GitHub Desktop.
A query against the GitHub Archive for "seven dirty word" usage in 2012
SELECT repository_name, repository_language, repository_description, repository_watchers, payload_commit_msg, url, PARSE_UTC_USEC(created_at) as timestamp
FROM [githubarchive:github.timeline]
WHERE type="PushEvent"
AND (
LOWER(payload_commit_msg) CONTAINS "shit" OR
LOWER(payload_commit_msg) CONTAINS "piss" OR
LOWER(payload_commit_msg) CONTAINS "fuck" OR
LOWER(payload_commit_msg) CONTAINS "cunt" OR
LOWER(payload_commit_msg) CONTAINS "cocksucker" OR
LOWER(payload_commit_msg) CONTAINS "motherfucker" OR
LOWER(payload_commit_msg) CONTAINS "tits"
)
AND PARSE_UTC_USEC(created_at) >= PARSE_UTC_USEC('2012-01-01 00:00:00')
ORDER BY timestamp DESC;
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment