Skip to content

Instantly share code, notes, and snippets.

Avatar

Greg Reda gjreda

View GitHub Profile
@gjreda
gjreda / saban-seasons.py
Created Nov 11, 2018
Scraping Nick Saban's seasons as Alabama head coach
View saban-seasons.py
"""
Scraping Nick Saban's seasons as Alabama head coach
I was curious what % of his time Alabama has spent at #1
"""
from collections import Counter
from bs4 import BeautifulSoup
import requests
View towed.json
This file has been truncated, but you can view the full file.
View senators.json
{
"meta": {
"limit": 100,
"offset": 0,
"total_count": 100
},
"objects": [
{
"caucus": null,
"congress_numbers": [
View gjreda-tweets.csv
id text
805168201126518784 @ryanisaac this is the weirdest quarter of football I’ve seen in a while
804818096942968833 @SportsTribution I don't follow.
804816669281546240 Looking for a weekend longread? @samhinkie's resignation letter is still one of the best things I've read in 2016 https://t.co/y7464DISgX
804759041318813696 Have used Postico to query our Redshift cluster for the last few months and it's been great. Similar to Sequel Pro. https://t.co/NN0DvdCpa6
804699067590840320 @jrmontag @tanehisicoates Agreed. Important book.
804690839221964801 The Year of the Looking Glass: Building Products https://t.co/0MVAbxeSze
804469380352446464 "So how do we build trust? The easy answer is by producing high quality work. The hard part is how you get there." https://t.co/M4MgJYU2Wm
804015210621239297 Holywow this looks awesome. Continuously impressed by the data products the @awscloud team keeps churning out: https://t.co/jmkLqFjyn7
803734870706896896 RT @jevnin: I'd recommend working with this guy. https://t.
View gjreda-tweets.json
[
{
"id": "805168201126518784",
"text": "@ryanisaac this is the weirdest quarter of football I’ve seen in a while"
},
{
"id": "804818096942968833",
"text": "@SportsTribution I don't follow."
},
{
@gjreda
gjreda / useful-one-liners.sh
Last active Apr 11, 2017
Random bash one-liners that are useful but I always forget
View useful-one-liners.sh
# Installing/upgrading old requirements.txt from python2 to python3
sed s/\=/\ /g requirements.txt | awk '{print $1}' | xargs -n1 pip3 install --upgrade
@gjreda
gjreda / concurrent_futures_example.py
Created Sep 25, 2016
example of using Python3's concurrent.futures module
View concurrent_futures_example.py
from concurrent.futures import ProcessPoolExecutor
import concurrent.futures
from halas.parsers import boxscore
GAMES = [ ... ]
results = []
with ProcessPoolExecutor(max_workers=4) as executor:
future_results = {executor.submit(boxscore, game):
@gjreda
gjreda / pandas-groupby-cumulative-count-with-reset.py
Last active Aug 2, 2016
days since last login -- pandas groupby cumulative count with reset
View pandas-groupby-cumulative-count-with-reset.py
# for creating a column like "days since last login"
df = pd.read_clipboard(index_col=['customer_id', 'days'])
(df
.groupby(level='customer_id')
.did_login
.cumsum()
.to_frame()
.groupby(level='customer_id')
.apply(lambda g: g.groupby('did_login').cumcount())
@gjreda
gjreda / pandas-groupby-cumsum.py
Last active Oct 19, 2018
add grouped cumulative sum column to pandas dataframe
View pandas-groupby-cumsum.py
"""
add grouped cumulative sum column to pandas dataframe
Add a new column to a pandas dataframe which holds the cumulative sum for a given grouped window
Desired output:
user_id,day,session_minutes,cumulative_minutes
516530,0,NaN,0
516530,1,0,0
516532,0,5,5
@gjreda
gjreda / coding-music.md
Last active Aug 18, 2017
Music to code by
View coding-music.md