Skip to content

Instantly share code, notes, and snippets.

@OHua
Last active September 3, 2020 03:59
Show Gist options
  • Save OHua/371984de3ed252e43189ef01215ffb0d to your computer and use it in GitHub Desktop.
Save OHua/371984de3ed252e43189ef01215ffb0d to your computer and use it in GitHub Desktop.
##
https://www.contextures.com/xlSampleData01.html#data
https://stackoverflow.com/questions/14529838/apply-multiple-functions-to-multiple-groupby-columns
https://cmdlinetips.com/2018/02/how-to-subset-pandas-dataframe-based-on-values-of-a-column/
https://stackoverflow.com/questions/11869910/pandas-filter-rows-of-dataframe-with-operator-chaining
##
import pandas as pd
import numpy as np
##
xlsx = pd.ExcelFile("SampleData.xlsx")
xlsx.sheet_names[1]
data = pd.read_excel("SampleData.xlsx", sheet_name=xlsx.sheet_names[1])
data.groupby('Region').count()
##
xlsx = pd.ExcelFile("SampleData.xlsx")
xlsx.sheet_names[1]
data = pd.read_excel("SampleData.xlsx", sheet_name=xlsx.sheet_names[1])
data[data['Units']>50][['Region', 'Units']].groupby('Region').count()['Units']
##
d1 = data.groupby('Region').count()
d2 = data[data['Units']>50][['Region', 'Units']].groupby('Region').count()['Units']
d1['count_units_gt_50'] = d2
d1['count_units_lte_50'] = d1['Units']-d2
d1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment