Skip to content

Instantly share code, notes, and snippets.

@Alex-Kopylov
Last active September 22, 2019 08:36
Show Gist options
  • Save Alex-Kopylov/697e9a985a0bd4cb025a81502e93b077 to your computer and use it in GitHub Desktop.
Save Alex-Kopylov/697e9a985a0bd4cb025a81502e93b077 to your computer and use it in GitHub Desktop.
def resumetable(df):
print(f"Dataset Shape: {df.shape}")
summary = pd.DataFrame(df.dtypes,columns=['dtypes'])
summary = summary.reset_index()
summary['Name'] = summary['index']
summary = summary[['Name','dtypes']]
summary['Missing'] = df.isnull().sum().values
summary['Uniques'] = df.nunique().values
summary['First Value'] = df.loc[0].values
summary['Second Value'] = df.loc[1].values
summary['Third Value'] = df.loc[2].values
for name in summary['Name'].value_counts().index:
summary.loc[summary['Name'] == name, 'Entropy'] = round(stats.entropy(df[name].value_counts(normalize=True), base=2),2)
return summary
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment