Skip to content

Instantly share code, notes, and snippets.

@hujuu
Last active March 23, 2019 12:56
Show Gist options
  • Save hujuu/c449de2d310842ba697fd235ce75c69c to your computer and use it in GitHub Desktop.
Save hujuu/c449de2d310842ba697fd235ce75c69c to your computer and use it in GitHub Desktop.
pandasでgroupbyしたところにapplyして使う用の関数です
def dup(df, columns=['columns_name']):
return df.duplicated(subset=columns)
def dup_count(df, columns=['columns_name']):
return df.duplicated(subset=columns).value_counts()
def vcount(df):
return df.value_counts()
def count(df):
return df.count()
def count_max(df):
return df.value_counts().max()
def diff(df):
return df.count() - df.value_counts().max()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment