Skip to content

Instantly share code, notes, and snippets.

What would you like to do?
Function to find duplicated columns in pandas dataframe
def duplicate_columns(frame):
groups = frame.columns.to_series().groupby(frame.dtypes).groups
dups = []
for t, v in groups.items():
cs = frame[v].columns
vs = frame[v]
lcs = len(cs)
for i in range(lcs):
iv = vs.iloc[:,i].tolist()
for j in range(i+1, lcs):
jv = vs.iloc[:,j].tolist()
if iv == jv:
return dups
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment