sokarob/Find and Isolate Duplicate Rows Across Multiple Columns.r

## Find and Isolate Duplicate Rows Across Multiple Columns.r
# Search for duplicate rows of records by grouping all columns, counting, and filtering.
Duplicates <- Data %>%
  group_by(Column1,Column2,Column3,`Column 4`,`Column 5`) %>%
    mutate(dupe = n()>1) %>%
      ungroup() %>%
        filter(dupe==TRUE)


# Credit to: https://stackoverflow.com/questions/6986657/find-duplicated-rows-based-on-2-columns-in-data-frame-in-r

# Note: Only 2 or more columns are needed to use this method.  It isn't necessary to group by all columns if the data is structured in a way that fewer columns can still identify duplicates.  Some use cases may be looking only for duplicates of specific information as well.

# This could also be adapted to group together and isolate related transactions that are not actually the same, like ones from a specific vendor and food item.
	# Search for duplicate rows of records by grouping all columns, counting, and filtering.
	Duplicates <- Data %>%
	group_by(Column1,Column2,Column3,`Column 4`,`Column 5`) %>%
	mutate(dupe = n()>1) %>%
	ungroup() %>%
	filter(dupe==TRUE)


	# Credit to: https://stackoverflow.com/questions/6986657/find-duplicated-rows-based-on-2-columns-in-data-frame-in-r

	# Note: Only 2 or more columns are needed to use this method. It isn't necessary to group by all columns if the data is structured in a way that fewer columns can still identify duplicates. Some use cases may be looking only for duplicates of specific information as well.

	# This could also be adapted to group together and isolate related transactions that are not actually the same, like ones from a specific vendor and food item.