Skip to content

Instantly share code, notes, and snippets.

@joelbecker
Created November 23, 2017 20:22
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save joelbecker/19cc4ec370512b74a7fad0bdb383df8b to your computer and use it in GitHub Desktop.
Save joelbecker/19cc4ec370512b74a7fad0bdb383df8b to your computer and use it in GitHub Desktop.
Draft API reference for the recordlinkage conflict resolution function suite.

Conflict Resolution

The recordlinkage.algorithms.conflict_resolution module contains a large number of conflict resolution functions. These functions can be used with recordlinkage.FuseLinks.resolve if a conflict handling strategy is needed, which is not currently implemented in the recordlinkage.FuseLinks interface.

These conflict resolution functions are based on: Bleiholder, J., & Naumann, F. (2006). Conflict Handling Strategies in an Integrated Information System. Humboldt-Universität zu Berlin, Mathematisch-Naturwissenschaftliche Fakultät II, Institut für Informatik. http://doi.org/http://dx.doi.org/10.18452/2460

recordlinkage.algorithms.conflict_resolution

aggregate

annotated_concat

choose_first

choose_last

choose_longest

choose_longest_tie_break

choose_max

choose_metadata_max

choose_metadata_min

choose_min

choose_random

choose_shortest

choose_shortest_tie_break

choose_trusted

count

group

identity

no_gossip

nullify

vote

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment