The homepage for the Splink documentation can be found here, including a tutorial and examples that can be run in the browser. The specification of the Fellegi Sunter statistical model behind splink ...
Splink is a Python package for probabilistic record linkage (entity resolution) that allows you to deduplicate and link records from datasets that lack unique identifiers. It is used widely by within ...
In biomedical record linkage, efficient determination of a threshold to decide at which level of similarity two records should be classified as belonging to the same patient is frequently still an ...