RecordLinkage: Record Linkage Functions for Linking and Deduplicating Data Sets

Provides functions for linking and deduplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain. For details, see our paper "The RecordLinkage Package: Detecting Errors in Data" Sariyar M / Borg A (2010) <doi:10.32614/RJ-2010-017>.

Version: 0.4-12
Depends: R (≥ 3.5.0), DBI, RSQLite (≥ 1.0.0), ff, ffbase
Imports: e1071, rpart, ada, ipred, stats, evd, methods, data.table (≥ 1.7.8), nnet, xtable
Suggests: RUnit, knitr
Published: 2020-04-09
Author: Andreas Borg Developer [aut], Murat Sariyar Developer [aut, cre]
Maintainer: Murat Sariyar Developer <murat.sariyar at bfh.ch>
Contact: murat.sariyar@bfh.ch
License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
URL: https://journal.r-project.org/archive/2010-2/RJournal_2010-2_Sariyar+Borg.pdf
NeedsCompilation: yes
Materials: NEWS
CRAN checks: RecordLinkage results

Downloads:

Reference manual: RecordLinkage.pdf
Vignettes: Classes for record linkage of big data sets
Record Linkage with Extreme Value Theory
Supervised Classification
Weight-based deduplication
Package source: RecordLinkage_0.4-12.tar.gz
Windows binaries: r-devel: RecordLinkage_0.4-12.zip, r-release: RecordLinkage_0.4-12.zip, r-oldrel: RecordLinkage_0.4-12.zip
macOS binaries: r-release: RecordLinkage_0.4-12.tgz, r-oldrel: RecordLinkage_0.4-12.tgz
Old sources: RecordLinkage archive

Reverse dependencies:

Reverse enhances: SoundexBR

Linking:

Please use the canonical form https://CRAN.R-project.org/package=RecordLinkage to link to this page.