Package: irelink 0.0.1


Christopher T. Kenny
irelink: Fast Probabilistic Record Linkage
Performs fast, scalable probabilistic record linkage and deduplication using the Fellegi-Sunter model. Records lacking a shared unique identifier are compared across configurable dimensions using exact, fuzzy, and distance-based comparisons, with model parameters estimated via unsupervised Expectation-Maximization. Multiple SQL backends are supported through 'DBI', enabling execution from laptop-scale ('DuckDB') through to distributed engines. This package is a translation of the Python 'splink' library by Linacre et al. into idiomatic R.
Authors:
irelink_0.0.1.tar.gz
irelink_0.0.1.zip(r-4.7)irelink_0.0.1.zip(r-4.6)irelink_0.0.1.zip(r-4.5)
irelink_0.0.1.tgz(r-4.6-any)irelink_0.0.1.tgz(r-4.5-any)
irelink_0.0.1.tar.gz(r-4.7-any)irelink_0.0.1.tar.gz(r-4.6-any)
irelink_0.0.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
irelink/json (API)
NEWS
| # Install 'irelink' in R: |
| install.packages('irelink', repos = c('https://christopherkenny.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/christopherkenny/irelink/issues
Pkgdown/docs site:https://christophertkenny.com
- fake_1000 - Splink Fake 1000: Deduplication Benchmark
- fake_1000_labels - Splink Fake 1000: Clerical Pairwise Labels
- fake_20 - Fake 20: Minimal Deduplication Example
- febrl4a - FEBRL 4a: Record Linkage Original Records
- febrl4b - FEBRL 4b: Record Linkage Duplicate Records
Last updated from:51c3d60282. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 254 | ||
| source / vignettes | OK | 586 | ||
| linux-release-x86_64 | OK | 257 | ||
| macos-release-arm64 | OK | 1304 | ||
| macos-oldrel-arm64 | OK | 1340 | ||
| windows-devel | OK | 2829 | ||
| windows-release | OK | 2949 | ||
| windows-oldrel | OK | 2963 | ||
| wasm-release | OK | 115 |
Exports:block_from_labelsblock_oncl_andcl_array_intersectcl_array_min_distancecl_array_subsetcl_columns_reversedcl_cosinecl_customcl_damerau_levenshteincl_date_diffcl_dobcl_elsecl_emailcl_exactcl_first_last_namecl_forename_surnamecl_geo_distancecl_jaccardcl_jarocl_jaro_winklercl_levelscl_levenshteincl_literalcl_namecl_notcl_nullcl_numeric_diffcl_orcl_pct_diffcl_postcodecl_soundexcl_time_diffcl_zip_codedayshoursil_accuracyil_array_elementil_attachil_block_onil_cast_to_stringil_cleanupil_cleanup_allil_clusteril_cluster_confusion_matrixil_comparator_scoreil_comparator_threshold_chartil_compareil_compare_recordsil_comparison_vectorsil_completenessil_confusion_matrixil_constrain_mil_constraintsil_count_pairsil_deterministic_linkil_dmetaphoneil_errorsil_estimate_emil_estimate_m_from_columnil_estimate_m_from_labelsil_estimate_prioril_estimate_uil_find_blocking_belowil_find_matchesil_graph_metricsil_largest_blocksil_loadil_metaphoneil_modelil_nullifil_parametersil_phonetic_chartil_precision_recallil_prior_mil_prior_prevalenceil_priorsil_profileil_regex_extractil_register_tfil_rocil_saveil_score_missing_edgesil_score_patternsil_soundexil_specil_string_similarityil_substril_suggest_blockingil_tf_chartil_training_historyil_transformil_try_parse_dateil_try_parse_timestampil_unlinkablesil_waterfallil_weightsis_il_modelis_il_speckmlabels_from_columnmiminutesmonthssecondsyears
Dependencies:clicpp11DBIduckdbfarverggplot2gluegtableisobandlabelinglifecyclemagrittrpillarpkgconfigR6RColorBrewerrlangS7scalesstringdisttibbletidyselectutf8vctrsviridisLitewithr
Advanced Workflows
Rendered fromadvanced.Rmdusingknitr::rmarkdownon Jun 19 2026.Last update: 2026-05-20
Started: 2026-04-10
Deduplicating 50k Synthetic Records
Rendered fromdeduplicate-50k.Rmdusingknitr::rmarkdownon Jun 19 2026.Last update: 2026-05-20
Started: 2026-04-10
Deduplication with Evaluation
Rendered fromdeduplication.Rmdusingknitr::rmarkdownon Jun 19 2026.Last update: 2026-05-20
Started: 2026-03-28
Getting Started
Rendered fromirelink.Rmdusingknitr::rmarkdownon Jun 19 2026.Last update: 2026-05-20
Started: 2026-03-27
Linking Banking Transactions
Rendered fromtransactions.Rmdusingknitr::rmarkdownon Jun 19 2026.Last update: 2026-05-20
Started: 2026-04-10
Record Linkage Across Datasets
Rendered fromrecord-linkage.Rmdusingknitr::rmarkdownon Jun 19 2026.Last update: 2026-05-20
Started: 2026-03-28
Translating from fastLink
Rendered fromfrom_fastLink.Rmdusingknitr::rmarkdownon Jun 19 2026.Last update: 2026-05-20
Started: 2026-05-20
Translating from Splink
Rendered fromfrom_splink.Rmdusingknitr::rmarkdownon Jun 19 2026.Last update: 2026-05-20
Started: 2026-03-27