Showing 2 of total 2 results (show query)
ngmarchant
comparator:Comparison Functions for Clustering and Record Linkage
Implements functions for comparing strings, sequences and numeric vectors for clustering and record linkage applications. Supported comparison functions include: generalized edit distances for comparing sequences/strings, Monge-Elkan similarity for fuzzy comparison of token sets, and L-p distances for comparing numeric vectors. Where possible, comparison functions are implemented in C/C++ to ensure good performance.
Maintained by Neil Marchant. Last updated 3 years ago.
clusteringdistance-measuresdistance-metricsentity-resolutionrecord-linkagesimilarity-measuresstring-similaritycpp
49.2 match 18 stars 4.63 score 47 scriptslewinfox
levitate:Fuzzy String Comparison
Provides string similarity calculations inspired by the Python 'thefuzz' package. Compare strings by edit distance, similarity ratio, best matching substring, ordered token matching and set-based token matching. A range of edit distance measures are available thanks to the 'stringdist' package.
Maintained by Lewin Appleton-Fox. Last updated 10 months ago.
data-matchingfuzzy-matchingsimilarity-measuresstring-similaritythefuzz
38.8 match 35 stars 5.24 score 4 scripts