Showing 2 of total 2 results (show query)
vpnagraj
rrefine:r Client for OpenRefine API
'OpenRefine' (formerly 'Google Refine') is a popular, open source data cleaning software. This package enables users to programmatically trigger data transfer between R and 'OpenRefine'. Available functionality includes project import, export and deletion.
Maintained by VP Nagraj. Last updated 2 years ago.
30.0 match 22 stars 5.77 score 27 scriptschrismuir
refinr:Cluster and Merge Similar Values Within a Character Vector
These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.
Maintained by Chris Muir. Last updated 1 years ago.
approximate-string-matchingclusteringdata-cleaningdata-clusteringfuzzy-matchingngramopenrefinecpp
10.8 match 104 stars 6.80 score 121 scripts