Showing 4 of total 4 results (show query)
davzim
rbm25:A Light Wrapper Around the 'BM25' 'Rust' Crate for Okapi BM25 Text Search
BM25 is a ranking function used by search engines to rank matching documents according to their relevance to a user's search query. This package provides a light wrapper around the 'BM25' 'rust' crate for Okapi BM25 text search. For more information, see Robertson et al. (1994) <https://trec.nist.gov/pubs/trec3/t3_proceedings.html>.
Maintained by David Zimmermann-Kollenda. Last updated 3 months ago.
bm25rustsimilarity-searchcargo
25.3 match 5 stars 4.00 scorebnosac
udpipe:Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.
Maintained by Jan Wijffels. Last updated 2 years ago.
conlldependency-parserlemmatizationnatural-language-processingnlppos-taggingr-pkgrcpptext-miningtokenizerudpipecpp
2.1 match 215 stars 11.83 score 1.2k scripts 9 dependentsropensci
pkgmatch:Find R Packages Matching Either Descriptions or Other R Packages
Find R packages matching either descriptions or other R packages.
Maintained by Mark Padgham. Last updated 5 days ago.
embeddingsllmsnatural-language-processingcpp
3.4 match 3 stars 5.28 scoresaraswatmks
superml:Build Machine Learning Models Like Using Python's Scikit-Learn Library in R
The idea is to provide a standard interface to users who use both R and Python for building machine learning models. This package provides a scikit-learn's fit, predict interface to train machine learning models in R.
Maintained by Manish Saraswat. Last updated 1 years ago.
2.3 match 32 stars 7.05 score 117 scripts