R-universe search: lemmatization

package

owner

contributor

author

maintainer

topic

needs

exports

data

Currently serving26318packages,22487articles, and64222datasets by1261organizations,13659 maintainers and22065 contributors.

Not sure what to search for? Why not try:maps, bayesian, ecology, climate, genome, gam, spatial, database, pdf, shiny, rstudio, machine learning, prediction, birds, fish, sports, ... (more popular topics)

Organizations

vimc

lcbc-uio

stan-dev

pharmaverse

r-spatial

tidyverse

ropengov

rstudio

r-lib

ropensci

bioc

r-forge

kwb-r

pik-piam

hypertidy

poissonconsulting

mrc-ide

tidymodels

pecanproject

insightsengineering

thinkr-open

inbo

mlr-org

ggseg

ohdsi

modeloriented

predictiveecology

paws-r

flr

ropenspain

sciviews

bnosac

openvolley

rmi-pacta

mrcieu

repboxr

nlmixr2

epiverse-trace

yulab-smu

frbcesab

ices-tools-prod

statnet

appsilon

azure

riatelab

bips-hb

mlverse

cloudyr

rjdverse

epiforecasts

tmsalab

openpharma

hubverse-org

usaid-oha-si

usepa

bupaverse

dreamrs

certe-medical-epidemiology

darwin-eu

easystats

ambiorix-web

business-science

merck

coatless-rpkg

hugheylab

rikenbit

r-dbi

uscbiostats

spatstat

bluegreen-labs

nutriverse

rsquaredacademy

ctu-bern

biometris

epicentre-msf

nflverse

ipeagit

ocbe-uio

ifpri

humaniverse

rspatial

apache

terminological

cogdisreslab

data-cleaning

reconhub

gesistsa

quanteda

cynkra

piecepackr

statisticsnorway

kharchenkolab

oxfordihtm

tlverse

idslme

decisionpatterns

Want to learn more about r-universe? Have a look atropensci.org/r-universeor updates from the rOpenSci blog:

Better documentation for R-universe!February 28, 2025
R-Universe Named an R Consortium Top-Level ProjectDecember 3, 2024
Capturing Screenshots Programmatically With RSeptember 10, 2024
Navigating the R ecosystem using R-universeSeptember 24, 2024
A fresh new look for R-universe!June 12, 2024
R-Universe Documentation Gets a Boost from Google Season of DocsApril 12, 2024
R-universe now builds MacOS ARM64 binaries for use on Apple Silicon (aka M1/M2/M3) systemsJanuary 14, 2024
R-universe now builds WASM binaries for all R packagesNovember 17, 2023
The rOpenSci MultiverseNovember 6, 2023
CRAN-ial Expansion: Taking Your R Package Development to New Frontiers with R-UniverseSeptember 19, 2023
Meeting the Stars of the R-Universe: The R-Universe Against Diseases.September 15, 2023
My Life with the R-universeAugust 1, 2023
New cran.dev shortlinks to package information and documentationJuly 26, 2023
Meeting the Stars of the R-Universe: PEcAn, an Open Source Project to Take Care of the PlanetJune 6, 2023
Downloading snapshots and creating stable R packages repositories using r-universeMay 31, 2023
How r-universe searches for packages on CRAN / BioconductorApril 3, 2023
Meeting the Stars of the R-Universe: Researching Our Brain with the Magic of the R-UniverseMarch 30, 2023
Meeting the Stars of the R-universe: ThinkR's Approach to Contributing to a Growing and Friendly R CommunityFebruary 28, 2023
Discovering and learning everything there is to know about R packages using r-universeFebruary 27, 2023
New preferred repo name for r-universe registriesFebruary 7, 2023
Improved permanent URL schema for r-universe.devJanuary 30, 2023
postdoc 1.0: minimal and uncluttered HTML package manualsNovember 29, 2022
Meeting the stars of the R-universe: R Community, Exchange and LearnNovember 23, 2022
Searching and browsing the R universeMarch 23, 2022
A Blend of Package Build FailuresJanuary 31, 2022
How renv restores packages from r-universe for reproducibility or productionJanuary 6, 2022
RSS feeds of package updates in r-universeNovember 24, 2021
How I Test cffr on (about) 2,000 Packages using GitHub Actions and R-universeNovember 23, 2021
Generating and customizing badges in r-universeOctober 14, 2021
rOpenSci docs are now built on r-universeSeptember 3, 2021
How to create your personal CRAN-like repository on R-universeJune 22, 2021
Publishing and browsing articles on R-universeApril 9, 2021
rOpenSci's R-universe ProjectMay 25, 2021
A first look at the R-universe build infrastructureMarch 4, 2021
Moving away from Travis CINovember 19, 2020
How to precompute package vignettes or pkgdown articlesDecember 8, 2019

Showing 10 of total 10 results (show query)

trinker

textstem:Tools for Stemming and Lemmatizing Text

Tools that stem and lemmatize text. Stemming is a process that removes endings such as affixes. Lemmatization is the process of grouping inflected forms together as a single base form.

Maintained by Tyler Rinker. Last updated 7 years ago.

lemmatization stemming text-mining

19.8 match 45 stars 8.71 score 888 scripts 11 dependents

bnosac

udpipe:Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit

This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.

Maintained by Jan Wijffels. Last updated 2 years ago.

conll dependency-parser lemmatization natural-language-processing nlp pos-tagging r-pkg rcpp text-mining tokenizer udpipe cpp

13.5 match 215 stars 11.83 score 1.2k scripts 9 dependents

trinker

lexicon:Lexicons for Text Analysis

A collection of lexical hash tables, dictionaries, and word lists.

Maintained by Tyler Rinker. Last updated 3 years ago.

hash lexicon lookup names-frequent stopwords text-dictionaries text-mining

4.5 match 111 stars 8.80 score 224 scripts 25 dependents

musajajorge

CINE:Classification International Normalized of Education

Function using lemmatization to classify educational programs according to the CINE(Classification International Normalized of Education) for Peru.

Maintained by Jorge L. C. Musaja. Last updated 2 years ago.

education lemmatization

12.3 match 2.70 score

shusei-e

RcppJagger:An R Wrapper for Jagger

A wrapper for Jagger, a morphological analyzer proposed in Yoshinaga (2023) <arXiv:2305.19045>. Jagger uses patterns derived from morphological dictionaries and training data sets and applies them from the beginning of the input. This simultaneous and deterministic process enables it to effectively perform tokenization, POS tagging, and lemmatization.

Maintained by Shusei Eshima. Last updated 2 years ago.

japanese-nlp morphological-analyser nlp part-of-speech-tagger text-analysis cpp

7.1 match 3 stars 3.18 score 3 scripts

tidymodels

textrecipes:Extra 'Recipes' for Text Processing

Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.

Maintained by Emil Hvitfeldt. Last updated 8 days ago.

2.0 match 160 stars 10.87 score 964 scripts 1 dependents

massimoaria

tall:Text Analysis for All

An R 'shiny' app designed for diverse text analysis tasks, offering a wide range of methodologies tailored to Natural Language Processing (NLP) needs. It is a versatile, general-purpose tool for analyzing textual data. 'tall' features a comprehensive workflow, including data cleaning, preprocessing, statistical analysis, and visualization, all integrated for effective text analysis.

Maintained by Massimo Aria. Last updated 3 days ago.

r-shiny text-analysis-and-sentiment-analysis text-classification text-mining textual-analysis cpp

3.4 match 14 stars 5.12 score

kidoishi

MadanText:Persian Textmining Tool for Frequency Analysis, Statistical Analysis, and Word Clouds

MadanText is an open-source software designed specifically for text mining in the Persian language. It allows users to examine word frequencies, download data for analysis, and generate word clouds. This tool is particularly useful for researchers and analysts working with Persian language data.

Maintained by Kido Ishikawa. Last updated 1 years ago.

openjdk

2.3 match 2.70 score

kidoishi

MadanTextNetwork:Persian Textmining Tool for Co-Occurrence_Network

MadanText_co-occurrence_network is an open-source software designed specifically for text mining in the Persian language. It adds co-occurrence network functionality to MadanText. The input file replaces the text format with an Excel format.

Maintained by Kido Ishikawa. Last updated 1 years ago.

openjdk

2.3 match 2.70 score

imbi-heidelberg

MetaNLP:Natural Language Processing for Meta Analysis

Given a CSV file with titles and abstracts, the package creates a document-term matrix that is lemmatized and stemmed and can directly be used to train machine learning methods for automatic title-abstract screening in the preparation of a meta analysis.

Maintained by Maximilian Pilz. Last updated 4 days ago.

0.5 match 3 stars 4.32 score 1 scripts