Showing 200 of total 657 results (show query)

quanteda

stopwords:Multilingual Stopword Lists

Provides multiple sources of stopwords, for use in text analysis and natural language processing.

Maintained by Kenneth Benoit. Last updated 3 years ago.

text-analysis

9.6 match 114 stars 10.54 score 1.1k scripts 65 dependents

miraisolutions

XLConnect:Excel Connector for R

Provides comprehensive functionality to read, write and format Excel data.

Maintained by Martin Studer. Last updated 16 days ago.

cross-platformexcelr-languagexlconnectopenjdk

7.5 match 130 stars 12.28 score 1.2k scripts 1 dependents

kumes

deepRstudio:Seamless Language Translation in 'RStudio' using 'DeepL' API and 'Rstudioapi'

Enhancing cross-language compatibility within the 'RStudio' environment and supporting seamless language understanding, the 'deepRstudio' package leverages the power of the 'DeepL' API (see <https://www.deepl.com/docs-api>) to enable seamless, fast, accurate, and affordable translation of code comments, documents, and text. This package offers the ability to translate selected text into English (EN), as well as from English into various languages, namely Japanese (JA), Chinese (ZH), Spanish (ES), French (FR), Russian (RU), Portuguese (PT), and Indonesian (ID). With much of the text being written in English, the emphasis is on compatibility from English. It is also designed for developers working on multilingual projects and data analysts collaborating with international teams, simplifying the translation process and making code more accessible and comprehensible to people with diverse language backgrounds. This package uses the 'rstudioapi' package and 'DeepL' API, and is simply implemented, executed from addins or via shortcuts on 'RStudio'. With just a few steps, content can be translated between supported languages, promoting better collaboration and expanding the global reach of work. The functionality of this package works only on 'RStudio' using 'rstudioapi'.

Maintained by Satoshi Kume. Last updated 1 years ago.

deepldeeprstudiolanguage-translationrstudiorstudioapiseamlessseamless-languagetranslation

23.3 match 2 stars 3.48 score 4 scripts 1 dependents

vincentarelbundock

countrycode:Convert Country Names and Country Codes

Standardize country names, convert them into one of 40 different coding schemes, convert between coding schemes, and assign region descriptors.

Maintained by Vincent Arel-Bundock. Last updated 2 months ago.

5.0 match 351 stars 14.80 score 6.3k scripts 119 dependents

tidyverse

purrr:Functional Programming Tools

A complete and consistent functional programming toolkit for R.

Maintained by Hadley Wickham. Last updated 1 months ago.

functional-programming

3.3 match 1.3k stars 22.12 score 59k scripts 6.9k dependents

ropensci

rnaturalearth:World Map Data from Natural Earth

Facilitates mapping by making natural earth map data from <https://www.naturalearthdata.com/> more easily available to R users.

Maintained by Philippe Massicotte. Last updated 15 days ago.

peer-reviewed

4.6 match 232 stars 15.35 score 7.2k scripts 47 dependents

hadley

pryr:Tools for Computing on the Language

Useful tools to pry back the covers of R and understand the language at a deeper level.

Maintained by Hadley Wickham. Last updated 1 years ago.

cpp

5.8 match 204 stars 11.85 score 1.9k scripts 56 dependents

trinker

wakefield:Generate Random Data Sets

Generates random data sets including: data.frames, lists, and vectors.

Maintained by Tyler Rinker. Last updated 5 years ago.

data-generationwakefield

9.4 match 256 stars 7.13 score 209 scripts

cran

nlme:Linear and Nonlinear Mixed Effects Models

Fit and compare Gaussian linear and nonlinear mixed-effects models.

Maintained by R Core Team. Last updated 2 months ago.

fortran

4.5 match 6 stars 13.00 score 13k scripts 8.7k dependents

hoxo-m

githubinstall:A Helpful Way to Install R Packages Hosted on GitHub

Provides an helpful way to install packages hosted on GitHub.

Maintained by Koji Makiyama. Last updated 7 years ago.

r-language

7.5 match 49 stars 7.35 score 177 scripts

kurthornik

NLP:Natural Language Processing Infrastructure

Basic classes and methods for Natural Language Processing.

Maintained by Kurt Hornik. Last updated 4 months ago.

5.6 match 6 stars 9.37 score 1.0k scripts 127 dependents

hoxo-m

magicfor:Magic Functions to Obtain Results from for Loops

Magic functions to obtain results from for loops.

Maintained by Koji Makiyama. Last updated 8 years ago.

r-language

7.5 match 20 stars 5.72 score 53 scripts

sjewo

readstata13:Import 'Stata' Data Files

Function to read and write the 'Stata' file format.

Maintained by Sebastian Jeworutzki. Last updated 2 years ago.

statacpp

3.8 match 41 stars 10.74 score 1.7k scripts 45 dependents

quanteda

spacyr:Wrapper to the 'spaCy' 'NLP' Library

An R wrapper to the 'Python' 'spaCy' 'NLP' library, from <https://spacy.io>.

Maintained by Kenneth Benoit. Last updated 1 months ago.

extract-entitiesnlpspacyspeech-tagging

3.6 match 253 stars 10.68 score 408 scripts 6 dependents

bnosac

doc2vec:Distributed Representations of Sentences, Documents and Topics

Learn vector representations of sentences, paragraphs or documents by using the 'Paragraph Vector' algorithms, namely the distributed bag of words ('PV-DBOW') and the distributed memory ('PV-DM') model. The techniques in the package are detailed in the paper "Distributed Representations of Sentences and Documents" by Mikolov et al. (2014), available at <arXiv:1405.4053>. The package also provides an implementation to cluster documents based on these embedding using a technique called top2vec. Top2vec finds clusters in text documents by combining techniques to embed documents and words and density-based clustering. It does this by embedding documents in the semantic space as defined by the 'doc2vec' algorithm. Next it maps these document embeddings to a lower-dimensional space using the 'Uniform Manifold Approximation and Projection' (UMAP) clustering algorithm and finds dense areas in that space using a 'Hierarchical Density-Based Clustering' technique (HDBSCAN). These dense areas are the topic clusters which can be represented by the corresponding topic vector which is an aggregate of the document embeddings of the documents which are part of that topic cluster. In the same semantic space similar words can be found which are representative of the topic. More details can be found in the paper 'Top2Vec: Distributed Representations of Topics' by D. Angelov available at <arXiv:2008.09470>.

Maintained by Jan Wijffels. Last updated 3 years ago.

doc2vecembeddingsnatural-language-processingparagraph2vecword2veccpp

6.7 match 48 stars 5.74 score 23 scripts

docopt

docopt:Command-Line Interface Specification Language

Define a command-line interface by just giving it a description in the specific format.

Maintained by Edwin de Jonge. Last updated 4 years ago.

3.0 match 213 stars 11.29 score 1.5k scripts 19 dependents

uribo

textlintr:Natural Language Linter Tools for 'R Markdown' and R Code

What the package does (one paragraph).

Maintained by Shinya Uryu. Last updated 2 years ago.

lintnatural-language-processing

9.5 match 9 stars 2.95 score 4 scripts

ropensci

babelquarto:Renders a Multilingual Quarto Book

Automate rendering and cross-linking of Quarto books following a prescribed structure.

Maintained by Maëlle Salmon. Last updated 1 months ago.

3.7 match 43 stars 7.52 score 23 scripts 1 dependents

giocomai

zoteror:Access the Zotero API in R

zoteror provides tools to access the Zotero API

Maintained by Giorgio Comai. Last updated 5 years ago.

r-languagezoterozotero-api

7.5 match 37 stars 3.27 score 5 scripts

bioc

BiocGenerics:S4 generic functions used in Bioconductor

The package defines many S4 generic functions used in Bioconductor.

Maintained by Hervé Pagès. Last updated 1 months ago.

infrastructurebioconductor-packagecore-package

1.7 match 12 stars 14.22 score 612 scripts 2.2k dependents