R-universe search: huggingface

package

owner

contributor

author

maintainer

topic

needs

exports

data

Currently serving26318packages,22487articles, and64222datasets by1261organizations,13659 maintainers and22065 contributors.

Not sure what to search for? Why not try:maps, bayesian, ecology, climate, genome, gam, spatial, database, pdf, shiny, rstudio, machine learning, prediction, birds, fish, sports, ... (more popular topics)

Organizations

vimc

lcbc-uio

stan-dev

pharmaverse

r-spatial

tidyverse

ropengov

rstudio

r-lib

ropensci

bioc

r-forge

kwb-r

pik-piam

hypertidy

poissonconsulting

mrc-ide

tidymodels

pecanproject

insightsengineering

thinkr-open

mlr-org

inbo

ggseg

ohdsi

modeloriented

predictiveecology

paws-r

ropenspain

flr

bnosac

sciviews

openvolley

repboxr

rmi-pacta

mrcieu

epiverse-trace

nlmixr2

yulab-smu

ices-tools-prod

frbcesab

azure

appsilon

statnet

bips-hb

riatelab

mlverse

rjdverse

cloudyr

epiforecasts

tmsalab

hubverse-org

usaid-oha-si

openpharma

usepa

bupaverse

dreamrs

darwin-eu

ambiorix-web

business-science

merck

easystats

certe-medical-epidemiology

coatless-rpkg

bluegreen-labs

rsquaredacademy

nutriverse

r-dbi

spatstat

uscbiostats

hugheylab

rikenbit

gesistsa

rspatial

humaniverse

apache

ocbe-uio

reconhub

terminological

ifpri

nflverse

ipeagit

epicentre-msf

biometris

ctu-bern

data-cleaning

cogdisreslab

framverse

rformassspectrometry

ecohealthalliance

a2-ai

tlverse

doi-usgs

cleanzr

stscl

quanteda

Want to learn more about r-universe? Have a look atropensci.org/r-universeor updates from the rOpenSci blog:

Better documentation for R-universe!February 28, 2025
R-Universe Named an R Consortium Top-Level ProjectDecember 3, 2024
Capturing Screenshots Programmatically With RSeptember 10, 2024
Navigating the R ecosystem using R-universeSeptember 24, 2024
A fresh new look for R-universe!June 12, 2024
R-Universe Documentation Gets a Boost from Google Season of DocsApril 12, 2024
R-universe now builds MacOS ARM64 binaries for use on Apple Silicon (aka M1/M2/M3) systemsJanuary 14, 2024
R-universe now builds WASM binaries for all R packagesNovember 17, 2023
The rOpenSci MultiverseNovember 6, 2023
CRAN-ial Expansion: Taking Your R Package Development to New Frontiers with R-UniverseSeptember 19, 2023
Meeting the Stars of the R-Universe: The R-Universe Against Diseases.September 15, 2023
My Life with the R-universeAugust 1, 2023
New cran.dev shortlinks to package information and documentationJuly 26, 2023
Meeting the Stars of the R-Universe: PEcAn, an Open Source Project to Take Care of the PlanetJune 6, 2023
Downloading snapshots and creating stable R packages repositories using r-universeMay 31, 2023
How r-universe searches for packages on CRAN / BioconductorApril 3, 2023
Meeting the Stars of the R-Universe: Researching Our Brain with the Magic of the R-UniverseMarch 30, 2023
Meeting the Stars of the R-universe: ThinkR's Approach to Contributing to a Growing and Friendly R CommunityFebruary 28, 2023
Discovering and learning everything there is to know about R packages using r-universeFebruary 27, 2023
New preferred repo name for r-universe registriesFebruary 7, 2023
Improved permanent URL schema for r-universe.devJanuary 30, 2023
postdoc 1.0: minimal and uncluttered HTML package manualsNovember 29, 2022
Meeting the stars of the R-universe: R Community, Exchange and LearnNovember 23, 2022
Searching and browsing the R universeMarch 23, 2022
A Blend of Package Build FailuresJanuary 31, 2022
How renv restores packages from r-universe for reproducibility or productionJanuary 6, 2022
RSS feeds of package updates in r-universeNovember 24, 2021
How I Test cffr on (about) 2,000 Packages using GitHub Actions and R-universeNovember 23, 2021
Generating and customizing badges in r-universeOctober 14, 2021
rOpenSci docs are now built on r-universeSeptember 3, 2021
How to create your personal CRAN-like repository on R-universeJune 22, 2021
Publishing and browsing articles on R-universeApril 9, 2021
rOpenSci's R-universe ProjectMay 25, 2021
A first look at the R-universe build infrastructureMarch 4, 2021
Moving away from Travis CINovember 19, 2020
How to precompute package vignettes or pkgdown articlesDecember 8, 2019

Showing 13 of total 13 results (show query)

oscarkjell

text:Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning

Link R with Transformers from Hugging Face to transform text variables to word embeddings; where the word embeddings are used to statistically test the mean difference between set of texts, compute semantic similarity scores between texts, predict numerical variables, and visual statistically significant words according to various dimensions etc. For more information see <https://www.r-text.org>.

Maintained by Oscar Kjell. Last updated 3 days ago.

deep-learning machine-learning nlp transformers openjdk

13.7 match 146 stars 13.16 score 436 scripts 1 dependents

michelnivard

gptstudio:Use Large Language Models Directly in your Development Environment

Large language models are readily accessible via API. This package lowers the barrier to use the API inside of your development environment. For more on the API, see <https://platform.openai.com/docs/introduction>.

Maintained by James Wade. Last updated 5 days ago.

chatgpt gpt-3 rstudio rstudio-addin

12.5 match 924 stars 10.83 score 43 scripts 1 dependents

psychbruce

FMAT:The Fill-Mask Association Test

The Fill-Mask Association Test ('FMAT') <doi:10.1037/pspa0000396> is an integrative and probability-based method using Masked Language Models to measure conceptual associations (e.g., attitudes, biases, stereotypes, social norms, cultural values) as propositions in natural language. Supported language models include 'BERT' <doi:10.48550/arXiv.1810.04805> and its variants available at 'Hugging Face' <https://huggingface.co/models?pipeline_tag=fill-mask>. Methodological references and installation guidance are provided at <https://psychbruce.github.io/FMAT/>.

Maintained by Han-Wu-Shuang Bao. Last updated 5 months ago.

ai artificial-intelligence bert bert-model bert-models contextualized-representation fill-in-the-blank fill-mask huggingface language-model language-models large-language-models masked-language-models natural-language-processing natural-language-understanding nlp pretrained-models transformer transformers

10.5 match 12 stars 4.82 score 2 scripts

jameshwade

gpttools:Extensions and Tools for gptstudio

gpttools is an R package that provides extensions to gptstudio to provide devtools-like functionality using the latest natural language processing (NLP) models. It is designed to make package development easier by providing a range of tools and functions that can be used to improve the quality of your package's documentation, testing, and maybe even functionality.

Maintained by James Wade. Last updated 7 months ago.

chatgpt nlp openai package-development rstudio-addin

7.1 match 293 stars 7.06 score 14 scripts

dyfanjones

sagemaker.mlframework:sagemaker machine learning developed by amazon

`sagemaker` machine learning developed by amazon.

Maintained by Dyfan Jones. Last updated 3 years ago.

amazon-sagemaker aws machine-learning sagemaker sdk

5.0 match 2.48 score 2 dependents

mlverse

hfhub:Hugging Face Hub Interface

Provides functionality to download and cache files from 'Hugging Face Hub' <https://huggingface.co/models>. Uses the same caching structure so files can be shared between different client libraries.

Maintained by Daniel Falbel. Last updated 6 months ago.

2.4 match 16 stars 4.28 score 24 scripts

ropensci

pangoling:Access to Large Language Model Predictions

Provides access to word predictability estimates using large language models (LLMs) based on 'transformer' architectures via integration with the 'Hugging Face' ecosystem. The package interfaces with pre-trained neural networks and supports both causal/auto-regressive LLMs (e.g., 'GPT-2'; Radford et al., 2019) and masked/bidirectional LLMs (e.g., 'BERT'; Devlin et al., 2019, <doi:10.48550/arXiv.1810.04805>) to compute the probability of words, phrases, or tokens given their linguistic context. By enabling a straightforward estimation of word predictability, the package facilitates research in psycholinguistics, computational linguistics, and natural language processing (NLP).

Maintained by Bruno Nicenboim. Last updated 4 days ago.

nlp psycholinguistics transformers

1.8 match 8 stars 4.90 score

dyfanjones

sagemaker:R SDK for `AWS Sagemaker`

A library for training and deploying machine learning models on Amazon SageMaker <https://aws.amazon.com/sagemaker/> using R through `paws SDK`.

Maintained by Dyfan Jones. Last updated 3 years ago.

amazon-sagemaker aws machine-learning sagemaker sdk

3.0 match 12 stars 2.78 score 6 scripts

psychbruce

PsychWordVec:Word Embedding Research Framework for Psychological Science

An integrative toolbox of word embedding research that provides: (1) a collection of 'pre-trained' static word vectors in the '.RData' compressed format <https://psychbruce.github.io/WordVector_RData.pdf>; (2) a series of functions to process, analyze, and visualize word vectors; (3) a range of tests to examine conceptual associations, including the Word Embedding Association Test <doi:10.1126/science.aal4230> and the Relative Norm Distance <doi:10.1073/pnas.1720347115>, with permutation test of significance; (4) a set of training methods to locally train (static) word vectors from text corpora, including 'Word2Vec' <arXiv:1301.3781>, 'GloVe' <doi:10.3115/v1/D14-1162>, and 'FastText' <arXiv:1607.04606>; (5) a group of functions to download 'pre-trained' language models (e.g., 'GPT', 'BERT') and extract contextualized (dynamic) word vectors (based on the R package 'text').

Maintained by Han-Wu-Shuang Bao. Last updated 1 years ago.

1.8 match 22 stars 4.04 score 10 scripts

atomashevic

transforEmotion:Sentiment Analysis for Text, Image and Video using Transformer Models

Implements sentiment analysis using huggingface <https://huggingface.co> transformer zero-shot classification model pipelines for text and image data. The default text pipeline is Cross-Encoder's DistilRoBERTa <https://huggingface.co/cross-encoder/nli-distilroberta-base> and default image/video pipeline is Open AI's CLIP <https://huggingface.co/openai/clip-vit-base-patch32>. All other zero-shot classification model pipelines can be implemented using their model name from <https://huggingface.co/models?pipeline_tag=zero-shot-classification>.

Maintained by Aleksandar Tomašević. Last updated 2 months ago.

1.0 match 26 stars 6.40 score 12 scripts

jaytimm

textpress:A Lightweight and Versatile NLP Toolkit

A simple Natural Language Processing (NLP) toolkit focused on search-centric workflows with minimal dependencies. The package offers key features for web scraping, text processing, corpus search, and text embedding generation via the 'HuggingFace API' <https://huggingface.co/docs/api-inference/index>.

Maintained by Jason Timm. Last updated 5 months ago.

corpus-search nlp openai-embeddings web-scraping

0.8 match 3 stars 4.18 score

gesistsa

grafzahl:Supervised Machine Learning for Textual Data Using Transformers and 'Quanteda'

Duct tape the 'quanteda' ecosystem (Benoit et al., 2018) <doi:10.21105/joss.00774> to modern Transformer-based text classification models (Wolf et al., 2020) <doi:10.18653/v1/2020.emnlp-demos.6>, in order to facilitate supervised machine learning for textual data. This package mimics the behaviors of 'quanteda.textmodels' and provides a function to setup the 'Python' environment to use the pretrained models from 'Hugging Face' <https://huggingface.co/>. More information: <doi:10.5117/CCR2023.1.003.CHAN>.

Maintained by Chung-hong Chan. Last updated 25 days ago.

0.5 match 41 stars 5.91 score 3 scripts

macmillancontentscience

wordpiece.data:Data for Wordpiece-Style Tokenization

Provides data to be used by the wordpiece algorithm in order to tokenize text into somewhat meaningful chunks. Included vocabularies were retrieved from <https://huggingface.co/bert-base-cased/resolve/main/vocab.txt> and <https://huggingface.co/bert-base-uncased/resolve/main/vocab.txt> and parsed into an R-friendly format.

Maintained by Jon Harmon. Last updated 3 years ago.

0.8 match 3.18 score 5 scripts 1 dependents