R-universe search: tidytext

Showing 8 of total 8 results (show query)

juliasilge

tidytext:Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Using tidy data principles can make many text mining tasks easier, more effective, and consistent with tools already in wide use. Much of the infrastructure needed for text mining with tidy data frames already exists in packages like 'dplyr', 'broom', 'tidyr', and 'ggplot2'. In this package, we provide functions and supporting data sets to allow conversion of text to and from tidy formats, and to switch seamlessly between tidy tools and existing text mining packages.

Maintained by Julia Silge. Last updated 11 months ago.

natural-language-processing text-mining tidy-data tidyverse

58.8 match 1.2k stars 16.86 score 17k scripts 61 dependents

tommyjones

textmineR:Functions for Text Mining and Topic Modeling

An aid for text mining in R, with a syntax that should be familiar to experienced R users. Provides a wrapper for several topic models that take similarly-formatted input and give similarly-formatted output. Has additional functionality for analyzing and diagnostics for topic models.

Maintained by Tommy Jones. Last updated 2 years ago.

cpp

4.7 match 106 stars 10.83 score 310 scripts 7 dependents

andrewheiss

quRan:Complete Text of the Qur'an

Full text, in data frames containing one row per verse, of the Qur'an in Arabic (with and without vowels) and in English (the Yusuf Ali and Saheeh International translations), formatted to be convenient for text analysis.

Maintained by Andrew Heiss. Last updated 6 years ago.

islam quran text-mining tidytext

11.0 match 29 stars 4.44 score 19 scripts

andrewheiss

scriptuRs:Complete Text of the LDS Scriptures

Full text, in data frames containing one row per verse, of the Standard Works of The Church of Jesus Christ of Latter-day Saints (LDS). These are the Old Testament, (KJV), the New Testament (KJV), the Book of Mormon, the Doctrine and Covenants, and the Pearl of Great Price.

Maintained by Andrew Heiss. Last updated 6 years ago.

lds lds-scriptures text-mining tidytext

11.0 match 14 stars 4.32 score 30 scripts

simmieyungie

texter:An Easy Text and Sentiment Analysis Library

Implement text and sentiment analysis with 'texter'. Generate sentiment scores on text data and also visualize sentiments. 'texter' allows you to quickly generate insights on your data. It includes support for lexicons such as 'NRC' and 'Bing'.

Maintained by Simi Kafaru. Last updated 3 years ago.

3.8 match 2 stars 3.00 score 4 scripts

hope-data-science

akc:Automatic Knowledge Classification

A tidy framework for automatic knowledge classification and visualization. Currently, the core functionality of the framework is mainly supported by modularity-based clustering (community detection) in keyword co-occurrence network, and focuses on co-word analysis of bibliometric research. However, the designed functions in 'akc' are general, and could be extended to solve other tasks in text mining as well.

Maintained by Tian-Yuan Huang. Last updated 20 days ago.

1.8 match 15 stars 5.85 score 47 scripts

paithiov909

gibasa:An Alternative 'Rcpp' Wrapper of 'MeCab'

A plain 'Rcpp' wrapper for 'MeCab' that can segment Chinese, Japanese, and Korean text into tokens. The main goal of this package is to provide an alternative to 'tidytext' using morphological analysis.

Maintained by Akiru Kato. Last updated 29 days ago.

mecab pos-tagging rcpp cpp

0.5 match 15 stars 5.02 score 3 scripts

tconwell

textTools:Functions for Text Cleansing and Text Analysis

A framework for text cleansing and analysis. Conveniently prepare and process large amounts of text for analysis. Includes various metrics for word counts/frequencies that scale efficiently. Quickly analyze large amounts of text data using a text.table (a data.table created with one word (or unit of text analysis) per row, similar to the tidytext format). Offers flexibility to efficiently work with text data stored in vectors as well as text data formatted as a text.table.

Maintained by Timothy Conwell. Last updated 4 years ago.

0.5 match 1.00 score 4 scripts