Showing 8 of total 8 results (show query)
juliasilge
tidytext:Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools
Using tidy data principles can make many text mining tasks easier, more effective, and consistent with tools already in wide use. Much of the infrastructure needed for text mining with tidy data frames already exists in packages like 'dplyr', 'broom', 'tidyr', and 'ggplot2'. In this package, we provide functions and supporting data sets to allow conversion of text to and from tidy formats, and to switch seamlessly between tidy tools and existing text mining packages.
Maintained by Julia Silge. Last updated 11 months ago.
natural-language-processingtext-miningtidy-datatidyverse
58.8 match 1.2k stars 16.86 score 17k scripts 61 dependentstommyjones
textmineR:Functions for Text Mining and Topic Modeling
An aid for text mining in R, with a syntax that should be familiar to experienced R users. Provides a wrapper for several topic models that take similarly-formatted input and give similarly-formatted output. Has additional functionality for analyzing and diagnostics for topic models.
Maintained by Tommy Jones. Last updated 2 years ago.
4.7 match 106 stars 10.83 score 310 scripts 7 dependentsandrewheiss
quRan:Complete Text of the Qur'an
Full text, in data frames containing one row per verse, of the Qur'an in Arabic (with and without vowels) and in English (the Yusuf Ali and Saheeh International translations), formatted to be convenient for text analysis.
Maintained by Andrew Heiss. Last updated 6 years ago.
11.0 match 29 stars 4.44 score 19 scriptsandrewheiss
scriptuRs:Complete Text of the LDS Scriptures
Full text, in data frames containing one row per verse, of the Standard Works of The Church of Jesus Christ of Latter-day Saints (LDS). These are the Old Testament, (KJV), the New Testament (KJV), the Book of Mormon, the Doctrine and Covenants, and the Pearl of Great Price.
Maintained by Andrew Heiss. Last updated 6 years ago.
ldslds-scripturestext-miningtidytext
11.0 match 14 stars 4.32 score 30 scriptssimmieyungie
texter:An Easy Text and Sentiment Analysis Library
Implement text and sentiment analysis with 'texter'. Generate sentiment scores on text data and also visualize sentiments. 'texter' allows you to quickly generate insights on your data. It includes support for lexicons such as 'NRC' and 'Bing'.
Maintained by Simi Kafaru. Last updated 3 years ago.
3.8 match 2 stars 3.00 score 4 scriptshope-data-science
akc:Automatic Knowledge Classification
A tidy framework for automatic knowledge classification and visualization. Currently, the core functionality of the framework is mainly supported by modularity-based clustering (community detection) in keyword co-occurrence network, and focuses on co-word analysis of bibliometric research. However, the designed functions in 'akc' are general, and could be extended to solve other tasks in text mining as well.
Maintained by Tian-Yuan Huang. Last updated 20 days ago.
1.8 match 15 stars 5.85 score 47 scriptspaithiov909
gibasa:An Alternative 'Rcpp' Wrapper of 'MeCab'
A plain 'Rcpp' wrapper for 'MeCab' that can segment Chinese, Japanese, and Korean text into tokens. The main goal of this package is to provide an alternative to 'tidytext' using morphological analysis.
Maintained by Akiru Kato. Last updated 29 days ago.
0.5 match 15 stars 5.02 score 3 scriptstconwell
textTools:Functions for Text Cleansing and Text Analysis
A framework for text cleansing and analysis. Conveniently prepare and process large amounts of text for analysis. Includes various metrics for word counts/frequencies that scale efficiently. Quickly analyze large amounts of text data using a text.table (a data.table created with one word (or unit of text analysis) per row, similar to the tidytext format). Offers flexibility to efficiently work with text data stored in vectors as well as text data formatted as a text.table.
Maintained by Timothy Conwell. Last updated 4 years ago.
0.5 match 1.00 score 4 scripts