Showing 164 of total 164 results (show query)

qinwf

jiebaR:Chinese Text Segmentation

Chinese text segmentation, keyword extraction and speech tagging For R.

Maintained by Qin Wenfeng. Last updated 5 years ago.

chinesechinese-text-segmentationcppjiebajiebalexical-analysisnlpcpp

9.6 match 352 stars 10.46 score 456 scripts 6 dependents

r-gregmisc

gtools:Various R Programming Tools

Functions to assist in R programming, including: - assist in developing, updating, and maintaining R and R packages ('ask', 'checkRVersion', 'getDependencies', 'keywords', 'scat'), - calculate the logit and inverse logit transformations ('logit', 'inv.logit'), - test if a value is missing, empty or contains only NA and NULL values ('invalid'), - manipulate R's .Last function ('addLast'), - define macros ('defmacro'), - detect odd and even integers ('odd', 'even'), - convert strings containing non-ASCII characters (like single quotes) to plain ASCII ('ASCIIfy'), - perform a binary search ('binsearch'), - sort strings containing both numeric and character components ('mixedsort'), - create a factor variable from the quantiles of a continuous variable ('quantcut'), - enumerate permutations and combinations ('combinations', 'permutation'), - calculate and convert between fold-change and log-ratio ('foldchange', 'logratio2foldchange', 'foldchange2logratio'), - calculate probabilities and generate random numbers from Dirichlet distributions ('rdirichlet', 'ddirichlet'), - apply a function over adjacent subsets of a vector ('running'), - modify the TCP_NODELAY ('de-Nagle') flag for socket objects, - efficient 'rbind' of data frames, even if the column names don't match ('smartbind'), - generate significance stars from p-values ('stars.pval'), - convert characters to/from ASCII codes ('asc', 'chr'), - convert character vector to ASCII representation ('ASCIIfy'), - apply title capitalization rules to a character vector ('capwords').

Maintained by Ben Bolker. Last updated 9 months ago.

5.3 match 25 stars 14.47 score 11k scripts 1.1k dependents

emilhvitfeldt

emoji:Data and Function to Work with Emojis

Contains data about emojis with relevant metadata, and functions to work with emojis when they are in strings.

Maintained by Emil Hvitfeldt. Last updated 5 months ago.

6.4 match 28 stars 7.93 score 304 scripts 3 dependents

datawookie

emayili:Send Email Messages

A light, simple tool for sending emails with minimal dependencies.

Maintained by Andrew B. Collier. Last updated 2 months ago.

hacktoberfest

4.9 match 180 stars 9.59 score 95 scripts 3 dependents

cran

discoverableresearch:Checks Title, Abstract and Keywords to Optimise Discoverability

A suite of tools are provided here to support authors in making their research more discoverable. check_keywords() - this function checks the keywords to assess whether they are already represented in the title and abstract. check_fields() - this function compares terminology used across the title, abstract and keywords to assess where terminological diversity (i.e. the use of synonyms) could increase the likelihood of the record being identified in a search. The function looks for terms in the title and abstract that also exist in other fields and highlights these as needing attention. suggest_keywords() - this function takes a full text document and produces a list of unigrams, bigrams and trigrams (1-, 2- or 2-word phrases) present in the full text after removing stop words (words with a low utility in natural language processing) that do not occur in the title or abstract that may be suitable candidates for keywords. suggest_title() - this function takes a full text document and produces a list of the most frequently used unigrams, bigrams and trigrams after removing stop words that do not occur in the abstract or keywords that may be suitable candidates for title words. check_title() - this function carries out a number of sub tasks: 1) it compares the length (number of words) of the title with the mean length of titles in major bibliographic databases to assess whether the title is likely to be too short; 2) it assesses the proportion of stop words in the title to highlight titles with low utility in search engines that strip out stop words; 3) it compares the title with a given sample of record titles from an .ris import and calculates a similarity score based on phrase overlap. This highlights the level of uniqueness of the title. This version of the package also contains functions currently in a non-CRAN package called 'litsearchr' <https://github.com/elizagrames/litsearchr>.

Maintained by Neal Haddaway. Last updated 4 years ago.

14.5 match 2.70 score

elipousson

d2r:Create Diagrams with D2

Build, read, write, and render diagrams using the D2 syntax.

Maintained by Eli Pousson. Last updated 6 months ago.

d2visualization

4.0 match 7 stars 3.24 score 6 scripts

ivan-rivera

RedditExtractoR:Reddit Data Extraction Toolkit

A collection of tools for extracting structured data from <https://www.reddit.com/>.

Maintained by Ivan Rivera. Last updated 2 years ago.

dataredditscraper

2.0 match 93 stars 6.02 score 153 scripts

cran

qmrparser:Parser Combinator in R

Basic functions for building parsers, with an application to PC-AXIS format files.

Maintained by Juan Gea. Last updated 3 years ago.

3.3 match 1 stars 3.26 score 6 dependents

ropensci

internetarchive:An API Client for the Internet Archive

Search the Internet Archive (<https://archive.org>), retrieve metadata, and download files.

Maintained by Ahmet Akkoc. Last updated 4 months ago.

1.8 match 60 stars 5.44 score 23 scripts

hoxo-m

githubinstall:A Helpful Way to Install R Packages Hosted on GitHub

Provides an helpful way to install packages hosted on GitHub.

Maintained by Koji Makiyama. Last updated 7 years ago.

r-language

1.2 match 49 stars 7.29 score 177 scripts

felixfan

PubMedWordcloud:'Pubmed' Word Clouds

Create a word cloud using the abstract of publications from 'Pubmed'.

Maintained by Felix Yanhui Fan. Last updated 6 years ago.

1.8 match 22 stars 4.79 score 28 scripts

mrc-ide

odin2:Next generation odin

Temporary package for rewriting odin.

Maintained by Rich FitzJohn. Last updated 2 months ago.

1.3 match 5 stars 6.32 score 22 scripts

elipousson

officerExtras:Extra Helpers for 'officer'

Helper and convenience functions using the 'officer' package to modify docx files.

Maintained by Eli Pousson. Last updated 3 months ago.

microsoft-wordofficer

1.7 match 13 stars 3.41 score 3 scripts

swissstatsr

dcatapchr:Create DCAT-AP CH Metadata Files

Create DCAT-AP CH metadata files, typically in rdf format.

Maintained by Sandro Burri. Last updated 3 months ago.

1.7 match 2.78 score 3 scripts

jl5000

tidyged.io:Import and Export GEDCOM Files

Import and export family tree GEDCOM files to and from tidy dataframes.

Maintained by Jamie Lendrum. Last updated 3 years ago.

1.9 match 2.48 score 2 dependents

kwb-r

kwb.prep:Markdown-Documented Data Preparation

R Package for Markdown-documented data preparation.

Maintained by Hauke Sonnenberg. Last updated 3 years ago.

1.8 match 2.18 score 1 scripts 1 dependents

paithiov909

aznyan:An 'Utanet' Scraper and Utilities

Scrape lyrics from 'Utanet' website.

Maintained by Akiru Kato. Last updated 11 months ago.

cpp

1.9 match 2.00 score 1 scripts

cran

pubmed.mineR:Text Mining of PubMed Abstracts

Text mining of PubMed Abstracts (text and XML) from <https://pubmed.ncbi.nlm.nih.gov/>.

Maintained by S. Ramachandran. Last updated 7 months ago.

1.8 match 6 stars 2.08 score

cran

PytrendsLongitudinalR:Create Longitudinal Google Trends Data

'Google Trends' provides cross-sectional and time-series data on searches, but lacks readily available longitudinal data. Researchers, who want to create longitudinal 'Google Trends' on their own, face practical challenges, such as normalized counts that make it difficult to combine cross-sectional and time-series data and limitations in data formats and timelines that limit data granularity over extended time periods. This package addresses these issues and enables researchers to generate longitudinal 'Google Trends' data. This package is built on 'pytrends', a Python library that acts as the unofficial 'Google Trends API' to collect 'Google Trends' data. As long as the 'Google Trends API', 'pytrends' and all their dependencies are working, this package will work. During testing, we noticed that for the same input (keyword, topic, data_format, timeline), the output index can vary from time to time. Besides, if the keyword is not very popular, then the resulting dataset will contain a lot of zeros, which will greatly affect the final result. While this package has no control over the accuracy or quality of 'Google Trends' data, once the data is created, this package coverts it to longitudinal data. In addition, the user may encounter a 429 Too Many Requests error when using cross_section() and time_series() to collect 'Google Trends' data. This error indicates that the user has exceeded the rate limits set by the 'Google Trends API'. For more information about the 'Google Trends API' - 'pytrends', visit <https://pypi.org/project/pytrends/>.

Maintained by Taeyong Park. Last updated 7 months ago.

0.8 match 2.70 score

phgrosjean

svIDE:Functions to Ease Interactions Between R and IDE or Code Editors

Function for the GUI API to interact with external IDE/code editors.

Maintained by Philippe Grosjean. Last updated 7 years ago.

1.9 match 1.04 score 11 scripts

hakkisabah

tsentiment:Fetching Tweet Data for Sentiment Analysis

Which uses Twitter APIs for the necessary data in sentiment analysis, acts as a middleware with the approved Twitter Application. A special access key is given to users who subscribe to the application with their Twitter account. With this special access key, the user defined keyword for sentiment analysis can be searched in twitter recent searches and results can be obtained( more information <https://github.com/hakkisabah/tsentiment> ). In addition, a service named tsentiment-services has been developed to provide all these operations ( for more information <https://github.com/hakkisabah/tsentiment-services> ). After the successful results obtained and in line with the permissions given by the user, the results of the analysis of the word cloud and bar graph saved in the user folder directory can be seen. In each analysis performed, the previous analysis visual result is deleted and this is the basic information you need to know as a practice rule. 'tsentiment' package provides a free service that acts as a middleware for easy data extraction from Twitter, and in return, the user rate limit is reduced by 30 requests from the total limit and the remaining requests are used. These 30 requests are reserved for use in application analytics. For information about endpoints, you can refer to the limit information in the "GET search/tweets" row in the Endpoints column in the list at <https://developer.twitter.com/en/docs/twitter-api/v1/rate-limits>.

Maintained by Hakki Sabah. Last updated 2 years ago.

sentimentsentiment-analysistidyversetwitter-apitwitter-sentiment-analysis

0.5 match 1 stars 2.70 score

christopherkenny

acronames:Create Acronyms for Naming Things

Simple tool for developing names based on first letters of keywords.

Maintained by Christopher T. Kenny. Last updated 3 years ago.

0.6 match 1 stars 1.70 score 1 scripts