Showing 200 of total 294 results (show query)

nflverse

nflreadr:Download 'nflverse' Data

A minimal package for downloading data from 'GitHub' repositories of the 'nflverse' project.

Maintained by Tan Ho. Last updated 4 months ago.

nflnflfastrnflversesports-data

142.8 match 66 stars 12.46 score 476 scripts 10 dependents

trinker

lexicon:Lexicons for Text Analysis

A collection of lexical hash tables, dictionaries, and word lists.

Maintained by Tyler Rinker. Last updated 3 years ago.

hashlexiconlookupnames-frequentstopwordstext-dictionariestext-mining

14.9 match 111 stars 8.80 score 224 scripts 25 dependents

epicentre-msf

dbc:Dictionary-Based Cleaning

Tools for dictionary-based data cleaning.

Maintained by Patrick Barks. Last updated 1 years ago.

27.4 match 2 stars 2.48 score 4 scripts 1 dependents

patzaw

BED:Biological Entity Dictionary (BED)

An interface for the 'Neo4j' database providing mapping between different identifiers of biological entities. This Biological Entity Dictionary (BED) has been developed to address three main challenges. The first one is related to the completeness of identifier mappings. Indeed, direct mapping information provided by the different systems are not always complete and can be enriched by mappings provided by other resources. More interestingly, direct mappings not identified by any of these resources can be indirectly inferred by using mappings to a third reference. For example, many human Ensembl gene ID are not directly mapped to any Entrez gene ID but such mappings can be inferred using respective mappings to HGNC ID. The second challenge is related to the mapping of deprecated identifiers. Indeed, entity identifiers can change from one resource release to another. The identifier history is provided by some resources, such as Ensembl or the NCBI, but it is generally not used by mapping tools. The third challenge is related to the automation of the mapping process according to the relationships between the biological entities of interest. Indeed, mapping between gene and protein ID scopes should not be done the same way than between two scopes regarding gene ID. Also, converting identifiers from different organisms should be possible using gene orthologs information. The method has been published by Godard and van Eyll (2018) <doi:10.12688/f1000research.13925.3>.

Maintained by Patrice Godard. Last updated 3 months ago.

9.5 match 8 stars 6.85 score 25 scripts

qinwf

jiebaR:Chinese Text Segmentation

Chinese text segmentation, keyword extraction and speech tagging For R.

Maintained by Qin Wenfeng. Last updated 5 years ago.

chinesechinese-text-segmentationcppjiebajiebalexical-analysisnlpcpp

6.0 match 348 stars 10.18 score 456 scripts 6 dependents

thomaschln

kgraph:Knowledge Graphs Constructions and Visualizations

Knowledge graphs enable to efficiently visualize and gain insights into large-scale data analysis results, as p-values from multiple studies or embedding data matrices. The usual workflow is a user providing a data frame of association studies results and specifying target nodes, e.g. phenotypes, to visualize. The knowledge graph then shows all the features which are significantly associated with the phenotype, with the edges being proportional to the association scores. As the user adds several target nodes and grouping information about the nodes such as biological pathways, the construction of such graphs soon becomes complex. The 'kgraph' package aims to enable users to easily build such knowledge graphs, and provides two main features: first, to enable building a knowledge graph based on a data frame of concepts relationships, be it p-values or cosine similarities; second, to enable determining an appropriate cut-off on cosine similarities from a complete embedding matrix, to enable the building of a knowledge graph directly from an embedding matrix. The 'kgraph' package provides several display, layout and cut-off options, and has already proven useful to researchers to enable them to visualize large sets of p-value associations with various phenotypes, and to quickly be able to visualize embedding results. Two example datasets are provided to demonstrate these behaviors, and several live 'shiny' applications are hosted by the CELEHS laboratory and Parse Health, as the KESER Mental Health application <https://keser-mental-health.parse-health.org/> based on Hong C. (2021) <doi:10.1038/s41746-021-00519-z>.

Maintained by Thomas Charlon. Last updated 24 days ago.

11.3 match 4.85 score

trinker

qdapTools:Tools for the 'qdap' Package

A collection of tools associated with the 'qdap' package that may be useful outside of the context of text analysis.

Maintained by Tyler Rinker. Last updated 2 years ago.

7.2 match 16 stars 7.04 score 408 scripts 5 dependents

rstudio

rstudioapi:Safely Access the RStudio API

Access the RStudio API (if available) and provide informative error messages when it's not.

Maintained by Kevin Ushey. Last updated 4 months ago.

2.0 match 172 stars 18.81 score 3.6k scripts 2.1k dependents

melff

RKernel:Yet another R kernel for Jupyter

Provides a kernel for Jupyter.

Maintained by Martin Elff. Last updated 14 days ago.

jupyterjupyter-kerneljupyter-kernelsjupyter-notebook

7.0 match 38 stars 4.60 score

predictiveecology

NetLogoR:Build and Run Spatially Explicit Agent-Based Models

Build and run spatially explicit agent-based models using only the R platform. 'NetLogoR' follows the same framework as the 'NetLogo' software (Wilensky (1999) <http://ccl.northwestern.edu/netlogo/>) and is a translation in R of the structure and functions of 'NetLogo'. 'NetLogoR' provides new R classes to define model agents and functions to implement spatially explicit agent-based models in the R environment. This package allows benefiting of the fast and easy coding phase from the highly developed 'NetLogo' framework, coupled with the versatility, power and massive resources of the R software. Examples of two models from the NetLogo software repository (Ants <http://ccl.northwestern.edu/netlogo/models/Ants>) and Wolf-Sheep-Predation (<http://ccl.northwestern.edu/netlogo/models/WolfSheepPredation>), and a third, Butterfly, from Railsback and Grimm (2012) <https://www.railsback-grimm-abm-book.com/>, all written using 'NetLogoR' are available. The 'NetLogo' code of the original version of these models is provided alongside. A programming guide inspired from the 'NetLogo' Programming Guide (<https://ccl.northwestern.edu/netlogo/docs/programming.html>) and a dictionary of 'NetLogo' primitives (<https://ccl.northwestern.edu/netlogo/docs/dictionary.html>) equivalences are also available. NOTE: To increment 'time', these functions can use a for loop or can be integrated with a discrete event simulator, such as 'SpaDES' (<https://cran.r-project.org/package=SpaDES>). The suggested package 'fastshp' can be installed with 'install.packages("fastshp", repos = ("<https://rforge.net>"), type = "source")'.

Maintained by Eliot J B McIntire. Last updated 4 months ago.

4.5 match 38 stars 6.94 score 19 scripts

vincentarelbundock

countrycode:Convert Country Names and Country Codes

Standardize country names, convert them into one of 40 different coding schemes, convert between coding schemes, and assign region descriptors.

Maintained by Vincent Arel-Bundock. Last updated 3 months ago.

2.0 match 351 stars 14.80 score 6.3k scripts 119 dependents

paithiov909

kelpbeds:Dictionary Tool for 'MeCab'

Provides the source 'IPAdic' for 'MeCab'.

Maintained by Akiru Kato. Last updated 11 months ago.

16.2 match 1.70 score

loelschlaeger

oeli:Utilities for Developing Data Science Software

Some general helper functions that I (and maybe others) find useful when developing data science software.

Maintained by Lennart Oelschlรคger. Last updated 4 months ago.

openblascpp

5.0 match 2 stars 5.42 score 1 scripts 4 dependents

trinker

qdapDictionaries:Dictionaries and Word Lists for the 'qdap' Package

A collection of text analysis dictionaries and word lists for use with the 'qdap' package.

Maintained by Tyler Rinker. Last updated 7 years ago.

3.6 match 4 stars 5.99 score 113 scripts 6 dependents

nt-williams

codebreak:Label Data Using a YAML Codebook

A light-weight framework for labeling coded data using a codebook saved as YAML text file.

Maintained by Nick Williams. Last updated 7 months ago.

codebookdata-dictionary

7.5 match 6 stars 2.48 score 1 scripts

usaid-oha-si

mindthegap:Mind the Gap

Package to tidy UNAIDS estimates (from the EDMS database) as well as plot trends in UNAIDS 95 goals and ART coverage gap by country.

Maintained by Karishma Srikanth. Last updated 2 months ago.

3.1 match 5 stars 5.51 score 13 scripts

scholaempirica

reschola:The Schola Empirica Package

A collection of utilies, themes and templates for data analysis at Schola Empirica.

Maintained by Jan Netรญk. Last updated 5 months ago.

3.5 match 4 stars 4.83 score 14 scripts

christopherkenny

acronames:Create Acronyms for Naming Things

Simple tool for developing names based on first letters of keywords.

Maintained by Christopher T. Kenny. Last updated 3 years ago.

9.8 match 1 stars 1.70 score 1 scripts

kwb-r

kwb.monitoring:Functions Used Within Different Kwb Monitoring Projects

Functions used within different KWB projects dealing with monitoring data.

Maintained by Hauke Sonnenberg. Last updated 6 years ago.

monitoring

4.1 match 3.78 score 3 scripts 4 dependents

ropensci

popler:Popler R Package

Browse and query the popler database.

Maintained by Compagnoni Aldo. Last updated 5 years ago.

3.7 match 7 stars 3.82 score 47 scripts

epicentre-msf

redcap:R Utilities For REDCap

R utilities for interacting with the REDCap API.

Maintained by Patrick Barks. Last updated 3 months ago.

3.5 match 7 stars 3.45 score 5 scripts