Showing 39 of total 39 results (show query)
tidyverse
ggplot2:Create Elegant Data Visualisations Using the Grammar of Graphics
A system for 'declaratively' creating graphics, based on "The Grammar of Graphics". You provide the data, tell 'ggplot2' how to map variables to aesthetics, what graphical primitives to use, and it takes care of the details.
Maintained by Thomas Lin Pedersen. Last updated 4 days ago.
data-visualisationvisualisation
6.6k stars 25.10 score 645k scripts 7.6k dependentstidyverse
dplyr:A Grammar of Data Manipulation
A fast, consistent tool for working with data frame like objects, both in memory and out of memory.
Maintained by Hadley Wickham. Last updated 26 days ago.
4.8k stars 24.68 score 659k scripts 7.8k dependentsr-lib
rlang:Functions for Base Types and Core R and 'Tidyverse' Features
A toolbox for working with base types, core R features like the condition system, and core 'Tidyverse' features like tidy evaluation.
Maintained by Lionel Henry. Last updated 4 days ago.
517 stars 20.56 score 9.8k scripts 15k dependentsbusiness-science
timetk:A Tool Kit for Working with Time Series
Easy visualization, wrangling, and feature engineering of time series data for forecasting and machine learning prediction. Consolidates and extends time series functionality from packages including 'dplyr', 'stats', 'xts', 'forecast', 'slider', 'padr', 'recipes', and 'rsample'.
Maintained by Matt Dancho. Last updated 1 years ago.
coercioncoercion-functionsdata-miningdplyrforecastforecastingforecasting-modelsmachine-learningseries-decompositionseries-signaturetibbletidytidyquanttidyversetimetime-seriestimeseries
626 stars 14.20 score 4.0k scripts 16 dependentsgergness
srvyr:'dplyr'-Like Syntax for Summary Statistics of Survey Data
Use piping, verbs like 'group_by' and 'summarize', and other 'dplyr' inspired syntactic style when calculating summary statistics on survey data using functions from the 'survey' package.
Maintained by Greg Freedman Ellis. Last updated 2 months ago.
215 stars 13.88 score 1.8k scripts 15 dependentsdata-cleaning
validate:Data Validation Infrastructure
Declare data validation rules and data quality indicators; confront data with them and analyze or visualize the results. The package supports rules that are per-field, in-record, cross-record or cross-dataset. Rules can be automatically analyzed for rule type and connectivity. Supports checks implied by an SDMX DSD file as well. See also Van der Loo and De Jonge (2018) <doi:10.1002/9781118897126>, Chapter 6 and the JSS paper (2021) <doi:10.18637/jss.v097.i10>.
Maintained by Mark van der Loo. Last updated 25 days ago.
419 stars 12.39 score 448 scripts 8 dependentsmarkfairbanks
tidytable:Tidy Interface to 'data.table'
A tidy interface to 'data.table', giving users the speed of 'data.table' while using tidyverse-like syntax.
Maintained by Mark Fairbanks. Last updated 2 months ago.
460 stars 11.39 score 732 scripts 11 dependentsbioc
decoupleR:decoupleR: Ensemble of computational methods to infer biological activities from omics data
Many methods allow us to extract biological activities from omics data using information from prior knowledge resources, reducing the dimensionality for increased statistical power and better interpretability. Here, we present decoupleR, a Bioconductor package containing different statistical methods to extract these signatures within a unified framework. decoupleR allows the user to flexibly test any method with any resource. It incorporates methods that take into account the sign and weight of network interactions. decoupleR can be used with any omic, as long as its features can be linked to a biological process based on prior knowledge. For example, in transcriptomics gene sets regulated by a transcription factor, or in phospho-proteomics phosphosites that are targeted by a kinase.
Maintained by Pau Badia-i-Mompel. Last updated 5 months ago.
differentialexpressionfunctionalgenomicsgeneexpressiongeneregulationnetworksoftwarestatisticalmethodtranscription
230 stars 11.27 score 316 scripts 3 dependentsrstudio
pointblank:Data Validation and Organization of Metadata for Local and Remote Tables
Validate data in data frames, 'tibble' objects, 'Spark' 'DataFrames', and database tables. Validation pipelines can be made using easily-readable, consecutive validation steps. Upon execution of the validation plan, several reporting options are available. User-defined thresholds for failure rates allow for the determination of appropriate reporting actions. Many other workflows are available including an information management workflow, where the aim is to record, collect, and generate useful information on data tables.
Maintained by Richard Iannone. Last updated 3 days ago.
data-assertionsdata-checkerdata-dictionariesdata-framesdata-inferencedata-managementdata-profilerdata-qualitydata-validationdata-verificationdatabase-tableseasy-to-understandreporting-toolschema-validationtesting-toolsyaml-configuration
942 stars 10.73 score 284 scriptsbusiness-science
modeltime:The Tidymodels Extension for Time Series Modeling
The time series forecasting framework for use with the 'tidymodels' ecosystem. Models include ARIMA, Exponential Smoothing, and additional time series models from the 'forecast' and 'prophet' packages. Refer to "Forecasting Principles & Practice, Second edition" (<https://otexts.com/fpp2/>). Refer to "Prophet: forecasting at scale" (<https://research.facebook.com/blog/2017/02/prophet-forecasting-at-scale/>.).
Maintained by Matt Dancho. Last updated 5 months ago.
arimadata-sciencedeep-learningetsforecastingmachine-learningmachine-learning-algorithmsmodeltimeprophettbatstidymodelingtidymodelstimetime-seriestime-series-analysistimeseriestimeseries-forecasting
551 stars 10.61 score 1.1k scripts 7 dependentsbioc
ballgown:Flexible, isoform-level differential expression analysis
Tools for statistical analysis of assembled transcriptomes, including flexible differential expression analysis, visualization of transcript structures, and matching of assembled transcripts to annotation.
Maintained by Jack Fu. Last updated 5 months ago.
immunooncologyrnaseqstatisticalmethodpreprocessingdifferentialexpression
145 stars 10.51 score 338 scripts 1 dependentsropensci
tidyhydat:Extract and Tidy Canadian 'Hydrometric' Data
Provides functions to access historical and real-time national 'hydrometric' data from Water Survey of Canada data sources (<https://dd.weather.gc.ca/hydrometric/csv/> and <https://collaboration.cmc.ec.gc.ca/cmc/hydrometrics/www/>) and then applies tidy data principles.
Maintained by Sam Albers. Last updated 18 days ago.
citzgovernment-datahydrologyhydrometricstidy-datawater-resources
71 stars 9.59 score 202 scripts 3 dependentsshixiangwang
sigminer:Extract, Analyze and Visualize Mutational Signatures for Genomic Variations
Genomic alterations including single nucleotide substitution, copy number alteration, etc. are the major force for cancer initialization and development. Due to the specificity of molecular lesions caused by genomic alterations, we can generate characteristic alteration spectra, called 'signature' (Wang, Shixiang, et al. (2021) <DOI:10.1371/journal.pgen.1009557> & Alexandrov, Ludmil B., et al. (2020) <DOI:10.1038/s41586-020-1943-3> & Steele Christopher D., et al. (2022) <DOI:10.1038/s41586-022-04738-6>). This package helps users to extract, analyze and visualize signatures from genomic alteration records, thus providing new insight into cancer study.
Maintained by Shixiang Wang. Last updated 6 months ago.
bayesian-nmfbioinformaticscancer-researchcnvcopynumber-signaturescosmic-signaturesdbseasy-to-useindelmutational-signaturesnmfnmf-extractionsbssignature-extractionsomatic-mutationssomatic-variantsvisualizationcpp
150 stars 9.48 score 123 scripts 2 dependentsbusiness-science
modeltime.ensemble:Ensemble Algorithms for Time Series Forecasting with Modeltime
A 'modeltime' extension that implements time series ensemble forecasting methods including model averaging, weighted averaging, and stacking. These techniques are popular methods to improve forecast accuracy and stability.
Maintained by Matt Dancho. Last updated 8 months ago.
ensembleensemble-learningforecastforecastingmodeltimestackingstacking-ensembletidymodelstimetime-seriestimeseries
77 stars 8.30 score 143 scriptsedjnet
tidywikidatar:Explore 'Wikidata' Through Tidy Data Frames
Query 'Wikidata' API <https://www.wikidata.org/wiki/Wikidata:Main_Page> with ease, get tidy data frames in response, and cache data in a local database.
Maintained by Giorgio Comai. Last updated 8 months ago.
26 stars 7.76 score 46 scripts 2 dependentsropensci
ruODK:An R Client for the ODK Central API
Access and tidy up data from the 'ODK Central' API. 'ODK Central' is a clearinghouse for digitally captured data using ODK <https://docs.getodk.org/central-intro/>. It manages user accounts and permissions, stores form definitions, and allows data collection clients like 'ODK Collect' to connect to it for form download and submission upload. The 'ODK Central' API is documented at <https://docs.getodk.org/central-api/>.
Maintained by Florian W. Mayer. Last updated 5 months ago.
databaseopen-dataodkapidatadatasetodataodata-clientodk-centralopendatakit
42 stars 7.73 score 57 scripts 1 dependentsspsanderson
healthyR.ts:The Time Series Modeling Companion to 'healthyR'
Hospital time series data analysis workflow tools, modeling, and automations. This library provides many useful tools to review common administrative time series hospital data. Some of these include average length of stay, and readmission rates. The aim is to provide a simple and consistent verb framework that takes the guesswork out of everything.
Maintained by Steven Sanderson. Last updated 6 months ago.
aiarima-forecastingarima-modeletsforecastingggplot2machine-learningmodelingprophettime-seriestime-series-analysisworkflows
19 stars 7.58 score 56 scripts 1 dependentsrmi-pacta
r2dii.analysis:Measure Climate Scenario Alignment of Corporate Loans
These tools help you to assess if a corporate lending portfolio aligns with climate goals. They summarize key climate indicators attributed to the portfolio (e.g. production, emission factors), and calculate alignment targets based on climate scenarios. They implement in R the last step of the free software 'PACTA' (Paris Agreement Capital Transition Assessment; <https://www.transitionmonitor.com/>). Financial institutions use 'PACTA' to study how their capital allocation decisions align with climate change mitigation goals.
Maintained by Jacob Kastl. Last updated 25 days ago.
12 stars 7.45 score 46 scripts 2 dependentsspsanderson
healthyR.ai:The Machine Learning and AI Modeling Companion to 'healthyR'
Hospital machine learning and ai data analysis workflow tools, modeling, and automations. This library provides many useful tools to review common administrative hospital data. Some of these include predicting length of stay, and readmits. The aim is to provide a simple and consistent verb framework that takes the guesswork out of everything.
Maintained by Steven Sanderson. Last updated 2 months ago.
aiartificial-intelligencehealthcareanalyticshealthyrhealthyversemachine-learning
16 stars 7.37 score 36 scripts 1 dependentsspsanderson
healthyR:Hospital Data Analysis Workflow Tools
Hospital data analysis workflow tools, modeling, and automations. This library provides many useful tools to review common administrative hospital data. Some of these include average length of stay, readmission rates, average net pay amounts by service lines just to name a few. The aim is to provide a simple and consistent verb framework that takes the guesswork out of everything.
Maintained by Steven Sanderson. Last updated 9 months ago.
analysisanalyticshealthcarehealthyr
30 stars 7.27 score 103 scripts 1 dependentsshixiangwang
ezcox:Easily Process a Batch of Cox Models
A tool to operate a batch of univariate or multivariate Cox models and return tidy result.
Maintained by Shixiang Wang. Last updated 1 years ago.
21 stars 7.22 score 44 scripts 1 dependentspadrinodb
ipmr:Integral Projection Models
Flexibly implements Integral Projection Models using a mathematical(ish) syntax. This package will not help with the vital rate modeling process, but will help convert those regression models into an IPM. 'ipmr' handles density dependence and environmental stochasticity, with a couple of options for implementing the latter. In addition, provides functions to avoid unintentional eviction of individuals from models. Additionally, provides model diagnostic tools, plotting functionality, stochastic/deterministic simulations, and analysis tools. Integral projection models are described in depth by Easterling et al. (2000) <doi:10.1890/0012-9658(2000)081[0694:SSSAAN]2.0.CO;2>, Merow et al. (2013) <doi:10.1111/2041-210X.12146>, Rees et al. (2014) <doi:10.1111/1365-2656.12178>, and Metcalf et al. (2015) <doi:10.1111/2041-210X.12405>. Williams et al. (2012) <doi:10.1890/11-2147.1> discuss the problem of unintentional eviction.
Maintained by Sam Levin. Last updated 5 months ago.
demographyintegral-projection-modelscpp
7 stars 6.92 score 66 scripts 1 dependentsbusiness-science
modeltime.resample:Resampling Tools for Time Series Forecasting
A 'modeltime' extension that implements forecast resampling tools that assess time-based model performance and stability for a single time series, panel data, and cross-sectional time series analysis.
Maintained by Matt Dancho. Last updated 1 years ago.
accuracy-metricsbacktestingbootstrapbootstrappingcross-validationforecastingmodeltimemodeltime-resampleresamplingstatisticstidymodelstime-series
19 stars 6.64 score 38 scripts 1 dependentsfredhasselman
invctr:Infix Functions For Vector Operations
Vector operations between grapes: An infix-only package! The 'invctr' functions perform common and less common operations on vectors, data frames matrices and list objects: - Extracting a value (range), or, finding the indices of a value (range). - Trimming, or padding a vector with a value of your choice. - Simple polynomial regression. - Set and membership operations. - General check & replace function for NAs, Inf and other values.
Maintained by Fred Hasselman. Last updated 1 months ago.
5 stars 5.30 score 40 scriptsspsanderson
healthyverse:Easily Install and Load the 'healthyverse'
The 'healthyverse' is a set of packages that work in harmony because they share common data representations and 'API' design. This package is designed to make it easy to install and load multiple 'healthyverse' packages in a single step.
Maintained by Steven Sanderson. Last updated 6 months ago.
analyticshealthcarehealthcare-applicationinstallationinstallermetapackages
11 stars 5.12 score 24 scriptspadrinodb
Rpadrino:Interact with the 'PADRINO' IPM Database
'PADRINO' houses textual representations of Integral Projection Models which can be converted from their table format into full kernels to reproduce or extend an already published analysis. 'Rpadrino' is an R interface to this database. For more information on Integral Projection Models, see Easterling et al. (2000) <doi:10.1890/0012-9658(2000)081[0694:SSSAAN]2.0.CO;2>, Merow et al. (2013) <doi:10.1111/2041-210X.12146>, Rees et al. (2014) <doi:10.1111/1365-2656.12178>, and Metcalf et al. (2015) <doi:10.1111/2041-210X.12405>. See Levin et al. (2021) for more information on 'ipmr', the engine that powers model reconstruction <doi:10.1111/2041-210X.13683>.
Maintained by Sam Levin. Last updated 2 years ago.
3 stars 4.95 score 15 scriptsjonnob
rsetse:Strain Elevation Tension Spring Embedding
An R implementation for the Strain Elevation and Tension embedding algorithm from Bourne (2020) <doi:10.1007/s41109-020-00329-4>. The package embeds graphs and networks using the Strain Elevation and Tension embedding (SETSe) algorithm. SETSe represents the network as a physical system, where edges are elastic, and nodes exert a force either up or down based on node features. SETSe positions the nodes vertically such that the tension in the edges of a node is equal and opposite to the force it exerts for all nodes in the network. The resultant structure can then be analysed by looking at the node elevation and the edge strain and tension. This algorithm works on weighted and unweighted networks as well as networks with or without explicit node features. Edge elasticity can be created from existing edge weights or kept as a constant.
Maintained by Jonathan Bourne. Last updated 3 years ago.
embeddingembedding-graphsgraph-embeddingigraphnetworksnetworkscienceunsupervised-learningopenblascppopenmp
7 stars 4.85 score 8 scriptsrdinnager
slimr:Create, Run and Post-Process 'SLiM' Population Genetics Forward Simulations
Lets you write 'SLiM' scripts (population genomics simulation) using your favourite R IDE, using a syntax as close as possible to the original 'SLiM' language. It offer many tools to manipulate those scripts, as well as run them in the 'SLiM' software from R, as well as capture and post-process their output, after or even during a simulation.
Maintained by Russell Dinnage. Last updated 5 months ago.
8 stars 4.70 score 42 scriptsshixiangwang
tinyscholar:Get and Show Personal 'Google Scholar' Profile
Provides functions to get personal 'Google Scholar' profile data from web API and show it in table or figure format.
Maintained by Shixiang Wang. Last updated 1 years ago.
8 stars 4.60 score 7 scriptsgiocomai
castarter:Content Analysis Starter Toolkit
Consistent approaches for basic web scraping, text mining and word frequency analysis of textual datasets.
Maintained by Giorgio Comai. Last updated 14 hours ago.
3 stars 4.59 score 2 scriptsshixiangwang
metawho:Meta-Analytical Implementation to Identify Who Benefits Most from Treatments
A tool for implementing so called 'deft' approach (see Fisher, David J., et al. (2017) <DOI:10.1136/bmj.j573>) and model visualization.
Maintained by Shixiang Wang. Last updated 5 years ago.
deft-approachmeta-analysissubgroup-analysis
7 stars 4.54 score 9 scriptsberkorbay
ibb:R Wrapper for Istanbul Municipality Open Data Portal
Call wrappers for Istanbul Metropolitan Municipality's Open Data Portal (Turkish: İstanbul Büyükşehir Belediyesi Açık Veri Portalı) at <https://data.ibb.gov.tr/en/>.
Maintained by Berk Orbay. Last updated 4 years ago.
10 stars 4.00 score 9 scriptstjmahr
notestar:Notebooks Using 'Targets' and 'Bookdown'
'Targets' is an R package for dependency and build management in data analysis projects. This package provides a set of targets and project infrastructure to create 'bookdown'-based notebooks using 'targets'.
Maintained by Tristan Mahr. Last updated 2 months ago.
bookdownknitrpandocrmarkdowntargets
30 stars 3.18 score 7 scriptsjimbrig
jimstools:Tools for R
What the package does (one paragraph).
Maintained by Jimmy Briggs. Last updated 3 years ago.
2 stars 3.00 score 2 scriptsrdinnager
impac:Make packed image mosaics
impac takes a list of images or a function that generates images and packs them onto a canvas using a greedy algorithm, while respecting transparency.
Maintained by Russell Dinnage. Last updated 2 years ago.
14 stars 2.85 score 4 scriptstjmahr
tjmisc:TJ's Miscellany
A collection of helper functions.
Maintained by Tristan Mahr. Last updated 2 years ago.
10 stars 2.70 score 5 scriptsjemus42
poddr:Collect Metadata for Selected Podcasts
Collecting all the data, but just for The Incomparable, Relay.fm and ATP.
Maintained by Lukas Burk. Last updated 27 days ago.
1 stars 2.54 score 6 scriptsigordot
scooter:Streamlined scRNA-Seq Analysis Pipeline
Streamlined scRNA-Seq analysis pipeline.
Maintained by Igor Dolgalev. Last updated 1 years ago.
4 stars 2.51 score 16 scriptswjschne
WJSmisc:Miscellaneous functions from W. Joel Schneider
Several functions I find useful.
Maintained by W. Joel Schneider. Last updated 2 years ago.
5 stars 2.40 score 10 scripts