Showing 200 of total 227 results (show query)

tidyverse

dplyr:A Grammar of Data Manipulation

A fast, consistent tool for working with data frame like objects, both in memory and out of memory.

Maintained by Hadley Wickham. Last updated 13 days ago.

data-manipulationgrammarcpp

9.8 match 4.8k stars 24.68 score 659k scripts 7.8k dependents

markfairbanks

tidytable:Tidy Interface to 'data.table'

A tidy interface to 'data.table', giving users the speed of 'data.table' while using tidyverse-like syntax.

Maintained by Mark Fairbanks. Last updated 2 months ago.

5.0 match 458 stars 11.41 score 732 scripts 10 dependents

mbedward

packcircles:Circle Packing

Algorithms to find arrangements of non-overlapping circles.

Maintained by Michael Bedward. Last updated 4 months ago.

cpp

5.7 match 57 stars 10.06 score 422 scripts 6 dependents

nathaneastwood

poorman:A Poor Man's Dependency Free Recreation of 'dplyr'

A replication of key functionality from 'dplyr' and the wider 'tidyverse' using only 'base'.

Maintained by Nathan Eastwood. Last updated 1 years ago.

base-rdata-manipulationgrammar

5.0 match 341 stars 10.79 score 156 scripts 27 dependents

branchlab

metasnf:Meta Clustering with Similarity Network Fusion

Framework to facilitate patient subtyping with similarity network fusion and meta clustering. The similarity network fusion (SNF) algorithm was introduced by Wang et al. (2014) in <doi:10.1038/nmeth.2810>. SNF is a data integration approach that can transform high-dimensional and diverse data types into a single similarity network suitable for clustering with minimal loss of information from each initial data source. The meta clustering approach was introduced by Caruana et al. (2006) in <doi:10.1109/ICDM.2006.103>. Meta clustering involves generating a wide range of cluster solutions by adjusting clustering hyperparameters, then clustering the solutions themselves into a manageable number of qualitatively similar solutions, and finally characterizing representative solutions to find ones that are best for the user's specific context. This package provides a framework to easily transform multi-modal data into a wide range of similarity network fusion-derived cluster solutions as well as to visualize, characterize, and validate those solutions. Core package functionality includes easy customization of distance metrics, clustering algorithms, and SNF hyperparameters to generate diverse clustering solutions; calculation and plotting of associations between features, between patients, and between cluster solutions; and standard cluster validation approaches including resampled measures of cluster stability, standard metrics of cluster quality, and label propagation to evaluate generalizability in unseen data. Associated vignettes guide the user through using the package to identify patient subtypes while adhering to best practices for unsupervised learning.

Maintained by Prashanth S Velayudhan. Last updated 5 days ago.

bioinformaticsclusteringmetaclusteringsnf

5.0 match 8 stars 8.21 score 30 scripts

david-barnett

microViz:Microbiome Data Analysis and Visualization

Microbiome data visualization and statistics tools built upon phyloseq.

Maintained by David Barnett. Last updated 3 months ago.

microbiomemicrobiome-analysismicrobiota

5.3 match 114 stars 6.22 score 480 scripts

mitchelloharawild

vitae:Curriculum Vitae for R Markdown

Provides templates and functions to simplify the production and maintenance of curriculum vitae.

Maintained by Mitchell OHara-Wild. Last updated 9 months ago.

cvozunconf18resumeunconf

3.0 match 1.2k stars 10.78 score 556 scripts

insightsengineering

tern:Create Common TLGs Used in Clinical Trials

Table, Listings, and Graphs (TLG) library for common outputs used in clinical trials.

Maintained by Joe Zhu. Last updated 2 months ago.

clinical-trialsgraphslistingsnestoutputstables

2.0 match 79 stars 12.62 score 186 scripts 9 dependents

ycroissant

dfidx:Indexed Data Frames

Provides extended data frames, with a special data frame column which contains two indexes, with potentially a nesting structure.

Maintained by Yves Croissant. Last updated 7 months ago.

3.0 match 2 stars 6.85 score 44 scripts 18 dependents

dmurdoch

plotrix:Various Plotting Functions

Lots of plots, various labeling, axis and color scaling functions. The author/maintainer died in September 2023.

Maintained by Duncan Murdoch. Last updated 1 years ago.

1.8 match 5 stars 11.31 score 9.2k scripts 361 dependents

poissonconsulting

mcmcdata:Manipulate MCMC Samples and Data Frames

Manipulates Monte Carlo Markov Chain samples and associated data frames.

Maintained by Joe Thorley. Last updated 2 months ago.

5.0 match 1 stars 3.56 score 4 scripts 4 dependents

nepem-ufsc

metan:Multi Environment Trials Analysis

Performs stability analysis of multi-environment trial data using parametric and non-parametric methods. Parametric methods includes Additive Main Effects and Multiplicative Interaction (AMMI) analysis by Gauch (2013) <doi:10.2135/cropsci2013.04.0241>, Ecovalence by Wricke (1965), Genotype plus Genotype-Environment (GGE) biplot analysis by Yan & Kang (2003) <doi:10.1201/9781420040371>, geometric adaptability index by Mohammadi & Amri (2008) <doi:10.1007/s10681-007-9600-6>, joint regression analysis by Eberhart & Russel (1966) <doi:10.2135/cropsci1966.0011183X000600010011x>, genotypic confidence index by Annicchiarico (1992), Murakami & Cruz's (2004) method, power law residuals (POLAR) statistics by Doring et al. (2015) <doi:10.1016/j.fcr.2015.08.005>, scale-adjusted coefficient of variation by Doring & Reckling (2018) <doi:10.1016/j.eja.2018.06.007>, stability variance by Shukla (1972) <doi:10.1038/hdy.1972.87>, weighted average of absolute scores by Olivoto et al. (2019a) <doi:10.2134/agronj2019.03.0220>, and multi-trait stability index by Olivoto et al. (2019b) <doi:10.2134/agronj2019.03.0221>. Non-parametric methods includes superiority index by Lin & Binns (1988) <doi:10.4141/cjps88-018>, nonparametric measures of phenotypic stability by Huehn (1990) <doi:10.1007/BF00024241>, TOP third statistic by Fox et al. (1990) <doi:10.1007/BF00040364>. Functions for computing biometrical analysis such as path analysis, canonical correlation, partial correlation, clustering analysis, and tools for inspecting, manipulating, summarizing and plotting typical multi-environment trial data are also provided.

Maintained by Tiago Olivoto. Last updated 10 days ago.

1.9 match 2 stars 9.48 score 1.3k scripts 2 dependents

kwb-r

kwb.plot:some useful functions for plotting

Some useful functions for plotting.

Maintained by Hauke Sonnenberg. Last updated 1 years ago.

data-visualisationproject-fakinproject-miacso

3.6 match 3.59 score 1 scripts 26 dependents

mirzaghaderi

rtpcr:qPCR Data Analysis

Various methods are employed for statistical analysis and graphical presentation of real-time PCR (quantitative PCR or qPCR) data. 'rtpcr' handles amplification efficiency calculation, statistical analysis and graphical representation of real-time PCR data based on up to two reference genes. By accounting for amplification efficiency values, 'rtpcr' was developed using a general calculation method described by Ganger et al. (2017) <doi:10.1186/s12859-017-1949-5> and Taylor et al. (2019) <doi:10.1016/j.tibtech.2018.12.002>, covering both the Livak and Pfaffl methods. Based on the experimental conditions, the functions of the 'rtpcr' package use t-test (for experiments with a two-level factor), analysis of variance (ANOVA), analysis of covariance (ANCOVA) or analysis of repeated measure data to calculate the fold change (FC, Delta Delta Ct method) or relative expression (RE, Delta Ct method). The functions further provide standard errors and confidence intervals for means, apply statistical mean comparisons and present significance. To facilitate function application, different data sets were used as examples and the outputs were explained. ‘rtpcr’ package also provides bar plots using various controlling arguments. The 'rtpcr' package is user-friendly and easy to work with and provides an applicable resource for analyzing real-time PCR data.

Maintained by Ghader Mirzaghaderi. Last updated 26 days ago.

data-analysisqpcr

2.4 match 1 stars 4.88 score 3 scripts

nicchr

fastplyr:Fast Alternatives to 'tidyverse' Functions

A full set of fast data manipulation tools with a tidy front-end and a fast back-end using 'collapse' and 'cheapr'.

Maintained by Nick Christofides. Last updated 24 days ago.

cpp

1.8 match 23 stars 6.32 score 36 scripts 1 dependents

kaiaragaki

gplate:A Grammar of Plates

`gplate` attempts to provide a succinct yet powerful grammar to describe common microwell layouts to aide in both plotting and tidying.

Maintained by Kai Aragaki. Last updated 7 months ago.

ggplot2

1.7 match 4 stars 4.56 score 9 scripts 3 dependents

alanarnholt

PASWR2:Probability and Statistics with R, Second Edition

Functions and data sets for the text Probability and Statistics with R, Second Edition.

Maintained by Alan T. Arnholt. Last updated 3 years ago.

1.8 match 1 stars 4.24 score 260 scripts

skranz

sktools:Helpful functions used in my courses

Several helpful functions that I use in my courses

Maintained by Sebastian Kranz. Last updated 4 years ago.

3.4 match 1 stars 2.15 score 28 scripts

jakubnowicki

fixtuRes:Mock Data Generator

Generate mock data in R using YAML configuration.

Maintained by Jakub Nowicki. Last updated 3 years ago.

fixturesmock-datamock-data-generatortest-data-generatoryaml-configuration

1.3 match 16 stars 4.98 score 12 scripts

wraff

wrProteo:Proteomics Data Analysis Functions

Data analysis of proteomics experiments by mass spectrometry is supported by this collection of functions mostly dedicated to the analysis of (bottom-up) quantitative (XIC) data. Fasta-formatted proteomes (eg from UniProt Consortium <doi:10.1093/nar/gky1049>) can be read with automatic parsing and multiple annotation types (like species origin, abbreviated gene names, etc) extracted. Initial results from multiple software for protein (and peptide) quantitation can be imported (to a common format): MaxQuant (Tyanova et al 2016 <doi:10.1038/nprot.2016.136>), Dia-NN (Demichev et al 2020 <doi:10.1038/s41592-019-0638-x>), Fragpipe (da Veiga et al 2020 <doi:10.1038/s41592-020-0912-y>), ionbot (Degroeve et al 2021 <doi:10.1101/2021.07.02.450686>), MassChroq (Valot et al 2011 <doi:10.1002/pmic.201100120>), OpenMS (Strauss et al 2021 <doi:10.1038/nmeth.3959>), ProteomeDiscoverer (Orsburn 2021 <doi:10.3390/proteomes9010015>), Proline (Bouyssie et al 2020 <doi:10.1093/bioinformatics/btaa118>), AlphaPept (preprint Strauss et al <doi:10.1101/2021.07.23.453379>) and Wombat-P (Bouyssie et al 2023 <doi:10.1021/acs.jproteome.3c00636>. Meta-data provided by initial analysis software and/or in sdrf format can be integrated to the analysis. Quantitative proteomics measurements frequently contain multiple NA values, due to physical absence of given peptides in some samples, limitations in sensitivity or other reasons. Help is provided to inspect the data graphically to investigate the nature of NA-values via their respective replicate measurements and to help/confirm the choice of NA-replacement algorithms. Meta-data in sdrf-format (Perez-Riverol et al 2020 <doi:10.1021/acs.jproteome.0c00376>) or similar tabular formats can be imported and included. Missing values can be inspected and imputed based on the concept of NA-neighbours or other methods. Dedicated filtering and statistical testing using the framework of package 'limma' <doi:10.18129/B9.bioc.limma> can be run, enhanced by multiple rounds of NA-replacements to provide robustness towards rare stochastic events. Multi-species samples, as frequently used in benchmark-tests (eg Navarro et al 2016 <doi:10.1038/nbt.3685>, Ramus et al 2016 <doi:10.1016/j.jprot.2015.11.011>), can be run with special options considering such sub-groups during normalization and testing. Subsequently, ROC curves (Hand and Till 2001 <doi:10.1023/A:1010920819831>) can be constructed to compare multiple analysis approaches. As detailed example the data-set from Ramus et al 2016 <doi:10.1016/j.jprot.2015.11.011>) quantified by MaxQuant, ProteomeDiscoverer, and Proline is provided with a detailed analysis of heterologous spike-in proteins.

Maintained by Wolfgang Raffelsberger. Last updated 4 months ago.

1.3 match 3.67 score 17 scripts 1 dependents

skranz

rmdtools:Tools for RMarkdown

Tools for RMarkdown

Maintained by Sebastian Kranz. Last updated 4 years ago.

2.3 match 1 stars 1.78 score 6 scripts 2 dependents

kylebittinger

usedist:Distance Matrix Utilities

Functions to re-arrange, extract, and work with distances.

Maintained by Kyle Bittinger. Last updated 10 months ago.

0.6 match 14 stars 6.63 score 169 scripts 6 dependents

ecohealthalliance

ehallm:What the Package Does (Title Case)

More about what it does (maybe more than one line) Use four spaces when indenting paragraphs within the Description.

Maintained by The package maintainer. Last updated 4 months ago.

1.8 match 1 stars 2.18 score

dwbapst

paleotree:Paleontological and Phylogenetic Analyses of Evolution

Provides tools for transforming, a posteriori time-scaling, and modifying phylogenies containing extinct (i.e. fossil) lineages. In particular, most users are interested in the functions timePaleoPhy, bin_timePaleoPhy, cal3TimePaleoPhy and bin_cal3TimePaleoPhy, which date cladograms of fossil taxa using stratigraphic data. This package also contains a large number of likelihood functions for estimating sampling and diversification rates from different types of data available from the fossil record (e.g. range data, occurrence data, etc). paleotree users can also simulate diversification and sampling in the fossil record using the function simFossilRecord, which is a detailed simulator for branching birth-death-sampling processes composed of discrete taxonomic units arranged in ancestor-descendant relationships. Users can use simFossilRecord to simulate diversification in incompletely sampled fossil records, under various models of morphological differentiation (i.e. the various patterns by which morphotaxa originate from one another), and with time-dependent, longevity-dependent and/or diversity-dependent rates of diversification, extinction and sampling. Additional functions allow users to translate simulated ancestor-descendant data from simFossilRecord into standard time-scaled phylogenies or unscaled cladograms that reflect the relationships among taxon units.

Maintained by David W. Bapst. Last updated 8 months ago.

0.5 match 21 stars 7.53 score 216 scripts 2 dependents