Showing 200 of total 1051 results (show query)
msalibian
RobStatTM:Robust Statistics: Theory and Methods
Companion package for the book: "Robust Statistics: Theory and Methods, second edition", <http://www.wiley.com/go/maronna/robust>. This package contains code that implements the robust estimators discussed in the recent second edition of the book above, as well as the scripts reproducing all the examples in the book.
Maintained by Matias Salibian-Barrera. Last updated 2 days ago.
robustrobust-estimationrobust-regressionrobust-statisticsrobustnessstatisticsfortranopenblas
77.5 match 17 stars 10.23 score 84 scripts 8 dependentsvalentint
robust:Port of the S+ "Robust Library"
Methods for robust statistics, a state of the art in the early 2000s, notably for robust regression and robust multivariate analysis.
Maintained by Valentin Todorov. Last updated 7 months ago.
104.2 match 7.52 score 572 scripts 8 dependentsr-forge
robustbase:Basic Robust Statistics
"Essential" Robust Statistics. Tools allowing to analyze data with robust methods. This includes regression methodology including model selections and multivariate statistics where we strive to cover the book "Robust Statistics, Theory and Methods" by 'Maronna, Martin and Yohai'; Wiley 2006.
Maintained by Martin Maechler. Last updated 4 months ago.
57.7 match 13.33 score 1.7k scripts 480 dependentsvalentint
rrcov:Scalable Robust Estimators with High Breakdown Point
Robust Location and Scatter Estimation and Robust Multivariate Analysis with High Breakdown Point: principal component analysis (Filzmoser and Todorov (2013), <doi:10.1016/j.ins.2012.10.017>), linear and quadratic discriminant analysis (Todorov and Pires (2007)), multivariate tests (Todorov and Filzmoser (2010) <doi:10.1016/j.csda.2009.08.015>), outlier detection (Todorov et al. (2010) <doi:10.1007/s11634-010-0075-2>). See also Todorov and Filzmoser (2009) <urn:isbn:978-3838108148>, Todorov and Filzmoser (2010) <doi:10.18637/jss.v032.i03> and Boudt et al. (2019) <doi:10.1007/s11222-019-09869-x>.
Maintained by Valentin Todorov. Last updated 7 months ago.
57.1 match 2 stars 10.57 score 484 scripts 96 dependentstobiasschoch
robsurvey:Robust Survey Statistics Estimation
Robust (outlier-resistant) estimators of finite population characteristics like of means, totals, ratios, regression, etc. Available methods are M- and GM-estimators of regression, weight reduction, trimming, and winsorization. The package extends the 'survey' <https://CRAN.R-project.org/package=survey> package.
Maintained by Tobias Schoch. Last updated 3 months ago.
42.3 match 9 stars 6.16 score 5 scriptsindrajeetpatil
statsExpressions:Tidy Dataframes and Expressions with Statistical Details
Utilities for producing dataframes with rich details for the most common types of statistical approaches and tests: parametric, nonparametric, robust, and Bayesian t-test, one-way ANOVA, correlation analyses, contingency table analyses, and meta-analyses. The functions are pipe-friendly and provide a consistent syntax to work with tidy data. These dataframes additionally contain expressions with statistical details, and can be used in graphing packages. This package also forms the statistical processing backend for 'ggstatsplot'. References: Patil (2021) <doi:10.21105/joss.03236>.
Maintained by Indrajeet Patil. Last updated 20 days ago.
bayesian-inferencebayesian-statisticscontingency-tablecorrelationeffectsizemeta-analysisparametricrobustrobust-statisticsstatistical-detailsstatistical-teststidy
19.0 match 312 stars 10.97 score 146 scripts 2 dependentsr-forge
RobLox:Optimally Robust Influence Curves and Estimators for Location and Scale
Functions for the determination of optimally robust influence curves and estimators in case of normal location and/or scale (see Chapter 8 in Kohl (2005) <https://epub.uni-bayreuth.de/839/2/DissMKohl.pdf>).
Maintained by Matthias Kohl. Last updated 2 months ago.
43.7 match 4.40 score 70 scripts 1 dependentsaalfons
robmed:(Robust) Mediation Analysis
Perform mediation analysis via the fast-and-robust bootstrap test ROBMED (Alfons, Ates & Groenen, 2022a; <doi:10.1177/1094428121999096>), as well as various other methods. Details on the implementation and code examples can be found in Alfons, Ates, and Groenen (2022b) <doi:10.18637/jss.v103.i13>. Further discussion on robust mediation analysis can be found in Alfons & Schley (2024) <doi:10.31234/osf.io/2hqdy>.
Maintained by Andreas Alfons. Last updated 15 days ago.
29.1 match 6 stars 6.35 score 31 scripts 1 dependentseasystats
correlation:Methods for Correlation Analysis
Lightweight package for computing different kinds of correlations, such as partial correlations, Bayesian correlations, multilevel correlations, polychoric correlations, biweight correlations, distance correlations and more. Part of the 'easystats' ecosystem. References: Makowski et al. (2020) <doi:10.21105/joss.02306>.
Maintained by Brenton M. Wiernik. Last updated 12 days ago.
bayesianbayesian-correlationsbiserialcorcorrelationcorrelation-analysiscorrelationseasystatsgammagaussian-graphical-modelshacktoberfestmatrixmultilevel-correlationsoutlierspartialpartial-correlationsregressionrobustspearman
11.0 match 439 stars 14.23 score 672 scripts 10 dependentsr-forge
WRS2:A Collection of Robust Statistical Methods
A collection of robust statistical methods based on Wilcox' WRS functions. It implements robust t-tests (independent and dependent samples), robust ANOVA (including between-within subject designs), quantile ANOVA, robust correlation, robust mediation, and nonparametric ANCOVA models based on robust location measures.
Maintained by Patrick Mair. Last updated 3 months ago.
17.4 match 8.96 score 402 scripts 7 dependentsfbartos
RoBMA:Robust Bayesian Meta-Analyses
A framework for estimating ensembles of meta-analytic and meta-regression models (assuming either presence or absence of the effect, heterogeneity, publication bias, and moderators). The RoBMA framework uses Bayesian model-averaging to combine the competing meta-analytic models into a model ensemble, weights the posterior parameter distributions based on posterior model probabilities and uses Bayes factors to test for the presence or absence of the individual components (e.g., effect vs. no effect; Bartoš et al., 2022, <doi:10.1002/jrsm.1594>; Maier, Bartoš & Wagenmakers, 2022, <doi:10.1037/met0000405>). Users can define a wide range of prior distributions for + the effect size, heterogeneity, publication bias (including selection models and PET-PEESE), and moderator components. The package provides convenient functions for summary, visualizations, and fit diagnostics.
Maintained by František Bartoš. Last updated 1 months ago.
meta-analysismodel-averagingpublication-biasjagsopenblascpp
22.1 match 9 stars 6.97 score 53 scriptszhuwang46
mpath:Regularized Linear Models
Algorithms compute robust estimators for loss functions in the concave convex (CC) family by the iteratively reweighted convex optimization (IRCO), an extension of the iteratively reweighted least squares (IRLS). The IRCO reduces the weight of the observation that leads to a large loss; it also provides weights to help identify outliers. Applications include robust (penalized) generalized linear models and robust support vector machines. The package also contains penalized Poisson, negative binomial, zero-inflated Poisson, zero-inflated negative binomial regression models and robust models with non-convex loss functions. Wang et al. (2014) <doi:10.1002/sim.6314>, Wang et al. (2015) <doi:10.1002/bimj.201400143>, Wang et al. (2016) <doi:10.1177/0962280214530608>, Wang (2021) <doi:10.1007/s11749-021-00770-2>, Wang (2020) <arXiv:2010.02848>.
Maintained by Zhu Wang. Last updated 3 years ago.
22.8 match 1 stars 6.67 score 131 scripts 4 dependentswviechtb
metafor:Meta-Analysis Package for R
A comprehensive collection of functions for conducting meta-analyses in R. The package includes functions to calculate various effect sizes or outcome measures, fit equal-, fixed-, random-, and mixed-effects models to such data, carry out moderator and meta-regression analyses, and create various types of meta-analytical plots (e.g., forest, funnel, radial, L'Abbe, Baujat, bubble, and GOSH plots). For meta-analyses of binomial and person-time data, the package also provides functions that implement specialized methods, including the Mantel-Haenszel method, Peto's method, and a variety of suitable generalized linear (mixed-effects) models (i.e., mixed-effects logistic and Poisson regression models). Finally, the package provides functionality for fitting meta-analytic multivariate/multilevel models that account for non-independent sampling errors and/or true effects (e.g., due to the inclusion of multiple treatment studies, multiple endpoints, or other forms of clustering). Network meta-analyses and meta-analyses accounting for known correlation structures (e.g., due to phylogenetic relatedness) can also be conducted. An introduction to the package can be found in Viechtbauer (2010) <doi:10.18637/jss.v036.i03>.
Maintained by Wolfgang Viechtbauer. Last updated 20 hours ago.
meta-analysismixed-effectsmultilevel-modelsmultivariate
9.0 match 246 stars 16.30 score 4.9k scripts 92 dependentseasystats
parameters:Processing of Model Parameters
Utilities for processing the parameters of various statistical models. Beyond computing p values, CIs, and other indices for a wide variety of models (see list of supported models using the function 'insight::supported_models()'), this package implements features like bootstrapping or simulating of parameters and models, feature reduction (feature extraction and variable selection) as well as functions to describe data and variable characteristics (e.g. skewness, kurtosis, smoothness or distribution).
Maintained by Daniel Lüdecke. Last updated 2 days ago.
betabootstrapciconfidence-intervalsdata-reductioneasystatsfafeature-extractionfeature-reductionhacktoberfestparameterspcapvaluesregression-modelsrobust-statisticsstandardizestandardized-estimatesstatistical-models
9.3 match 453 stars 15.65 score 1.8k scripts 56 dependentsbraverock
PerformanceAnalytics:Econometric Tools for Performance and Risk Analysis
Collection of econometric functions for performance and risk analysis. In addition to standard risk and performance metrics, this package aims to aid practitioners and researchers in utilizing the latest research in analysis of non-normal return streams. In general, it is most tested on return (rather than price) data on a regular scale, but most functions will work with irregular return data as well, and increasing numbers of functions will work with P&L or price data where possible.
Maintained by Brian G. Peterson. Last updated 3 months ago.
8.5 match 222 stars 15.93 score 4.8k scripts 20 dependentspedrohcgs
DRDID:Doubly Robust Difference-in-Differences Estimators
Implements the locally efficient doubly robust difference-in-differences (DiD) estimators for the average treatment effect proposed by Sant'Anna and Zhao (2020) <doi:10.1016/j.jeconom.2020.06.003>. The estimator combines inverse probability weighting and outcome regression estimators (also implemented in the package) to form estimators with more attractive statistical properties. Two different estimation methods can be used to estimate the nuisance functions.
Maintained by Pedro H. C. SantAnna. Last updated 5 months ago.
15.3 match 92 stars 8.88 score 133 scripts 5 dependentsvalentint
pcaPP:Robust PCA by Projection Pursuit
Provides functions for robust PCA by projection pursuit. The methods are described in Croux et al. (2006) <doi:10.2139/ssrn.968376>, Croux et al. (2013) <doi:10.1080/00401706.2012.727746>, Todorov and Filzmoser (2013) <doi:10.1007/978-3-642-33042-1_31>.
Maintained by Valentin Todorov. Last updated 7 months ago.
12.4 match 1 stars 10.56 score 186 scripts 351 dependentskkholst
mets:Analysis of Multivariate Event Times
Implementation of various statistical models for multivariate event history data <doi:10.1007/s10985-013-9244-x>. Including multivariate cumulative incidence models <doi:10.1002/sim.6016>, and bivariate random effects probit models (Liability models) <doi:10.1016/j.csda.2015.01.014>. Modern methods for survival analysis, including regression modelling (Cox, Fine-Gray, Ghosh-Lin, Binomial regression) with fast computation of influence functions.
Maintained by Klaus K. Holst. Last updated 2 days ago.
multivariate-time-to-eventsurvival-analysistime-to-eventfortranopenblascpp
9.6 match 14 stars 13.47 score 236 scripts 42 dependentsbsaul
geex:An API for M-Estimation
Provides a general, flexible framework for estimating parameters and empirical sandwich variance estimator from a set of unbiased estimating equations (i.e., M-estimation in the vein of Stefanski & Boos (2002) <doi:10.1198/000313002753631330>). All examples from Stefanski & Boos (2002) are published in the corresponding Journal of Statistical Software paper "The Calculus of M-Estimation in R with geex" by Saul & Hudgens (2020) <doi:10.18637/jss.v092.i02>. Also provides an API to compute finite-sample variance corrections.
Maintained by Bradley Saul. Last updated 10 months ago.
asymptoticscovariance-estimatescovariance-estimationestimate-parametersestimating-equationsestimationinferencem-estimationrobustsandwich
16.5 match 8 stars 7.70 score 131 scripts 2 dependentsaalfons
robustHD:Robust Methods for High-Dimensional Data
Robust methods for high-dimensional data, in particular linear model selection techniques based on least angle regression and sparse regression. Specifically, the package implements robust least angle regression (Khan, Van Aelst & Zamar, 2007; <doi:10.1198/016214507000000950>), (robust) groupwise least angle regression (Alfons, Croux & Gelper, 2016; <doi:10.1016/j.csda.2015.02.007>), and sparse least trimmed squares regression (Alfons, Croux & Gelper, 2013; <doi:10.1214/12-AOAS575>).
Maintained by Andreas Alfons. Last updated 9 months ago.
16.4 match 10 stars 7.06 score 174 scripts 8 dependentschabert-liddell
robber:Using Block Model to Estimate the Robustness of Ecological Network
Implementation of a variety of methods to compute the robustness of ecological interaction networks with binary interactions as described in <doi:10.1002/env.2709>. In particular, using the Stochastic Block Model and its bipartite counterpart, the Latent Block Model to put a parametric model on the network, allows the comparison of the robustness of networks differing in species richness and number of interactions. It also deals with networks that are partially sampled and/or with missing values.
Maintained by Saint-Clair Chabert-Liddell. Last updated 1 years ago.
ecological-networkrobberrobustness
30.9 match 1 stars 3.70 score 4 scriptsr-forge
RobAStBase:Robust Asymptotic Statistics
Base S4-classes and functions for robust asymptotic statistics.
Maintained by Matthias Kohl. Last updated 2 months ago.
20.8 match 4.96 score 64 scripts 4 dependentsaalfons
ccaPP:(Robust) Canonical Correlation Analysis via Projection Pursuit
Canonical correlation analysis and maximum correlation via projection pursuit, as well as fast implementations of correlation estimators, with a focus on robust and nonparametric methods.
Maintained by Andreas Alfons. Last updated 6 months ago.
18.5 match 2 stars 5.58 score 27 scripts 3 dependentsasheshrambachan
HonestDiD:Robust Inference in Difference-in-Differences and Event Study Designs
Provides functions to conduct robust inference in difference-in-differences and event study designs by implementing the methods developed in Rambachan & Roth (2023) <doi:10.1093/restud/rdad018>, "A More Credible Approach to Parallel Trends" [Previously titled "An Honest Approach..."]. Inference is conducted under a weaker version of the parallel trends assumption. Uniformly valid confidence sets are constructed based upon conditional confidence sets, fixed-length confidence sets and hybridized confidence sets.
Maintained by Ashesh Rambachan. Last updated 17 days ago.
difference-in-differencesevent-studiesrobust-inference
14.0 match 195 stars 7.11 score 63 scriptscran
fBasics:Rmetrics - Markets and Basic Statistics
Provides a collection of functions to explore and to investigate basic properties of financial returns and related quantities. The covered fields include techniques of explorative data analysis and the investigation of distributional properties, including parameter estimation and hypothesis testing. Even more there are several utility functions for data handling and management.
Maintained by Georgi N. Boshnakov. Last updated 7 months ago.
13.9 match 2 stars 7.11 score 129 dependentssmartdata-analysis-and-statistics
precmed:Precision Medicine
A doubly robust precision medicine approach to fit, cross-validate and visualize prediction models for the conditional average treatment effect (CATE). It implements doubly robust estimation and semiparametric modeling approach of treatment-covariate interactions as proposed by Yadlowsky et al. (2020) <doi:10.1080/01621459.2020.1772080>.
Maintained by Thomas Debray. Last updated 5 months ago.
22.6 match 4 stars 4.20 score 4 scriptsdakep
pense:Penalized Elastic Net S/MM-Estimator of Regression
Robust penalized (adaptive) elastic net S and M estimators for linear regression. The methods are proposed in Cohen Freue, G. V., Kepplinger, D., Salibián-Barrera, M., and Smucler, E. (2019) <https://projecteuclid.org/euclid.aoas/1574910036>. The package implements the extensions and algorithms described in Kepplinger, D. (2020) <doi:10.14288/1.0392915>.
Maintained by David Kepplinger. Last updated 8 months ago.
linear-regressionpenseregressionrobust-regresssionrobust-statisticsopenblascppopenmp
15.5 match 4 stars 6.06 score 48 scriptskollerma
robustlmm:Robust Linear Mixed Effects Models
Implements the Robust Scoring Equations estimator to fit linear mixed effects models robustly. Robustness is achieved by modification of the scoring equations combined with the Design Adaptive Scale approach.
Maintained by Manuel Koller. Last updated 1 years ago.
10.3 match 28 stars 8.79 score 138 scriptsbioc
preprocessCore:A collection of pre-processing functions
A library of core preprocessing routines.
Maintained by Ben Bolstad. Last updated 5 months ago.
7.5 match 19 stars 12.03 score 1.8k scripts 204 dependentscoffeemuggler
EMMAgeo:End-Member Modelling of Grain-Size Data
End-member modelling analysis of grain-size data is an approach to unmix a data set's underlying distributions and their contribution to the data set. EMMAgeo provides deterministic and robust protocols for that purpose.
Maintained by Michael Dietze. Last updated 5 years ago.
20.3 match 10 stars 4.13 score 27 scriptsbenkeser
drtmle:Doubly-Robust Nonparametric Estimation and Inference
Targeted minimum loss-based estimators of counterfactual means and causal effects that are doubly-robust with respect both to consistency and asymptotic normality (Benkeser et al (2017), <doi:10.1093/biomet/asx053>; MJ van der Laan (2014), <doi:10.1515/ijb-2012-0038>).
Maintained by David Benkeser. Last updated 2 years ago.
causal-inferenceensemble-learningiptwstatistical-inferencetmle
11.9 match 19 stars 6.89 score 90 scripts 1 dependentsfriendly
heplots:Visualizing Hypothesis Tests in Multivariate Linear Models
Provides HE plot and other functions for visualizing hypothesis tests in multivariate linear models. HE plots represent sums-of-squares-and-products matrices for linear hypotheses and for error using ellipses (in two dimensions) and ellipsoids (in three dimensions). The related 'candisc' package provides visualizations in a reduced-rank canonical discriminant space when there are more than a few response variables.
Maintained by Michael Friendly. Last updated 8 days ago.
linear-hypothesesmatricesmultivariate-linear-modelsplotrepeated-measure-designsvisualizing-hypothesis-tests
7.1 match 9 stars 11.49 score 1.1k scripts 7 dependentsstscl
gdverse:Analysis of Spatial Stratified Heterogeneity
Analyzing spatial factors and exploring spatial associations based on the concept of spatial stratified heterogeneity, while also taking into account local spatial dependencies, spatial interpretability, complex spatial interactions, and robust spatial stratification. Additionally, it supports the spatial stratified heterogeneity family established in academic literature.
Maintained by Wenbo Lv. Last updated 1 days ago.
geographical-detectorgeoinformaticsgeospatial-analysisspatial-statisticsspatial-stratified-heterogeneitycpp
8.8 match 32 stars 9.07 score 41 scripts 2 dependentsr-forge
ROptEst:Optimally Robust Estimation
R infrastructure for optimally robust estimation in general smoothly parameterized models using S4 classes and methods as described Kohl, M., Ruckdeschel, P., and Rieder, H. (2010), <doi:10.1007/s10260-010-0133-0>, and in Rieder, H., Kohl, M., and Ruckdeschel, P. (2008), <doi:10.1007/s10260-007-0047-7>.
Maintained by Matthias Kohl. Last updated 2 months ago.
18.5 match 4.26 score 50 scripts 1 dependentsvalentint
rrcovHD:Robust Multivariate Methods for High Dimensional Data
Robust multivariate methods for high dimensional data including outlier detection (Filzmoser and Todorov (2013) <doi:10.1016/j.ins.2012.10.017>), robust sparse PCA (Croux et al. (2013) <doi:10.1080/00401706.2012.727746>, Todorov and Filzmoser (2013) <doi:10.1007/978-3-642-33042-1_31>), robust PLS (Todorov and Filzmoser (2014) <doi:10.17713/ajs.v43i4.44>), and robust sparse classification (Ortner et al. (2020) <doi:10.1007/s10618-019-00666-8>).
Maintained by Valentin Todorov. Last updated 7 months ago.
23.2 match 3.39 score 49 scriptsvalentint
tclust:Robust Trimmed Clustering
Provides functions for robust trimmed clustering. The methods are described in Garcia-Escudero (2008) <doi:10.1214/07-AOS515>, Fritz et al. (2012) <doi:10.18637/jss.v047.i12>, Garcia-Escudero et al. (2011) <doi:10.1007/s11222-010-9194-z> and others.
Maintained by Valentin Todorov. Last updated 25 days ago.
9.5 match 3 stars 8.02 score 72 scripts 3 dependentsvlyubchich
lawstat:Tools for Biostatistics, Public Policy, and Law
Statistical tests widely utilized in biostatistics, public policy, and law. Along with the well-known tests for equality of means and variances, randomness, and measures of relative variability, the package contains new robust tests of symmetry, omnibus and directional tests of normality, and their graphical counterparts such as robust QQ plot, robust trend tests for variances, etc. All implemented tests and methods are illustrated by simulations and real-life examples from legal statistics, economics, and biostatistics.
Maintained by Yulia R. Gel. Last updated 2 years ago.
9.9 match 7.17 score 484 scripts 6 dependentsjlaake
RMark:R Code for Mark Analysis
An interface to the software package MARK that constructs input files for MARK and extracts the output. MARK was developed by Gary White and is freely available at <http://www.phidot.org/software/mark/downloads/> but is not open source.
Maintained by Jeff Laake. Last updated 3 years ago.
14.4 match 4.90 score 366 scripts 4 dependentstreynkens
rospca:Robust Sparse PCA using the ROSPCA Algorithm
Implementation of robust sparse PCA using the ROSPCA algorithm of Hubert et al. (2016) <DOI:10.1080/00401706.2015.1093962>.
Maintained by Tom Reynkens. Last updated 4 months ago.
14.8 match 13 stars 4.77 score 45 scriptspaulnorthrop
chandwich:Chandler-Bate Sandwich Loglikelihood Adjustment
Performs adjustments of a user-supplied independence loglikelihood function using a robust sandwich estimator of the parameter covariance matrix, based on the methodology in Chandler and Bate (2007) <doi:10.1093/biomet/asm015>. This can be used for cluster correlated data when interest lies in the parameters of the marginal distributions or for performing inferences that are robust to certain types of model misspecification. Functions for profiling the adjusted loglikelihoods are also provided, as are functions for calculating and plotting confidence intervals, for single model parameters, and confidence regions, for pairs of model parameters. Nested models can be compared using an adjusted likelihood ratio test.
Maintained by Paul J. Northrop. Last updated 2 years ago.
clustered-dataclusterscomposite-likelihoodindependence-loglikelihoodmlerobustsandwichstatistical-inference
11.8 match 4 stars 5.88 score 18 scripts 7 dependentskkholst
targeted:Targeted Inference
Various methods for targeted and semiparametric inference including augmented inverse probability weighted (AIPW) estimators for missing data and causal inference (Bang and Robins (2005) <doi:10.1111/j.1541-0420.2005.00377.x>), variable importance and conditional average treatment effects (CATE) (van der Laan (2006) <doi:10.2202/1557-4679.1008>), estimators for risk differences and relative risks (Richardson et al. (2017) <doi:10.1080/01621459.2016.1192546>), assumption lean inference for generalized linear model parameters (Vansteelandt et al. (2022) <doi:10.1111/rssb.12504>).
Maintained by Klaus K. Holst. Last updated 1 months ago.
causal-inferencedouble-robustestimationsemiparametric-estimationstatisticsopenblascppopenmp
9.2 match 11 stars 7.20 score 30 scripts 1 dependentsbenkeser
drord:Doubly-Robust Estimators for Ordinal Outcomes
Efficient covariate-adjusted estimators of quantities that are useful for establishing the effects of treatments on ordinal outcomes (Benkeser, Diaz, Luedtke 2020 <doi:10.1111/biom.13377>)
Maintained by David Benkeser. Last updated 4 years ago.
causal-inferencecovid-19double-robustmann-whitneyordinal-regression
15.0 match 4 stars 4.38 score 12 scriptsfilzmoserp
chemometrics:Multivariate Statistical Analysis in Chemometrics
R companion to the book "Introduction to Multivariate Statistical Analysis in Chemometrics" written by K. Varmuza and P. Filzmoser (2009).
Maintained by Peter Filzmoser. Last updated 2 years ago.
9.6 match 4 stars 6.72 score 213 scripts 4 dependentseasystats
insight:Easy Access to Model Information for Various Model Objects
A tool to provide an easy, intuitive and consistent access to information contained in various R models, like model formulas, model terms, information about random effects, data that was used to fit the model or data from response variables. 'insight' mainly revolves around two types of functions: Functions that find (the names of) information, starting with 'find_', and functions that get the underlying data, starting with 'get_'. The package has a consistent syntax and works with many different model objects, where otherwise functions to access these information are missing.
Maintained by Daniel Lüdecke. Last updated 4 days ago.
easystatshacktoberfestinsightmodelsnamespredictorsrandom
3.7 match 412 stars 17.24 score 568 scripts 210 dependentsjapal
MALDIrppa:MALDI Mass Spectrometry Data Robust Pre-Processing and Analysis
Provides methods for quality control and robust pre-processing and analysis of MALDI mass spectrometry data (Palarea-Albaladejo et al. (2018) <doi:10.1093/bioinformatics/btx628>).
Maintained by Javier Palarea-Albaladejo. Last updated 1 years ago.
mass-spectrometrypre-processing
10.5 match 2 stars 6.06 score 32 scripts 1 dependentsbioc
affy:Methods for Affymetrix Oligonucleotide Arrays
The package contains functions for exploratory oligonucleotide array analysis. The dependence on tkWidgets only concerns few convenience functions. 'affy' is fully functional without it.
Maintained by Robert D. Shear. Last updated 2 months ago.
microarrayonechannelpreprocessing
5.6 match 11.12 score 2.5k scripts 98 dependentsasa12138
MetaNet:Network Analysis for Omics Data
Comprehensive network analysis package. Calculate correlation network fastly, accelerate lots of analysis by parallel computing. Support for multi-omics data, search sub-nets fluently. Handle bigger data, more than 10,000 nodes in each omics. Offer various layout method for multi-omics network and some interfaces to other software ('Gephi', 'Cytoscape', 'ggplot2'), easy to visualize. Provide comprehensive topology indexes calculation, including ecological network stability.
Maintained by Chen Peng. Last updated 11 days ago.
dataimportnetwork analysisomicssoftwarevisualization
11.3 match 13 stars 5.51 score 9 scriptsjeroen
jsonlite:A Simple and Robust JSON Parser and Generator for R
A reasonably fast JSON parser and generator, optimized for statistical data and the web. Offers simple, flexible tools for working with JSON in R, and is particularly powerful for building pipelines and interacting with a web API. The implementation is based on the mapping described in the vignette (Ooms, 2014). In addition to converting JSON data from/to R objects, 'jsonlite' contains functions to stream, validate, and prettify JSON data. The unit tests included with the package verify that all edge cases are encoded and decoded consistently for use with dynamic data in systems and applications.
Maintained by Jeroen Ooms. Last updated 22 days ago.
2.9 match 384 stars 21.15 score 27k scripts 8.6k dependentssmac-group
simts:Time Series Analysis Tools
A system contains easy-to-use tools as a support for time series analysis courses. In particular, it incorporates a technique called Generalized Method of Wavelet Moments (GMWM) as well as its robust implementation for fast and robust parameter estimation of time series models which is described, for example, in Guerrier et al. (2013) <doi: 10.1080/01621459.2013.799920>. More details can also be found in the paper linked to via the URL below.
Maintained by Stéphane Guerrier. Last updated 2 years ago.
rcpprcpparmadillosimulationtime-seriestimeseriestimeseries-dataopenblascpp
7.9 match 15 stars 7.68 score 59 scripts 4 dependentsjorischau
gslnls:GSL Multi-Start Nonlinear Least-Squares Fitting
An R interface to weighted nonlinear least-squares optimization with the GNU Scientific Library (GSL), see M. Galassi et al. (2009, ISBN:0954612078). The available trust region methods include the Levenberg-Marquardt algorithm with and without geodesic acceleration, the Steihaug-Toint conjugate gradient algorithm for large systems and several variants of Powell's dogleg algorithm. Multi-start optimization based on quasi-random samples is implemented using a modified version of the algorithm in Hickernell and Yuan (1997, OR Transactions). Robust nonlinear regression can be performed using various robust loss functions, in which case the optimization problem is solved by iterative reweighted least squares (IRLS). Bindings are provided to tune a number of parameters affecting the low-level aspects of the trust region algorithms. The interface mimics R's nls() function and returns model objects inheriting from the same class.
Maintained by Joris Chau. Last updated 2 months ago.
gnu-scientific-librarygsllevenberg-marquardtmulti-startnonlinear-least-squaresnonlinear-regressionrobust-regresssionfortranglibc
10.1 match 15 stars 6.03 score 35 scripts 1 dependentscore-bioinformatics
ClustAssess:Tools for Assessing Clustering
A set of tools for evaluating clustering robustness using proportion of ambiguously clustered pairs (Senbabaoglu et al. (2014) <doi:10.1038/srep06207>), as well as similarity across methods and method stability using element-centric clustering comparison (Gates et al. (2019) <doi:10.1038/s41598-019-44892-y>). Additionally, this package enables stability-based parameter assessment for graph-based clustering pipelines typical in single-cell data analysis.
Maintained by Andi Munteanu. Last updated 1 months ago.
softwaresinglecellrnaseqatacseqnormalizationpreprocessingdimensionreductionvisualizationqualitycontrolclusteringclassificationannotationgeneexpressiondifferentialexpressionbioinformaticsgenomicsmachine-learningparameter-optimizationrobustnesssingle-cellunsupervised-learningcpp
10.5 match 23 stars 5.70 score 18 scriptskosukeimai
MatchIt:Nonparametric Preprocessing for Parametric Causal Inference
Selects matched samples of the original treated and control groups with similar covariate distributions -- can be used to match exactly on covariates, to match on propensity scores, or perform a variety of other matching procedures. The package also implements a series of recommendations offered in Ho, Imai, King, and Stuart (2007) <DOI:10.1093/pan/mpl013>. (The 'gurobi' package, which is not on CRAN, is optional and comes with an installation of the Gurobi Optimizer, available at <https://www.gurobi.com>.)
Maintained by Noah Greifer. Last updated 1 days ago.
4.0 match 220 stars 15.03 score 2.4k scripts 21 dependentscran
grf:Generalized Random Forests
Forest-based statistical estimation and inference. GRF provides non-parametric methods for heterogeneous treatment effects estimation (optionally using right-censored outcomes, multiple treatment arms or outcomes, or instrumental variables), as well as least-squares regression, quantile regression, and survival regression, all with support for missing covariates.
Maintained by Erik Sverdrup. Last updated 4 months ago.
10.2 match 5.83 score 1.2k scripts 14 dependentsnt-williams
lmtp:Non-Parametric Causal Effects of Feasible Interventions Based on Modified Treatment Policies
Non-parametric estimators for casual effects based on longitudinal modified treatment policies as described in Diaz, Williams, Hoffman, and Schenck <doi:10.1080/01621459.2021.1955691>, traditional point treatment, and traditional longitudinal effects. Continuous, binary, categorical treatments, and multivariate treatments are allowed as well are censored outcomes. The treatment mechanism is estimated via a density ratio classification procedure irrespective of treatment variable type. For both continuous and binary outcomes, additive treatment effects can be calculated and relative risks and odds ratios may be calculated for binary outcomes. Supports survival outcomes with competing risks (Diaz, Hoffman, and Hejazi; <doi:10.1007/s10985-023-09606-7>).
Maintained by Nicholas Williams. Last updated 8 days ago.
causal-inferencecensored-datalongitudinal-datamachine-learningmodified-treatment-policynonparametric-statisticsprecision-medicinerobust-statisticsstatisticsstochastic-interventionssurvival-analysistargeted-learning
9.3 match 64 stars 6.37 score 91 scriptsyqzhong7
AIPW:Augmented Inverse Probability Weighting
The 'AIPW' package implements the augmented inverse probability weighting, a doubly robust estimator, for average causal effect estimation with user-defined stacked machine learning algorithms. To cite the 'AIPW' package, please use: "Yongqi Zhong, Edward H. Kennedy, Lisa M. Bodnar, Ashley I. Naimi (2021). AIPW: An R Package for Augmented Inverse Probability Weighted Estimation of Average Causal Effects. American Journal of Epidemiology. doi: 10.1093/aje/kwab207". Visit: <https://yqzhong7.github.io/AIPW/> for more information.
Maintained by Yongqi Zhong. Last updated 6 months ago.
causal-inferencemachine-learningrobust-estimators
8.0 match 24 stars 7.35 score 31 scripts 1 dependentsstatimagcoll
RESI:Robust Effect Size Index (RESI) Estimation
Summarize model output using a robust effect size index. The index is introduced in Vandekar, Tao, & Blume (2020) <doi:10.1007/s11336-020-09698-2>.
Maintained by Megan Jones. Last updated 13 days ago.
13.5 match 4.30 score 20 scriptsr-forge
sandwich:Robust Covariance Matrix Estimators
Object-oriented software for model-robust covariance matrix estimators. Starting out from the basic robust Eicker-Huber-White sandwich covariance methods include: heteroscedasticity-consistent (HC) covariances for cross-section data; heteroscedasticity- and autocorrelation-consistent (HAC) covariances for time series data (such as Andrews' kernel HAC, Newey-West, and WEAVE estimators); clustered covariances (one-way and multi-way); panel and panel-corrected covariances; outer-product-of-gradients covariances; and (clustered) bootstrap covariances. All methods are applicable to (generalized) linear model objects fitted by lm() and glm() but can also be adapted to other classes through S3 methods. Details can be found in Zeileis et al. (2020) <doi:10.18637/jss.v095.i01>, Zeileis (2004) <doi:10.18637/jss.v011.i10> and Zeileis (2006) <doi:10.18637/jss.v016.i09>.
Maintained by Achim Zeileis. Last updated 2 months ago.
3.9 match 14.92 score 11k scripts 887 dependentscran
MASS:Support Functions and Datasets for Venables and Ripley's MASS
Functions and datasets to support Venables and Ripley, "Modern Applied Statistics with S" (4th edition, 2002).
Maintained by Brian Ripley. Last updated 16 days ago.
5.4 match 19 stars 10.53 score 11k dependentsappliedstat
rQCC:Robust Quality Control Chart
Constructs various robust quality control charts based on the median or Hodges-Lehmann estimator (location) and the median absolute deviation (MAD) or Shamos estimator (scale). The estimators used for the robust control charts are all unbiased with a sample of finite size. For more details, see Park, Kim and Wang (2022) <doi:10.1080/03610918.2019.1699114>. In addition, using this R package, the conventional quality control charts such as X-bar, S, R, p, np, u, c, g, h, and t charts are also easily constructed. This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2022R1A2C1091319).
Maintained by Chanseok Park. Last updated 1 years ago.
control-chartgoodness-of-fitr-languageweibull
12.0 match 2 stars 4.70 score 3 scriptszackfisher
robumeta:Robust Variance Meta-Regression
Functions for conducting robust variance estimation (RVE) meta-regression using both large and small sample RVE estimators under various weighting schemes. These methods are distribution free and provide valid point estimates, standard errors and hypothesis tests even when the degree and structure of dependence between effect sizes is unknown. Also included are functions for conducting sensitivity analyses under correlated effects weighting and producing RVE-based forest plots.
Maintained by Zachary Fisher. Last updated 4 years ago.
7.1 match 8 stars 7.75 score 178 scripts 4 dependentsbiometry
bipartite:Visualising Bipartite Networks and Calculating Some (Ecological) Indices
Functions to visualise webs and calculate a series of indices commonly used to describe pattern in (ecological) webs. It focuses on webs consisting of only two levels (bipartite), e.g. pollination webs or predator-prey-webs. Visualisation is important to get an idea of what we are actually looking at, while the indices summarise different aspects of the web's topology.
Maintained by Carsten F. Dormann. Last updated 6 days ago.
5.0 match 37 stars 10.93 score 592 scripts 15 dependentsmerliseclyde
BAS:Bayesian Variable Selection and Model Averaging using Bayesian Adaptive Sampling
Package for Bayesian Variable Selection and Model Averaging in linear models and generalized linear models using stochastic or deterministic sampling without replacement from posterior distributions. Prior distributions on coefficients are from Zellner's g-prior or mixtures of g-priors corresponding to the Zellner-Siow Cauchy Priors or the mixture of g-priors from Liang et al (2008) <DOI:10.1198/016214507000001337> for linear models or mixtures of g-priors from Li and Clyde (2019) <DOI:10.1080/01621459.2018.1469992> in generalized linear models. Other model selection criteria include AIC, BIC and Empirical Bayes estimates of g. Sampling probabilities may be updated based on the sampled models using sampling w/out replacement or an efficient MCMC algorithm which samples models using a tree structure of the model space as an efficient hash table. See Clyde, Ghosh and Littman (2010) <DOI:10.1198/jcgs.2010.09049> for details on the sampling algorithms. Uniform priors over all models or beta-binomial prior distributions on model size are allowed, and for large p truncated priors on the model space may be used to enforce sampling models that are full rank. The user may force variables to always be included in addition to imposing constraints that higher order interactions are included only if their parents are included in the model. This material is based upon work supported by the National Science Foundation under Division of Mathematical Sciences grant 1106891. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
Maintained by Merlise Clyde. Last updated 4 months ago.
bayesianbayesian-inferencegeneralized-linear-modelslinear-regressionlogistic-regressionmcmcmodel-selectionpoisson-regressionpredictive-modelingregressionvariable-selectionfortranopenblas
5.1 match 44 stars 10.81 score 420 scripts 3 dependentsthie1e
cutpointr:Determine and Evaluate Optimal Cutpoints in Binary Classification Tasks
Estimate cutpoints that optimize a specified metric in binary classification tasks and validate performance using bootstrapping. Some methods for more robust cutpoint estimation are supported, e.g. a parametric method assuming normal distributions, bootstrapped cutpoints, and smoothing of the metric values per cutpoint using Generalized Additive Models. Various plotting functions are included. For an overview of the package see Thiele and Hirschfeld (2021) <doi:10.18637/jss.v098.i11>.
Maintained by Christian Thiele. Last updated 3 months ago.
bootstrappingcutpoint-optimizationroc-curvecpp
5.1 match 88 stars 10.44 score 322 scripts 1 dependentsfinyang
RRRR:Online Robust Reduced-Rank Regression Estimation
Methods for estimating online robust reduced-rank regression. The Gaussian maximum likelihood estimation method is described in Johansen, S. (1991) <doi:10.2307/2938278>. The majorisation-minimisation estimation method is partly described in Zhao, Z., & Palomar, D. P. (2017) <doi:10.1109/GlobalSIP.2017.8309093>. The description of the generic stochastic successive upper-bound minimisation method and the sample average approximation can be found in Razaviyayn, M., Sanjabi, M., & Luo, Z. Q. (2016) <doi:10.1007/s10107-016-1021-7>.
Maintained by Yangzhuoran Fin Yang. Last updated 2 years ago.
12.7 match 3 stars 4.18 score 10 scriptsbioc
RolDE:RolDE: Robust longitudinal Differential Expression
RolDE detects longitudinal differential expression between two conditions in noisy high-troughput data. Suitable even for data with a moderate amount of missing values.RolDE is a composite method, consisting of three independent modules with different approaches to detecting longitudinal differential expression. The combination of these diverse modules allows RolDE to robustly detect varying differences in longitudinal trends and expression levels in diverse data types and experimental settings.
Maintained by Medical Bioinformatics Centre. Last updated 5 months ago.
statisticalmethodsoftwaretimecourseregressionproteomicsdifferentialexpression
10.1 match 5 stars 5.18 score 1 scriptsvalentint
rrcovNA:Scalable Robust Estimators with High Breakdown Point for Incomplete Data
Robust Location and Scatter Estimation and Robust Multivariate Analysis with High Breakdown Point for Incomplete Data (missing values) (Todorov et al. (2010) <doi:10.1007/s11634-010-0075-2>).
Maintained by Valentin Todorov. Last updated 3 months ago.
13.9 match 1 stars 3.77 score 59 scriptsthinkr-open
golem:A Framework for Robust Shiny Applications
An opinionated framework for building a production-ready 'Shiny' application. This package contains a series of tools for building a robust 'Shiny' application from start to finish.
Maintained by Colin Fay. Last updated 7 months ago.
golemversehacktoberfestshinyshiny-appsshiny-rshinyapps
3.7 match 921 stars 14.23 score 167 scripts 62 dependentsctruciosm
RobGARCHBoot:Robust Bootstrap Forecast Densities for GARCH Models
Bootstrap forecast densities for GARCH (Generalized Autoregressive Conditional Heteroskedastic) returns and volatilities using the robust residual-based bootstrap procedure of Trucios, Hotta and Ruiz (2017) <DOI:10.1080/00949655.2017.1359601>.
Maintained by Carlos Trucios. Last updated 4 years ago.
16.3 match 3 stars 3.18 score 1 scriptstobiasschoch
wbacon:Weighted BACON Algorithms
The BACON algorithms are methods for multivariate outlier nomination (detection) and robust linear regression by Billor, Hadi, and Velleman (2000) <doi:10.1016/S0167-9473(99)00101-2>. The extension to weighted problems is due to Beguin and Hulliger (2008) <https://www150.statcan.gc.ca/n1/en/catalogue/12-001-X200800110616>; see also <doi:10.21105/joss.03238>.
Maintained by Tobias Schoch. Last updated 6 months ago.
outlieroutlier-detectionrobust-regressionstatisticsopenblasopenmp
12.6 match 2 stars 4.00 score 8 scriptspaulnorthrop
lax:Loglikelihood Adjustment for Extreme Value Models
Performs adjusted inferences based on model objects fitted, using maximum likelihood estimation, by the extreme value analysis packages 'eva' <https://cran.r-project.org/package=eva>, 'evd' <https://cran.r-project.org/package=evd>, 'evir' <https://cran.r-project.org/package=evir>, 'extRemes' <https://cran.r-project.org/package=extRemes>, 'fExtremes' <https://cran.r-project.org/package=fExtremes>, 'ismev' <https://cran.r-project.org/package=ismev>, 'mev' <https://cran.r-project.org/package=mev>, 'POT' <https://cran.r-project.org/package=POT> and 'texmex' <https://cran.r-project.org/package=texmex>. Adjusted standard errors and an adjusted loglikelihood are provided, using the 'chandwich' package <https://cran.r-project.org/package=chandwich> and the object-oriented features of the 'sandwich' package <https://cran.r-project.org/package=sandwich>. The adjustment is based on a robust sandwich estimator of the parameter covariance matrix, based on the methodology in Chandler and Bate (2007) <doi:10.1093/biomet/asm015>. This can be used for cluster correlated data when interest lies in the parameters of the marginal distributions, or for performing inferences that are robust to certain types of model misspecification. Univariate extreme value models, including regression models, are supported.
Maintained by Paul J. Northrop. Last updated 1 years ago.
clustered-dataclusterscomposite-likelihoodevdextreme-value-analysisextreme-value-statisticsextremesindependence-loglikelihoodloglikelihood-adjustmentmlepotregressionregression-modellingrobustsandwichsandwich-estimator
11.8 match 3 stars 4.29 score 13 scriptsconvexfi
fitHeavyTail:Mean and Covariance Matrix Estimation under Heavy Tails
Robust estimation methods for the mean vector, scatter matrix, and covariance matrix (if it exists) from data (possibly containing NAs) under multivariate heavy-tailed distributions such as angular Gaussian (via Tyler's method), Cauchy, and Student's t distributions. Additionally, a factor model structure can be specified for the covariance matrix. The latest revision also includes the multivariate skewed t distribution. The package is based on the papers: Sun, Babu, and Palomar (2014); Sun, Babu, and Palomar (2015); Liu and Rubin (1995); Zhou, Liu, Kumar, and Palomar (2019); Pascal, Ollila, and Palomar (2021).
Maintained by Daniel P. Palomar. Last updated 2 years ago.
cauchycovariance-estimationcovariance-matrixheavy-tailed-distributionsoutliersrobust-estimationstudent-ttyler
8.0 match 22 stars 6.27 score 28 scripts 1 dependentsbioc
msqrob2:Robust statistical inference for quantitative LC-MS proteomics
msqrob2 provides a robust linear mixed model framework for assessing differential abundance in MS-based Quantitative proteomics experiments. Our workflows can start from raw peptide intensities or summarised protein expression values. The model parameter estimates can be stabilized by ridge regression, empirical Bayes variance estimation and robust M-estimation. msqrob2's hurde workflow can handle missing data without having to rely on hard-to-verify imputation assumptions, and, outcompetes state-of-the-art methods with and without imputation for both high and low missingness. It builds on QFeature infrastructure for quantitative mass spectrometry data to store the model results together with the raw data and preprocessed data.
Maintained by Lieven Clement. Last updated 18 days ago.
proteomicsmassspectrometrydifferentialexpressionmultiplecomparisonregressionexperimentaldesignsoftwareimmunooncologynormalizationtimecoursepreprocessing
7.2 match 10 stars 6.94 score 83 scriptsstatdivlab
corncob:Count Regression for Correlated Observations with the Beta-Binomial
Statistical modeling for correlated count data using the beta-binomial distribution, described in Martin et al. (2020) <doi:10.1214/19-AOAS1283>. It allows for both mean and overdispersion covariates.
Maintained by Amy D Willis. Last updated 6 months ago.
5.1 match 105 stars 9.64 score 248 scripts 1 dependentsuncertaintyquantification
RobustGaSP:Robust Gaussian Stochastic Process Emulation
Robust parameter estimation and prediction of Gaussian stochastic process emulators. It allows for robust parameter estimation and prediction using Gaussian stochastic process emulator. It also implements the parallel partial Gaussian stochastic process emulator for computer model with massive outputs See the reference: Mengyang Gu and Jim Berger, 2016, Annals of Applied Statistics; Mengyang Gu, Xiaojing Wang and Jim Berger, 2018, Annals of Statistics.
Maintained by Mengyang Gu. Last updated 1 years ago.
20.5 match 2.35 score 75 scripts 1 dependentsmmaechler
sfsmisc:Utilities from 'Seminar fuer Statistik' ETH Zurich
Useful utilities ['goodies'] from Seminar fuer Statistik ETH Zurich, some of which were ported from S-plus in the 1990s. For graphics, have pretty (Log-scale) axes eaxis(), an enhanced Tukey-Anscombe plot, combining histogram and boxplot, 2d-residual plots, a 'tachoPlot()', pretty arrows, etc. For robustness, have a robust F test and robust range(). For system support, notably on Linux, provides 'Sys.*()' functions with more access to system and CPU information. Finally, miscellaneous utilities such as simple efficient prime numbers, integer codes, Duplicated(), toLatex.numeric() and is.whole().
Maintained by Martin Maechler. Last updated 5 months ago.
4.4 match 11 stars 10.87 score 566 scripts 119 dependentsbioc
limma:Linear Models for Microarray and Omics Data
Data analysis, linear models and differential expression for omics data.
Maintained by Gordon Smyth. Last updated 5 days ago.
exonarraygeneexpressiontranscriptionalternativesplicingdifferentialexpressiondifferentialsplicinggenesetenrichmentdataimportbayesianclusteringregressiontimecoursemicroarraymicrornaarraymrnamicroarrayonechannelproprietaryplatformstwochannelsequencingrnaseqbatcheffectmultiplecomparisonnormalizationpreprocessingqualitycontrolbiomedicalinformaticscellbiologycheminformaticsepigeneticsfunctionalgenomicsgeneticsimmunooncologymetabolomicsproteomicssystemsbiologytranscriptomics
3.4 match 13.81 score 16k scripts 585 dependentsphilchalmers
SimDesign:Structure for Organizing Monte Carlo Simulation Designs
Provides tools to safely and efficiently organize and execute Monte Carlo simulation experiments in R. The package controls the structure and back-end of Monte Carlo simulation experiments by utilizing a generate-analyse-summarise workflow. The workflow safeguards against common simulation coding issues, such as automatically re-simulating non-convergent results, prevents inadvertently overwriting simulation files, catches error and warning messages during execution, implicitly supports parallel processing with high-quality random number generation, and provides tools for managing high-performance computing (HPC) array jobs submitted to schedulers such as SLURM. For a pedagogical introduction to the package see Sigal and Chalmers (2016) <doi:10.1080/10691898.2016.1246953>. For a more in-depth overview of the package and its design philosophy see Chalmers and Adkins (2020) <doi:10.20982/tqmp.16.4.p248>.
Maintained by Phil Chalmers. Last updated 4 hours ago.
monte-carlo-simulationsimulationsimulation-framework
3.5 match 62 stars 13.36 score 253 scripts 46 dependentszdebruine
RcppML:Rcpp Machine Learning Library
Fast machine learning algorithms including matrix factorization and divisive clustering for large sparse and dense matrices.
Maintained by Zach DeBruine. Last updated 2 years ago.
clusteringmatrix-factorizationnmfrcpprcppeigensparse-matrixcppopenmp
4.5 match 104 stars 10.53 score 125 scripts 46 dependentskisungyou
Rdimtools:Dimension Reduction and Estimation Methods
We provide linear and nonlinear dimension reduction techniques. Intrinsic dimension estimation methods for exploratory analysis are also provided. For more details on the package, see the paper by You and Shung (2022) <doi:10.1016/j.simpa.2022.100414>.
Maintained by Kisung You. Last updated 2 years ago.
dimension-estimationdimension-reductionmanifold-learningsubspace-learningopenblascppopenmp
5.6 match 52 stars 8.37 score 186 scripts 8 dependentsunina-sfere
funcharts:Functional Control Charts
Provides functional control charts for statistical process monitoring of functional data, using the methods of Capezza et al. (2020) <doi:10.1002/asmb.2507>, Centofanti et al. (2021) <doi:10.1080/00401706.2020.1753581>, Capezza et al. (2024) <doi:10.1080/00401706.2024.2327346>, Capezza et al. (2024) <doi:10.1080/00224065.2024.2383674>, Centofanti et al. (2022) <doi:10.48550/arXiv.2205.06256>. The package is thoroughly illustrated in the paper of Capezza et al (2023) <doi:10.1080/00224065.2023.2219012>.
Maintained by Christian Capezza. Last updated 2 days ago.
7.0 match 2 stars 6.67 score 168 scriptsspkaluzny
robustarima:Robust ARIMA Modeling
Functions for fitting a linear regression model with ARIMA errors using a filtered tau-estimate. The methodology is described in Maronna et al (2017, ISBN:9781119214687).
Maintained by Stephen Kaluzny. Last updated 6 months ago.
arimarobust-statisticstime-series-analysisfortranopenblas
14.3 match 3.23 score 17 scriptss3alfisc
summclust:Module to Compute Influence and Leverage Statistics for Regression Models with Clustered Errors
Module to compute cluster specific information for regression models with clustered errors, including leverage and influence statistics. Models of type 'lm' and 'fixest'(from the 'stats' and 'fixest' packages) are supported. 'summclust' implements similar features as the user-written 'summclust.ado' Stata module (MacKinnon, Nielsen & Webb, 2022; <arXiv:2205.03288v1>).
Maintained by Alexander Fischer. Last updated 2 years ago.
clustered-standard-errorsfixestlinear-regressionrobust-inference
7.5 match 6 stars 6.16 score 53 scripts 3 dependentsjkurle
robust2sls:Outlier Robust Two-Stage Least Squares Inference and Testing
An implementation of easy tools for outlier robust inference in two-stage least squares (2SLS) models. The user specifies a reference distribution against which observations are classified as outliers or not. After removing the outliers, adjusted standard errors are automatically provided. Furthermore, several statistical tests for the false outlier detection rate can be calculated. The outlier removing algorithm can be iterated a fixed number of times or until the procedure converges. The algorithms and robust inference are described in more detail in Jiao (2019) <https://drive.google.com/file/d/1qPxDJnLlzLqdk94X9wwVASptf1MPpI2w/view>.
Maintained by Jonas Kurle. Last updated 2 years ago.
10.3 match 1 stars 4.43 score 18 scriptsdeclaredesign
estimatr:Fast Estimators for Design-Based Inference
Fast procedures for small set of commonly-used, design-appropriate estimators with robust standard errors and confidence intervals. Includes estimators for linear regression, instrumental variables regression, difference-in-means, Horvitz-Thompson estimation, and regression improving precision of experimental estimates by interacting treatment with centered pre-treatment covariates introduced by Lin (2013) <doi:10.1214/12-AOAS583>.
Maintained by Graeme Blair. Last updated 1 months ago.
3.9 match 133 stars 11.58 score 1.7k scripts 11 dependentsjohnnyzhz
rsem:Robust Structural Equation Modeling with Missing Data and Auxiliary Variables
A robust procedure is implemented to estimate means and covariance matrix of multiple variables with missing data using Huber weight and then to estimate a structural equation model.
Maintained by Zhiyong Zhang. Last updated 2 years ago.
15.7 match 2.89 score 13 scripts 2 dependentskbose28
FarmSelect:Factor Adjusted Robust Model Selection
Implements a consistent model selection strategy for high dimensional sparse regression when the covariate dependence can be reduced through factor models. By separating the latent factors from idiosyncratic components, the problem is transformed from model selection with highly correlated covariates to that with weakly correlated variables. It is appropriate for cases where we have many variables compared to the number of samples. Moreover, it implements a robust procedure to estimate distribution parameters wherever possible, hence being suitable for cases when the underlying distribution deviates from Gaussianity. See the paper on the 'FarmSelect' method, Fan et al.(2017) <arXiv:1612.08490>, for detailed description of methods and further references.
Maintained by Kaizheng Wang. Last updated 6 years ago.
10.0 match 7 stars 4.54 score 8 scriptsrstudio
shinytest2:Testing for Shiny Applications
Automated unit testing of Shiny applications through a headless 'Chromium' browser.
Maintained by Barret Schloerke. Last updated 1 months ago.
3.8 match 108 stars 12.08 score 704 scripts 1 dependentsbflammers
ANN2:Artificial Neural Networks for Anomaly Detection
Training of neural networks for classification and regression tasks using mini-batch gradient descent. Special features include a function for training autoencoders, which can be used to detect anomalies, and some related plotting functions. Multiple activation functions are supported, including tanh, relu, step and ramp. For the use of the step and ramp activation functions in detecting anomalies using autoencoders, see Hawkins et al. (2002) <doi:10.1007/3-540-46145-0_17>. Furthermore, several loss functions are supported, including robust ones such as Huber and pseudo-Huber loss, as well as L1 and L2 regularization. The possible options for optimization algorithms are RMSprop, Adam and SGD with momentum. The package contains a vectorized C++ implementation that facilitates fast training through mini-batch learning.
Maintained by Bart Lammers. Last updated 4 years ago.
anomaly-detectionartificial-neural-networksautoencodersneural-networksrobust-statisticsopenblascppopenmp
8.0 match 13 stars 5.59 score 60 scriptsrogih
tsqn:Applications of the Qn Estimator to Time Series (Univariate and Multivariate)
Time Series Qn is a package with applications of the Qn estimator of Rousseeuw and Croux (1993) <doi:10.1080/01621459.1993.10476408> to univariate and multivariate Time Series in time and frequency domains. More specifically, the robust estimation of autocorrelation or autocovariance matrix functions from Ma and Genton (2000, 2001) <doi:10.1111/1467-9892.00203>, <doi:10.1006/jmva.2000.1942> and Cotta (2017) <doi:10.13140/RG.2.2.14092.10883> are provided. The robust pseudo-periodogram of Molinares et. al. (2009) <doi:10.1016/j.jspi.2008.12.014> is also given. This packages also provides the M-estimator of the long-memory parameter d based on the robustification of the GPH estimator proposed by Reisen et al. (2017) <doi:10.1016/j.jspi.2017.02.008>.
Maintained by Higor Cotta. Last updated 6 years ago.
19.9 match 1 stars 2.23 score 17 scriptsmsalibian
RBF:Robust Backfitting
A robust backfitting algorithm for additive models based on (robust) local polynomial kernel smoothers. It includes both bounded and re-descending (kernel) M-estimators, and it computes predictions for points outside the training set if desired. See Boente, Martinez and Salibian-Barrera (2017) <doi:10.1080/10485252.2017.1369077> and Martinez and Salibian-Barrera (2021) <doi:10.21105/joss.02992> for details.
Maintained by Matias Salibian-Barrera. Last updated 2 years ago.
8.6 match 2 stars 5.11 score 13 scriptsjacob-long
jtools:Analysis and Presentation of Social Scientific Data
This is a collection of tools for more efficiently understanding and sharing the results of (primarily) regression analyses. There are also a number of miscellaneous functions for statistical and programming purposes. Support for models produced by the survey and lme4 packages are points of emphasis.
Maintained by Jacob A. Long. Last updated 6 months ago.
3.0 match 167 stars 14.48 score 4.0k scripts 14 dependentsbayesplay
bayesplay:The Bayes Factor Playground
A lightweight modelling syntax for defining likelihoods and priors and for computing Bayes factors for simple one parameter models. It includes functionality for computing and plotting priors, likelihoods, and model predictions. Additional functionality is included for computing and plotting posteriors.
Maintained by Lincoln John Colling. Last updated 1 years ago.
bayesbayesianbayesian-statistics
7.8 match 6 stars 5.54 score 23 scriptsubod
rococo:Robust Rank Correlation Coefficient and Test
Provides the robust gamma rank correlation coefficient as introduced by Bodenhofer, Krone, and Klawonn (2013) <DOI:10.1016/j.ins.2012.11.026> along with a permutation-based rank correlation test. The rank correlation coefficient and the test are explicitly designed for dealing with noisy numerical data.
Maintained by Ulrich Bodenhofer. Last updated 11 months ago.
9.9 match 4.32 score 21 scriptstobiasschoch
rsae:Robust Small Area Estimation
Empirical best linear unbiased prediction (EBLUP) and robust prediction of the area-level means under the basic unit-level model. The model can be fitted by maximum likelihood or a (robust) M-estimator. Mean square prediction error is computed by a parametric bootstrap.
Maintained by Tobias Schoch. Last updated 6 months ago.
11.5 match 1 stars 3.70 score 8 scriptsbioc
MsCoreUtils:Core Utils for Mass Spectrometry Data
MsCoreUtils defines low-level functions for mass spectrometry data and is independent of any high-level data structures. These functions include mass spectra processing functions (noise estimation, smoothing, binning, baseline estimation), quantitative aggregation functions (median polish, robust summarisation, ...), missing data imputation, data normalisation (quantiles, vsn, ...), misc helper functions, that are used across high-level data structure within the R for Mass Spectrometry packages.
Maintained by RforMassSpectrometry Package Maintainer. Last updated 4 days ago.
infrastructureproteomicsmassspectrometrymetabolomicsbioconductormass-spectrometryutils
4.0 match 16 stars 10.52 score 41 scripts 71 dependentsaalfons
robmedExtra:Extra Functionality for (Robust) Mediation Analysis
This companion package extends the package 'robmed' (Alfons, Ates & Groenen, 2022b; <doi:10.18637/jss.v103.i13>) in various ways. Most notably, it provides a graphical user interface for the robust bootstrap test ROBMED (Alfons, Ates & Groenen, 2022a; <doi:10.1177/1094428121999096>) to make the method more accessible to less proficient 'R' users, as well as functions to export the results as a table in a 'Microsoft Word' or 'Microsoft Powerpoint' document, or as a 'LaTeX' table. Furthermore, the package contains a 'shiny' app to compare various bootstrap procedures for mediation analysis on simulated data.
Maintained by Andreas Alfons. Last updated 4 months ago.
15.6 match 1 stars 2.70 scorevaleriapolicastro
robin:ROBustness in Network
Assesses the robustness of the community structure of a network found by one or more community detection algorithm to give indications about their reliability. It detects if the community structure found by a set of algorithms is statistically significant and compares the different selected detection algorithms on the same network. robin helps to choose among different community detection algorithms the one that better fits the network of interest. Reference in Policastro V., Righelli D., Carissimo A., Cutillo L., De Feis I. (2021) <https://journal.r-project.org/archive/2021/RJ-2021-040/index.html>.
Maintained by Valeria Policastro. Last updated 19 days ago.
5.5 match 18 stars 7.66 score 8 scriptstushiqi
MAnorm2:Tools for Normalizing and Comparing ChIP-seq Samples
Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is the premier technology for profiling genome-wide localization of chromatin-binding proteins, including transcription factors and histones with various modifications. This package provides a robust method for normalizing ChIP-seq signals across individual samples or groups of samples. It also designs a self-contained system of statistical models for calling differential ChIP-seq signals between two or more biological conditions as well as for calling hypervariable ChIP-seq signals across samples. Refer to Tu et al. (2021) <doi:10.1101/gr.262675.120> and Chen et al. (2022) <doi:10.1186/s13059-022-02627-9> for associated statistical details.
Maintained by Shiqi Tu. Last updated 2 years ago.
chip-seqdifferential-analysisempirical-bayeswinsorize-values
7.7 match 32 stars 5.48 score 19 scriptsbioc
RPA:RPA: Robust Probabilistic Averaging for probe-level analysis
Probabilistic analysis of probe reliability and differential gene expression on short oligonucleotide arrays.
Maintained by Leo Lahti. Last updated 5 months ago.
geneexpressionmicroarraypreprocessingqualitycontrol
7.2 match 5.78 score 20 scripts 1 dependentsfbartos
RoBTT:Robust Bayesian T-Test
An implementation of Bayesian model-averaged t-tests that allows users to draw inferences about the presence versus absence of an effect, variance heterogeneity, and potential outliers. The 'RoBTT' package estimates ensembles of models created by combining competing hypotheses and applies Bayesian model averaging using posterior model probabilities. Users can obtain model-averaged posterior distributions and inclusion Bayes factors, accounting for uncertainty in the data-generating process (Maier et al., 2024, <doi:10.3758/s13423-024-02590-5>). The package also provides a truncated likelihood version of the model-averaged t-test, enabling users to exclude potential outliers without introducing bias (Godmann et al., 2024, <doi:10.31234/osf.io/j9f3s>). Users can specify a wide range of informative priors for all parameters of interest. The package offers convenient functions for summary, visualization, and fit diagnostics.
Maintained by František Bartoš. Last updated 4 months ago.
bayesianmodel-averagingoutlierst-testcpp
7.9 match 3 stars 5.26 score 9 scriptshenrikbengtsson
aroma.affymetrix:Analysis of Large Affymetrix Microarray Data Sets
A cross-platform R framework that facilitates processing of any number of Affymetrix microarray samples regardless of computer system. The only parameter that limits the number of chips that can be processed is the amount of available disk space. The Aroma Framework has successfully been used in studies to process tens of thousands of arrays. This package has actively been used since 2006.
Maintained by Henrik Bengtsson. Last updated 1 years ago.
infrastructureproprietaryplatformsexonarraymicroarrayonechannelguidataimportdatarepresentationpreprocessingqualitycontrolvisualizationreportwritingacghcopynumbervariantsdifferentialexpressiongeneexpressionsnptranscriptionaffymetrixanalysiscopy-numberdnaexpressionhpclarge-scalenotebookreproducibilityrna
7.2 match 10 stars 5.79 score 112 scripts 3 dependentsalexanderrobitzsch
sirt:Supplementary Item Response Theory Models
Supplementary functions for item response models aiming to complement existing R packages. The functionality includes among others multidimensional compensatory and noncompensatory IRT models (Reckase, 2009, <doi:10.1007/978-0-387-89976-3>), MCMC for hierarchical IRT models and testlet models (Fox, 2010, <doi:10.1007/978-1-4419-0742-4>), NOHARM (McDonald, 1982, <doi:10.1177/014662168200600402>), Rasch copula model (Braeken, 2011, <doi:10.1007/s11336-010-9190-4>; Schroeders, Robitzsch & Schipolowski, 2014, <doi:10.1111/jedm.12054>), faceted and hierarchical rater models (DeCarlo, Kim & Johnson, 2011, <doi:10.1111/j.1745-3984.2011.00143.x>), ordinal IRT model (ISOP; Scheiblechner, 1995, <doi:10.1007/BF02301417>), DETECT statistic (Stout, Habing, Douglas & Kim, 1996, <doi:10.1177/014662169602000403>), local structural equation modeling (LSEM; Hildebrandt, Luedtke, Robitzsch, Sommer & Wilhelm, 2016, <doi:10.1080/00273171.2016.1142856>).
Maintained by Alexander Robitzsch. Last updated 3 months ago.
item-response-theoryopenblascpp
4.1 match 23 stars 10.01 score 280 scripts 22 dependentsvalentint
rrcov3way:Robust Methods for Multiway Data Analysis, Applicable also for Compositional Data
Provides methods for multiway data analysis by means of Parafac and Tucker 3 models. Robust versions (Engelen and Hubert (2011) <doi:10.1016/j.aca.2011.04.043>) and versions for compositional data are also provided (Gallo (2015) <doi:10.1080/03610926.2013.798664>, Di Palma et al. (2018) <doi:10.1080/02664763.2017.1381669>). Several optimization methods alternative to ALS are available (Simonacci and Gallo (2019) <doi:10.1016/j.chemolab.2019.103822>, Simonacci and Gallo (2020) <doi:10.1007/s00500-019-04320-9>).
Maintained by Valentin Todorov. Last updated 1 years ago.
9.6 match 4.28 score 38 scriptsnhejazi
txshift:Efficient Estimation of the Causal Effects of Stochastic Interventions
Efficient estimation of the population-level causal effects of stochastic interventions on a continuous-valued exposure. Both one-step and targeted minimum loss estimators are implemented for the counterfactual mean value of an outcome of interest under an additive modified treatment policy, a stochastic intervention that may depend on the natural value of the exposure. To accommodate settings with outcome-dependent two-phase sampling, procedures incorporating inverse probability of censoring weighting are provided to facilitate the construction of inefficient and efficient one-step and targeted minimum loss estimators. The causal parameter and its estimation were first described by Díaz and van der Laan (2013) <doi:10.1111/j.1541-0420.2011.01685.x>, while the multiply robust estimation procedure and its application to data from two-phase sampling designs is detailed in NS Hejazi, MJ van der Laan, HE Janes, PB Gilbert, and DC Benkeser (2020) <doi:10.1111/biom.13375>. The software package implementation is described in NS Hejazi and DC Benkeser (2020) <doi:10.21105/joss.02447>. Estimation of nuisance parameters may be enhanced through the Super Learner ensemble model in 'sl3', available for download from GitHub using 'remotes::install_github("tlverse/sl3")'.
Maintained by Nima Hejazi. Last updated 6 months ago.
causal-effectscausal-inferencecensored-datamachine-learningrobust-statisticsstatisticsstochastic-interventionsstochastic-treatment-regimestargeted-learningtreatment-effectsvariable-importance
8.0 match 14 stars 5.12 score 19 scriptshelske
ramcmc:Robust Adaptive Metropolis Algorithm
Function for adapting the shape of the random walk Metropolis proposal as specified by robust adaptive Metropolis algorithm by Vihola (2012) <doi:10.1007/s11222-011-9269-5>. The package also includes fast functions for rank-one Cholesky update and downdate. These functions can be used directly from R or the corresponding C++ header files can be easily linked to other R packages.
Maintained by Jouni Helske. Last updated 3 years ago.
6.5 match 6 stars 6.21 score 8 scripts 12 dependentsandyliaw-mrk
locfit:Local Regression, Likelihood and Density Estimation
Local regression, likelihood and density estimation methods as described in the 1999 book by Loader.
Maintained by Andy Liaw. Last updated 11 days ago.
4.3 match 1 stars 9.40 score 428 scripts 606 dependentsaplantin
MiRKAT:Microbiome Regression-Based Kernel Association Tests
Test for overall association between microbiome composition data and phenotypes via phylogenetic kernels. The phenotype can be univariate continuous or binary (Zhao et al. (2015) <doi:10.1016/j.ajhg.2015.04.003>), survival outcomes (Plantinga et al. (2017) <doi:10.1186/s40168-017-0239-9>), multivariate (Zhan et al. (2017) <doi:10.1002/gepi.22030>) and structured phenotypes (Zhan et al. (2017) <doi:10.1111/biom.12684>). The package can also use robust regression (unpublished work) and integrated quantile regression (Wang et al. (2021) <doi:10.1093/bioinformatics/btab668>). In each case, the microbiome community effect is modeled nonparametrically through a kernel function, which can incorporate phylogenetic tree information.
Maintained by Anna Plantinga. Last updated 2 years ago.
8.4 match 3 stars 4.74 score 183 scriptsbioc
GARS:GARS: Genetic Algorithm for the identification of Robust Subsets of variables in high-dimensional and challenging datasets
Feature selection aims to identify and remove redundant, irrelevant and noisy variables from high-dimensional datasets. Selecting informative features affects the subsequent classification and regression analyses by improving their overall performances. Several methods have been proposed to perform feature selection: most of them relies on univariate statistics, correlation, entropy measurements or the usage of backward/forward regressions. Herein, we propose an efficient, robust and fast method that adopts stochastic optimization approaches for high-dimensional. GARS is an innovative implementation of a genetic algorithm that selects robust features in high-dimensional and challenging datasets.
Maintained by Mattia Chiesa. Last updated 5 months ago.
classificationfeatureextractionclusteringopenjdk
7.9 match 5.00 score 2 scriptscwatson
brainGraph:Graph Theory Analysis of Brain MRI Data
A set of tools for performing graph theory analysis of brain MRI data. It works with data from a Freesurfer analysis (cortical thickness, volumes, local gyrification index, surface area), diffusion tensor tractography data (e.g., from FSL) and resting-state fMRI data (e.g., from DPABI). It contains a graphical user interface for graph visualization and data exploration, along with several functions for generating useful figures.
Maintained by Christopher G. Watson. Last updated 1 years ago.
brain-connectivitybrain-imagingcomplex-networksconnectomeconnectomicsfmrigraph-theorymrinetwork-analysisneuroimagingneurosciencestatisticstractography
5.0 match 188 stars 7.86 score 107 scripts 3 dependentskloke
npsm:Nonparametric Statistical Methods
Accompanies the book "Nonparametric Statistical Methods Using R, 2nd Edition" by Kloke and McKean (2024, ISBN:9780367651350). Includes methods, datasets, and random number generation useful for the study of robust and/or nonparametric statistics. Emphasizes classical nonparametric methods for a variety of designs --- especially one-sample and two-sample problems. Includes methods for general scores, including estimation and testing for the two-sample location problem as well as Hogg's adaptive method.
Maintained by John Kloke. Last updated 9 months ago.
11.2 match 3.47 score 59 scriptsuniprjrc
fsdaR:Robust Data Analysis Through Monitoring and Dynamic Visualization
Provides interface to the 'MATLAB' toolbox 'Flexible Statistical Data Analysis (FSDA)' which is comprehensive and computationally efficient software package for robust statistics in regression, multivariate and categorical data analysis. The current R version implements tools for regression: (forward search, S- and MM-estimation, least trimmed squares (LTS) and least median of squares (LMS)), for multivariate analysis (forward search, S- and MM-estimation), for cluster analysis and cluster-wise regression. The distinctive feature of our package is the possibility of monitoring the statistics of interest as a function of breakdown point, efficiency or subset size, depending on the estimator. This is accompanied by a rich set of graphical features, such as dynamic brushing, linking, particularly useful for exploratory data analysis.
Maintained by Valentin Todorov. Last updated 1 years ago.
7.2 match 5 stars 5.37 score 93 scriptscran
rdrobust:Robust Data-Driven Statistical Inference in Regression-Discontinuity Designs
Regression-discontinuity (RD) designs are quasi-experimental research designs popular in social, behavioral and natural sciences. The RD design is usually employed to study the (local) causal effect of a treatment, intervention or policy. This package provides tools for data-driven graphical and analytical statistical inference in RD designs: rdrobust() to construct local-polynomial point estimators and robust confidence intervals for average treatment effects at the cutoff in Sharp, Fuzzy and Kink RD settings, rdbwselect() to perform bandwidth selection for the different procedures implemented, and rdplot() to conduct exploratory data analysis (RD plots).
Maintained by Sebastian Calonico. Last updated 1 years ago.
6.7 match 4 stars 5.70 score 638 scripts 6 dependentsfchamroukhi
meteorits:Mixture-of-Experts Modeling for Complex Non-Normal Distributions
Provides a unified mixture-of-experts (ME) modeling and estimation framework with several original and flexible ME models to model, cluster and classify heterogeneous data in many complex situations where the data are distributed according to non-normal, possibly skewed distributions, and when they might be corrupted by atypical observations. Mixtures-of-Experts models for complex and non-normal distributions ('meteorits') are originally introduced and written in 'Matlab' by Faicel Chamroukhi. The references are mainly the following ones. The references are mainly the following ones. Chamroukhi F., Same A., Govaert, G. and Aknin P. (2009) <doi:10.1016/j.neunet.2009.06.040>. Chamroukhi F. (2010) <https://chamroukhi.com/FChamroukhi-PhD.pdf>. Chamroukhi F. (2015) <arXiv:1506.06707>. Chamroukhi F. (2015) <https://chamroukhi.com/FChamroukhi-HDR.pdf>. Chamroukhi F. (2016) <doi:10.1109/IJCNN.2016.7727580>. Chamroukhi F. (2016) <doi:10.1016/j.neunet.2016.03.002>. Chamroukhi F. (2017) <doi:10.1016/j.neucom.2017.05.044>.
Maintained by Florian Lecocq. Last updated 5 years ago.
artificial-intelligenceclusteringem-algorithmmixture-of-expertsneural-networksnon-linear-regressionpredictionrobust-learningskew-normalskew-tskewed-datastatistical-inferencestatistical-learningt-distributionunsupervised-learningopenblascpp
7.5 match 3 stars 5.12 score 11 scriptsr-lib
lintr:A 'Linter' for R Code
Checks adherence to a given style, syntax errors and possible semantic issues. Supports on the fly checking of R code edited with 'RStudio IDE', 'Emacs', 'Vim', 'Sublime Text', 'Atom' and 'Visual Studio Code'.
Maintained by Michael Chirico. Last updated 8 days ago.
2.3 match 1.2k stars 17.00 score 916 scripts 33 dependentscran
ssmrob:Robust Estimation and Inference in Sample Selection Models
Package provides a set of tools for robust estimation and inference for models with sample selectivity and endogenous treatment model. For details, see Zhelonkin and Ronchetti (2021) <doi:10.18637/jss.v099.i04>.
Maintained by Mikhail Zhelonkin. Last updated 4 years ago.
38.2 match 1.00 score 9 scriptsjaak-s
rDEA:Robust Data Envelopment Analysis (DEA) for R
Data Envelopment Analysis for R, estimating robust DEA scores without and with environmental variables and doing returns-to-scale tests.
Maintained by Jaak Simm. Last updated 2 years ago.
7.7 match 24 stars 4.92 score 23 scriptschristiangoueguel
ConfidenceEllipse:Computation of 2D and 3D Elliptical Joint Confidence Regions
Computing elliptical joint confidence regions at a specified confidence level. It provides the flexibility to estimate either classical or robust confidence regions, which can be visualized in 2D or 3D plots. The classical approach assumes normality and uses the mean and covariance matrix to define the confidence regions. Alternatively, the robustified version employs estimators like minimum covariance determinant (MCD) and M-estimator, making them less sensitive to outliers and departures from normality. Furthermore, the functions allow users to group the dataset based on categorical variables and estimate separate confidence regions for each group. This capability is particularly useful for exploring potential differences or similarities across subgroups within a dataset. Varmuza and Filzmoser (2009, ISBN:978-1-4200-5947-2). Johnson and Wichern (2007, ISBN:0-13-187715-1). Raymaekers and Rousseeuw (2019) <DOI:10.1080/00401706.2019.1677270>.
Maintained by Christian L. Goueguel. Last updated 11 months ago.
confidence-ellipseconfidence-ellipsoidconfidence-regionmultivariate-distributionoutliers-detectionrobust-statistics
8.0 match 1 stars 4.70 scorehsnbulut
MVTests:Multivariate Hypothesis Tests
Multivariate hypothesis tests and the confidence intervals. It can be used to test the hypothesizes about mean vector or vectors (one-sample, two independent samples, paired samples), covariance matrix (one or more matrices), and the correlation matrix. Moreover, it can be used for robust Hotelling T^2 test at one sample case in high dimensional data. For this package, we have benefited from the studies Rencher (2003), Nel and Merwe (1986) <DOI: 10.1080/03610928608829342>, Tatlidil (1996), Tsagris (2014), Villasenor Alva and Estrada (2009) <DOI: 10.1080/03610920802474465>.
Maintained by Hasan Bulut. Last updated 5 months ago.
11.2 match 3.30 score 40 scriptskolesarm
dfadjust:Degrees of Freedom Adjustment for Robust Standard Errors
Computes small-sample degrees of freedom adjustment for heteroskedasticity robust standard errors, and for clustered standard errors in linear regression. See Imbens and Kolesár (2016) <doi:10.1162/REST_a_00552> for a discussion of these adjustments.
Maintained by Michal Kolesár. Last updated 3 months ago.
6.4 match 31 stars 5.75 score 12 scriptsbioc
flowClust:Clustering for Flow Cytometry
Robust model-based clustering using a t-mixture model with Box-Cox transformation. Note: users should have GSL installed. Windows users: 'consult the README file available in the inst directory of the source distribution for necessary configuration instructions'.
Maintained by Greg Finak. Last updated 5 months ago.
immunooncologyclusteringvisualizationflowcytometry
5.0 match 7.30 score 83 scripts 6 dependentscran
adamethods:Archetypoid Algorithms and Anomaly Detection
Collection of several algorithms to obtain archetypoids with small and large databases, and with both classical multivariate data and functional data (univariate and multivariate). Some of these algorithms also allow to detect anomalies (outliers). Please see Vinue and Epifanio (2020) <doi:10.1007/s11634-020-00412-9>.
Maintained by Guillermo Vinue. Last updated 5 years ago.
22.2 match 1.63 score 43 scriptssubroy13
rsvddpd:Robust Singular Value Decomposition using Density Power Divergence
Computing singular value decomposition with robustness is a challenging task. This package provides an implementation of computing robust SVD using density power divergence (<arXiv:2109.10680>). It combines the idea of robustness and efficiency in estimation based on a tuning parameter. It also provides utility functions to simulate various scenarios to compare performances of different algorithms.
Maintained by Subhrajyoty Roy. Last updated 2 years ago.
8.7 match 3 stars 4.18 score 6 scriptsantoinelucas64
amap:Another Multidimensional Analysis Package
Tools for Clustering and Principal Component Analysis (With robust methods, and parallelized functions).
Maintained by Antoine Lucas. Last updated 5 months ago.
4.7 match 7.66 score 460 scripts 26 dependentsbioc
LimROTS:A Hybrid Method Integrating Empirical Bayes and Reproducibility-Optimized Statistics for Robust Analysis of Proteomics and Metabolomics Data
Differential expression analysis is a prevalent method utilised in the examination of diverse biological data. The reproducibility-optimized test statistic (ROTS) modifies a t-statistic based on the data's intrinsic characteristics and ranks features according to their statistical significance for differential expression between two or more groups (f-statistic). Focussing on proteomics and metabolomics, the current ROTS implementation cannot account for technical or biological covariates such as MS batches or gender differences among the samples. Consequently, we developed LimROTS, which employs a reproducibility-optimized test statistic utilising the limma methodology to simulate complex experimental designs. LimROTS is a hybrid method integrating empirical bayes and reproducibility-optimized statistics for robust analysis of proteomics and metabolomics data.
Maintained by Ali Mostafa Anwar. Last updated 3 months ago.
softwaregeneexpressiondifferentialexpressionmicroarrayrnaseqproteomicsimmunooncologymetabolomicsmrnamicroarray
7.5 match 1 stars 4.70 score 1 scriptscran
bst:Gradient Boosting
Functional gradient descent algorithm for a variety of convex and non-convex loss functions, for both classical and robust regression and classification problems. See Wang (2011) <doi:10.2202/1557-4679.1304>, Wang (2012) <doi:10.3414/ME11-02-0020>, Wang (2018) <doi:10.1080/10618600.2018.1424635>, Wang (2018) <doi:10.1214/18-EJS1404>.
Maintained by Zhu Wang. Last updated 2 years ago.
8.3 match 4.17 score 5 dependentsericaponzi
RaJIVE:Robust Angle Based Joint and Individual Variation Explained
A robust alternative to the aJIVE (angle based Joint and Individual Variation Explained) method (Feng et al 2018: <doi:10.1016/j.jmva.2018.03.008>) for the estimation of joint and individual components in the presence of outliers in multi-source data. It decomposes the multi-source data into joint, individual and residual (noise) contributions. The decomposition is robust to outliers and noise in the data. The method is illustrated in Ponzi et al (2021) <arXiv:2101.09110>.
Maintained by Erica Ponzi. Last updated 4 years ago.
12.8 match 2.70 score 1 scriptsyunyishen
robustcov:Collection of Robust Covariance and (Sparse) Precision Matrix Estimators
Collection of methods for robust covariance and (sparse) precision matrix estimation based on Loh and Tan (2018) <doi:10.1214/18-EJS1427>.
Maintained by Yunyi Shen. Last updated 4 years ago.
precision-matrixrobust-estimatesopenblascppopenmp
12.8 match 1 stars 2.70 scorejamovi
walrus:Robust Statistical Methods
A toolbox of common robust statistical tests, including robust descriptives, robust t-tests, and robust ANOVA. It is also available as a module for 'jamovi' (see <https://www.jamovi.org> for more information). Walrus is based on the WRS2 package by Patrick Mair, which is in turn based on the scripts and work of Rand Wilcox. These analyses are described in depth in the book 'Introduction to Robust Estimation & Hypothesis Testing'.
Maintained by Jonathon Love. Last updated 2 years ago.
12.5 match 2 stars 2.68 score 12 scriptsbraverock
PortfolioAnalytics:Portfolio Analysis, Including Numerical Methods for Optimization of Portfolios
Portfolio optimization and analysis routines and graphics.
Maintained by Brian G. Peterson. Last updated 3 months ago.
2.9 match 81 stars 11.49 score 626 scripts 2 dependentsbioc
GENESIS:GENetic EStimation and Inference in Structured samples (GENESIS): Statistical methods for analyzing genetic data from samples with population structure and/or relatedness
The GENESIS package provides methodology for estimating, inferring, and accounting for population and pedigree structure in genetic analyses. The current implementation provides functions to perform PC-AiR (Conomos et al., 2015, Gen Epi) and PC-Relate (Conomos et al., 2016, AJHG). PC-AiR performs a Principal Components Analysis on genome-wide SNP data for the detection of population structure in a sample that may contain known or cryptic relatedness. Unlike standard PCA, PC-AiR accounts for relatedness in the sample to provide accurate ancestry inference that is not confounded by family structure. PC-Relate uses ancestry representative principal components to adjust for population structure/ancestry and accurately estimate measures of recent genetic relatedness such as kinship coefficients, IBD sharing probabilities, and inbreeding coefficients. Additionally, functions are provided to perform efficient variance component estimation and mixed model association testing for both quantitative and binary phenotypes.
Maintained by Stephanie M. Gogarten. Last updated 1 months ago.
snpgeneticvariabilitygeneticsstatisticalmethoddimensionreductionprincipalcomponentgenomewideassociationqualitycontrolbiocviews
3.2 match 36 stars 10.44 score 342 scripts 1 dependentsbioc
AlpsNMR:Automated spectraL Processing System for NMR
Reads Bruker NMR data directories both zipped and unzipped. It provides automated and efficient signal processing for untargeted NMR metabolomics. It is able to interpolate the samples, detect outliers, exclude regions, normalize, detect peaks, align the spectra, integrate peaks, manage metadata and visualize the spectra. After spectra proccessing, it can apply multivariate analysis on extracted data. Efficient plotting with 1-D data is also available. Basic reading of 1D ACD/Labs exported JDX samples is also available.
Maintained by Sergio Oller Moreno. Last updated 5 months ago.
softwarepreprocessingvisualizationclassificationcheminformaticsmetabolomicsdataimport
4.4 match 15 stars 7.59 score 12 scripts 1 dependentspecanproject
PEcAn.utils:PEcAn Functions Used for Ecological Forecasts and Reanalysis
The Predictive Ecosystem Carbon Analyzer (PEcAn) is a scientific workflow management tool that is designed to simplify the management of model parameterization, execution, and analysis. The goal of PEcAn is to streamline the interaction between data and models, and to improve the efficacy of scientific investigation.
Maintained by Rob Kooper. Last updated 2 days ago.
bayesiancyberinfrastructuredata-assimilationdata-scienceecosystem-modelecosystem-scienceforecastingmeta-analysisnational-science-foundationpecanplants
3.0 match 216 stars 10.92 score 218 scripts 35 dependentscran
metaplus:Robust Meta-Analysis and Meta-Regression
Performs meta-analysis and meta-regression using standard and robust methods with confidence intervals based on the profile likelihood. Robust methods are based on alternative distributions for the random effect, either the t-distribution (Lee and Thompson, 2008 <doi:10.1002/sim.2897> or Baker and Jackson, 2008 <doi:10.1007/s10729-007-9041-8>) or mixtures of normals (Beath, 2014 <doi:10.1002/jrsm.1114>).
Maintained by Ken Beath. Last updated 2 months ago.
8.2 match 3.96 score 34 scriptskolesarm
ebci:Robust Empirical Bayes Confidence Intervals
Computes empirical Bayes confidence estimators and confidence intervals in a normal means model. The intervals are robust in the sense that they achieve correct coverage regardless of the distribution of the means. If the means are treated as fixed, the intervals have an average coverage guarantee. The implementation is based on Armstrong, Kolesár and Plagborg-Møller (2022) <doi:10.3982/ECTA18597>.
Maintained by Michal Kolesár. Last updated 7 months ago.
6.5 match 10 stars 5.00 score 3 scriptsaadler
revss:Robust Estimation in Very Small Samples
Implements the estimation techniques described in Rousseeuw & Verboven (2002) <doi:10.1016/S0167-9473(02)00078-6> for the location and scale of very small samples.
Maintained by Avraham Adler. Last updated 9 months ago.
9.0 match 4 stars 3.60 scorebioc
marray:Exploratory analysis for two-color spotted microarray data
Class definitions for two-color spotted microarray data. Fuctions for data input, diagnostic plots, normalization and quality checking.
Maintained by Yee Hwa (Jean) Yang. Last updated 5 months ago.
microarraytwochannelpreprocessing
3.6 match 8.92 score 222 scripts 37 dependentsleelabsg
SKAT:SNP-Set (Sequence) Kernel Association Test
Functions for kernel-regression-based association tests including Burden test, SKAT and SKAT-O. These methods aggregate individual SNP score statistics in a SNP set and efficiently compute SNP-set level p-values.
Maintained by Seunggeun (Shawn) Lee. Last updated 1 months ago.
3.3 match 45 stars 9.70 score 268 scripts 16 dependentsdrjoze
drgee:Doubly Robust Generalized Estimating Equations
Fit restricted mean models for the conditional association between an exposure and an outcome, given covariates. Three methods are implemented: O-estimation, where a nuisance model for the association between the covariates and the outcome is used; E-estimation where a nuisance model for the association between the covariates and the exposure is used, and doubly robust (DR) estimation where both nuisance models are used. In DR-estimation, the estimates will be consistent when at least one of the nuisance models is correctly specified, not necessarily both. For more information, see Zetterqvist and Sjölander (2015) <doi:10.1515/em-2014-0021>.
Maintained by Johan Zetterqvist. Last updated 5 years ago.
9.1 match 3.50 score 35 scripts 3 dependentsandreasnordland
polle:Policy Learning
Package for evaluating user-specified finite stage policies and learning optimal treatment policies via doubly robust loss functions. Policy learning methods include doubly robust learning of the blip/conditional average treatment effect and sequential policy tree learning. The package also include methods for optimal subgroup analysis. See Nordland and Holst (2022) <doi:10.48550/arXiv.2212.02335> for documentation and references.
Maintained by Andreas Nordland. Last updated 9 days ago.
5.5 match 4 stars 5.76 score 6 scriptscran
SSDforR:Functions to Analyze Single System Data
Functions to visually and statistically analyze single system data.
Maintained by Charles Auerbach. Last updated 3 months ago.
12.9 match 2.44 score 80 scriptsmbannick
RobinCar:Robust Inference for Covariate Adjustment in Randomized Clinical Trials
Performs robust estimation and inference when using covariate adjustment and/or covariate-adaptive randomization in randomized clinical trials. Ting Ye, Jun Shao, Yanyao Yi, Qinyuan Zhao (2023) <doi:10.1080/01621459.2022.2049278>. Ting Ye, Marlena Bannick, Yanyao Yi, Jun Shao (2023) <doi:10.1080/24754269.2023.2205802>. Ting Ye, Jun Shao, Yanyao Yi (2023) <doi:10.1093/biomet/asad045>. Marlena Bannick, Jun Shao, Jingyi Liu, Yu Du, Yanyao Yi, Ting Ye (2024) <doi:10.48550/arXiv.2306.10213>.
Maintained by Marlena Bannick. Last updated 6 days ago.
7.0 match 6 stars 4.42 score 11 scriptsrogih
acfMPeriod:Robust Estimation of the ACF from the M-Periodogram
Non-robust and robust computations of the sample autocovariance (ACOVF) and sample autocorrelation functions (ACF) of univariate and multivariate processes. The methodology consists in reversing the diagonalization procedure involving the periodogram or the cross-periodogram and the Fourier transform vectors, and, thus, obtaining the ACOVF or the ACF as discussed in Fuller (1995) <doi:10.1002/9780470316917>. The robust version is obtained by fitting robust M-regressors to obtain the M-periodogram or M-cross-periodogram as discussed in Reisen et al. (2017) <doi:10.1016/j.jspi.2017.02.008>.
Maintained by Higor Cotta. Last updated 6 years ago.
15.4 match 2.00 scorerussellpierce
naptime:A Flexible and Robust Sys.sleep() Replacement
Provides a near drop-in replacement for base::Sys.sleep() that allows more types of input to produce delays in the execution of code and can silence/prevent typical sources of error.
Maintained by Russell S. Pierce. Last updated 7 months ago.
5.9 match 9 stars 5.21 score 12 scripts 1 dependentsunina-sfere
rofanova:Robust Functional Analysis of Variance
Implements the robust functional analysis of variance (RoFANOVA), described in Centofanti et al. (2021) <arXiv:2112.10643>. It allows testing mean differences among groups of functional data by being robust against the presence of outliers.
Maintained by Fabio Centofanti. Last updated 3 years ago.
9.5 match 3.22 score 11 scripts 1 dependentsshabbychef
fromo:Fast Robust Moments
Fast, numerically robust computation of weighted moments via 'Rcpp'. Supports computation on vectors and matrices, and Monoidal append of moments. Moments and cumulants over running fixed length windows can be computed, as well as over time-based windows. Moment computations are via a generalization of Welford's method, as described by Bennett et. (2009) <doi:10.1109/CLUSTR.2009.5289161>.
Maintained by Steven E. Pav. Last updated 4 months ago.
cumulantsmomentsrolling-statisticsstatisticscpp
5.8 match 3 stars 5.22 score 22 scriptsxmengju
RRBoost:A Robust Boosting Algorithm
An implementation of robust boosting algorithms for regression in R. This includes the RRBoost method proposed in the paper "Robust Boosting for Regression Problems" (Ju X and Salibian-Barrera M. 2020) <doi:10.1016/j.csda.2020.107065> (to appear in Computational Statistics and Data Science). It also implements previously proposed boosting algorithms in the simulation section of the paper: L2Boost, LADBoost, MBoost (Friedman, J. H. (2001) <10.1214/aos/1013203451>) and Robloss (Lutz et al. (2008) <10.1016/j.csda.2007.11.006>).
Maintained by Xiaomeng Ju. Last updated 4 months ago.
11.3 match 2.70 score 3 scriptslbb220
GWmodel:Geographically-Weighted Models
Techniques from a particular branch of spatial statistics,termed geographically-weighted (GW) models. GW models suit situations when data are not described well by some global model, but where there are spatial regions where a suitably localised calibration provides a better description. 'GWmodel' includes functions to calibrate: GW summary statistics (Brunsdon et al., 2002)<doi: 10.1016/s0198-9715(01)00009-6>, GW principal components analysis (Harris et al., 2011)<doi: 10.1080/13658816.2011.554838>, GW discriminant analysis (Brunsdon et al., 2007)<doi: 10.1111/j.1538-4632.2007.00709.x> and various forms of GW regression (Brunsdon et al., 1996)<doi: 10.1111/j.1538-4632.1996.tb00936.x>; some of which are provided in basic and robust (outlier resistant) forms.
Maintained by Binbin Lu. Last updated 6 months ago.
4.8 match 18 stars 6.38 score 266 scripts 4 dependentsquantumofmoose
complexlm:Linear Fitting for Complex Valued Data
Tools for linear fitting with complex variables. Includes ordinary least-squares (zlm()) and robust M-estimation (rzlm()), and complex methods for oft used generics. Originally adapted from the rlm() functions of 'MASS' and the lm() functions of 'stats'.
Maintained by William Ryan. Last updated 1 years ago.
complex-numbersfittinglinear-modelslinear-regressionrobust-statisticsstatistics
15.1 match 1 stars 2.00 score 6 scriptsbioc
ClustAll:ClustAll: Data driven strategy to robustly identify stratification of patients within complex diseases
Data driven strategy to find hidden groups of patients with complex diseases using clinical data. ClustAll facilitates the unsupervised identification of multiple robust stratifications. ClustAll, is able to overcome the most common limitations found when dealing with clinical data (missing values, correlated data, mixed data types).
Maintained by Asier Ortega-Legarreta. Last updated 5 months ago.
softwarestatisticalmethodclusteringdimensionreductionprincipalcomponent
8.0 match 3.78 score 1 scriptspaulsmirnov
robcor:Robust Correlations
Robust pairwise correlations based on estimates of scale, particularly on "FastQn" one-step M-estimate.
Maintained by Paul Smirnov. Last updated 3 years ago.
11.6 match 2.58 score 21 scripts 6 dependentsavehtari
aaltobda:Functionality and Data for the Aalto Course on Bayesian Data Analysis
Functionality and Data for the Aalto University Course on Bayesian Data Analysis.
Maintained by Aki Vehtari. Last updated 3 months ago.
bayesbayesianbayesian-data-analysisbayesian-inferencebayesian-methodsbayesian-workflow
3.4 match 2.2k stars 8.93 score 159 scriptsfvidoli
Compind:Composite Indicators Functions
A collection of functions to calculate Composite Indicators methods, focusing, in particular, on the normalisation and weighting-aggregation steps, as described in OECD Handbook on constructing composite indicators: methodology and user guide, 2008, 'Vidoli' and 'Fusco' and 'Mazziotta' <doi:10.1007/s11205-014-0710-y>, 'Mazziotta' and 'Pareto' (2016) <doi:10.1007/s11205-015-0998-2>, 'Van Puyenbroeck and 'Rogge' <doi:10.1016/j.ejor.2016.07.038> and other authors.
Maintained by Francesco Vidoli. Last updated 2 months ago.
10.3 match 1 stars 2.90 score 40 scriptsbioc
netSmooth:Network smoothing for scRNAseq
netSmooth is an R package for network smoothing of single cell RNA sequencing data. Using bio networks such as protein-protein interactions as priors for gene co-expression, netsmooth improves cell type identification from noisy, sparse scRNAseq data.
Maintained by Jonathan Ronen. Last updated 5 months ago.
networkgraphandnetworksinglecellrnaseqgeneexpressionsequencingtranscriptomicsnormalizationpreprocessingclusteringdimensionreductionbioinformaticsgenomicssingle-cell
4.0 match 27 stars 7.41 score 4 scriptswahani
saeRobust:Robust Small Area Estimation
Methods to fit robust alternatives to commonly used models used in Small Area Estimation. The methods here used are based on best linear unbiased predictions and linear mixed models. At this time available models include area level models incorporating spatial and temporal correlation in the random effects.
Maintained by Sebastian Warnholz. Last updated 1 years ago.
7.3 match 1 stars 4.03 score 12 scripts 3 dependentsmuschellij2
fslr:Wrapper Functions for 'FSL' ('FMRIB' Software Library) from Functional MRI of the Brain ('FMRIB')
Wrapper functions that interface with 'FSL' <http://fsl.fmrib.ox.ac.uk/fsl/fslwiki/>, a powerful and commonly-used 'neuroimaging' software, using system commands. The goal is to be able to interface with 'FSL' completely in R, where you pass R objects of class 'nifti', implemented by package 'oro.nifti', and the function executes an 'FSL' command and returns an R object of class 'nifti' if desired.
Maintained by John Muschelli. Last updated 1 months ago.
fslfslrneuroimagingneuroimaging-analysisneuroimaging-data-science
3.7 match 41 stars 8.01 score 420 scriptslaylaparast
Rsurrogate:Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information
Provides functions to estimate the proportion of treatment effect on the primary outcome that is explained by the treatment effect on the surrogate marker.
Maintained by Layla Parast. Last updated 2 years ago.
9.3 match 3.16 score 12 scripts 4 dependentsshanascogin
coxrobust:Fit Robustly Proportional Hazards Regression Model
An implementation of robust estimation in Cox model. Functionality includes fitting efficiently and robustly Cox proportional hazards regression model in its basic form, where explanatory variables are time independent with one event per subject. Method is based on a smooth modification of the partial likelihood.
Maintained by Shana Scogin. Last updated 3 years ago.
7.2 match 3 stars 4.05 score 21 scripts 2 dependentsamishra-stats
robregcc:Robust Regression with Compositional Covariates
We implement the algorithm estimating the parameters of the robust regression model with compositional covariates. The model simultaneously treats outliers and provides reliable parameter estimates. Publication reference: Mishra, A., Mueller, C.,(2019) <arXiv:1909.04990>.
Maintained by Aditya Mishra. Last updated 4 years ago.
7.1 match 6 stars 4.11 score 43 scriptsbioc
epimutacions:Robust outlier identification for DNA methylation data
The package includes some statistical outlier detection methods for epimutations detection in DNA methylation data. The methods included in the package are MANOVA, Multivariate linear models, isolation forest, robust mahalanobis distance, quantile and beta. The methods compare a case sample with a suspected disease against a reference panel (composed of healthy individuals) to identify epimutations in the given case sample. It also contains functions to annotate and visualize the identified epimutations.
Maintained by Dolors Pelegri-Siso. Last updated 5 months ago.
dnamethylationbiologicalquestionpreprocessingstatisticalmethodnormalizationcpp
6.8 match 4.23 score 28 scriptsdddlab
robsel:Robust Selection Algorithm
An implementation of algorithms for estimation of the graphical lasso regularization parameter described in Pedro Cisneros-Velarde, Alexander Petersen and Sang-Yun Oh (2020) <http://proceedings.mlr.press/v108/cisneros20a.html>.
Maintained by Chau Tran. Last updated 4 years ago.
6.8 match 2 stars 4.28 score 19 scriptsdakep
pyinit:Pena-Yohai Initial Estimator for Robust S-Regression
Deterministic Pena-Yohai initial estimator for robust S estimators of regression. The procedure is described in detail in Pena, D., & Yohai, V. (1999) <doi:10.2307/2670164>.
Maintained by David Kepplinger. Last updated 3 years ago.
5.3 match 1 stars 5.43 score 17 scripts 9 dependentsrfastofficial
Rfast2:A Collection of Efficient and Extremely Fast R Functions II
A collection of fast statistical and utility functions for data analysis. Functions for regression, maximum likelihood, column-wise statistics and many more have been included. C++ has been utilized to speed up the functions. References: Tsagris M., Papadakis M. (2018). Taking R to its limits: 70+ tips. PeerJ Preprints 6:e26605v1 <doi:10.7287/peerj.preprints.26605v1>.
Maintained by Manos Papadakis. Last updated 1 years ago.
3.5 match 38 stars 8.09 score 75 scripts 26 dependentsbioc
rbsurv:Robust likelihood-based survival modeling with microarray data
This package selects genes associated with survival.
Maintained by Soo-heang Eo. Last updated 5 months ago.
6.5 match 4.30 score 7 scriptsreckziegel
epo:Enhanced Portfolio Optimization (EPO)
Implements the Enhanced Portfolio Optimization (EPO) method as described in Pedersen, Babu and Levine (2021) <doi:10.2139/ssrn.3530390>.
Maintained by Bernardo Reckziegel. Last updated 1 years ago.
bayesian-optimizationblack-littermanmean-variance-optimizationprincipal-component-analysisrobust-optimization
7.5 match 10 stars 3.70 score 4 scriptscran
rlcv:Robust Likelihood Cross Validation Bandwidth Selection
Robust likelihood cross validation bandwidth for uni- and multi-variate kernel densities. It is robust against fat-tailed distributions and/or outliers. Based on "Robust Likelihood Cross-Validation for Kernel Density Estimation," Wu (2019) <doi:10.1080/07350015.2018.1424633>.
Maintained by Ximing Wu. Last updated 3 years ago.
10.3 match 2.70 scoresalvatoremangiafico
rcompanion:Functions to Support Extension Education Program Evaluation
Functions and datasets to support Summary and Analysis of Extension Program Evaluation in R, and An R Companion for the Handbook of Biological Statistics. Vignettes are available at <https://rcompanion.org>.
Maintained by Salvatore Mangiafico. Last updated 30 days ago.
3.4 match 4 stars 8.01 score 2.4k scripts 5 dependentsbioc
sccomp:Tests differences in cell-type proportion for single-cell data, robust to outliers
A robust and outlier-aware method for testing differences in cell-type proportion in single-cell data. This model can infer changes in tissue composition and heterogeneity, and can produce realistic data simulations based on any existing dataset. This model can also transfer knowledge from a large set of integrated datasets to increase accuracy further.
Maintained by Stefano Mangiola. Last updated 16 days ago.
bayesianregressiondifferentialexpressionsinglecellbatch-correctioncompositioncytofdifferential-proportionmicrobiomemultilevelproportionsrandom-effectssingle-cellunwanted-variation
3.3 match 99 stars 8.41 score 69 scriptsemilopezcano
SixSigma:Six Sigma Tools for Quality Control and Improvement
Functions and utilities to perform Statistical Analyses in the Six Sigma way. Through the DMAIC cycle (Define, Measure, Analyze, Improve, Control), you can manage several Quality Management studies: Gage R&R, Capability Analysis, Control Charts, Loss Function Analysis, etc. Data frames used in the books "Six Sigma with R" [ISBN 978-1-4614-3652-2] and "Quality Control with R" [ISBN 978-3-319-24046-6], are also included in the package.
Maintained by Emilio L. Cano. Last updated 2 years ago.
quality-controlquality-improvementsix-sigmaspc
3.5 match 15 stars 7.82 score 169 scripts 1 dependentsenricoschumann
NMOF:Numerical Methods and Optimization in Finance
Functions, examples and data from the first and the second edition of "Numerical Methods and Optimization in Finance" by M. Gilli, D. Maringer and E. Schumann (2019, ISBN:978-0128150658). The package provides implementations of optimisation heuristics (Differential Evolution, Genetic Algorithms, Particle Swarm Optimisation, Simulated Annealing and Threshold Accepting), and other optimisation tools, such as grid search and greedy search. There are also functions for the valuation of financial instruments such as bonds and options, for portfolio selection and functions that help with stochastic simulations.
Maintained by Enrico Schumann. Last updated 30 days ago.
black-scholesdifferential-evolutiongenetic-algorithmgrid-searchheuristicsimplied-volatilitylocal-searchoptimizationparticle-swarm-optimizationsimulated-annealingthreshold-accepting
2.9 match 36 stars 9.56 score 101 scripts 4 dependentsalexisderumigny
MMDCopula:Robust Estimation of Copulas by Maximum Mean Discrepancy
Provides functions for the robust estimation of parametric families of copulas using minimization of the Maximum Mean Discrepancy, following the article Alquier, Chérief-Abdellatif, Derumigny and Fermanian (2022) <doi:10.1080/01621459.2021.2024836>.
Maintained by Alexis Derumigny. Last updated 3 years ago.
6.2 match 5 stars 4.40 score 3 scriptsbioc
dearseq:Differential Expression Analysis for RNA-seq data through a robust variance component test
Differential Expression Analysis RNA-seq data with variance component score test accounting for data heteroscedasticity through precision weights. Perform both gene-wise and gene set analyses, and can deal with repeated or longitudinal data. Methods are detailed in: i) Agniel D & Hejblum BP (2017) Variance component score test for time-course gene set analysis of longitudinal RNA-seq data, Biostatistics, 18(4):589-604 ; and ii) Gauthier M, Agniel D, Thiébaut R & Hejblum BP (2020) dearseq: a variance component score test for RNA-Seq differential analysis that effectively controls the false discovery rate, NAR Genomics and Bioinformatics, 2(4):lqaa093.
Maintained by Boris P. Hejblum. Last updated 5 months ago.
biomedicalinformaticscellbiologydifferentialexpressiondnaseqgeneexpressiongeneticsgenesetenrichmentimmunooncologykeggregressionrnaseqsequencingsystemsbiologytimecoursetranscriptiontranscriptomics
4.4 match 8 stars 6.20 score 11 scripts 1 dependentsbioc
DESeq2:Differential gene expression analysis based on the negative binomial distribution
Estimate variance-mean dependence in count data from high-throughput sequencing assays and test for differential expression based on a model using the negative binomial distribution.
Maintained by Michael Love. Last updated 11 days ago.
sequencingrnaseqchipseqgeneexpressiontranscriptionnormalizationdifferentialexpressionbayesianregressionprincipalcomponentclusteringimmunooncologyopenblascpp
1.7 match 375 stars 16.11 score 17k scripts 115 dependentsharveyklyne
drape:Doubly Robust Average Partial Effects
Doubly robust average partial effect estimation. This implementation contains methods for adding additional smoothness to plug-in regression procedures and for estimating score functions using smoothing splines. Details of the method can be found in Harvey Klyne and Rajen D. Shah (2023) <doi:10.48550/arXiv.2308.09207>.
Maintained by Harvey Klyne. Last updated 4 months ago.
6.7 match 2 stars 4.00 score 4 scriptszhuwang46
irboost:Iteratively Reweighted Boosting for Robust Analysis
Fit a predictive model using iteratively reweighted boosting (IRBoost) to minimize robust loss functions within the CC-family (concave-convex). This constitutes an application of iteratively reweighted convex optimization (IRCO), where convex optimization is performed using the functional descent boosting algorithm. IRBoost assigns weights to facilitate outlier identification. Applications include robust generalized linear models and robust accelerated failure time models. Wang (2025) <doi:10.6339/24-JDS1138>.
Maintained by Zhu Wang. Last updated 1 months ago.
8.9 match 3.00 scorejdgonzalezwork
ktaucenters:Robust Clustering Procedures
A clustering algorithm similar to K-Means is implemented, it has two main advantages, namely (a) The estimator is resistant to outliers, that means that results of estimator are still correct when there are atypical values in the sample and (b) The estimator is efficient, roughly speaking, if there are no outliers in the sample, results will be similar to those obtained by a classic algorithm (K-Means). Clustering procedure is carried out by minimizing the overall robust scale so-called tau scale. (see Gonzalez, Yohai and Zamar (2019) <arxiv:1906.08198>).
Maintained by Juan Domingo Gonzalez. Last updated 1 years ago.
8.3 match 3.18 score 5 scripts 1 dependentschedgala
lqr:Robust Linear Quantile Regression
It fits a robust linear quantile regression model using a new family of zero-quantile distributions for the error term. Missing values and censored observations can be handled as well. This family of distribution includes skewed versions of the Normal, Student's t, Laplace, Slash and Contaminated Normal distribution. It also performs logistic quantile regression for bounded responses as shown in Galarza et.al.(2020) <doi:10.1007/s13571-020-00231-0>. It provides estimates and full inference. It also provides envelopes plots for assessing the fit and confidences bands when several quantiles are provided simultaneously.
Maintained by Christian E. Galarza. Last updated 8 months ago.
12.6 match 1 stars 2.08 score 9 scripts 2 dependentsstopsack
risks:Estimate Risk Ratios and Risk Differences using Regression
Risk ratios and risk differences are estimated using regression models that allow for binary, categorical, and continuous exposures and confounders. Implemented are marginal standardization after fitting logistic models (g-computation) with delta-method and bootstrap standard errors, Miettinen's case-duplication approach (Schouten et al. 1993, <doi:10.1002/sim.4780121808>), log-binomial (Poisson) models with empirical variance (Zou 2004, <doi:10.1093/aje/kwh090>), binomial models with starting values from Poisson models (Spiegelman and Hertzmark 2005, <doi:10.1093/aje/kwi188>), and others.
Maintained by Konrad Stopsack. Last updated 11 months ago.
binomialbiostatisticsepidemiologyregression-models
5.3 match 5 stars 4.95 score 12 scriptsuncertaintyquantification
RobustCalibration:Robust Calibration of Imperfect Mathematical Models
Implements full Bayesian analysis for calibrating mathematical models with new methodology for modeling the discrepancy function. It allows for emulation, calibration and prediction using complex mathematical model outputs and experimental data. See the reference: Mengyang Gu and Long Wang, 2018, Journal of Uncertainty Quantification; Mengyang Gu, Fangzheng Xie and Long Wang, 2022, Journal of Uncertainty Quantification; Mengyang Gu, Kyle Anderson and Erika McPhillips, 2023, Technometrics.
Maintained by Mengyang Gu. Last updated 10 months ago.
21.3 match 1.23 score 17 scriptsbioc
openCyto:Hierarchical Gating Pipeline for flow cytometry data
This package is designed to facilitate the automated gating methods in sequential way to mimic the manual gating strategy.
Maintained by Mike Jiang. Last updated 5 months ago.
immunooncologyflowcytometrydataimportpreprocessingdatarepresentationcpp
3.4 match 7.62 score 404 scripts 1 dependentsxiaooupan
FarmTest:Factor-Adjusted Robust Multiple Testing
Performs robust multiple testing for means in the presence of known and unknown latent factors presented in Fan et al.(2019) "FarmTest: Factor-Adjusted Robust Multiple Testing With Approximate False Discovery Control" <doi:10.1080/01621459.2018.1527700>. Implements a series of adaptive Huber methods combined with fast data-drive tuning schemes proposed in Ke et al.(2019) "User-Friendly Covariance Estimation for Heavy-Tailed Distributions" <doi:10.1214/19-STS711> to estimate model parameters and construct test statistics that are robust against heavy-tailed and/or asymmetric error distributions. Extensions to two-sample simultaneous mean comparison problems are also included. As by-products, this package contains functions that compute adaptive Huber mean, covariance and regression estimators that are of independent interest.
Maintained by Xiaoou Pan. Last updated 4 years ago.
7.5 match 4 stars 3.48 score 15 scriptsaalfons
laeken:Estimation of Indicators on Social Exclusion and Poverty
Estimation of indicators on social exclusion and poverty, as well as Pareto tail modeling for empirical income distributions.
Maintained by Andreas Alfons. Last updated 1 years ago.
2.7 match 3 stars 9.57 score 300 scripts 30 dependentsbioc
TCC:TCC: Differential expression analysis for tag count data with robust normalization strategies
This package provides a series of functions for performing differential expression analysis from RNA-seq count data using robust normalization strategy (called DEGES). The basic idea of DEGES is that potential differentially expressed genes or transcripts (DEGs) among compared samples should be removed before data normalization to obtain a well-ranked gene list where true DEGs are top-ranked and non-DEGs are bottom ranked. This can be done by performing a multi-step normalization strategy (called DEGES for DEG elimination strategy). A major characteristic of TCC is to provide the robust normalization methods for several kinds of count data (two-group with or without replicates, multi-group/multi-factor, and so on) by virtue of the use of combinations of functions in depended packages.
Maintained by Jianqiang Sun. Last updated 5 months ago.
immunooncologysequencingdifferentialexpressionrnaseq
5.2 match 4.91 score 41 scriptskenaho1
asbio:A Collection of Statistical Tools for Biologists
Contains functions from: Aho, K. (2014) Foundational and Applied Statistics for Biologists using R. CRC/Taylor and Francis, Boca Raton, FL, ISBN: 978-1-4398-7338-0.
Maintained by Ken Aho. Last updated 2 months ago.
3.5 match 5 stars 7.32 score 310 scripts 3 dependentsmandymejia
fMRIscrub:Scrubbing and Other Data Cleaning Routines for fMRI
Data-driven fMRI denoising with projection scrubbing (Pham et al (2022) <doi:10.1016/j.neuroimage.2023.119972>). Also includes routines for DVARS (Derivatives VARianceS) (Afyouni and Nichols (2018) <doi:10.1016/j.neuroimage.2017.12.098>), motion scrubbing (Power et al (2012) <doi:10.1016/j.neuroimage.2011.10.018>), aCompCor (anatomical Components Correction) (Muschelli et al (2014) <doi:10.1016/j.neuroimage.2014.03.028>), detrending, and nuisance regression. Projection scrubbing is also applicable to other outlier detection tasks involving high-dimensional data.
Maintained by Amanda Mejia. Last updated 2 years ago.
5.5 match 4 stars 4.56 score 15 scripts 1 dependents