R-universe search: needs:tensorA

stan-dev

rstan:R Interface to Stan

User-facing R functions are provided to parse, compile, test, estimate, and analyze Stan models by accessing the header-only Stan library provided by the 'StanHeaders' package. The Stan project develops a probabilistic programming language that implements full Bayesian statistical inference via Markov Chain Monte Carlo, rough Bayesian inference via 'variational' approximation, and (optionally penalized) maximum likelihood estimation via optimization. In all three cases, automatic differentiation is used to quickly and accurately evaluate gradients without burdening the user with the need to derive the partial derivatives.

Maintained by Ben Goodrich. Last updated 1 days ago.

bayesian-data-analysis bayesian-inference bayesian-statistics mcmc stan cpp

1.1k stars 18.86 score 14k scripts 281 dependents

stan-dev

loo:Efficient Leave-One-Out Cross-Validation and WAIC for Bayesian Models

Efficient approximate leave-one-out cross-validation (LOO) for Bayesian models fit using Markov chain Monte Carlo, as described in Vehtari, Gelman, and Gabry (2017) <doi:10.1007/s11222-016-9696-4>. The approximation uses Pareto smoothed importance sampling (PSIS), a new procedure for regularizing importance weights. As a byproduct of the calculations, we also obtain approximate standard errors for estimated predictive errors and for the comparison of predictive errors between models. The package also provides methods for using stacking and other model weighting techniques to average Bayesian predictive distributions.

Maintained by Jonah Gabry. Last updated 16 days ago.

bayes bayesian bayesian-data-analysis bayesian-inference bayesian-methods bayesian-statistics cross-validation information-criterion model-comparison stan

152 stars 17.30 score 2.6k scripts 297 dependents

stan-dev

bayesplot:Plotting for Bayesian Models

Plotting functions for posterior analysis, MCMC diagnostics, prior and posterior predictive checks, and other visualizations to support the applied Bayesian workflow advocated in Gabry, Simpson, Vehtari, Betancourt, and Gelman (2019) <doi:10.1111/rssa.12378>. The package is designed not only to provide convenient functionality for users, but also a common set of functions that can be easily used by developers working on a variety of R packages for Bayesian modeling, particularly (but not exclusively) packages interfacing with 'Stan'.

Maintained by Jonah Gabry. Last updated 2 months ago.

bayesian ggplot2 mcmc pandoc stan statistical-graphics visualization

436 stars 16.69 score 6.5k scripts 98 dependents

paul-buerkner

brms:Bayesian Regression Models using 'Stan'

Fit Bayesian generalized (non-)linear multivariate multilevel models using 'Stan' for full Bayesian inference. A wide range of distributions and link functions are supported, allowing users to fit -- among others -- linear, robust linear, count data, survival, response times, ordinal, zero-inflated, hurdle, and even self-defined mixture models all in a multilevel context. Further modeling options include both theory-driven and data-driven non-linear terms, auto-correlation structures, censoring and truncation, meta-analytic standard errors, and quite a few more. In addition, all parameters of the response distribution can be predicted in order to perform distributional regression. Prior specifications are flexible and explicitly encourage users to apply prior distributions that actually reflect their prior knowledge. Models can easily be evaluated and compared using several methods assessing posterior or prior predictions. References: Bürkner (2017) <doi:10.18637/jss.v080.i01>; Bürkner (2018) <doi:10.32614/RJ-2018-017>; Bürkner (2021) <doi:10.18637/jss.v100.i05>; Carpenter et al. (2017) <doi:10.18637/jss.v076.i01>.

Maintained by Paul-Christian Bürkner. Last updated 3 days ago.

bayesian-inference brms multilevel-models stan statistical-models

1.3k stars 16.64 score 13k scripts 35 dependents

stan-dev

posterior:Tools for Working with Posterior Distributions

Provides useful tools for both users and developers of packages for fitting Bayesian models or working with output from Bayesian models. The primary goals of the package are to: (a) Efficiently convert between many different useful formats of draws (samples) from posterior or prior distributions. (b) Provide consistent methods for operations commonly performed on draws, for example, subsetting, binding, or mutating draws. (c) Provide various summaries of draws in convenient formats. (d) Provide lightweight implementations of state of the art posterior inference diagnostics. References: Vehtari et al. (2021) <doi:10.1214/20-BA1221>.

Maintained by Paul-Christian Bürkner. Last updated 1 days ago.

bayes bayesian mcmc

168 stars 16.21 score 3.3k scripts 346 dependents

stan-dev

rstanarm:Bayesian Applied Regression Modeling via Stan

Estimates previously compiled regression models using the 'rstan' package, which provides the R interface to the Stan C++ library for Bayesian estimation. Users specify models via the customary R syntax with a formula and data.frame plus some additional arguments for priors.

Maintained by Ben Goodrich. Last updated 10 days ago.

bayesian bayesian-data-analysis bayesian-inference bayesian-methods bayesian-statistics multilevel-models rstan rstanarm stan statistical-modeling cpp

393 stars 15.70 score 5.0k scripts 13 dependents

facebook

prophet:Automatic Forecasting Procedure

Implements a procedure for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. It works best with time series that have strong seasonal effects and several seasons of historical data. Prophet is robust to missing data and shifts in the trend, and typically handles outliers well.

Maintained by Sean Taylor. Last updated 5 months ago.

forecasting python cpp

19k stars 15.59 score 976 scripts 13 dependents

mjskay

tidybayes:Tidy Data and 'Geoms' for Bayesian Models

Compose data for and extract, manipulate, and visualize posterior draws from Bayesian models ('JAGS', 'Stan', 'rstanarm', 'brms', 'MCMCglmm', 'coda', ...) in a tidy data format. Functions are provided to help extract tidy data frames of draws from Bayesian models and that generate point summaries and intervals in a tidy format. In addition, 'ggplot2' 'geoms' and 'stats' are provided for common visualization primitives like points with multiple uncertainty intervals, eye plots (intervals plus densities), and fit curves with multiple, arbitrary uncertainty bands.

Maintained by Matthew Kay. Last updated 6 months ago.

bayesian-data-analysis brms ggplot2 jags stan tidy-data visualization

733 stars 14.72 score 7.3k scripts 20 dependents

stan-dev

shinystan:Interactive Visual and Numerical Diagnostics and Posterior Analysis for Bayesian Models

A graphical user interface for interactive Markov chain Monte Carlo (MCMC) diagnostics and plots and tables helpful for analyzing a posterior sample. The interface is powered by the 'Shiny' web application framework from 'RStudio' and works with the output of MCMC programs written in any programming language (and has extended functionality for 'Stan' models fit using the 'rstan' and 'rstanarm' packages).

Maintained by Jonah Gabry. Last updated 3 years ago.

bayesian bayesian-data-analysis bayesian-inference bayesian-methods bayesian-statistics mcmc shiny-apps stan statistical-graphics

200 stars 13.13 score 1.6k scripts 15 dependents

bioc

microbiome:Microbiome Analytics

Utilities for microbiome analysis.

Maintained by Leo Lahti. Last updated 5 months ago.

metagenomics microbiome sequencing systemsbiology hitchip hitchip-atlas human-microbiome microbiology microbiome-analysis phyloseq population-study

293 stars 12.51 score 2.0k scripts 5 dependents

stan-dev

cmdstanr:R Interface to 'CmdStan'

A lightweight interface to 'Stan' <https://mc-stan.org>. The 'CmdStanR' interface is an alternative to 'RStan' that calls the command line interface for compilation and running algorithms instead of interfacing with C++ via 'Rcpp'. This has many benefits including always being compatible with the latest version of Stan, fewer installation errors, fewer unexpected crashes in RStudio, and a more permissive license.

Maintained by Andrew Johnson. Last updated 10 months ago.

bayes bayesian markov-chain-monte-carlo maximum-likelihood mcmc stan variational-inference

145 stars 12.27 score 5.2k scripts 9 dependents

epiforecasts

EpiNow2:Estimate Real-Time Case Counts and Time-Varying Epidemiological Parameters

Estimates the time-varying reproduction number, rate of spread, and doubling time using a range of open-source tools (Abbott et al. (2020) <doi:10.12688/wellcomeopenres.16006.1>), and current best practices (Gostic et al. (2020) <doi:10.1101/2020.06.18.20134858>). It aims to help users avoid some of the limitations of naive implementations in a framework that is informed by community feedback and is actively supported.

Maintained by Sebastian Funk. Last updated 1 months ago.

backcalculation covid-19 gaussian-processes open-source reproduction-number stan cpp

123 stars 11.86 score 210 scripts

ecmerkle

blavaan:Bayesian Latent Variable Analysis

Fit a variety of Bayesian latent variable models, including confirmatory factor analysis, structural equation models, and latent growth curve models. References: Merkle & Rosseel (2018) <doi:10.18637/jss.v085.i04>; Merkle et al. (2021) <doi:10.18637/jss.v100.i06>.

Maintained by Edgar Merkle. Last updated 10 days ago.

bayesian-statistics factor-analysis growth-curve-models latent-variables missing-data multilevel-models multivariate-analysis path-analysis psychometrics statistical-modeling structural-equation-modeling cpp

92 stars 10.84 score 183 scripts 3 dependents

cjvanlissa

tidySEM:Tidy Structural Equation Modeling

A tidy workflow for generating, estimating, reporting, and plotting structural equation models using 'lavaan', 'OpenMx', or 'Mplus'. Throughout this workflow, elements of syntax, results, and graphs are represented as 'tidy' data, making them easy to customize. Includes functionality to estimate latent class analyses, and to plot 'dagitty' and 'igraph' objects.

Maintained by Caspar J. van Lissa. Last updated 20 days ago.

58 stars 10.69 score 330 scripts 1 dependents

business-science

modeltime:The Tidymodels Extension for Time Series Modeling

The time series forecasting framework for use with the 'tidymodels' ecosystem. Models include ARIMA, Exponential Smoothing, and additional time series models from the 'forecast' and 'prophet' packages. Refer to "Forecasting Principles & Practice, Second edition" (<https://otexts.com/fpp2/>). Refer to "Prophet: forecasting at scale" (<https://research.facebook.com/blog/2017/02/prophet-forecasting-at-scale/>.).

Maintained by Matt Dancho. Last updated 5 months ago.

arima data-science deep-learning ets forecasting machine-learning machine-learning-algorithms modeltime prophet tbats tidymodeling tidymodels time time-series time-series-analysis timeseries timeseries-forecasting

551 stars 10.61 score 1.1k scripts 7 dependents

caseyyoungflesh

MCMCvis:Tools to Visualize, Manipulate, and Summarize MCMC Output

Performs key functions for MCMC analysis using minimal code - visualizes, manipulates, and summarizes MCMC output. Functions support simple and straightforward subsetting of model parameters within the calls, and produce presentable and 'publication-ready' output. MCMC output may be derived from Bayesian model output fit with Stan, NIMBLE, JAGS, and other software.

Maintained by Casey Youngflesh. Last updated 4 months ago.

38 stars 10.52 score 1.8k scripts 5 dependents

facebookexperimental

Robyn:Semi-Automated Marketing Mix Modeling (MMM) from Meta Marketing Science

Semi-Automated Marketing Mix Modeling (MMM) aiming to reduce human bias by means of ridge regression and evolutionary algorithms, enables actionable decision making providing a budget allocation and diminishing returns curves and allows ground-truth calibration to account for causation.

Maintained by Gufeng Zhou. Last updated 11 days ago.

adstocking budget-allocation cost-response-curve econometrics evolutionary-algorithm gradient-based-optimisation hyperparameter-optimization marketing-mix-modeling marketing-mix-modelling marketing-science mmm ridge-regression

1.3k stars 10.27 score 95 scripts

bioc

singleCellTK:Comprehensive and Interactive Analysis of Single Cell RNA-Seq Data

The Single Cell Toolkit (SCTK) in the singleCellTK package provides an interface to popular tools for importing, quality control, analysis, and visualization of single cell RNA-seq data. SCTK allows users to seamlessly integrate tools from various packages at different stages of the analysis workflow. A general "a la carte" workflow gives users the ability access to multiple methods for data importing, calculation of general QC metrics, doublet detection, ambient RNA estimation and removal, filtering, normalization, batch correction or integration, dimensionality reduction, 2-D embedding, clustering, marker detection, differential expression, cell type labeling, pathway analysis, and data exporting. Curated workflows can be used to run Seurat and Celda. Streamlined quality control can be performed on the command line using the SCTK-QC pipeline. Users can analyze their data using commands in the R console or by using an interactive Shiny Graphical User Interface (GUI). Specific analyses or entire workflows can be summarized and shared with comprehensive HTML reports generated by Rmarkdown. Additional documentation and vignettes can be found at camplab.net/sctk.

Maintained by Joshua David Campbell. Last updated 1 months ago.

singlecell geneexpression differentialexpression alignment clustering immunooncology batcheffect normalization qualitycontrol dataimport gui

182 stars 10.17 score 252 scripts

bioc

BASiCS:Bayesian Analysis of Single-Cell Sequencing data

Single-cell mRNA sequencing can uncover novel cell-to-cell heterogeneity in gene expression levels in seemingly homogeneous populations of cells. However, these experiments are prone to high levels of technical noise, creating new challenges for identifying genes that show genuine heterogeneous expression within the population of cells under study. BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model to perform statistical analyses of single-cell RNA sequencing datasets in the context of supervised experiments (where the groups of cells of interest are known a priori, e.g. experimental conditions or cell types). BASiCS performs built-in data normalisation (global scaling) and technical noise quantification (based on spike-in genes). BASiCS provides an intuitive detection criterion for highly (or lowly) variable genes within a single group of cells. Additionally, BASiCS can compare gene expression patterns between two or more pre-specified groups of cells. Unlike traditional differential expression tools, BASiCS quantifies changes in expression that lie beyond comparisons of means, also allowing the study of changes in cell-to-cell heterogeneity. The latter can be quantified via a biological over-dispersion parameter that measures the excess of variability that is observed with respect to Poisson sampling noise, after normalisation and technical noise removal. Due to the strong mean/over-dispersion confounding that is typically observed for scRNA-seq datasets, BASiCS also tests for changes in residual over-dispersion, defined by residual values with respect to a global mean/over-dispersion trend.

Maintained by Catalina Vallejos. Last updated 5 months ago.

immunooncology normalization sequencing rnaseq software geneexpression transcriptomics singlecell differentialexpression bayesian cellbiology bioconductor-package gene-expression rcpp rcpparmadillo scrna-seq single-cell openblas cpp openmp

83 stars 10.14 score 368 scripts 1 dependents

stan-dev

projpred:Projection Predictive Feature Selection

Performs projection predictive feature selection for generalized linear models (Piironen, Paasiniemi, and Vehtari, 2020, <doi:10.1214/20-EJS1711>) with or without multilevel or additive terms (Catalina, Bürkner, and Vehtari, 2022, <https://proceedings.mlr.press/v151/catalina22a.html>), for some ordinal and nominal regression models (Weber, Glass, and Vehtari, 2023, <arXiv:2301.01660>), and for many other regression models (using the latent projection by Catalina, Bürkner, and Vehtari, 2021, <arXiv:2109.04702>, which can also be applied to most of the former models). The package is compatible with the 'rstanarm' and 'brms' packages, but other reference models can also be used. See the vignettes and the documentation for more information and examples.

Maintained by Frank Weber. Last updated 1 days ago.

bayes bayesian bayesian-inference rstanarm stan statistics variable-selection openblas cpp

112 stars 10.12 score 241 scripts

nicholasjclark

mvgam:Multivariate (Dynamic) Generalized Additive Models

Fit Bayesian Dynamic Generalized Additive Models to multivariate observations. Users can build nonlinear State-Space models that can incorporate semiparametric effects in observation and process components, using a wide range of observation families. Estimation is performed using Markov Chain Monte Carlo with Hamiltonian Monte Carlo in the software 'Stan'. References: Clark & Wells (2023) <doi:10.1111/2041-210X.13974>.

Maintained by Nicholas J Clark. Last updated 0 hours ago.

bayesian-statistics dynamic-factor-models ecological-modelling forecasting gaussian-process generalised-additive-models generalized-additive-models joint-species-distribution-modelling multilevel-models multivariate-timeseries stan time-series-analysis timeseries vector-autoregression vectorautoregression cpp

148 stars 9.92 score 117 scripts

n8thangreen

BCEA:Bayesian Cost Effectiveness Analysis

Produces an economic evaluation of a sample of suitable variables of cost and effectiveness / utility for two or more interventions, e.g. from a Bayesian model in the form of MCMC simulations. This package computes the most cost-effective alternative and produces graphical summaries and probabilistic sensitivity analysis, see Baio et al (2017) <doi:10.1007/978-3-319-55718-2>.

Maintained by Gianluca Baio. Last updated 2 months ago.

bayesian cost-effectiveness

3 stars 9.90 score 243 scripts 3 dependents

donaldrwilliams

BGGM:Bayesian Gaussian Graphical Models

Fit Bayesian Gaussian graphical models. The methods are separated into two Bayesian approaches for inference: hypothesis testing and estimation. There are extensions for confirmatory hypothesis testing, comparing Gaussian graphical models, and node wise predictability. These methods were recently introduced in the Gaussian graphical model literature, including Williams (2019) <doi:10.31234/osf.io/x8dpr>, Williams and Mulder (2019) <doi:10.31234/osf.io/ypxd8>, Williams, Rast, Pericchi, and Mulder (2019) <doi:10.31234/osf.io/yt386>.

Maintained by Philippe Rast. Last updated 3 months ago.

bayes-factors bayesian-hypothesis-testing gaussian-graphical-models openblas cpp openmp

55 stars 9.61 score 102 scripts 1 dependents

cdriveraus

ctsem:Continuous Time Structural Equation Modelling

Hierarchical continuous (and discrete) time state space modelling, for linear and nonlinear systems measured by continuous variables, with limited support for binary data. The subject specific dynamic system is modelled as a stochastic differential equation (SDE) or difference equation, measurement models are typically multivariate normal factor models. Linear mixed effects SDE's estimated via maximum likelihood and optimization are the default. Nonlinearities, (state dependent parameters) and random effects on all parameters are possible, using either max likelihood / max a posteriori optimization (with optional importance sampling) or Stan's Hamiltonian Monte Carlo sampling. See <https://github.com/cdriveraus/ctsem/raw/master/vignettes/hierarchicalmanual.pdf> for details. Priors may be used. For the conceptual overview of the hierarchical Bayesian linear SDE approach, see <https://www.researchgate.net/publication/324093594_Hierarchical_Bayesian_Continuous_Time_Dynamic_Modeling>. Exogenous inputs may also be included, for an overview of such possibilities see <https://www.researchgate.net/publication/328221807_Understanding_the_Time_Course_of_Interventions_with_Continuous_Time_Dynamic_Models> . Stan based functions are not available on 32 bit Windows systems at present. <https://cdriver.netlify.app/> contains some tutorial blog posts.

Maintained by Charles Driver. Last updated 25 days ago.

stochastic-differential-equations time-series cpp

42 stars 9.58 score 366 scripts 1 dependents

bioc

scMerge:scMerge: Merging multiple batches of scRNA-seq data

Like all gene expression data, single-cell data suffers from batch effects and other unwanted variations that makes accurate biological interpretations difficult. The scMerge method leverages factor analysis, stably expressed genes (SEGs) and (pseudo-) replicates to remove unwanted variations and merge multiple single-cell data. This package contains all the necessary functions in the scMerge pipeline, including the identification of SEGs, replication-identification methods, and merging of single-cell data.

Maintained by Yingxin Lin. Last updated 5 months ago.

batcheffect geneexpression normalization rnaseq sequencing singlecell software transcriptomics bioinformatics single-cell

67 stars 9.52 score 137 scripts 1 dependents

dmphillippo

multinma:Bayesian Network Meta-Analysis of Individual and Aggregate Data

Network meta-analysis and network meta-regression models for aggregate data, individual patient data, and mixtures of both individual and aggregate data using multilevel network meta-regression as described by Phillippo et al. (2020) <doi:10.1111/rssa.12579>. Models are estimated in a Bayesian framework using 'Stan'.

Maintained by David M. Phillippo. Last updated 3 days ago.

statistics cpp

35 stars 9.34 score 163 scripts

microsoft

finnts:Microsoft Finance Time Series Forecasting Framework

Automated time series forecasting developed by Microsoft Finance. The Microsoft Finance Time Series Forecasting Framework, aka Finn, can be used to forecast any component of the income statement, balance sheet, or any other area of interest by finance. Any numerical quantity over time, Finn can be used to forecast it. While it can be applied outside of the finance domain, Finn was built to meet the needs of financial analysts to better forecast their businesses within a company, and has a lot of built in features that are specific to the needs of financial forecasters. Happy forecasting!

Maintained by Mike Tokic. Last updated 1 months ago.

business data-science feature-selection finance finnts forecasting machine-learning microsoft time-series

194 stars 9.30 score 39 scripts

brianstock

MixSIAR:Bayesian Mixing Models in R

Creates and runs Bayesian mixing models to analyze biological tracer data (i.e. stable isotopes, fatty acids), which estimate the proportions of source (prey) contributions to a mixture (consumer). 'MixSIAR' is not one model, but a framework that allows a user to create a mixing model based on their data structure and research questions, via options for fixed/ random effects, source data types, priors, and error terms. 'MixSIAR' incorporates several years of advances since 'MixSIR' and 'SIAR'.

Maintained by Brian Stock. Last updated 4 years ago.

jags cpp

98 stars 9.27 score 122 scripts

integrated-inferences

CausalQueries:Make, Update, and Query Binary Causal Models

Users can declare causal models over binary nodes, update beliefs about causal types given data, and calculate arbitrary queries. Updating is implemented in 'stan'. See Humphreys and Jacobs, 2023, Integrated Inferences (<DOI: 10.1017/9781316718636>) and Pearl, 2009 Causality (<DOI:10.1017/CBO9780511803161>).

Maintained by Till Tietz. Last updated 1 months ago.

bayes causal dags mixedmethods stan cpp

28 stars 9.02 score 54 scripts

bioc

scone:Single Cell Overview of Normalized Expression data

SCONE is an R package for comparing and ranking the performance of different normalization schemes for single-cell RNA-seq and other high-throughput analyses.

Maintained by Davide Risso. Last updated 1 months ago.

immunooncology normalization preprocessing qualitycontrol geneexpression rnaseq software transcriptomics sequencing singlecell coverage

53 stars 9.00 score 104 scripts

ropensci

nlrx:Setup, Run and Analyze 'NetLogo' Model Simulations from 'R' via 'XML'

Setup, run and analyze 'NetLogo' (<https://ccl.northwestern.edu/netlogo/>) model simulations in 'R'. 'nlrx' experiments use a similar structure as 'NetLogos' Behavior Space experiments. However, 'nlrx' offers more flexibility and additional tools for running and analyzing complex simulation designs and sensitivity analyses. The user defines all information that is needed in an intuitive framework, using class objects. Experiments are submitted from 'R' to 'NetLogo' via 'XML' files that are dynamically written, based on specifications defined by the user. By nesting model calls in future environments, large simulation design with many runs can be executed in parallel. This also enables simulating 'NetLogo' experiments on remote high performance computing machines. In order to use this package, 'Java' and 'NetLogo' (>= 5.3.1) need to be available on the executing system.

Maintained by Sebastian Hanss. Last updated 7 months ago.

agent-based-modeling individual-based-modelling netlogo peer-reviewed

78 stars 8.86 score 195 scripts

jarrodhadfield

MCMCglmm:MCMC Generalised Linear Mixed Models

Fits Multivariate Generalised Linear Mixed Models (and related models) using Markov chain Monte Carlo techniques (Hadfield 2010 J. Stat. Soft.).

Maintained by Jarrod Hadfield. Last updated 3 months ago.

cpp

2 stars 8.83 score 1.2k scripts 14 dependents

connordonegan

geostan:Bayesian Spatial Analysis

For spatial data analysis; provides exploratory spatial analysis tools, spatial regression, spatial econometric, and disease mapping models, model diagnostics, and special methods for inference with small area survey data (e.g., the America Community Survey (ACS)) and censored population health monitoring data. Models are pre-specified using the Stan programming language, a platform for Bayesian inference using Markov chain Monte Carlo (MCMC). References: Carpenter et al. (2017) <doi:10.18637/jss.v076.i01>; Donegan (2021) <doi:10.31219/osf.io/3ey65>; Donegan (2022) <doi:10.21105/joss.04716>; Donegan, Chun and Hughes (2020) <doi:10.1016/j.spasta.2020.100450>; Donegan, Chun and Griffith (2021) <doi:10.3390/ijerph18136856>; Morris et al. (2019) <doi:10.1016/j.sste.2019.100301>.

Maintained by Connor Donegan. Last updated 3 months ago.

bayesian bayesian-inference bayesian-statistics epidemiology modeling public-health rspatial spatial stan cpp

80 stars 8.80 score 46 scripts

openpharma

brms.mmrm:Bayesian MMRMs using 'brms'

The mixed model for repeated measures (MMRM) is a popular model for longitudinal clinical trial data with continuous endpoints, and 'brms' is a powerful and versatile package for fitting Bayesian regression models. The 'brms.mmrm' R package leverages 'brms' to run MMRMs, and it supports a simplified interfaced to reduce difficulty and align with the best practices of the life sciences. References: Bürkner (2017) <doi:10.18637/jss.v080.i01>, Mallinckrodt (2008) <doi:10.1177/009286150804200402>.

Maintained by William Michael Landau. Last updated 6 months ago.

brms life-sciences mc-stan mmrm stan statistics

21 stars 8.80 score 13 scripts

data-edu

tidyLPA:Easily Carry Out Latent Profile Analysis (LPA) Using Open-Source or Commercial Software

Easily carry out latent profile analysis ("LPA"), determine the correct number of classes based on best practices, and tabulate and plot the results. Provides functionality to estimate commonly-specified models with free means, variances, and covariances for each profile. Follows a tidy approach, in that output is in the form of a data frame that can subsequently be computed on. Models can be estimated using the free open source 'R' packages 'Mclust' and 'OpenMx', or using the commercial program 'MPlus', via the 'MplusAutomation' package.

Maintained by Joshua M Rosenberg. Last updated 1 years ago.

58 stars 8.76 score 121 scripts

ccs-lab

hBayesDM:Hierarchical Bayesian Modeling of Decision-Making Tasks

Fit an array of decision-making tasks with computational models in a hierarchical Bayesian framework. Can perform hierarchical Bayesian analysis of various computational models with a single line of coding (Ahn et al., 2017) <doi:10.1162/CPSY_a_00002>.

Maintained by Woo-Young Ahn. Last updated 11 months ago.

bayesian computational decision-making hierarchical-bayesian-analysis modeling reinforcement-learning

237 stars 8.71 score 270 scripts

brockk

trialr:Clinical Trial Designs in 'rstan'

A collection of clinical trial designs and methods, implemented in 'rstan' and R, including: the Continual Reassessment Method by O'Quigley et al. (1990) <doi:10.2307/2531628>; EffTox by Thall & Cook (2004) <doi:10.1111/j.0006-341X.2004.00218.x>; the two-parameter logistic method of Neuenschwander, Branson & Sponer (2008) <doi:10.1002/sim.3230>; and the Augmented Binary method by Wason & Seaman (2013) <doi:10.1002/sim.5867>; and more. We provide functions to aid model-fitting and analysis. The 'rstan' implementations may also serve as a cookbook to anyone looking to extend or embellish these models. We hope that this package encourages the use of Bayesian methods in clinical trials. There is a preponderance of early phase trial designs because this is where Bayesian methods are used most. If there is a method you would like implemented, please get in touch.

Maintained by Kristian Brock. Last updated 1 years ago.

cpp

42 stars 8.56 score 106 scripts 3 dependents

bioc

M3Drop:Michaelis-Menten Modelling of Dropouts in single-cell RNASeq

This package fits a model to the pattern of dropouts in single-cell RNASeq data. This model is used as a null to identify significantly variable (i.e. differentially expressed) genes for use in downstream analysis, such as clustering cells. Also includes an method for calculating exact Pearson residuals in UMI-tagged data using a library-size aware negative binomial model.

Maintained by Tallulah Andrews. Last updated 5 months ago.

rnaseq sequencing transcriptomics geneexpression software differentialexpression dimensionreduction featureextraction human-cell-atlas rna-seq single-cell single-cell-rna-seq

29 stars 8.53 score 119 scripts 2 dependents

tidymodels

tidyposterior:Bayesian Analysis to Compare Models using Resampling Statistics

Bayesian analysis used here to answer the question: "when looking at resampling results, are the differences between models 'real'?" To answer this, a model can be created were the performance statistic is the resampling statistics (e.g. accuracy or RMSE). These values are explained by the model types. In doing this, we can get parameter estimates for each model's affect on performance and make statistical (and practical) comparisons between models. The methods included here are similar to Benavoli et al (2017) <https://jmlr.org/papers/v18/16-305.html>.

Maintained by Max Kuhn. Last updated 5 months ago.

102 stars 8.44 score 273 scripts

gbradburd

conStruct:Models Spatially Continuous and Discrete Population Genetic Structure

A method for modeling genetic data as a combination of discrete layers, within each of which relatedness may decay continuously with geographic distance. This package contains code for running analyses (which are implemented in the modeling language 'rstan') and visualizing and interpreting output. See the paper for more details on the model and its utility.

Maintained by Gideon Bradburd. Last updated 1 years ago.

cpp

35 stars 8.39 score 70 scripts

florale

multilevelcoda:Estimate Bayesian Multilevel Models for Compositional Data

Implement Bayesian Multilevel Modelling for compositional data in a multilevel framework. Compute multilevel compositional data and Isometric log ratio (ILR) at between and within-person levels, fit Bayesian multilevel models for compositional predictors and outcomes, and run post-hoc analyses such as isotemporal substitution models. References: Le, Stanford, Dumuid, and Wiley (2024) <doi:10.48550/arXiv.2405.03985>, Le, Dumuid, Stanford, and Wiley (2024) <doi:10.48550/arXiv.2411.12407>.

Maintained by Flora Le. Last updated 3 days ago.

bayesian-inference compositional-data-analysis multilevel-models multilevelcoda

15 stars 8.34 score 118 scripts

jsilve24

fido:Bayesian Multinomial Logistic Normal Regression

Provides methods for fitting and inspection of Bayesian Multinomial Logistic Normal Models using MAP estimation and Laplace Approximation as developed in Silverman et. Al. (2022) <https://www.jmlr.org/papers/v23/19-882.html>. Key functionality is implemented in C++ for scalability. 'fido' replaces the previous package 'stray'.

Maintained by Justin Silverman. Last updated 1 months ago.

cpp openmp

20 stars 8.31 score 103 scripts

business-science

modeltime.ensemble:Ensemble Algorithms for Time Series Forecasting with Modeltime

A 'modeltime' extension that implements time series ensemble forecasting methods including model averaging, weighted averaging, and stacking. These techniques are popular methods to improve forecast accuracy and stability.

Maintained by Matt Dancho. Last updated 8 months ago.

ensemble ensemble-learning forecast forecasting modeltime stacking stacking-ensemble tidymodels time time-series timeseries

77 stars 8.30 score 143 scripts

n-kall

priorsense:Prior Diagnostics and Sensitivity Analysis

Provides functions for prior and likelihood sensitivity analysis in Bayesian models. Currently it implements methods to determine the sensitivity of the posterior to power-scaling perturbations of the prior and likelihood.

Maintained by Noa Kallioinen. Last updated 26 days ago.

bayes bayesian bayesian-data-analysis bayesian-methods prior-distribution sensitivity-analysis stan

59 stars 8.27 score 70 scripts

fate-ewi

bayesdfa:Bayesian Dynamic Factor Analysis (DFA) with 'Stan'

Implements Bayesian dynamic factor analysis with 'Stan'. Dynamic factor analysis is a dimension reduction tool for multivariate time series. 'bayesdfa' extends conventional dynamic factor models in several ways. First, extreme events may be estimated in the latent trend by modeling process error with a student-t distribution. Second, alternative constraints (including proportions are allowed). Third, the estimated dynamic factors can be analyzed with hidden Markov models to evaluate support for latent regimes.

Maintained by Eric J. Ward. Last updated 13 days ago.

cpp

28 stars 8.27 score 101 scripts

jomulder

BFpack:Flexible Bayes Factor Testing of Scientific Expectations

Implementation of default Bayes factors for testing statistical hypotheses under various statistical models. The package is intended for applied quantitative researchers in the social and behavioral sciences, medical research, and related fields. The Bayes factor tests can be executed for statistical models such as univariate and multivariate normal linear models, correlation analysis, generalized linear models, special cases of linear mixed models, survival models, relational event models. Parameters that can be tested are location parameters (e.g., group means, regression coefficients), variances (e.g., group variances), and measures of association (e.g,. polychoric/polyserial/biserial/tetrachoric/product moments correlations), among others. The statistical underpinnings are described in O'Hagan (1995) <DOI:10.1111/j.2517-6161.1995.tb02017.x>, De Santis and Spezzaferri (2001) <DOI:10.1016/S0378-3758(00)00240-8>, Mulder and Xin (2022) <DOI:10.1080/00273171.2021.1904809>, Mulder and Gelissen (2019) <DOI:10.1080/02664763.2021.1992360>, Mulder (2016) <DOI:10.1016/j.jmp.2014.09.004>, Mulder and Fox (2019) <DOI:10.1214/18-BA1115>, Mulder and Fox (2013) <DOI:10.1007/s11222-011-9295-3>, Boeing-Messing, van Assen, Hofman, Hoijtink, and Mulder (2017) <DOI:10.1037/met0000116>, Hoijtink, Mulder, van Lissa, and Gu (2018) <DOI:10.1037/met0000201>, Gu, Mulder, and Hoijtink (2018) <DOI:10.1111/bmsp.12110>, Hoijtink, Gu, and Mulder (2018) <DOI:10.1111/bmsp.12145>, and Hoijtink, Gu, Mulder, and Rosseel (2018) <DOI:10.1037/met0000187>. When using the packages, please refer to the package Mulder et al. (2021) <DOI:10.18637/jss.v100.i18> and the relevant methodological papers.

Maintained by Joris Mulder. Last updated 2 months ago.

fortran openblas

15 stars 8.24 score 55 scripts 3 dependents

brockk

escalation:A Modular Approach to Dose-Finding Clinical Trials

Methods for working with dose-finding clinical trials. We provide implementations of many dose-finding clinical trial designs, including the continual reassessment method (CRM) by O'Quigley et al. (1990) <doi:10.2307/2531628>, the toxicity probability interval (TPI) design by Ji et al. (2007) <doi:10.1177/1740774507079442>, the modified TPI (mTPI) design by Ji et al. (2010) <doi:10.1177/1740774510382799>, the Bayesian optimal interval design (BOIN) by Liu & Yuan (2015) <doi:10.1111/rssc.12089>, EffTox by Thall & Cook (2004) <doi:10.1111/j.0006-341X.2004.00218.x>; the design of Wages & Tait (2015) <doi:10.1080/10543406.2014.920873>, and the 3+3 described by Korn et al. (1994) <doi:10.1002/sim.4780131802>. All designs are implemented with a common interface. We also offer optional additional classes to tailor the behaviour of all designs, including avoiding skipping doses, stopping after n patients have been treated at the recommended dose, stopping when a toxicity condition is met, or demanding that n patients are treated before stopping is allowed. By daisy-chaining together these classes using the pipe operator from 'magrittr', it is simple to tailor the behaviour of a dose-finding design so it behaves how the trialist wants. Having provided a flexible interface for specifying designs, we then provide functions to run simulations and calculate dose-paths for future cohorts of patients.

Maintained by Kristian Brock. Last updated 1 days ago.

15 stars 8.16 score 67 scripts

open-aims

bayesnec:A Bayesian No-Effect- Concentration (NEC) Algorithm

Implementation of No-Effect-Concentration estimation that uses 'brms' (see Burkner (2017)<doi:10.18637/jss.v080.i01>; Burkner (2018)<doi:10.32614/RJ-2018-017>; Carpenter 'et al.' (2017)<doi:10.18637/jss.v076.i01> to fit concentration(dose)-response data using Bayesian methods for the purpose of estimating 'ECx' values, but more particularly 'NEC' (see Fox (2010)<doi:10.1016/j.ecoenv.2009.09.012>), 'NSEC' (see Fisher and Fox (2023)<doi:10.1002/etc.5610>), and 'N(S)EC (see Fisher et al. 2023<doi:10.1002/ieam.4809>). A full description of this package can be found in Fisher 'et al.' (2024)<doi:10.18637/jss.v110.i05>. This package expands and supersedes an original version implemented in 'R2jags' (see Su and Yajima (2020)<https://CRAN.R-project.org/package=R2jags>; Fisher et al. (2020)<doi:10.5281/ZENODO.3966864>).

Maintained by Rebecca Fisher. Last updated 7 months ago.

bayesian-inference concentration-response ecotoxicology no-effect-concentration non-linear-decay threshold-derivation toxicology

13 stars 8.15 score 360 scripts

bayes-rules

bayesrules:Datasets and Supplemental Functions from Bayes Rules! Book

Provides datasets and functions used for analysis and visualizations in the Bayes Rules! book (<https://www.bayesrulesbook.com>). The package contains a set of functions that summarize and plot Bayesian models from some conjugate families and another set of functions for evaluation of some Bayesian models.

Maintained by Mine Dogucu. Last updated 3 years ago.

bayesian-statistics data

72 stars 8.06 score 466 scripts

bioc

netZooR:Unified methods for the inference and analysis of gene regulatory networks

netZooR unifies the implementations of several Network Zoo methods (netzoo, netzoo.github.io) into a single package by creating interfaces between network inference and network analysis methods. Currently, the package has 3 methods for network inference including PANDA and its optimized implementation OTTER (network reconstruction using mutliple lines of biological evidence), LIONESS (single-sample network inference), and EGRET (genotype-specific networks). Network analysis methods include CONDOR (community detection), ALPACA (differential community detection), CRANE (significance estimation of differential modules), MONSTER (estimation of network transition states). In addition, YARN allows to process gene expresssion data for tissue-specific analyses and SAMBAR infers missing mutation data based on pathway information.

Maintained by Tara Eicher. Last updated 11 days ago.

networkinference network generegulation geneexpression transcription microarray graphandnetwork gene-regulatory-network transcription-factors

105 stars 7.98 score

novartis

RBesT:R Bayesian Evidence Synthesis Tools

Tool-set to support Bayesian evidence synthesis. This includes meta-analysis, (robust) prior derivation from historical data, operating characteristics and analysis (1 and 2 sample cases). Please refer to Weber et al. (2021) <doi:10.18637/jss.v100.i19> for details on applying this package while Neuenschwander et al. (2010) <doi:10.1177/1740774509356002> and Schmidli et al. (2014) <doi:10.1111/biom.12242> explain details on the methodology.

Maintained by Sebastian Weber. Last updated 2 months ago.

bayesian clinical historical-data meta-analysis cpp

23 stars 7.94 score 115 scripts 5 dependents

ropensci

dynamite:Bayesian Modeling and Causal Inference for Multivariate Longitudinal Data

Easy-to-use and efficient interface for Bayesian inference of complex panel (time series) data using dynamic multivariate panel models by Helske and Tikka (2024) <doi:10.1016/j.alcr.2024.100617>. The package supports joint modeling of multiple measurements per individual, time-varying and time-invariant effects, and a wide range of discrete and continuous distributions. Estimation of these dynamic multivariate panel models is carried out via 'Stan'. For an in-depth tutorial of the package, see (Tikka and Helske, 2024) <doi:10.48550/arXiv.2302.01607>.

Maintained by Santtu Tikka. Last updated 4 days ago.

bayesian-inference panel-data stan statistical-models

29 stars 7.90 score 20 scripts

biodiverse

ubms:Bayesian Models for Data from Unmarked Animals using 'Stan'

Fit Bayesian hierarchical models of animal abundance and occurrence via the 'rstan' package, the R interface to the 'Stan' C++ library. Supported models include single-season occupancy, dynamic occupancy, and N-mixture abundance models. Covariates on model parameters are specified using a formula-based interface similar to package 'unmarked', while also allowing for estimation of random slope and intercept terms. References: Carpenter et al. (2017) <doi:10.18637/jss.v076.i01>; Fiske and Chandler (2011) <doi:10.18637/jss.v043.i10>.

Maintained by Ken Kellner. Last updated 1 months ago.

distance-sampling hierarchical-models n-mixture-model occupancy stan openblas cpp

36 stars 7.90 score 73 scripts

genentech

psborrow2:Bayesian Dynamic Borrowing Analysis and Simulation

Bayesian dynamic borrowing is an approach to incorporating external data to supplement a randomized, controlled trial analysis in which external data are incorporated in a dynamic way (e.g., based on similarity of outcomes); see Viele 2013 <doi:10.1002/pst.1589> for an overview. This package implements the hierarchical commensurate prior approach to dynamic borrowing as described in Hobbes 2011 <doi:10.1111/j.1541-0420.2011.01564.x>. There are three main functionalities. First, 'psborrow2' provides a user-friendly interface for applying dynamic borrowing on the study results handles the Markov Chain Monte Carlo sampling on behalf of the user. Second, 'psborrow2' provides a simulation framework to compare different borrowing parameters (e.g. full borrowing, no borrowing, dynamic borrowing) and other trial and borrowing characteristics (e.g. sample size, covariates) in a unified way. Third, 'psborrow2' provides a set of functions to generate data for simulation studies, and also allows the user to specify their own data generation process. This package is designed to use the sampling functions from 'cmdstanr' which can be installed from <https://stan-dev.r-universe.dev>.

Maintained by Matt Secrest. Last updated 2 months ago.

bayesian-dynamic-borrowing psborrow2 simulation-study

18 stars 7.87 score 16 scripts

epinowcast

epinowcast:Flexible Hierarchical Nowcasting

Tools to enable flexible and efficient hierarchical nowcasting of right-truncated epidemiological time-series using a semi-mechanistic Bayesian model with support for a range of reporting and generative processes. Nowcasting, in this context, is gaining situational awareness using currently available observations and the reporting patterns of historical observations. This can be useful when tracking the spread of infectious disease in real-time: without nowcasting, changes in trends can be obfuscated by partial reporting or their detection may be delayed due to the use of simpler methods like truncation. While the package has been designed with epidemiological applications in mind, it could be applied to any set of right-truncated time-series count data.

Maintained by Sam Abbott. Last updated 12 months ago.

cmdstanr effective-reproduction-number-estimation epidemiology infectious-disease-surveillance nowcasting outbreak-analysis pandemic-preparedness real-time-infectious-disease-modelling stan

61 stars 7.79 score 71 scripts

danheck

metaBMA:Bayesian Model Averaging for Random and Fixed Effects Meta-Analysis

Computes the posterior model probabilities for standard meta-analysis models (null model vs. alternative model assuming either fixed- or random-effects, respectively). These posterior probabilities are used to estimate the overall mean effect size as the weighted average of the mean effect size estimates of the random- and fixed-effect model as proposed by Gronau, Van Erp, Heck, Cesario, Jonas, & Wagenmakers (2017, <doi:10.1080/23743603.2017.1326760>). The user can define a wide range of non-informative or informative priors for the mean effect size and the heterogeneity coefficient. Moreover, using pre-compiled Stan models, meta-analysis with continuous and discrete moderators with Jeffreys-Zellner-Siow (JZS) priors can be fitted and tested. This allows to compute Bayes factors and perform Bayesian model averaging across random- and fixed-effects meta-analysis with and without moderators. For a primer on Bayesian model-averaged meta-analysis, see Gronau, Heck, Berkhout, Haaf, & Wagenmakers (2021, <doi:10.1177/25152459211031256>).

Maintained by Daniel W. Heck. Last updated 1 years ago.

bayes bayes-factor bayesian-inference evidence-synthesis meta-analysis model-averaging stan cpp

28 stars 7.75 score 54 scripts 4 dependents

theoreticalecology

sjSDM:Scalable Joint Species Distribution Modeling

A scalable and fast method for estimating joint Species Distribution Models (jSDMs) for big community data, including eDNA data. The package estimates a full (i.e. non-latent) jSDM with different response distributions (including the traditional multivariate probit model). The package allows to perform variation partitioning (VP) / ANOVA on the fitted models to separate the contribution of environmental, spatial, and biotic associations. In addition, the total R-squared can be further partitioned per species and site to reveal the internal metacommunity structure, see Leibold et al., <doi:10.1111/oik.08618>. The internal structure can then be regressed against environmental and spatial distinctiveness, richness, and traits to analyze metacommunity assembly processes. The package includes support for accounting for spatial autocorrelation and the option to fit responses using deep neural networks instead of a standard linear predictor. As described in Pichler & Hartig (2021) <doi:10.1111/2041-210X.13687>, scalability is achieved by using a Monte Carlo approximation of the joint likelihood implemented via 'PyTorch' and 'reticulate', which can be run on CPUs or GPUs.

Maintained by Maximilian Pichler. Last updated 1 months ago.

deep-learning gpu-acceleration machine-learning species-distribution-modelling species-interactions

69 stars 7.64 score 70 scripts

andrewcparnell

simmr:A Stable Isotope Mixing Model

Fits Stable Isotope Mixing Models (SIMMs) and is meant as a longer term replacement to the previous widely-used package SIAR. SIMMs are used to infer dietary proportions of organisms consuming various food sources from observations on the stable isotope values taken from the organisms' tissue samples. However SIMMs can also be used in other scenarios, such as in sediment mixing or the composition of fatty acids. The main functions are simmr_load() and simmr_mcmc(). The two vignettes contain a quick start and a full listing of all the features. The methods used are detailed in the papers Parnell et al 2010 <doi:10.1371/journal.pone.0009672>, and Parnell et al 2013 <doi:10.1002/env.2221>.

Maintained by Emma Govan. Last updated 12 months ago.

openblas cpp jags

32 stars 7.59 score 81 scripts

spsanderson

healthyR.ts:The Time Series Modeling Companion to 'healthyR'

Hospital time series data analysis workflow tools, modeling, and automations. This library provides many useful tools to review common administrative time series hospital data. Some of these include average length of stay, and readmission rates. The aim is to provide a simple and consistent verb framework that takes the guesswork out of everything.

Maintained by Steven Sanderson. Last updated 6 months ago.

ai arima-forecasting arima-model ets forecasting ggplot2 machine-learning modeling prophet time-series time-series-analysis workflows

19 stars 7.58 score 56 scripts 1 dependents

epiforecasts

epinowcast:Flexible Hierarchical Nowcasting

Tools to enable flexible and efficient hierarchical nowcasting of right-truncated epidemiological time-series using a semi-mechanistic Bayesian model with support for a range of reporting and generative processes. Nowcasting, in this context, is gaining situational awareness using currently available observations and the reporting patterns of historical observations. This can be useful when tracking the spread of infectious disease in real-time: without nowcasting, changes in trends can be obfuscated by partial reporting or their detection may be delayed due to the use of simpler methods like truncation. While the package has been designed with epidemiological applications in mind, it could be applied to any set of right-truncated time-series count data.

Maintained by Sam Abbott. Last updated 12 months ago.

cmdstanr effective-reproduction-number-estimation epidemiology infectious-disease-surveillance nowcasting outbreak-analysis pandemic-preparedness real-time-infectious-disease-modelling stan

61 stars 7.58 score 65 scripts

hsbadr

bayesian:Bindings for Bayesian TidyModels

Fit Bayesian models using 'brms'/'Stan' with 'parsnip'/'tidymodels' via 'bayesian' <doi:10.5281/zenodo.4426836>. 'tidymodels' is a collection of packages for machine learning; see Kuhn and Wickham (2020) <https://www.tidymodels.org>). The technical details of 'brms' and 'Stan' are described in Bürkner (2017) <doi:10.18637/jss.v080.i01>, Bürkner (2018) <doi:10.32614/RJ-2018-017>, and Carpenter et al. (2017) <doi:10.18637/jss.v076.i01>.

Maintained by Hamada S. Badr. Last updated 2 months ago.

bayesian brms stan tidymodels

46 stars 7.47 score 18 scripts

leoegidi

footBayes:Fitting Bayesian and MLE Football Models

This is the first package allowing for the estimation, visualization and prediction of the most well-known football models: double Poisson, bivariate Poisson, Skellam, student_t, diagonal-inflated bivariate Poisson, and zero-inflated Skellam. It supports both maximum likelihood estimation (MLE, for 'static' models only) and Bayesian inference. For Bayesian methods, it incorporates several techniques: MCMC sampling with Hamiltonian Monte Carlo, variational inference using either the Pathfinder algorithm or Automatic Differentiation Variational Inference (ADVI), and the Laplace approximation. The package compiles all the 'CmdStan' models once during installation using the 'instantiate' package. The model construction relies on the most well-known football references, such as Dixon and Coles (1997) <doi:10.1111/1467-9876.00065>, Karlis and Ntzoufras (2003) <doi:10.1111/1467-9884.00366> and Egidi, Pauli and Torelli (2018) <doi:10.1177/1471082X18798414>.

Maintained by Leonardo Egidi. Last updated 13 days ago.

42 stars 7.43 score 53 scripts

paulnorthrop

revdbayes:Ratio-of-Uniforms Sampling for Bayesian Extreme Value Analysis

Provides functions for the Bayesian analysis of extreme value models. The 'rust' package <https://cran.r-project.org/package=rust> is used to simulate a random sample from the required posterior distribution. The functionality of 'revdbayes' is similar to the 'evdbayes' package <https://cran.r-project.org/package=evdbayes>, which uses Markov Chain Monte Carlo ('MCMC') methods for posterior simulation. In addition, there are functions for making inferences about the extremal index, using the models for threshold inter-exceedance times of Suveges and Davison (2010) <doi:10.1214/09-AOAS292> and Holesovsky and Fusek (2020) <doi:10.1007/s10687-020-00374-3>. Also provided are d,p,q,r functions for the Generalised Extreme Value ('GEV') and Generalised Pareto ('GP') distributions that deal appropriately with cases where the shape parameter is very close to zero.

Maintained by Paul J. Northrop. Last updated 7 months ago.

analysis bayesian extreme extreme-value-statistics extremes generalized-pareto-distribution gev inference nhpp point-process posterior predictive rcpp value openblas cpp

4 stars 7.41 score 58 scripts 4 dependents

spsanderson

healthyR.ai:The Machine Learning and AI Modeling Companion to 'healthyR'

Hospital machine learning and ai data analysis workflow tools, modeling, and automations. This library provides many useful tools to review common administrative hospital data. Some of these include predicting length of stay, and readmits. The aim is to provide a simple and consistent verb framework that takes the guesswork out of everything.

Maintained by Steven Sanderson. Last updated 2 months ago.

ai artificial-intelligence healthcareanalytics healthyr healthyverse machine-learning

16 stars 7.37 score 36 scripts 1 dependents

saudiwin

ordbetareg:Ordered Beta Regression Models with 'brms'

Implements ordered beta regression models, which are for modeling continuous variables with upper and lower bounds, such as survey sliders, dose-response relationships and indexes. For more information, see Kubinec (2023) <doi:10.31235/osf.io/2sx6y>. The package is a front-end to the R package 'brms', which facilitates a range of regression specifications, including hierarchical, dynamic and multivariate modeling.

Maintained by Robert Kubinec. Last updated 2 months ago.

21 stars 7.35 score 38 scripts

wwiecek

baggr:Bayesian Aggregate Treatment Effects

Running and comparing meta-analyses of data with hierarchical Bayesian models in Stan, including convenience functions for formatting data, plotting and pooling measures specific to meta-analysis. This implements many models from Meager (2019) <doi:10.1257/app.20170299>.

Maintained by Witold Wiecek. Last updated 5 days ago.

bayesian-statistics meta-analysis quantile-regression stan treatment-effects cpp

49 stars 7.24 score 88 scripts

jschoeley

tricolore:A Flexible Color Scale for Ternary Compositions

Compositional data consisting of three-parts can be color mapped with a ternary color scale. Such a scale is provided by the tricolore packages with options for discrete and continuous colors, mean-centering and scaling. See Jonas Schöley (2021) "The centered ternary balance scheme. A technique to visualize surfaces of unbalanced three-part compositions" <doi:10.4054/DemRes.2021.44.19>, Jonas Schöley, Frans Willekens (2017) "Visualizing compositional data on the Lexis surface" <doi:10.4054/DemRes.2017.36.21>, and Ilya Kashnitsky, Jonas Schöley (2018) "Regional population structures at a glance" <doi:10.1016/S0140-6736(18)31194-2>.

Maintained by Jonas Schöley. Last updated 11 months ago.

151 stars 7.11 score 85 scripts

cbergmeir

Rlgt:Bayesian Exponential Smoothing Models with Trend Modifications

An implementation of a number of Global Trend models for time series forecasting that are Bayesian generalizations and extensions of some Exponential Smoothing models. The main differences/additions include 1) nonlinear global trend, 2) Student-t error distribution, and 3) a function for the error size, so heteroscedasticity. The methods are particularly useful for short time series. When tested on the well-known M3 dataset, they are able to outperform all classical time series algorithms. The models are fitted with MCMC using the 'rstan' package.

Maintained by Christoph Bergmeir. Last updated 8 months ago.

cpp

20 stars 7.05 score 31 scripts

drizopoulos

JMbayes:Joint Modeling of Longitudinal and Time-to-Event Data under a Bayesian Approach

Shared parameter models for the joint modeling of longitudinal and time-to-event data using MCMC; Dimitris Rizopoulos (2016) <doi:10.18637/jss.v072.i07>.

Maintained by Dimitris Rizopoulos. Last updated 4 years ago.

joint-models longitudinal-responses prediction-model survival-analysis openblas cpp openmp jags

59 stars 6.97 score 80 scripts

ropensci

jagstargets:Targets for JAGS Pipelines

Bayesian data analysis usually incurs long runtimes and cumbersome custom code. A pipeline toolkit tailored to Bayesian statisticians, the 'jagstargets' R package is leverages 'targets' and 'R2jags' to ease this burden. 'jagstargets' makes it super easy to set up scalable JAGS pipelines that automatically parallelize the computation and skip expensive steps when the results are already up to date. Minimal custom code is required, and there is no need to manually configure branching, so usage is much easier than 'targets' alone. For the underlying methodology, please refer to the documentation of 'targets' <doi:10.21105/joss.02959> and 'JAGS' (Plummer 2003) <https://www.r-project.org/conferences/DSC-2003/Proceedings/Plummer.pdf>.

Maintained by William Michael Landau. Last updated 4 months ago.

bayesian high-performance-computing jags make r-targetopia reproducibility rjags statistics targets cpp

10 stars 6.95 score 32 scripts

paul-buerkner

thurstonianIRT:Thurstonian IRT Models

Fit Thurstonian Item Response Theory (IRT) models in R. This package supports fitting Thurstonian IRT models and its extensions using 'Stan', 'lavaan', or 'Mplus' for the model estimation. Functionality for extracting results, making predictions, and simulating data is provided as well. References: Brown & Maydeu-Olivares (2011) <doi:10.1177/0013164410375112>; Bürkner et al. (2019) <doi:10.1177/0013164419832063>.

Maintained by Paul-Christian Bürkner. Last updated 11 months ago.

cpp

32 stars 6.94 score 15 scripts 1 dependents

boehringer-ingelheim

BayesianMCPMod:Simulate, Evaluate, and Analyze Dose Finding Trials with Bayesian MCPMod

Bayesian MCPMod (Fleischer et al. (2022) <doi:10.1002/pst.2193>) is an innovative method that improves the traditional MCPMod by systematically incorporating historical data, such as previous placebo group data. This R package offers functions for simulating, analyzing, and evaluating Bayesian MCPMod trials with normally distributed endpoints. It enables the assessment of trial designs incorporating historical data across various true dose-response relationships and sample sizes. Robust mixture prior distributions, such as those derived with the Meta-Analytic-Predictive approach (Schmidli et al. (2014) <doi:10.1111/biom.12242>), can be specified for each dose group. Resulting mixture posterior distributions are used in the Bayesian Multiple Comparison Procedure and modeling steps. The modeling step also includes a weighted model averaging approach (Pinheiro et al. (2014) <doi:10.1002/sim.6052>). Estimated dose-response relationships can be bootstrapped and visualized.

Maintained by Stephan Wojciekowski. Last updated 22 days ago.

9 stars 6.94 score 4 scripts

asael697

bayesforecast:Bayesian Time Series Modeling with Stan

Fit Bayesian time series models using 'Stan' for full Bayesian inference. A wide range of distributions and models are supported, allowing users to fit Seasonal ARIMA, ARIMAX, Dynamic Harmonic Regression, GARCH, t-student innovation GARCH models, asymmetric GARCH, Random Walks, stochastic volatility models for univariate time series. Prior specifications are flexible and explicitly encourage users to apply prior distributions that actually reflect their beliefs. Model fit can easily be assessed and compared with typical visualization methods, information criteria such as loglik, AIC, BIC WAIC, Bayes factor and leave-one-out cross-validation methods. References: Hyndman (2017) <doi:10.18637/jss.v027.i03>; Carpenter et al. (2017) <doi:10.18637/jss.v076.i01>.

Maintained by Asael Alonzo Matamoros. Last updated 1 years ago.

bayesian-inference forecasting-models mcmc stan time-series-analysis cpp

45 stars 6.92 score 62 scripts

datacloning

dclone:Data Cloning and MCMC Tools for Maximum Likelihood Methods

Low level functions for implementing maximum likelihood estimating procedures for complex models using data cloning and Bayesian Markov chain Monte Carlo methods as described in Solymos 2010 <doi:10.32614/RJ-2010-011>. Sequential and parallel MCMC support for 'JAGS', 'WinBUGS', 'OpenBUGS', and 'Stan'.

Maintained by Peter Solymos. Last updated 6 months ago.

jags cpp

7 stars 6.91 score 215 scripts 4 dependents

ropensci

stantargets:Targets for Stan Workflows

Bayesian data analysis usually incurs long runtimes and cumbersome custom code. A pipeline toolkit tailored to Bayesian statisticians, the 'stantargets' R package leverages 'targets' and 'cmdstanr' to ease these burdens. 'stantargets' makes it super easy to set up scalable Stan pipelines that automatically parallelize the computation and skip expensive steps when the results are already up to date. Minimal custom code is required, and there is no need to manually configure branching, so usage is much easier than 'targets' alone. 'stantargets' can access all of 'cmdstanr''s major algorithms (MCMC, variational Bayes, and optimization) and it supports both single-fit workflows and multi-rep simulation studies. For the statistical methodology, please refer to 'Stan' documentation (Stan Development Team 2020) <https://mc-stan.org/>.

Maintained by William Michael Landau. Last updated 2 months ago.

bayesian high-performance-computing make r-targetopia reproducibility stan statistics targets

49 stars 6.85 score 180 scripts

wjakethompson

measr:Bayesian Psychometric Measurement Using 'Stan'

Estimate diagnostic classification models (also called cognitive diagnostic models) with 'Stan'. Diagnostic classification models are confirmatory latent class models, as described by Rupp et al. (2010, ISBN: 978-1-60623-527-0). Automatically generate 'Stan' code for the general loglinear cognitive diagnostic diagnostic model proposed by Henson et al. (2009) <doi:10.1007/s11336-008-9089-5> and other subtypes that introduce additional model constraints. Using the generated 'Stan' code, estimate the model evaluate the model's performance using model fit indices, information criteria, and reliability metrics.

Maintained by W. Jake Thompson. Last updated 1 days ago.

bayesian cdm cmdstanr cognitive-diagnosis cognitive-diagnostic-models dcm diagnostic-classification-models psychometrics rstan stan cpp

10 stars 6.81 score 31 scripts

oucru-modelling

serosv:Model Infectious Disease Parameters from Serosurveys

An easy-to-use and efficient tool to estimate infectious diseases parameters using serological data. Implemented models include SIR models (basic_sir_model(), static_sir_model(), mseir_model(), sir_subpops_model()), parametric models (polynomial_model(), fp_model()), nonparametric models (lp_model()), semiparametric models (penalized_splines_model()), hierarchical models (hierarchical_bayesian_model()). The package is based on the book "Modeling Infectious Disease Parameters Based on Serological and Social Contact Data: A Modern Statistical Perspective" (Hens, Niel & Shkedy, Ziv & Aerts, Marc & Faes, Christel & Damme, Pierre & Beutels, Philippe., 2013) <doi:10.1007/978-1-4614-4072-7>.

Maintained by Anh Phan Truong Quynh. Last updated 9 days ago.

cpp

6.77 score 24 scripts

lindeloev

mcp:Regression with Multiple Change Points

Flexible and informed regression with Multiple Change Points. 'mcp' can infer change points in means, variances, autocorrelation structure, and any combination of these, as well as the parameters of the segments in between. All parameters are estimated with uncertainty and prediction intervals are supported - also near the change points. 'mcp' supports hypothesis testing via Savage-Dickey density ratio, posterior contrasts, and cross-validation. 'mcp' is described in Lindeløv (submitted) <doi:10.31219/osf.io/fzqxv> and generalizes the approach described in Carlin, Gelfand, & Smith (1992) <doi:10.2307/2347570> and Stephens (1994) <doi:10.2307/2986119>.

Maintained by Jonas Kristoffer Lindeløv. Last updated 6 months ago.

jags cpp

108 stars 6.74 score 85 scripts 1 dependents

seananderson

glmmfields:Generalized Linear Mixed Models with Robust Random Fields for Spatiotemporal Modeling

Implements Bayesian spatial and spatiotemporal models that optionally allow for extreme spatial deviations through time. 'glmmfields' uses a predictive process approach with random fields implemented through a multivariate-t distribution instead of the usual multivariate normal. Sampling is conducted with 'Stan'. References: Anderson and Ward (2019) <doi:10.1002/ecy.2403>.

Maintained by Sean C. Anderson. Last updated 1 years ago.

ecology extremes spatial-analysis spatiotemporal cpp

50 stars 6.74 score 55 scripts

handcock

reldist:Relative Distribution Methods

Tools for the comparison of distributions. This includes nonparametric estimation of the relative distribution PDF and CDF and numerical summaries as described in "Relative Distribution Methods in the Social Sciences" by Mark S. Handcock and Martina Morris, Springer-Verlag, 1999, Springer-Verlag, ISBN 0387987789.

Maintained by Mark S. Handcock. Last updated 2 years ago.

1 stars 6.70 score 344 scripts 7 dependents

koenderks

jfa:Statistical Methods for Auditing

Provides statistical methods for auditing as implemented in JASP for Audit (Derks et al., 2021 <doi:10.21105/joss.02733>). First, the package makes it easy for an auditor to plan a statistical sample, select the sample from the population, and evaluate the misstatement in the sample compliant with international auditing standards. Second, the package provides statistical methods for auditing data, including tests of digit distributions and repeated values. Finally, the package includes methods for auditing algorithms on the aspect of fairness and bias. Next to classical statistical methodology, the package implements Bayesian equivalents of these methods whose statistical underpinnings are described in Derks et al. (2021) <doi:10.1111/ijau.12240>, Derks et al. (2024) <doi:10.2308/AJPT-2021-086>, Derks et al. (2022) <doi:10.31234/osf.io/8nf3e> Derks et al. (2024) <doi:10.31234/osf.io/tgq5z>, and Derks et al. (2025) <doi:10.31234/osf.io/b8tu2>.

Maintained by Koen Derks. Last updated 14 days ago.

algorithm-auditing audit audit-sampling bayesian data-auditing jasp jasp-for-audit statistical-audit statistics cpp

8 stars 6.69 score 17 scripts

gsk-biostatistics

beastt:Bayesian Evaluation, Analysis, and Simulation Software Tools for Trials

Bayesian dynamic borrowing with covariate adjustment via inverse probability weighting for simulations and data analyses in clinical trials. This makes it easy to use propensity score methods to balance covariate distributions between external and internal data.

Maintained by Christina Fillmore. Last updated 4 days ago.

cpp

3 stars 6.65 score 4 scripts

pavlakrotka

NCC:Simulation and Analysis of Platform Trials with Non-Concurrent Controls

Design and analysis of flexible platform trials with non-concurrent controls. Functions for data generation, analysis, visualization and running simulation studies are provided. The implemented analysis methods are described in: Bofill Roig et al. (2022) <doi:10.1186/s12874-022-01683-w>, Saville et al. (2022) <doi:10.1177/17407745221112013> and Schmidli et al. (2014) <doi:10.1111/biom.12242>.

Maintained by Pavla Krotka. Last updated 19 days ago.

clinical-trials platform-trials simulation statistical-inference jags cpp

5 stars 6.64 score 29 scripts

business-science

modeltime.resample:Resampling Tools for Time Series Forecasting

A 'modeltime' extension that implements forecast resampling tools that assess time-based model performance and stability for a single time series, panel data, and cross-sectional time series analysis.

Maintained by Matt Dancho. Last updated 1 years ago.

accuracy-metrics backtesting bootstrap bootstrapping cross-validation forecasting modeltime modeltime-resample resampling statistics tidymodels time-series

19 stars 6.64 score 38 scripts 1 dependents

theomichelot

hmmTMB:Fit Hidden Markov Models using Template Model Builder

Fitting hidden Markov models using automatic differentiation and Laplace approximation, allowing for fast inference and flexible covariate effects (including random effects and smoothing splines) on model parameters. The package is described by Michelot (2022) <arXiv:2211.14139>.

Maintained by Theo Michelot. Last updated 1 days ago.

cpp

54 stars 6.62 score 64 scripts

bioc

CiteFuse:CiteFuse: multi-modal analysis of CITE-seq data

CiteFuse pacakage implements a suite of methods and tools for CITE-seq data from pre-processing to integrative analytics, including doublet detection, network-based modality integration, cell type clustering, differential RNA and protein expression analysis, ADT evaluation, ligand-receptor interaction analysis, and interactive web-based visualisation of the analyses.

Maintained by Yingxin Lin. Last updated 5 months ago.

singlecell geneexpression bioinformatics single-cell cpp

27 stars 6.59 score 18 scripts

thomasasmith

CRTspat:Workflow for Cluster Randomised Trials with Spillover

Design, workflow and statistical analysis of Cluster Randomised Trials of (health) interventions where there may be spillover between the arms (see <https://thomasasmith.github.io/index.html>).

Maintained by Thomas Smith. Last updated 11 days ago.

jags cpp

4 stars 6.57 score 24 scripts

danielcfurr

edstan:Stan Models for Item Response Theory

Streamlines the fitting of common Bayesian item response models using Stan.

Maintained by Daniel C. Furr. Last updated 5 days ago.

8 stars 6.56 score 25 scripts 2 dependents

epinowcast

epidist:Estimate Epidemiological Delay Distributions With brms

Understanding and accurately estimating epidemiological delay distributions is important for public health policy. These estimates influence epidemic situational awareness, control strategies, and resource allocation. This package provides methods to address the key challenges in estimating these distributions, including truncation, interval censoring, and dynamical biases. These issues are frequently overlooked, resulting in biased conclusions. Built on top of 'brms', it allows for flexible modelling including time-varying spatial components and partially pooled estimates of demographic characteristics.

Maintained by Sam Abbott. Last updated 22 days ago.

14 stars 6.52 score 7 scripts

jsocolar

flocker:Flexible Occupancy Estimation with Stan

Fit occupancy models in 'Stan' via 'brms'. The full variety of 'brms' formula-based effects structures are available to use in multiple classes of occupancy model, including single-season models, models with data augmentation for never-observed species, dynamic (multiseason) models with explicit colonization and extinction processes, and dynamic models with autologistic occupancy dynamics. Formulas can be specified for all relevant distributional terms, including detection and one or more of occupancy, colonization, extinction, and autologistic depending on the model type. Several important forms of model post-processing are provided. References: Bürkner (2017) <doi:10.18637/jss.v080.i01>; Carpenter et al. (2017) <doi:10.18637/jss.v076.i01>; Socolar & Mills (2023) <doi:10.1101/2023.10.26.564080>.

Maintained by Jacob B. Socolar. Last updated 2 months ago.

30 stars 6.48 score 20 scripts

mitchelloharawild

fable.prophet:Prophet Modelling Interface for 'fable'

Allows prophet models from the 'prophet' package to be used in a tidy workflow with the modelling interface of 'fabletools'. This extends 'prophet' to provide enhanced model specification and management, performance evaluation methods, and model combination tools.

Maintained by Mitchell OHara-Wild. Last updated 1 years ago.

56 stars 6.48 score 107 scripts

jazznbass

scan:Single-Case Data Analyses for Single and Multiple Baseline Designs

A collection of procedures for analysing, visualising, and managing single-case data. These include piecewise linear regression models, multilevel models, overlap indices ('PND', 'PEM', 'PAND', 'PET', 'tau-u', 'baseline corrected tau', 'CDC'), and randomization tests. Data preparation functions support outlier detection, handling missing values, scaling, and custom transformations. An export function helps to generate html, word, and latex tables in a publication friendly style. More details can be found in the online book 'Analyzing single-case data with R and scan', Juergen Wilbert (2025) <https://jazznbass.github.io/scan-Book/>.

Maintained by Juergen Wilbert. Last updated 11 days ago.

4 stars 6.47 score 62 scripts 1 dependents

bstatcomp

bayes4psy:User Friendly Bayesian Data Analysis for Psychology

Contains several Bayesian models for data analysis of psychological tests. A user friendly interface for these models should enable students and researchers to perform professional level Bayesian data analysis without advanced knowledge in programming and Bayesian statistics. This package is based on the Stan platform (Carpenter et el. 2017 <doi:10.18637/jss.v076.i01>).

Maintained by Jure Demšar. Last updated 1 years ago.

cpp

14 stars 6.44 score 33 scripts

helske

bssm:Bayesian Inference of Non-Linear and Non-Gaussian State Space Models

Efficient methods for Bayesian inference of state space models via Markov chain Monte Carlo (MCMC) based on parallel importance sampling type weighted estimators (Vihola, Helske, and Franks, 2020, <doi:10.1111/sjos.12492>), particle MCMC, and its delayed acceptance version. Gaussian, Poisson, binomial, negative binomial, and Gamma observation densities and basic stochastic volatility models with linear-Gaussian state dynamics, as well as general non-linear Gaussian models and discretised diffusion models are supported. See Helske and Vihola (2021, <doi:10.32614/RJ-2021-103>) for details.

Maintained by Jouni Helske. Last updated 7 months ago.

bayesian-inference cpp markov-chain-monte-carlo particle-filter state-space time-series openblas cpp openmp

42 stars 6.43 score 11 scripts

helske

walker:Bayesian Generalized Linear Models with Time-Varying Coefficients

Efficient Bayesian generalized linear models with time-varying coefficients as in Helske (2022, <doi:10.1016/j.softx.2022.101016>). Gaussian, Poisson, and binomial observations are supported. The Markov chain Monte Carlo (MCMC) computations are done using Hamiltonian Monte Carlo provided by Stan, using a state space representation of the model in order to marginalise over the coefficients for efficient sampling. For non-Gaussian models, the package uses the importance sampling type estimators based on approximate marginal MCMC as in Vihola, Helske, Franks (2020, <doi:10.1111/sjos.12492>).

Maintained by Jouni Helske. Last updated 7 months ago.

bayesian generalized-linear-models mcmc stan time-series openblas cpp

44 stars 6.42 score 15 scripts

ygeunkim

bvhar:Bayesian Vector Heterogeneous Autoregressive Modeling

Tools to model and forecast multivariate time series including Bayesian Vector heterogeneous autoregressive (VHAR) model by Kim & Baek (2023) (<doi:10.1080/00949655.2023.2281644>). 'bvhar' can model Vector Autoregressive (VAR), VHAR, Bayesian VAR (BVAR), and Bayesian VHAR (BVHAR) models.

Maintained by Young Geun Kim. Last updated 30 days ago.

bayesian bayesian-econometrics bvar eigen forecasting har pybind11 python rcppeigen time-series vector-autoregression cpp openmp

6 stars 6.42 score 25 scripts

cran

compositions:Compositional Data Analysis

Provides functions for the consistent analysis of compositional data (e.g. portions of substances) and positive numbers (e.g. concentrations) in the way proposed by J. Aitchison and V. Pawlowsky-Glahn.

Maintained by K. Gerald van den Boogaart. Last updated 1 years ago.

openblas

1 stars 6.35 score 36 dependents

yoshidk6

rstanemax:Emax Model Analysis with 'Stan'

Perform sigmoidal Emax model fit using 'Stan' in a formula notation, without writing 'Stan' model code.

Maintained by Kenta Yoshida. Last updated 1 months ago.

cpp

5 stars 6.34 score 21 scripts 1 dependents

cole-monnahan-noaa

adnuts:No-U-Turn MCMC Sampling for 'ADMB' Models

Bayesian inference using the no-U-turn (NUTS) algorithm by Hoffman and Gelman (2014) <https://www.jmlr.org/papers/v15/hoffman14a.html>. Designed for 'AD Model Builder' ('ADMB') models, or when R functions for log-density and log-density gradient are available, such as 'Template Model Builder' models and other special cases. Functionality is similar to 'Stan', and the 'rstan' and 'shinystan' packages are used for diagnostics and inference.

Maintained by Cole Monnahan. Last updated 1 years ago.

24 stars 6.33 score 59 scripts

ethan-alt

hdbayes:Bayesian Analysis of Generalized Linear Models with Historical Data

User-friendly functions for leveraging (multiple) historical data set(s) for generalized linear models (GLMs) and survival models. The package provides functions for sampling from the posterior distribution under various informative priors, including the prior induced by the Bayesian hierarchical model, power prior by Ibrahim and Chen (2000) <doi:10.1214/ss/1009212673>, normalized power prior by Duan et al. (2006) <doi:10.1002/env.752>, normalized asymptotic power prior by Ibrahim et al. (2015) <doi:10.1002/sim.6728>, commensurate prior by Hobbs et al. (2011) <doi:10.1111/j.1541-0420.2011.01564.x>, robust meta-analytic-predictive prior by Schmidli et al. (2014) <doi:10.1111/biom.12242>, latent exchangeability prior by Alt et al. (2024) <doi:10.1093/biomtc/ujae083>, and a normal (or half-normal) prior. In addition to GLMs, the package supports survival models including: (1) accelerated failure time (AFT) models, (2) piecewise exponential (PWE) models, i.e., proportional hazards models with piecewise constant baseline hazards, and (3) mixture cure rate models that assume a common probability of cure across subjects, paired with a PWE model for the non-cured population. Functions for computing the marginal log-likelihood under each implemented prior are also included. The package compiles all the 'CmdStan' models once during installation using the 'instantiate' package.

Maintained by Ethan M. Alt. Last updated 1 days ago.

5 stars 6.32 score 7 scripts

bioc

scMET:Bayesian modelling of cell-to-cell DNA methylation heterogeneity

High-throughput single-cell measurements of DNA methylomes can quantify methylation heterogeneity and uncover its role in gene regulation. However, technical limitations and sparse coverage can preclude this task. scMET is a hierarchical Bayesian model which overcomes sparsity, sharing information across cells and genomic features to robustly quantify genuine biological heterogeneity. scMET can identify highly variable features that drive epigenetic heterogeneity, and perform differential methylation and variability analyses. We illustrate how scMET facilitates the characterization of epigenetically distinct cell populations and how it enables the formulation of novel hypotheses on the epigenetic regulation of gene expression.

Maintained by Andreas C. Kapourani. Last updated 5 months ago.

immunooncology dnamethylation differentialmethylation differentialexpression geneexpression generegulation epigenetics genetics clustering featureextraction regression bayesian sequencing coverage singlecell bayesian-inference generalised-linear-models heterogeneity hierarchical-models methylation-analysis single-cell cpp

20 stars 6.23 score 42 scripts

david-barnett

microViz:Microbiome Data Analysis and Visualization

Microbiome data visualization and statistics tools built upon phyloseq.

Maintained by David Barnett. Last updated 3 months ago.

microbiome microbiome-analysis microbiota

114 stars 6.22 score 480 scripts

epiverse-trace

serofoi:Bayesian Estimation of the Force of Infection from Serological Data

Estimating the force of infection from time varying, age varying, or constant serocatalytic models from population based seroprevalence studies using a Bayesian framework, including data simulation functions enabling the generation of serological surveys based on this models. This tool also provides a flexible prior specification syntax for the force of infection and the seroreversion rate, as well as methods to assess model convergence and comparison criteria along with useful visualisation functions.

Maintained by Zulma M. Cucunubá. Last updated 1 days ago.

antibodies bayesian-methods epidemiology epiverse serological-surveys stan-language cpp

19 stars 6.20 score 10 scripts

noaa-nwfsc

zoid:Bayesian Zero-and-One Inflated Dirichlet Regression Modelling

Fits Dirichlet regression and zero-and-one inflated Dirichlet regression with Bayesian methods implemented in Stan. These models are sometimes referred to as trinomial mixture models; covariates and overdispersion can optionally be included.

Maintained by Eric J. Ward. Last updated 12 days ago.

mixture-models nwfsc-cb stan cpp

8 stars 6.19 score 12 scripts

frmunoz

ecolottery:Coalescent-Based Simulation of Ecological Communities

Coalescent-Based Simulation of Ecological Communities as proposed by Munoz et al. (2018) <doi:10.1111/2041-210X.12918>. The package includes a tool for estimating parameters of community assembly by using Approximate Bayesian Computation.

Maintained by François Munoz. Last updated 2 years ago.

15 stars 6.18 score 17 scripts 1 dependents

vivianalobo

lnmixsurv:Bayesian Mixture Log-Normal Survival Model

Bayesian Survival models via the mixture of Log-Normal distribution extends the well-known survival models and accommodates different behaviour over time and considers higher censored survival times. The proposal combines mixture distributions Fruhwirth-Schnatter(2006) <doi:10.1007/s11336-009-9121-4>, and data augmentation techniques Tanner and Wong (1987) <doi:10.1080/01621459.1987.10478458>.

Maintained by Victor Hugo Soares Ney. Last updated 24 days ago.

gsl openblas cpp

2 stars 6.16 score 18 scripts

cefasrepres

EcoEnsemble:A General Framework for Combining Ecosystem Models

Fit and sample from the ensemble model described in Spence et al (2018): "A general framework for combining ecosystem models"<doi:10.1111/faf.12310>.

Maintained by Michael A. Spence. Last updated 11 days ago.

cpp

1 stars 6.15 score 19 scripts

grvanderploeg

parafac4microbiome:Parallel Factor Analysis Modelling of Longitudinal Microbiome Data

Creation and selection of PARAllel FACtor Analysis (PARAFAC) models of longitudinal microbiome data. You can import your own data with our import functions or use one of the example datasets to create your own PARAFAC models. Selection of the optimal number of components can be done using assessModelQuality() and assessModelStability(). The selected model can then be plotted using plotPARAFACmodel(). The Parallel Factor Analysis method was originally described by Caroll and Chang (1970) <doi:10.1007/BF02310791> and Harshman (1970) <https://www.psychology.uwo.ca/faculty/harshman/wpppfac0.pdf>.

Maintained by Geert Roelof van der Ploeg. Last updated 8 hours ago.

dimensionality-reduction microbiome microbiome-data multiway multiway-algorithms parallel-factor-analysis

6 stars 6.15 score 13 scripts

jwiley

brmsmargins:Bayesian Marginal Effects for 'brms' Models

Calculate Bayesian marginal effects, average marginal effects, and marginal coefficients (also called population averaged coefficients) for models fit using the 'brms' package including fixed effects, mixed effects, and location scale models. These are based on marginal predictions that integrate out random effects if necessary (see for example <doi:10.1186/s12874-015-0046-6> and <doi:10.1111/biom.12707>).

Maintained by Joshua F. Wiley. Last updated 2 months ago.

openblas cpp openmp

20 stars 6.14 score 42 scripts

simonlabcode

bakR:Analyze and Compare Nucleotide Recoding RNA Sequencing Datasets

Several implementations of a novel Bayesian hierarchical statistical model of nucleotide recoding RNA-seq experiments (NR-seq; TimeLapse-seq, SLAM-seq, TUC-seq, etc.) for analyzing and comparing NR-seq datasets (see 'Vock and Simon' (2023) <doi:10.1261/rna.079451.122>). NR-seq is a powerful extension of RNA-seq that provides information about the kinetics of RNA metabolism (e.g., RNA degradation rate constants), which is notably lacking in standard RNA-seq data. The statistical model makes maximal use of these high-throughput datasets by sharing information across transcripts to significantly improve uncertainty quantification and increase statistical power. 'bakR' includes a maximally efficient implementation of this model for conservative initial investigations of datasets. 'bakR' also provides more highly powered implementations using the probabilistic programming language 'Stan' to sample from the full posterior distribution. 'bakR' performs multiple-test adjusted statistical inference with the output of these model implementations to help biologists separate signal from background. Methods to automatically visualize key results and detect batch effects are also provided.

Maintained by Isaac Vock. Last updated 4 months ago.

cpp

6 stars 6.12 score 21 scripts

jeremygelb

geocmeans:Implementing Methods for Spatial Fuzzy Unsupervised Classification

Provides functions to apply spatial fuzzy unsupervised classification, visualize and interpret results. This method is well suited when the user wants to analyze data with a fuzzy clustering algorithm and to account for the spatial dimension of the dataset. In addition, indexes for estimating the spatial consistency and classification quality are proposed. The methods were originally proposed in the field of brain imagery (seed Cai and al. 2007 <doi:10.1016/j.patcog.2006.07.011> and Zaho and al. 2013 <doi:10.1016/j.dsp.2012.09.016>) and recently applied in geography (see Gelb and Apparicio <doi:10.4000/cybergeo.36414>).

Maintained by Jeremy Gelb. Last updated 4 months ago.

clustering cmeans fuzzy-classification-algorithms spatial-analysis spatial-fuzzy-cmeans unsupervised-learning cpp openmp

28 stars 6.10 score 90 scripts

google

bsynth:Bayesian Synthetic Control

Implements the Bayesian Synthetic Control method for causal inference in comparative case studies. This package provides tools for estimating treatment effects in settings with a single treated unit and multiple control units, allowing for uncertainty quantification and flexible modeling of time-varying effects. The methodology is based on the paper by Vives and Martinez (2022) <doi:10.48550/arXiv.2206.01779>.

Maintained by Ignacio Martinez. Last updated 13 days ago.

cpp

18 stars 6.03 score 4 scripts

bonstats

tidytreatment:Tidy Methods for Bayesian Treatment Effect Models

Functions for extracting tidy data from Bayesian treatment effect models, in particular BART, but extensions are possible. Functionality includes extracting tidy posterior summaries as in 'tidybayes' <https://github.com/mjskay/tidybayes>, estimating (average) treatment effects, common support calculations, and plotting useful summaries of these.

Maintained by Joshua J Bon. Last updated 3 months ago.

bayesian tidy treatment-effects

16 stars 5.97 score 13 scripts 1 dependents

bioc

ClustIRR:Clustering of immune receptor repertoires

ClustIRR analyzes repertoires of B- and T-cell receptors. It starts by identifying communities of immune receptors with similar specificities, based on the sequences of their complementarity-determining regions (CDRs). Next, it employs a Bayesian probabilistic models to quantify differential community occupancy (DCO) between repertoires, allowing the identification of expanding or contracting communities in response to e.g. infection or cancer treatment.

Maintained by Simo Kitanovski. Last updated 30 days ago.

clustering immunooncology singlecell software classification b-cell-receptor bioinformatics immunoinformatics immunology quantitative-methods rep-seq repertoire-analysis t-cell-receptor cpp

2 stars 5.95 score 2 scripts

leoegidi

pivmet:Pivotal Methods for Bayesian Relabelling and k-Means Clustering

Collection of pivotal algorithms for: relabelling the MCMC chains in order to undo the label switching problem in Bayesian mixture models; fitting sparse finite mixtures; initializing the centers of the classical k-means algorithm in order to obtain a better clustering solution. For further details see Egidi, Pappadà, Pauli and Torelli (2018b)<ISBN:9788891910233>.

Maintained by Leonardo Egidi. Last updated 10 months ago.

jags cpp

5 stars 5.94 score 25 scripts

jtimonen

lgpr:Longitudinal Gaussian Process Regression

Interpretable nonparametric modeling of longitudinal data using additive Gaussian process regression. Contains functionality for inferring covariate effects and assessing covariate relevances. Models are specified using a convenient formula syntax, and can include shared, group-specific, non-stationary, heterogeneous and temporally uncertain effects. Bayesian inference for model parameters is performed using 'Stan'. The modeling approach and methods are described in detail in Timonen et al. (2021) <doi:10.1093/bioinformatics/btab021>.

Maintained by Juho Timonen. Last updated 7 months ago.

bayesian-inference gaussian-processes longitudinal-data stan cpp

25 stars 5.94 score 69 scripts

venpopov

bmm:Easy and Accessible Bayesian Measurement Models Using 'brms'

Fit computational and measurement models using full Bayesian inference. The package provides a simple and accessible interface by translating complex domain-specific models into 'brms' syntax, a powerful and flexible framework for fitting Bayesian regression models using 'Stan'. The package is designed so that users can easily apply state-of-the-art models in various research fields, and so that researchers can use it as a new model development framework. References: Frischkorn and Popov (2023) <doi:10.31234/osf.io/umt57>.

Maintained by Vencislav Popov. Last updated 26 days ago.

15 stars 5.92 score 35 scripts

bioc

IgGeneUsage:Differential gene usage in immune repertoires

Detection of biases in the usage of immunoglobulin (Ig) genes is an important task in immune repertoire profiling. IgGeneUsage detects aberrant Ig gene usage between biological conditions using a probabilistic model which is analyzed computationally by Bayes inference. With this IgGeneUsage also avoids some common problems related to the current practice of null-hypothesis significance testing.

Maintained by Simo Kitanovski. Last updated 5 months ago.

differentialexpression regression genetics bayesian biomedicalinformatics immunooncology mathematicalbiology b-cell-receptor bcr-repertoire differential-analysis differential-gene-expression high-throughput-sequencing immune-repertoire immune-repertoire-analysis immune-repertoires immunogenomics immunoglobulin immunoinformatics immunological-bioinformatics immunology tcr-repertoire vdj-recombination cpp

6 stars 5.92 score 1 scripts

matthieu-bruneaux

isotracer:Isotopic Tracer Analysis Using MCMC

Implements Bayesian models to analyze data from tracer addition experiments. The implemented method was originally described in the article "A New Method to Reconstruct Quantitative Food Webs and Nutrient Flows from Isotope Tracer Addition Experiments" by López-Sepulcre et al. (2020) <doi:10.1086/708546>.

Maintained by Matthieu Bruneaux. Last updated 5 months ago.

cpp

5.92 score 60 scripts

munchfab

mlts:Multilevel Latent Time Series Models with 'R' and 'Stan'

Fit multilevel manifest or latent time-series models, including popular Dynamic Structural Equation Models (DSEM). The models can be set up and modified with user-friendly functions and are fit to the data using 'Stan' for Bayesian inference. Path models and formulas for user-defined models can be easily created with functions using 'knitr'. Asparouhov, Hamaker, & Muthen (2018) <doi:10.1080/10705511.2017.1406803>.

Maintained by Kenneth Koslowski. Last updated 9 days ago.

cpp

2 stars 5.92 score 9 scripts

jeffreypullin

rater:Statistical Models of Repeated Categorical Rating Data

Fit statistical models based on the Dawid-Skene model - Dawid and Skene (1979) <doi:10.2307/2346806> - to repeated categorical rating data. Full Bayesian inference for these models is supported through the Stan modelling language. 'rater' also allows the user to extract and plot key parameters of these models.

Maintained by Jeffrey Pullin. Last updated 2 years ago.

annotations bayesian bayesian-statistics stan cpp

17 stars 5.83 score 20 scripts

mvuorre

bmlm:Bayesian Multilevel Mediation

Easy estimation of Bayesian multilevel mediation models with Stan.

Maintained by Matti Vuorre. Last updated 4 months ago.

bayesian-data-analysis multilevel-mediation-models statistics cpp

42 stars 5.81 score 34 scripts

boehringer-ingelheim

oncomsm:Bayesian Multi-State Models for Early Oncology

Implements methods to fit a parametric Bayesian multi-state model to tumor response data. The model can be used to sample from the predictive distribution to impute missing data and calculate probability of success for custom decision criteria in early clinical trials during an ongoing trial. The inference is implemented using 'stan'.

Maintained by Kevin Kunzmann. Last updated 2 years ago.

cpp

8 stars 5.80 score 13 scripts

bioc

benchdamic:Benchmark of differential abundance methods on microbiome data

Starting from a microbiome dataset (16S or WMS with absolute count values) it is possible to perform several analysis to assess the performances of many differential abundance detection methods. A basic and standardized version of the main differential abundance analysis methods is supplied but the user can also add his method to the benchmark. The analyses focus on 4 main aspects: i) the goodness of fit of each method's distributional assumptions on the observed count data, ii) the ability to control the false discovery rate, iii) the within and between method concordances, iv) the truthfulness of the findings if any apriori knowledge is given. Several graphical functions are available for result visualization.

Maintained by Matteo Calgaro. Last updated 4 months ago.

metagenomics microbiome differentialexpression multiplecomparison normalization preprocessing software benchmark differential-abundance-methods

8 stars 5.78 score 8 scripts

jwiley

multilevelTools:Multilevel and Mixed Effects Model Diagnostics and Effect Sizes

Effect sizes, diagnostics and performance metrics for multilevel and mixed effects models. Includes marginal and conditional 'R2' estimates for linear mixed effects models based on Johnson (2014) <doi:10.1111/2041-210X.12225>.

Maintained by Joshua F. Wiley. Last updated 4 days ago.

4 stars 5.74 score 136 scripts

paulnorthrop

threshr:Threshold Selection and Uncertainty for Extreme Value Analysis

Provides functions for the selection of thresholds for use in extreme value models, based mainly on the methodology in Northrop, Attalides and Jonathan (2017) <doi:10.1111/rssc.12159>. It also performs predictive inferences about future extreme values, based either on a single threshold or on a weighted average of inferences from multiple thresholds, using the 'revdbayes' package <https://cran.r-project.org/package=revdbayes>. At the moment only the case where the data can be treated as independent identically distributed observations is considered.

Maintained by Paul J. Northrop. Last updated 2 months ago.

extreme-value-statistics extremes generalized inference pareto plot prediction threshold threshold-selection uncertainty

6 stars 5.72 score 29 scripts 1 dependents

bioc

ppcseq:Probabilistic Outlier Identification for RNA Sequencing Generalized Linear Models

Relative transcript abundance has proven to be a valuable tool for understanding the function of genes in biological systems. For the differential analysis of transcript abundance using RNA sequencing data, the negative binomial model is by far the most frequently adopted. However, common methods that are based on a negative binomial model are not robust to extreme outliers, which we found to be abundant in public datasets. So far, no rigorous and probabilistic methods for detection of outliers have been developed for RNA sequencing data, leaving the identification mostly to visual inspection. Recent advances in Bayesian computation allow large-scale comparison of observed data against its theoretical distribution given in a statistical model. Here we propose ppcseq, a key quality-control tool for identifying transcripts that include outlier data points in differential expression analysis, which do not follow a negative binomial distribution. Applying ppcseq to analyse several publicly available datasets using popular tools, we show that from 3 to 10 percent of differentially abundant transcripts across algorithms and datasets had statistics inflated by the presence of outliers.

Maintained by Stefano Mangiola. Last updated 5 months ago.

rnaseq differentialexpression geneexpression normalization clustering qualitycontrol sequencing transcription transcriptomics bayesian-inference deseq2 edger negative-binomial outlier stan cpp

8 stars 5.71 score 16 scripts

giopogg

webSDM:Including Known Interactions in Species Distribution Models

A collection of tools to fit and work with trophic Species Distribution Models. Trophic Species Distribution Models combine knowledge of trophic interactions with Bayesian structural equation models that model each species as a function of its prey (or predators) and environmental conditions. It exploits the topological ordering of the known trophic interaction network to predict species distribution in space and/or time, where the prey (or predator) distribution is unavailable. The method implemented by the package is described in Poggiato, Andréoletti, Pollock and Thuiller (2022) <doi:10.22541/au.166853394.45823739/v1>.

Maintained by Giovanni Poggiato. Last updated 9 months ago.

17 stars 5.71 score 9 scripts

tmsalab

hmcdm:Hidden Markov Cognitive Diagnosis Models for Learning

Fitting hidden Markov models of learning under the cognitive diagnosis framework. The estimation of the hidden Markov diagnostic classification model, the first order hidden Markov model, the reduced-reparameterized unified learning model, and the joint learning model for responses and response times.

Maintained by Sunbeom Kwon. Last updated 2 years ago.

cognitive-diagnostic-models psychometrics rcpp rcpparmadillo openblas cpp openmp

7 stars 5.70 score 12 scripts

atsa-es

atsar:Stan Routines For Univariate And Multivariate Time Series

Bundles univariate and multivariate STAN scripts for FISH 507 class.

Maintained by Eric J. Ward. Last updated 10 months ago.

bayesian stan time-series cpp

48 stars 5.68 score 33 scripts

sthomas522

hmclearn:Fit Statistical Models Using Hamiltonian Monte Carlo

Provide users with a framework to learn the intricacies of the Hamiltonian Monte Carlo algorithm with hands-on experience by tuning and fitting their own models. All of the code is written in R. Theoretical references are listed below:. Neal, Radford (2011) "Handbook of Markov Chain Monte Carlo" ISBN: 978-1420079418, Betancourt, Michael (2017) "A Conceptual Introduction to Hamiltonian Monte Carlo" <arXiv:1701.02434>, Thomas, S., Tu, W. (2020) "Learning Hamiltonian Monte Carlo in R" <arXiv:2006.16194>, Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2013) "Bayesian Data Analysis" ISBN: 978-1439840955, Agresti, Alan (2015) "Foundations of Linear and Generalized Linear Models ISBN: 978-1118730034, Pinheiro, J., Bates, D. (2006) "Mixed-effects Models in S and S-Plus" ISBN: 978-1441903174.

Maintained by Samuel Thomas. Last updated 4 years ago.

11 stars 5.64 score 16 scripts

paulnorthrop

bang:Bayesian Analysis, No Gibbs

Provides functions for the Bayesian analysis of some simple commonly-used models, without using Markov Chain Monte Carlo (MCMC) methods such as Gibbs sampling. The 'rust' package <https://cran.r-project.org/package=rust> is used to simulate a random sample from the required posterior distribution, using the generalized ratio-of-uniforms method. See Wakefield, Gelfand and Smith (1991) <DOI:10.1007/BF01889987> for details. At the moment three conjugate hierarchical models are available: beta-binomial, gamma-Poisson and a 1-way analysis of variance (ANOVA).

Maintained by Paul J. Northrop. Last updated 2 months ago.

anova bayesian beta binomial gamma gibbs hierarchical poisson

3 stars 5.62 score 35 scripts

mrcieu

mrbayes:Bayesian Summary Data Models for Mendelian Randomization Studies

Bayesian estimation of inverse variance weighted (IVW), Burgess et al. (2013) <doi:10.1002/gepi.21758>, and MR-Egger, Bowden et al. (2015) <doi:10.1093/ije/dyv080>, summary data models for Mendelian randomization analyses.

Maintained by Tom Palmer. Last updated 13 days ago.

cpp

4 stars 5.56 score 2 scripts

traitecoevo

hmde:Hierarchical Methods for Differential Equations

Wrapper for Stan that offers a number of in-built models to implement a hierarchical Bayesian longitudinal model for repeat observation data. Model choice selects the differential equation that is fit to the observations. Single and multi-individual models are available.

Maintained by Tess OBrien. Last updated 5 days ago.

bayesian-inverse-problems bayesian-methods differential-equations hierarchical-models rstan stan cpp

3 stars 5.56 score 10 scripts

krisrs1128

multimedia:Multimodal Mediation Analysis

Multimodal mediation analysis is an emerging problem in microbiome data analysis. Multimedia make advanced mediation analysis techniques easy to use, ensuring that all statistical components are transparent and adaptable to specific problem contexts. The package provides a uniform interface to direct and indirect effect estimation, synthetic null hypothesis testing, bootstrap confidence interval construction, and sensitivity analysis. More details are available in Jiang et al. (2024) "multimedia: Multimodal Mediation Analysis of Microbiome Data" <doi:10.1101/2024.03.27.587024>.

Maintained by Kris Sankaran. Last updated 1 months ago.

coverage microbiome regression sequencing software statisticalmethod structuralequationmodels causal-inference data-integration mediation-analysis

1 stars 5.49 score 13 scripts

cmilando

WhiteLabRt:Novel Methods for Reproduction Number Estimation, Back-Calculation, and Forecasting

A collection of functions related to novel methods for estimating R(t), created by the lab of Professor Laura White. Currently implemented methods include two-step Bayesian back-calculation and now-casting for line-list data with missing reporting delays, adapted in 'STAN' from Li (2021) <doi:10.1371/journal.pcbi.1009210>, and calculation of time-varying reproduction number assuming a flux between various adjacent states, adapted into 'STAN' from Zhou (2021) <doi:10.1371/journal.pcbi.1010434>.

Maintained by Chad Milando. Last updated 13 days ago.

cpp

6 stars 5.48 score 4 scripts

sandhu-ss

bsitar:Bayesian Super Imposition by Translation and Rotation Growth Curve Analysis

The Super Imposition by Translation and Rotation (SITAR) model is a shape-invariant nonlinear mixed effect model that fits a natural cubic spline mean curve to the growth data and aligns individual-specific growth curves to the underlying mean curve via a set of random effects (see Cole, 2010 <doi:10.1093/ije/dyq115> for details). The non-Bayesian version of the SITAR model can be fit by using the already available R package 'sitar'. While the 'sitar' package allows modelling of a single outcome only, the 'bsitar' package offers great flexibility in fitting models of varying complexities, including joint modelling of multiple outcomes such as height and weight (multivariate model). Additionally, the 'bsitar' package allows for the simultaneous analysis of an outcome separately for subgroups defined by a factor variable such as gender. This is achieved by fitting separate models for each subgroup (for example males and females for gender variable). An advantage of this approach is that posterior draws for each subgroup are part of a single model object, making it possible to compare coefficients across subgroups and test hypotheses. Since the 'bsitar' package is a front-end to the R package 'brms', it offers excellent support for post-processing of posterior draws via various functions that are directly available from the 'brms' package. In addition, the 'bsitar' package includes various customized functions that allow for the visualization of distance (increase in size with age) and velocity (change in growth rate as a function of age), as well as the estimation of growth spurt parameters such as age at peak growth velocity and peak growth velocity.

Maintained by Satpal Sandhu. Last updated 8 days ago.

5.46 score 7 scripts

staffanbetner

rethinking:Statistical Rethinking book package

Utilities for fitting and comparing models

Maintained by Richard McElreath. Last updated 4 months ago.

5.42 score 4.4k scripts

edgarsantos-fernandez

SSNbayes:Bayesian Spatio-Temporal Analysis in Stream Networks

Fits Bayesian spatio-temporal models and makes predictions on stream networks using the approach by Santos-Fernandez, Edgar, et al. (2022)."Bayesian spatio-temporal models for stream networks" and Santos-Fernandez, Edgar, et al. (2023). "SSNbayes: An R Package for Bayesian Spatio-Temporal Modelling on Stream Networks". In these models, spatial dependence is captured using stream distance and flow connectivity, while temporal autocorrelation is modelled using vector autoregression methods.

Maintained by Edgar Santos-Fernandez. Last updated 2 months ago.

17 stars 5.41 score 6 scripts

dmenne

gastempt:Analyzing Gastric Emptying from MRI or Scintigraphy

Fits gastric emptying time series from MRI or 'scintigraphic' measurements using nonlinear mixed-model population fits with 'nlme' and Bayesian methods with Stan; computes derived parameters such as t50 and AUC.

Maintained by Dieter Menne. Last updated 3 months ago.

cpp

3 stars 5.41 score 17 scripts

cjvanlissa

pema:Penalized Meta-Analysis

Conduct penalized meta-analysis, see Van Lissa, Van Erp, & Clapper (2023) <doi:10.31234/osf.io/6phs5>. In meta-analysis, there are often between-study differences. These can be coded as moderator variables, and controlled for using meta-regression. However, if the number of moderators is large relative to the number of studies, such an analysis may be overfit. Penalized meta-regression is useful in these cases, because it shrinks the regression slopes of irrelevant moderators towards zero.

Maintained by Caspar J van Lissa. Last updated 7 hours ago.

cpp

5.40 score 21 scripts

angabrio

missingHE:Missing Outcome Data in Health Economic Evaluation

Contains a suite of functions for health economic evaluations with missing outcome data. The package can fit different types of statistical models under a fully Bayesian approach using the software 'JAGS' (which should be installed locally and which is loaded in 'missingHE' via the 'R' package 'R2jags'). Three classes of models can be fitted under a variety of missing data assumptions: selection models, pattern mixture models and hurdle models. In addition to model fitting, 'missingHE' provides a set of specialised functions to assess model convergence and fit, and to summarise the statistical and economic results using different types of measures and graphs. The methods implemented are described in Mason (2018) <doi:10.1002/hec.3793>, Molenberghs (2000) <doi:10.1007/978-1-4419-0300-6_18> and Gabrio (2019) <doi:10.1002/sim.8045>.

Maintained by Andrea Gabrio. Last updated 2 years ago.

cost-effectiveness-analysis health-economic-evaluation individual-level-data jags missing-data parametric-modelling sensitivity-analysis cpp

5 stars 5.38 score 24 scripts

ericdunipace

causalOT:Optimal Transport Weights for Causal Inference

Uses optimal transport distances to find probabilistic matching estimators for causal inference. These methods are described in Dunipace, Eric (2021) <arXiv:2109.01991>. The package will build the weights, estimate treatment effects, and calculate confidence intervals via the methods described in the paper. The package also supports several other methods as described in the help files.

Maintained by Eric Dunipace. Last updated 8 months ago.

cpp

6 stars 5.38 score 20 scripts

bioc

MetaboDynamics:Bayesian analysis of longitudinal metabolomics data

MetaboDynamics is an R-package that provides a framework of probabilistic models to analyze longitudinal metabolomics data. It enables robust estimation of mean concentrations despite varying spread between timepoints and reports differences between timepoints as well as metabolite specific dynamics profiles that can be used for identifying "dynamics clusters" of metabolites of similar dynamics. Provides probabilistic over-representation analysis of KEGG functional modules and pathways as well as comparison between clusters of different experimental conditions.

Maintained by Katja Danielzik. Last updated 4 days ago.

software metabolomics bayesian functionalprediction multiplecomparison kegg pathways dynamics functional-analysis longitudinal-analysis metabolomics-data metabolomics-pipeline cpp

5 stars 5.30 score 3 scripts

cotima

CoTiMA:Continuous Time Meta-Analysis ('CoTiMA')

The 'CoTiMA' package performs meta-analyses of correlation matrices of repeatedly measured variables taken from studies that used different time intervals. Different time intervals between measurement occasions impose problems for meta-analyses because the effects (e.g. cross-lagged effects) cannot be simply aggregated, for example, by means of common fixed or random effects analysis. However, continuous time math, which is applied in 'CoTiMA', can be used to extrapolate or intrapolate the results from all studies to any desired time lag. By this, effects obtained in studies that used different time intervals can be meta-analyzed. 'CoTiMA' fits models to empirical data using the structural equation model (SEM) package 'ctsem', the effects specified in a SEM are related to parameters that are not directly included in the model (i.e., continuous time parameters; together, they represent the continuous time structural equation model, CTSEM). Statistical model comparisons and significance tests are then performed on the continuous time parameter estimates. 'CoTiMA' also allows analysis of publication bias (Egger's test, PET-PEESE estimates, zcurve analysis etc.) and analysis of statistical power (post hoc power, required sample sizes). See Dormann, C., Guthier, C., & Cortina, J. M. (2019) <doi:10.1177/1094428119847277>. and Guthier, C., Dormann, C., & Voelkle, M. C. (2020) <doi:10.1037/bul0000304>.

Maintained by Markus Homberg. Last updated 5 days ago.

4 stars 5.26 score

fbartos

RoBTT:Robust Bayesian T-Test

An implementation of Bayesian model-averaged t-tests that allows users to draw inferences about the presence versus absence of an effect, variance heterogeneity, and potential outliers. The 'RoBTT' package estimates ensembles of models created by combining competing hypotheses and applies Bayesian model averaging using posterior model probabilities. Users can obtain model-averaged posterior distributions and inclusion Bayes factors, accounting for uncertainty in the data-generating process (Maier et al., 2024, <doi:10.3758/s13423-024-02590-5>). The package also provides a truncated likelihood version of the model-averaged t-test, enabling users to exclude potential outliers without introducing bias (Godmann et al., 2024, <doi:10.31234/osf.io/j9f3s>). Users can specify a wide range of informative priors for all parameters of interest. The package offers convenient functions for summary, visualization, and fit diagnostics.

Maintained by František Bartoš. Last updated 5 months ago.

bayesian model-averaging outliers t-test cpp

3 stars 5.26 score 9 scripts

epiforecasts

forecast.vocs:Forecast Case and Sequence Notifications using Variant of Concern Strain Dynamics

Contains models and tools to produce short-term forecasts for both case and sequence notifications assuming circulation of either one or two variants. Tools are also provided to allow the evaluation of the use of sequence data for short-term forecasts in both real-world settings and in user generated scenarios.

Maintained by Sam Abbott. Last updated 2 years ago.

8 stars 5.25 score 22 scripts

maeveupton

reslr:Modelling Relative Sea Level Data

The Bayesian modelling of relative sea-level data using a comprehensive approach that incorporates various statistical models within a unifying framework. Details regarding each statistical models; linear regression (Ashe et al 2019) <doi:10.1016/j.quascirev.2018.10.032>, change point models (Cahill et al 2015) <doi:10.1088/1748-9326/10/8/084002>, integrated Gaussian process models (Cahill et al 2015) <doi:10.1214/15-AOAS824>, temporal splines (Upton et al 2023) <arXiv:2301.09556>, spatio-temporal splines (Upton et al 2023) <arXiv:2301.09556> and generalised additive models (Upton et al 2023) <arXiv:2301.09556>. This package facilitates data loading, model fitting and result summarisation. Notably, it accommodates the inherent measurement errors found in relative sea-level data across multiple dimensions, allowing for their inclusion in the statistical models.

Maintained by Maeve Upton. Last updated 1 years ago.

jags cpp

4 stars 5.23 score 28 scripts

hannahcomiskey

mcmsupply:Estimating Public and Private Sector Contraceptive Market Supply Shares

Family Planning programs and initiatives typically use nationally representative surveys to estimate key indicators of a country’s family planning progress. However, in recent years, routinely collected family planning services data (Service Statistics) have been used as a supplementary data source to bridge gaps in the surveys. The use of service statistics comes with the caveat that adjustments need to be made for missing private sector contributions to the contraceptive method supply chain. Evaluating the supply source of modern contraceptives often relies on Demographic Health Surveys (DHS), where many countries do not have recent data beyond 2015/16. Fortunately, in the absence of recent surveys we can rely on statistical model-based estimates and projections to fill the knowledge gap. We present a Bayesian, hierarchical, penalized-spline model with multivariate-normal spline coefficients, to account for across method correlations, to produce country-specific,annual estimates for the proportion of modern contraceptive methods coming from the public and private sectors. This package provides a quick and convenient way for users to access the DHS modern contraceptive supply share data at national and subnational administration levels, estimate, evaluate and plot annual estimates with uncertainty for a sample of low- and middle-income countries. Methods for the estimation of method supply shares at the national level are described in Comiskey, Alkema, Cahill (2022) <arXiv:2212.03844>.

Maintained by Hannah Comiskey. Last updated 12 months ago.

jags cpp

2 stars 5.15 score 20 scripts

spsanderson

healthyverse:Easily Install and Load the 'healthyverse'

The 'healthyverse' is a set of packages that work in harmony because they share common data representations and 'API' design. This package is designed to make it easy to install and load multiple 'healthyverse' packages in a single step.

Maintained by Steven Sanderson. Last updated 6 months ago.

analytics healthcare healthcare-application installation installer meta packages

11 stars 5.12 score 24 scripts

fndemarqui

bellreg:Count Regression Models Based on the Bell Distribution

Bell regression models for count data with overdispersion. The implemented models account for ordinary and zero-inflated regression models under both frequentist and Bayesian approaches. Theoretical details regarding the models implemented in the package can be found in Castellares et al. (2018) <doi:10.1016/j.apm.2017.12.014> and Lemonte et al. (2020) <doi:10.1080/02664763.2019.1636940>.

Maintained by Fabio Demarqui. Last updated 5 months ago.

cpp

5.10 score 21 scripts 1 dependents

cran

ggtern:An Extension to 'ggplot2', for the Creation of Ternary Diagrams

Extends the functionality of 'ggplot2', providing the capability to plot ternary diagrams for (subset of) the 'ggplot2' geometries. Additionally, 'ggtern' has implemented several NEW geometries which are unavailable to the standard 'ggplot2' release. For further examples and documentation, please proceed to the 'ggtern' website.

Maintained by Nicholas Hamilton. Last updated 1 years ago.

4 stars 5.09 score 6 dependents

gunhanb

MetaStan:Bayesian Meta-Analysis via 'Stan'

Performs Bayesian meta-analysis, meta-regression and model-based meta-analysis using 'Stan'. Includes binomial-normal hierarchical models and option to use weakly informative priors for the heterogeneity parameter and the treatment effect parameter which are described in Guenhan, Roever, and Friede (2020) <doi:10.1002/jrsm.1370>.

Maintained by Burak Kuersad Guenhan. Last updated 2 years ago.

cpp

8 stars 5.08 score 7 scripts

jbolstad

hbamr:Hierarchical Bayesian Aldrich-McKelvey Scaling via 'Stan'

Perform hierarchical Bayesian Aldrich-McKelvey scaling using Hamiltonian Monte Carlo via 'Stan'. Aldrich-McKelvey ('AM') scaling is a method for estimating the ideological positions of survey respondents and political actors on a common scale using positional survey data. The hierarchical versions of the Bayesian 'AM' model included in this package outperform other versions both in terms of yielding meaningful posterior distributions for respondent positions and in terms of recovering true respondent positions in simulations. The package contains functions for preparing data, fitting models, extracting estimates, plotting key results, and comparing models using cross-validation. The original version of the default model is described in Bølstad (2024) <doi:10.1017/pan.2023.18>.

Maintained by Jørgen Bølstad. Last updated 4 days ago.

bayesian bayesian-inference ideal-point-estimation stan survey-analysis cpp

2 stars 5.04 score

vinhdizzo

IRexamples:Collection of Practical Institutional Research Examples and Tutorials

Provides examples of code for analyzing data or accomplishing tasks that may be useful to institutional or educational researchers.

Maintained by Vinh Nguyen. Last updated 2 years ago.

4 stars 5.00 score 4 scripts

emmagovan

cosimmr:Fast Fitting of Stable Isotope Mixing Models with Covariates

Fast fitting of Stable Isotope Mixing Models in R. Allows for the inclusion of covariates. Also has built-in summary functions and plot functions which allow for the creation of isospace plots. Variational Bayes is used to fit these models, methods as described in: Tran et al., (2021) <doi:10.48550/arXiv.2103.01327>.

Maintained by Emma Govan. Last updated 7 months ago.

openblas cpp openmp jags

1 stars 5.00 score 7 scripts

bioc

coseq:Co-Expression Analysis of Sequencing Data

Co-expression analysis for expression profiles arising from high-throughput sequencing data. Feature (e.g., gene) profiles are clustered using adapted transformations and mixture models or a K-means algorithm, and model selection criteria (to choose an appropriate number of clusters) are provided.

Maintained by Andrea Rau. Last updated 5 months ago.

geneexpression rnaseq sequencing software immunooncology

4.98 score 16 scripts

connordonegan

surveil:Time Series Models for Disease Surveillance

Fits time trend models for routine disease surveillance tasks and returns probability distributions for a variety of quantities of interest, including age-standardized rates, period and cumulative percent change, and measures of health inequality. The models are appropriate for count data such as disease incidence and mortality data, employing a Poisson or binomial likelihood and the first-difference (random-walk) prior for unknown risk. Optionally add a covariance matrix for multiple, correlated time series models. Inference is completed using Markov chain Monte Carlo via the Stan modeling language. References: Donegan, Hughes, and Lee (2022) <doi:10.2196/34589>; Stan Development Team (2021) <https://mc-stan.org>; Theil (1972, ISBN:0-444-10378-3).

Maintained by Connor Donegan. Last updated 9 months ago.

bayesian-statistics cancer health-equity public-health rstan cpp

2 stars 4.98 score 12 scripts

sujit-sahu

bmstdr:Bayesian Modeling of Spatio-Temporal Data with R

Fits, validates and compares a number of Bayesian models for spatial and space time point referenced and areal unit data. Model fitting is done using several packages: 'rstan', 'INLA', 'spBayes', 'spTimer', 'spTDyn', 'CARBayes' and 'CARBayesST'. Model comparison is performed using the DIC and WAIC, and K-fold cross-validation where the user is free to select their own subset of data rows for validation. Sahu (2022) <doi:10.1201/9780429318443> describes the methods in detail.

Maintained by Sujit K. Sahu. Last updated 1 days ago.

bayesian modelling spatio-temporal-data cpp

16 stars 4.98 score 12 scripts

bgoussen

BeeGUTS:General Unified Threshold Model of Survival for Bees using Bayesian Inference

Tools to calibrate, validate, and make predictions with the General Unified Threshold model of Survival adapted for Bee species. The model is presented in the publication from Baas, J., Goussen, B., Miles, M., Preuss, T.G., Roessing, I. (2022) <doi:10.1002/etc.5423> and Baas, J., Goussen, B., Taenzler, V., Roeben, V., Miles, M., Preuss, T.G., van den Berg, S., Roessink, I. (2024) <doi:10.1002/etc.5871>, and is based on the GUTS framework Jager, T., Albert, C., Preuss, T.G. and Ashauer, R. (2011) <doi:10.1021/es103092a>. The authors are grateful to Bayer A.G. for its financial support.

Maintained by Benoit Goussen. Last updated 5 months ago.

cpp

3 stars 4.95 score 6 scripts

bioc

decontX:Decontamination of single cell genomics data

This package contains implementation of DecontX (Yang et al. 2020), a decontamination algorithm for single-cell RNA-seq, and DecontPro (Yin et al. 2023), a decontamination algorithm for single cell protein expression data. DecontX is a novel Bayesian method to computationally estimate and remove RNA contamination in individual cells without empty droplet information. DecontPro is a Bayesian method that estimates the level of contamination from ambient and background sources in CITE-seq ADT dataset and decontaminate the dataset.

Maintained by Joshua Campbell. Last updated 1 months ago.

singlecell bayesian cpp

4.94 score 29 scripts

lnalborczyk

ESTER:Efficient Sequential Testing with Evidence Ratios

An implementation of sequential testing that uses evidence ratios computed from the weights of a set of models. These weights correspond either to Akaike weights computed from the Akaike Information Criterion (AIC) or the Bayesian Information Criterion (BIC) and following Burnham & Anderson (2004, <doi:10.1177/0049124104268644>) recommendations, or to pseudo-BMA weights computed from the WAIC or the LOOIC of models fitted with 'brms' and following Yao et al. (2017, <arXiv:1704.02030v3>).

Maintained by Ladislas Nalborczyk. Last updated 7 years ago.

2 stars 4.92 score 21 scripts

tspsyched

autoFC:Automatic Construction of Forced-Choice Tests

Forced-choice (FC) response has gained increasing popularity and interest for its resistance to faking when well-designed (Cao & Drasgow, 2019 <doi:10.1037/apl0000414>). To established well-designed FC scales, typically each item within a block should measure different trait and have similar level of social desirability (Zhang et al., 2020 <doi:10.1177/1094428119836486>). Recent study also suggests the importance of high inter-item agreement of social desirability between items within a block (Pavlov et al., 2021 <doi:10.31234/osf.io/hmnrc>). In addition to this, FC developers may also need to maximize factor loading differences (Brown & Maydeu-Olivares, 2011 <doi:10.1177/0013164410375112>) or minimize item location differences (Cao & Drasgow, 2019 <doi:10.1037/apl0000414>) depending on scoring models. Decision of which items should be assigned to the same block, termed item pairing, is thus critical to the quality of an FC test. This pairing process is essentially an optimization process which is currently carried out manually. However, given that we often need to simultaneously meet multiple objectives, manual pairing becomes impractical or even not feasible once the number of latent traits and/or number of items per trait are relatively large. To address these problems, autoFC is developed as a practical tool for facilitating the automatic construction of FC tests (Li et al., 2022 <doi:10.1177/01466216211051726>), essentially exempting users from the burden of manual item pairing and reducing the computational costs and biases induced by simple ranking methods. Given characteristics of each item (and item responses), FC tests can be automatically constructed based on user-defined pairing criteria and weights as well as customized optimization behavior. Users can also construct parallel forms of the same test following the same pairing rules.

Maintained by Mengtong Li. Last updated 18 days ago.

4 stars 4.90 score 3 scripts

bdj34

cloneRate:Estimate Growth Rates from Phylogenetic Trees

Quickly estimate the net growth rate of a population or clone whose growth can be approximated by a birth-death branching process. Input should be phylogenetic tree(s) of clone(s) with edge lengths corresponding to either time or mutations. Based on coalescent results in Johnson et al. (2023) <doi:10.1093/bioinformatics/btad561>. Simulation techniques as well as growth rate methods build on prior work from Lambert A. (2018) <doi:10.1016/j.tpb.2018.04.005> and Stadler T. (2009) <doi:10.1016/j.jtbi.2009.07.018>.

Maintained by Brian Johnson. Last updated 12 months ago.

cpp

4 stars 4.90 score 8 scripts

nhanhocu

metamicrobiomeR:an R package for analysis of microbiome relative abundance data using zero inflated beta GAMLSS and meta-analysis across studies using random effect model

The metamicrobiomeR package implements Generalized Additive Model for Location, Scale and Shape (GAMLSS) with zero inflated beta (BEZI) family for analysis of microbiome relative abundance data (with various options for data transformation/normalization to address compositional effects) and random effect meta-analysis models for meta-analysis pooling estimates across microbiome studies. Random Forest model to predict microbiome age based on relative abundances of shared bacterial genera with the Bangladesh data (Subramanian et al 2014), comparison of multiple diversity indexes using linear/linear mixed effect models and some data display/visualization are also implemented.

Maintained by Nhan Ho. Last updated 4 years ago.

33 stars 4.90 score 12 scripts

ropensci

eDNAjoint:Joint Modeling of Traditional and Environmental DNA Survey Data in a Bayesian Framework

Models integrate environmental DNA (eDNA) detection data and traditional survey data to jointly estimate species catch rate (see package vignette: <https://ednajoint.netlify.app/>). Models can be used with count data via traditional survey methods (i.e., trapping, electrofishing, visual) and replicated eDNA detection/nondetection data via polymerase chain reaction (i.e., PCR or qPCR) from multiple survey locations. Estimated parameters include probability of a false positive eDNA detection, a site-level covariates that scale the sensitivity of eDNA surveys relative to traditional surveys, and gear scaling coefficients for traditional gear types. Models are implemented with a Bayesian framework (Markov chain Monte Carlo) using the 'Stan' probabilistic programming language.

Maintained by Abigail G. Keller. Last updated 1 days ago.

cpp

5 stars 4.89 score 8 scripts

r-forge

EasyABC:Efficient Approximate Bayesian Computation Sampling Schemes

Enables launching a series of simulations of a computer code from the R session, and to retrieve the simulation outputs in an appropriate format for post-processing treatments. Five sequential sampling schemes and three coupled-to-MCMC schemes are implemented.

Maintained by Nicolas Dumoulin. Last updated 3 months ago.

4.88 score 140 scripts 3 dependents

benjamin-w-campbell

fergm:Estimation and Fit Assessment of Frailty Exponential Random Graph Models

Frailty Exponential Random Graph Models estimated through pseudo likelihood with frailty terms estimated using 'Stan' as per Box-Steffensmeier et. al (2017) <doi:10.7910/DVN/K3D1M2>. Goodness of fit for Frailty Exponential Random Graph Models is also available, with easy visualizations for comparison to fit Exponential Random Graph Models.

Maintained by Benjamin W. Campbell. Last updated 3 years ago.

4 stars 4.86 score 18 scripts

bioc

mfa:Bayesian hierarchical mixture of factor analyzers for modelling genomic bifurcations

MFA models genomic bifurcations using a Bayesian hierarchical mixture of factor analysers.

Maintained by Kieran Campbell. Last updated 5 months ago.

immunooncology rnaseq geneexpression bayesian singlecell cpp

4.85 score 35 scripts

dalenbe2

bayesMeanScale:Bayesian Post-Estimation on the Mean Scale

Computes Bayesian posterior distributions of predictions, marginal effects, and differences of marginal effects for various generalized linear models. Importantly, the posteriors are on the mean (response) scale, allowing for more natural interpretation than summaries on the link scale. Also, predictions and marginal effects of the count probabilities for Poisson and negative binomial models can be computed.

Maintained by David M. Dalenberg. Last updated 3 months ago.

4.85 score 1 scripts

cstewartgh

QFASA:Quantitative Fatty Acid Signature Analysis

Accurate estimates of the diets of predators are required in many areas of ecology, but for many species current methods are imprecise, limited to the last meal, and often biased. The diversity of fatty acids and their patterns in organisms, coupled with the narrow limitations on their biosynthesis, properties of digestion in monogastric animals, and the prevalence of large storage reservoirs of lipid in many predators, led to the development of quantitative fatty acid signature analysis (QFASA) to study predator diets.

Maintained by Connie Stewart. Last updated 7 months ago.

1 stars 4.83 score 17 scripts

angusmclure

PoolTestR:Prevalence and Regression for Pool-Tested (Group-Tested) Data

An easy-to-use tool for working with presence/absence tests on 'pooled' or 'grouped' samples. The primary application is for estimating prevalence of a marker in a population based on the results of tests on pooled specimens. This sampling method is often employed in surveillance of rare conditions in humans or animals (e.g. molecular xenomonitoring). The package was initially conceived as an R-based alternative to the molecular xenomonitoring software, 'PoolScreen' <https://sites.uab.edu/statgenetics/software/>. However, it goes further, allowing for estimates of prevalence to be adjusted for hierarchical sampling frames, and perform flexible mixed-effect regression analyses (McLure et al. Environmental Modelling and Software. <DOI:10.1016/j.envsoft.2021.105158>). The package is currently in early stages, however more features are planned or in the works: e.g. adjustments for imperfect test specificity/sensitivity, functions for helping with optimal experimental design, and functions for spatial modelling.

Maintained by Angus McLure. Last updated 3 months ago.

cpp

5 stars 4.83 score 8 scripts

chjackson

survextrap:Survival Extrapolation with a Flexible Parametric Model and External Data

Survival analysis using a flexible Bayesian model for individual-level right-censored data, optionally combined with aggregate data on counts of survivors in different periods of time. An M-spline is used to describe the hazard function, with a hierarchical prior on the coefficients to control overfitting. Proportional hazards or flexible non-proportional hazards models can be used to relate survival to predictors. Mixture cure models, additive hazards (relative survival) models and waning treatment effects models are also supported. Priors can be customised and calibrated to substantive beliefs. Posterior distributions are estimated using Stan, and outputs are arranged in a tidy format. See See Jackson (2023) <doi:10.48550/arXiv.2306.03957>.

Maintained by Christopher Jackson. Last updated 1 months ago.

cpp

10 stars 4.82 score 11 scripts

saulo-chaves

ProbBreed:Probability Theory for Selecting Candidates in Plant Breeding

Use probability theory under the Bayesian framework for calculating the risk of selecting candidates in a multi-environment context. Contained are functions used to fit a Bayesian multi-environment model (based on the available presets), extract posterior values and maximum posterior values, compute the variance components, check the model’s convergence, and calculate the probabilities. For both across and within-environments scopes, the package computes the probability of superior performance and the pairwise probability of superior performance. Furthermore, the probability of superior stability and the pairwise probability of superior stability across environments is estimated. A joint probability of superior performance and stability is also provided.

Maintained by Saulo Chaves. Last updated 12 days ago.

cpp

8 stars 4.81 score 4 scripts

genentech

BayesERtools:Bayesian Exposure-Response Analysis Tools

Suite of tools that facilitate exposure-response analysis using Bayesian methods. The package provides a streamlined workflow for fitting types of models that are commonly used in exposure-response analysis - linear and Emax for continuous endpoints, logistic linear and logistic Emax for binary endpoints, as well as performing simulation and visualization. Learn more about the workflow at <https://genentech.github.io/BayesERbook/>.

Maintained by Kenta Yoshida. Last updated 1 months ago.

2 stars 4.78 score 20 scripts

vivienroussez

autoTS:Automatic Model Selection and Prediction for Univariate Time Series

Offers a set of functions to easily make predictions for univariate time series. 'autoTS' is a wrapper of existing functions of the 'forecast' and 'prophet' packages, harmonising their outputs in tidy dataframes and using default values for each. The core function getBestModel() allows the user to effortlessly benchmark seven algorithms along with a bagged estimator to identify which one performs the best for a given time series.

Maintained by Vivien Roussez. Last updated 5 years ago.

10 stars 4.78 score 12 scripts

lbbe-software

rbioacc:Inference and Prediction of ToxicoKinetic (TK) Models

The MOSAICbioacc application is a turnkey package providing bioaccumulation factors (BCF/BMF/BSAF) from a toxicokinetic (TK) model fitted to accumulation-depuration data. It is designed to fulfil the requirements of regulators when examining applications for market authorization of active substances. See Ratier et al. (2021) <doi:10.1101/2021.09.08.459421>.

Maintained by Virgile Baudrot. Last updated 1 years ago.

cpp

4.78 score 8 scripts

c7rishi

BAMBI:Bivariate Angular Mixture Models

Fit (using Bayesian methods) and simulate mixtures of univariate and bivariate angular distributions. Chakraborty and Wong (2021) <doi:10.18637/jss.v099.i11>.

Maintained by Saptarshi Chakraborty. Last updated 5 months ago.

cpp

3 stars 4.77 score 65 scripts 1 dependents

cubiczebra

TPMplt:Tool-Kit for Dynamic Materials Model and Thermal Processing Maps

Provides a simple approach for constructing dynamic materials modeling suggested by Prasad and Gegel (1984) <doi:10.1007/BF02664902>. It can easily generate various processing-maps based on this model as well. The calculation result in this package contains full materials constants, information about power dissipation efficiency factor, and rheological properties, can be exported completely also, through which further analysis and customized plots will be applicable as well.

Maintained by Chen Zhang. Last updated 6 months ago.

2 stars 4.76 score 29 scripts

squidgroup

squid:Statistical Quantification of Individual Differences

A simulation-based tool made to help researchers to become familiar with multilevel variations, and to build up sampling designs for their study. This tool has two main objectives: First, it provides an educational tool useful for students, teachers and researchers who want to learn to use mixed-effects models. Users can experience how the mixed-effects model framework can be used to understand distinct biological phenomena by interactively exploring simulated multilevel data. Second, it offers research opportunities to those who are already familiar with mixed-effects models, as it enables the generation of data sets that users may download and use for a range of simulation-based statistical analyses such as power and sensitivity analysis of multilevel and multivariate data [Allegue, H., Araya-Ajoy, Y.G., Dingemanse, N.J., Dochtermann N.A., Garamszegi, L.Z., Nakagawa, S., Reale, D., Schielzeth, H. and Westneat, D.F. (2016) <doi: 10.1111/2041-210X.12659>].

Maintained by Hassen Allegue. Last updated 3 years ago.

linear-mixed-effects-modelling multilevel-data personality phenotypic-equation phenotypic-plasticity reaction-norm repeatability simulations variance-components

34 stars 4.76 score 17 scripts

jonathansmart

BayesGrowth:Estimate Fish Growth Using MCMC Analysis

Estimate fish length-at-age models using MCMC analysis with 'rstan' models. This package allows a multimodel approach to growth fitting to be applied to length-at-age data and is supported by further analyses to determine model selection and result presentation. The core methods of this package are presented in Smart and Grammer (2021) "Modernising fish and shark growth curves with Bayesian length-at-age models". PLOS ONE 16(2): e0246734 <doi:10.1371/journal.pone.0246734>.

Maintained by Jonathan Smart. Last updated 1 years ago.

cpp

11 stars 4.74 score 8 scripts

epiforecasts

EpiNow:Estimate Realtime Case Counts and Time-varying Epidemiological Parameters

To identify changes in the reproduction number, rate of spread, and doubling time during the course of outbreaks whilst accounting for potential biases due to delays in case reporting.

Maintained by Sam Abbott. Last updated 5 years ago.

cpp

33 stars 4.74 score 111 scripts

dcgerard

hwep:Hardy-Weinberg Equilibrium in Polyploids

Inference concerning equilibrium and random mating in autopolyploids. Methods are available to test for equilibrium and random mating at any even ploidy level (>2) in the presence of double reduction at biallelic loci. For autopolyploid populations in equilibrium, methods are available to estimate the degree of double reduction. We also provide functions to calculate genotype frequencies at equilibrium, or after one or several rounds of random mating, given rates of double reduction. The main function is hwefit(). This material is based upon work supported by the National Science Foundation under Grant No. 2132247. The opinions, findings, and conclusions or recommendations expressed are those of the author and do not necessarily reflect the views of the National Science Foundation. For details of these methods, see Gerard (2023a) <doi:10.1111/biom.13722> and Gerard (2023b) <doi:10.1111/1755-0998.13856>.

Maintained by David Gerard. Last updated 2 years ago.

cpp

3 stars 4.72 score 35 scripts

sshpa

bayesvl:Visually Learning the Graphical Structure of Bayesian Networks and Performing MCMC with 'Stan'

Provides users with its associated functions for pedagogical purposes in visually learning Bayesian networks and Markov chain Monte Carlo (MCMC) computations. It enables users to: a) Create and examine the (starting) graphical structure of Bayesian networks; b) Create random Bayesian networks using a dataset with customized constraints; c) Generate Stan code for structures of Bayesian networks for sampling the data and learning parameters; d) Plot the network graphs; e) Perform Markov chain Monte Carlo computations and produce graphs for posteriors checks. The package refers to one reference item, which describes the methods and algorithms: Vuong, Quan-Hoang and La, Viet-Phuong (2019) <doi:10.31219/osf.io/w5dx6> The 'bayesvl' R package. Open Science Framework (May 18).

Maintained by Viet-Phuong La. Last updated 3 years ago.

bayesian mcmc

20 stars 4.70 score 3 scripts

zjg540066169

AuxSurvey:Survey Analysis with Auxiliary Discretized Variables

Probability surveys often use auxiliary continuous data from administrative records, but the utility of this data is diminished when it is discretized for confidentiality. We provide a set of survey estimators to make full use of information from the discretized variables. See Williams, S.Z., Zou, J., Liu, Y., Si, Y., Galea, S. and Chen, Q. (2024), Improving Survey Inference Using Administrative Records Without Releasing Individual-Level Continuous Data. Statistics in Medicine, 43: 5803-5813. <doi:10.1002/sim.10270> for details.

Maintained by Jungang Zou. Last updated 3 months ago.

auxilary-variables categorical-variables survey-analysis

1 stars 4.70 score 5 scripts

rh8liuqy

GUD:Bayesian Modal Regression Based on the GUD Family

Provides probability density functions and sampling algorithms for three key distributions from the General Unimodal Distribution (GUD) family: the Flexible Gumbel (FG) distribution, the Double Two-Piece (DTP) Student-t distribution, and the Two-Piece Scale (TPSC) Student-t distribution. Additionally, this package includes a function for Bayesian linear modal regression, leveraging these three distributions for model fitting. The details of the Bayesian modal regression model based on the GUD family can be found at Liu, Huang, and Bai (2024) <doi:10.1016/j.csda.2024.108012>.

Maintained by Qingyang Liu. Last updated 9 months ago.

cpp

5 stars 4.70 score 2 scripts

audreyqyfu

MRPC:PC Algorithm with the Principle of Mendelian Randomization

A PC Algorithm with the Principle of Mendelian Randomization. This package implements the MRPC (PC with the principle of Mendelian randomization) algorithm to infer causal graphs. It also contains functions to simulate data under a certain topology, to visualize a graph in different ways, and to compare graphs and quantify the differences. See Badsha and Fu (2019) <doi:10.3389/fgene.2019.00460>,Badsha, Martin and Fu (2021) <doi:10.3389/fgene.2021.651812>.

Maintained by Audrey Fu. Last updated 3 years ago.

8 stars 4.68 score 20 scripts

pboesu

moultmcmc:Bayesian Inference for Avian Moult Phenology Models

Precompiled Stan models to conduct full Bayesian inference in regression models for the phenology of avian primary moult based on the approach of Underhill & Zucchini (1988) <DOI:10.1111/j.1474-919X.1988.tb00993.x> and related hierarchical models to accommodate repeated-measures data (within-season recaptures of individuals).

Maintained by Philipp Boersch-Supan. Last updated 2 years ago.

cpp

4 stars 4.67 score 26 scripts 1 dependents

yxlin

ggdmc:Cognitive Models

The package provides tools to fit the LBA, DDM, PM and 2-D diffusion models, using the population-based Markov Chain Monte Carlo.

Maintained by Yi-Shin Lin. Last updated 8 months ago.

openblas cpp

19 stars 4.66 score 24 scripts

mlampros

fitbitViz:'Fitbit' Visualizations

Connection to the 'Fitbit' Web API <https://dev.fitbit.com/build/reference/web-api/> by including 'ggplot2' Visualizations, 'Leaflet' and 3-dimensional 'Rayshader' Maps. The 3-dimensional 'Rayshader' Map requires the installation of the 'CopernicusDEM' R package which includes the 30- and 90-meter elevation data.

Maintained by Lampros Mouselimis. Last updated 1 years ago.

blogdown fitbit fitbit-api github-actions visualization

9 stars 4.65 score 1 scripts

cubiczebra

VBTree:Vector Binary Tree to Make Your Data Management More Efficient

Vector binary tree provides a new data structure, to make your data visiting and management more efficient. If the data has structured column names, it can read these names and factorize them through specific split pattern, then build the mappings within double list, vector binary tree, array and tensor mutually, through which the batched data processing is achievable easily. The methods of array and tensor are also applicable. Detailed methods are described in Chen Zhang et al. (2020) <doi:10.35566/isdsa2019c8>.

Maintained by Chen Zhang. Last updated 1 years ago.

4.65 score 30 scripts 1 dependents

krisrs1128

miniLNM:Miniature Logistic-Normal Multinomial Models

Logistic-normal Multinomial (LNM) models are common in problems with multivariate count data. This package gives a simple implementation with a 30 line 'Stan' script. This lightweight implementation makes it an easy starting point for other projects, in particular for downstream tasks that require analysis of "compositional" data. It can be applied whenever a multinomial probability parameter is thought to depend linearly on inputs in a transformed, log ratio space. Additional utilities make it easy to inspect, create predictions, and draw samples using the fitted models. More about the LNM can be found in Xia et al. (2013) "A Logistic Normal Multinomial Regression Model for Microbiome Compositional Data Analysis" <doi:10.1111/biom.12079> and Sankaran and Holmes (2023) "Generative Models: An Interdisciplinary Perspective" <doi:10.1146/annurev-statistics-033121-110134>.

Maintained by Kris Sankaran. Last updated 7 months ago.

cpp

4.65 score 5 scripts 1 dependents

bbsbayes

bbsBayes2:Hierarchical Bayesian Analysis of North American BBS Data

The North American Breeding Bird Survey (BBS) is a long-running program that seeks to monitor the status and trends of the breeding birds in North America. Since its start in 1966, the BBS has accumulated over 50 years of data for over 500 species of North American Birds. Given the temporal and spatial structure of the data, hierarchical Bayesian models are used to assess the status and trends of these 500+ species of birds. 'bbsBayes2' allows you to perform hierarchical Bayesian analysis of BBS data. You can run a full model analysis for one or more species that you choose, or you can take more control and specify how the data should be stratified, prepared for 'Stan', or modelled.

Maintained by Brandon P.M. Edwards. Last updated 2 months ago.

8 stars 4.64 score 121 scripts

andrjohns

StanEstimators:Estimate Parameters for Arbitrary R Functions using 'Stan'

Allows for the estimation of parameters for 'R' functions using the various algorithms implemented in the 'Stan' probabilistic programming language.

Maintained by Andrew R. Johnson. Last updated 9 months ago.

cpp

25 stars 4.62 score 11 scripts

jessieyeung

rcbayes:Estimate Rogers-Castro Migration Age Schedules with Bayesian Models

A collection of functions to estimate Rogers-Castro migration age schedules using 'Stan'. This model which describes the fundamental relationship between migration and age in the form of a flexible multi-exponential migration model was most notably proposed in Rogers and Castro (1978) <doi:10.1068/a100475>.

Maintained by Jessie Yeung. Last updated 1 years ago.

cpp

2 stars 4.60 score 8 scripts

datacloning

dcmle:Hierarchical Models Made Easy with Data Cloning

S4 classes around infrastructure provided by the 'coda' and 'dclone' packages to make package development easy as a breeze with data cloning for hierarchical models.

Maintained by Peter Solymos. Last updated 6 months ago.

jags cpp

4.60 score 66 scripts 2 dependents

sooyongl

flps:Fully-Latent Principal Stratification

Simulation and analysis of Fully-Latent Principal Stratification (FLPS) with measurement models. Lee, Adam, Kang, & Whittaker (2023). <doi:10.1007/978-3-031-27781-8_25>. This package is supported by the Institute of Education Sciences, U.S. Department of Education, through Grant R305D210036.

Maintained by Sooyong Lee. Last updated 11 months ago.

2 stars 4.58 score 19 scripts

fndemarqui

rsurv:Random Generation of Survival Data

Random generation of survival data from a wide range of regression models, including accelerated failure time (AFT), proportional hazards (PH), proportional odds (PO), accelerated hazard (AH), Yang and Prentice (YP), and extended hazard (EH) models. The package 'rsurv' also stands out by its ability to generate survival data from an unlimited number of baseline distributions provided that an implementation of the quantile function of the chosen baseline distribution is available in R. Another nice feature of the package 'rsurv' lies in the fact that linear predictors are specified via a formula-based approach, facilitating the inclusion of categorical variables and interaction terms. The functions implemented in the package 'rsurv' can also be employed to simulate survival data with more complex structures, such as survival data with different types of censoring mechanisms, survival data with cure fraction, survival data with random effects (frailties), multivariate survival data, and competing risks survival data. Details about the R package 'rsurv' can be found in Demarqui (2024) <doi:10.48550/arXiv.2406.01750>.

Maintained by Fabio Demarqui. Last updated 2 months ago.

1 stars 4.56 score 24 scripts

paulnorthrop

lite:Likelihood-Based Inference for Time Series Extremes

Performs likelihood-based inference for stationary time series extremes. The general approach follows Fawcett and Walshaw (2012) <doi:10.1002/env.2133>. Marginal extreme value inferences are adjusted for cluster dependence in the data using the methodology in Chandler and Bate (2007) <doi:10.1093/biomet/asm015>, producing an adjusted log-likelihood for the model parameters. A log-likelihood for the extremal index is produced using the K-gaps model of Suveges and Davison (2010) <doi:10.1214/09-AOAS292>. These log-likelihoods are combined to make inferences about extreme values. Both maximum likelihood and Bayesian approaches are available.

Maintained by Paul J. Northrop. Last updated 9 months ago.

clustered extremal-index extreme-value-statistics extremes frequentist generalised-pareto inference likelihood log-likelihood threshold time-series

3 stars 4.56 score 12 scripts

falkcarl

multilevelmediation:Utility Functions for Multilevel Mediation Analysis

The ultimate goal is to support 2-2-1, 2-1-1, and 1-1-1 models for multilevel mediation, the option of a moderating variable for either the a, b, or both paths, and covariates. Currently the 1-1-1 model is supported and several options of random effects; the initial code for bootstrapping was evaluated in simulations by Falk, Vogel, Hammami, and Miočević (2024) <doi:10.3758/s13428-023-02079-4>. Support for Bayesian estimation using 'brms' comprises ongoing work. Currently only continuous mediators and outcomes are supported. Factors for any predictors must be numerically represented.

Maintained by Carl F. Falk. Last updated 3 months ago.

6 stars 4.56 score 2 scripts