Showing 162 of total 162 results (show query)
gssrdoc:Document General Social Survey Variable
The General Social Survey (GSS) is a long-running, mostly annual survey of US households. It is administered by the National Opinion Research Center (NORC). This package contains the a tibble with information on the survey variables, together with every variable documented as an R help page. For more information on the GSS see \url{}.
Maintained by Kieran Healy. Last updated 11 months ago.
90.2 match 2.28 score 38 scriptssiacus
sde:Simulation and Inference for Stochastic Differential Equations
Companion package to the book Simulation and Inference for Stochastic Differential Equations With R Examples, ISBN 978-0-387-75838-1, Springer, NY. *
Maintained by Stefano Maria Iacus. Last updated 2 years ago.
17.3 match 7.02 score 178 scripts 15 dependentssmoeding
usl:Analyze System Scalability with the Universal Scalability Law
The Universal Scalability Law (Gunther 2007) <doi:10.1007/978-3-540-31010-5> is a model to predict hardware and software scalability. It uses system capacity as a function of load to forecast the scalability for the system.
Maintained by Stefan Moeding. Last updated 2 years ago.
18.9 match 36 stars 6.32 score 117 scriptsradicalcommecol
cxr:A Toolbox for Modelling Species Coexistence in R
Recent developments in modern coexistence theory have advanced our understanding on how species are able to persist and co-occur with other species at varying abundances. However, applying this mathematical framework to empirical data is still challenging, precluding a larger adoption of the theoretical tools developed by empiricists. This package provides a complete toolbox for modelling interaction effects between species, and calculate fitness and niche differences. The functions are flexible, may accept covariates, and different fitting algorithms can be used. A full description of the underlying methods is available in Garcรญa-Callejas, D., Godoy, O., and Bartomeus, I. (2020) <doi:10.1111/2041-210X.13443>. Furthermore, the package provides a series of functions to calculate dynamics for stage-structured populations across sites.
Maintained by David Garcia-Callejas. Last updated 1 months ago.
18.3 match 10 stars 6.51 score 27 scriptsigraph
igraph:Network Analysis and Visualization
Routines for simple graphs and network analysis. It can handle large graphs very well and provides functions for generating random and regular graphs, graph visualization, centrality methods and much more.
Maintained by Kirill Mรผller. Last updated 8 hours ago.
5.6 match 582 stars 21.11 score 31k scripts 1.9k dependentsmpascariu
MortalityLaws:Parametric Mortality Models, Life Tables and HMD
Fit the most popular human mortality 'laws', and construct full and abridge life tables given various input indices. A mortality law is a parametric function that describes the dying-out process of individuals in a population during a significant portion of their life spans. For a comprehensive review of the most important mortality laws see Tabeau (2001) <doi:10.1007/0-306-47562-6_1>. Practical functions for downloading data from various human mortality databases are provided as well.
Maintained by Marius D. Pascariu. Last updated 1 years ago.
15.9 match 32 stars 7.00 score 103 scripts 1 dependentsschochastics
networkdata:Repository of Network Datasets
The package contains a large collection of network dataset with different context. This includes social networks, animal networks and movie networks. All datasets are in 'igraph' format.
Maintained by David Schoch. Last updated 12 months ago.
21.8 match 143 stars 5.01 score 143 scriptssor16
bdrc:Bayesian Discharge Rating Curves
Fits a discharge rating curve based on the power-law and the generalized power-law from data on paired stage and discharge measurements in a given river using a Bayesian hierarchical model as described in Hrafnkelsson et al. (2020) <arXiv:2010.04769>.
Maintained by Rafael Danรญel Vias. Last updated 6 months ago.
17.7 match 12 stars 6.07 score 11 scriptscsgillespie
poweRlaw:Analysis of Heavy Tailed Distributions
An implementation of maximum likelihood estimators for a variety of heavy tailed distributions, including both the discrete and continuous power law distributions. Additionally, a goodness-of-fit based approach is used to estimate the lower cut-off for the scaling region.
Maintained by Colin Gillespie. Last updated 1 months ago.
7.9 match 112 stars 12.79 score 332 scripts 32 dependentsmichalovadek
eurlex:Retrieve Data on European Union Law
Access to data on European Union laws and court decisions made easy with pre-defined 'SPARQL' queries and 'GET' requests. See Ovadek (2021) <doi:10.1080/2474736X.2020.1870150> .
Maintained by Michal Ovadek. Last updated 7 months ago.
14.5 match 36 stars 6.18 score 21 scriptsgeobosh
StableEstim:Estimate the Four Parameters of Stable Laws using Different Methods
Estimate the four parameters of stable laws using maximum likelihood method, generalised method of moments with finite and continuum number of points, iterative Koutrouvelis regression and Kogon-McCulloch method. The asymptotic properties of the estimators (covariance matrix, confidence intervals) are also provided.
Maintained by Georgi N. Boshnakov. Last updated 5 months ago.
15.8 match 3.73 score 18 scripts 2 dependentsbiooss
sensitivity:Global Sensitivity Analysis of Model Outputs and Importance Measures
A collection of functions for sensitivity analysis of model outputs (factor screening, global sensitivity analysis and robustness analysis), for variable importance measures of data, as well as for interpretability of machine learning models. Most of the functions have to be applied on scalar output, but several functions support multi-dimensional outputs.
Maintained by Bertrand Iooss. Last updated 7 months ago.
8.4 match 17 stars 6.74 score 472 scripts 8 dependentsscottkosty
bootstrap:Functions for the Book "An Introduction to the Bootstrap"
Software (bootstrap, cross-validation, jackknife) and data for the book "An Introduction to the Bootstrap" by B. Efron and R. Tibshirani, 1993, Chapman and Hall. This package is primarily provided for projects already based on it, and for support of the book. New projects should preferentially use the recommended package "boot".
Maintained by Scott Kostyshak. Last updated 6 years ago.
7.2 match 7.62 score 890 scripts 30 dependentssbgraves237
Ecdat:Data Sets for Econometrics
Data sets for econometrics, including political science.
Maintained by Spencer Graves. Last updated 4 months ago.
7.2 match 2 stars 7.25 score 740 scripts 3 dependentsmassimoaria
bibliometrix:Comprehensive Science Mapping Analysis
Tool for quantitative research in scientometrics and bibliometrics. It implements the comprehensive workflow for science mapping analysis proposed in Aria M. and Cuccurullo C. (2017) <doi:10.1016/j.joi.2017.08.007>. 'bibliometrix' provides various routines for importing bibliographic data from 'SCOPUS', 'Clarivate Analytics Web of Science' (<>), 'Digital Science Dimensions' (<>), 'OpenAlex' (<>), 'Cochrane Library' (<>), 'Lens' (<>), and 'PubMed' (<>) databases, performing bibliometric analysis and building networks for co-citation, coupling, scientific collaboration and co-word analysis.
Maintained by Massimo Aria. Last updated 7 days ago.
4.1 match 545 stars 12.54 score 518 scripts 2 dependentsoxfordihtm
africalaws:Interface to the Laws.Africa API
Laws.Africa <> endeavours to facilitate African governments in offering sustainable, free access to dependable digital laws. It aims to achieve this by ensuring that the laws are easily accessible, user-friendly, educational, and reusable. This initiative seeks to empower citizens with knowledge of their legal rights and obligations while promoting transparency and accountability within the legal system. Laws.Africa offers a content application programming interface (API) to fetch legislative content and metadata. The API is a read-only API for listing and fetching published versions of legislative works. This package interfaces with this API to allow access using R.
Maintained by Ernest Guevarra. Last updated 5 months ago.
17.7 match 3 stars 2.88 scorechgigot
epiphy:Analysis of Plant Disease Epidemics
A toolbox to make it easy to analyze plant disease epidemics. It provides a common framework for plant disease intensity data recorded over time and/or space. Implemented statistical methods are currently mainly focused on spatial pattern analysis (e.g., aggregation indices, Taylor and binary power laws, distribution fitting, SADIE and 'mapcomp' methods). See Laurence V. Madden, Gareth Hughes, Franck van den Bosch (2007) <doi:10.1094/9780890545058> for further information on these methods. Several data sets that were mainly published in plant disease epidemiology literature are also included in this package.
Maintained by Christophe Gigot. Last updated 1 years ago.
8.0 match 15 stars 6.05 score 37 scriptsvlyubchich
lawstat:Tools for Biostatistics, Public Policy, and Law
Statistical tests widely utilized in biostatistics, public policy, and law. Along with the well-known tests for equality of means and variances, randomness, and measures of relative variability, the package contains new robust tests of symmetry, omnibus and directional tests of normality, and their graphical counterparts such as robust QQ plot, robust trend tests for variances, etc. All implemented tests and methods are illustrated by simulations and real-life examples from legal statistics, economics, and biostatistics.
Maintained by Yulia R. Gel. Last updated 2 years ago.
6.8 match 7.17 score 484 scripts 6 dependentsepivec
TDLM:Systematic Comparison of Trip Distribution Laws and Models
The main purpose of this package is to propose a rigorous framework to fairly compare trip distribution laws and models as described in Lenormand et al. (2016) <doi:10.1016/j.jtrangeo.2015.12.008>.
Maintained by Maxime Lenormand. Last updated 10 days ago.
9.9 match 2 stars 4.85 score 3 scriptspharmaverse
admiral:ADaM in R Asset Library
A toolbox for programming Clinical Data Interchange Standards Consortium (CDISC) compliant Analysis Data Model (ADaM) datasets in R. ADaM datasets are a mandatory part of any New Drug or Biologics License Application submitted to the United States Food and Drug Administration (FDA). Analysis derivations are implemented in accordance with the "Analysis Data Model Implementation Guide" (CDISC Analysis Data Model Team, 2021, <>).
Maintained by Ben Straub. Last updated 4 days ago.
3.3 match 236 stars 13.89 score 486 scripts 4 dependentsb-rodrigues
chronicler:Add Logging To Functions
Decorate functions to make them return enhanced output. The enhanced output consists in an object of type 'chronicle' containing the result of the function applied to its arguments, as well as a log detailing when the function was run, what were its inputs, what were the errors (if the function failed to run) and other useful information. Tools to handle decorated functions are included, such as a forward pipe operator that makes chaining decorated functions possible.
Maintained by Bruno Rodrigues. Last updated 11 months ago.
6.0 match 51 stars 7.51 score 35 scriptsmatloff
qeML:Quick and Easy Machine Learning Tools
The letters 'qe' in the package title stand for "quick and easy," alluding to the convenience goal of the package. We bring together a variety of machine learning (ML) tools from standard R packages, providing wrappers with a simple, convenient, and uniform interface.
Maintained by Norm Matloff. Last updated 26 days ago.
5.0 match 41 stars 8.41 score 48 scripts 1 dependentsclement-lee
crandep:Network Analysis of Dependencies of CRAN Packages
The dependencies of CRAN packages can be analysed in a network fashion. For each package we can obtain the packages that it depends, imports, suggests, etc. By iterating this procedure over a number of packages, we can build, visualise, and analyse the dependency network, enabling us to have a bird's-eye view of the CRAN ecosystem. One aspect of interest is the number of reverse dependencies of the packages, or equivalently the in-degree distribution of the dependency network. This can be fitted by the power law and/or an extreme value mixture distribution <doi:10.1111/stan.12355>, of which functions are provided.
Maintained by Clement Lee. Last updated 7 months ago.
6.7 match 8 stars 6.23 score 20 scriptskainhofer
MortalityTables:A Framework for Various Types of Mortality / Life Tables
Classes to implement, analyze and plot cohort life tables for actuarial calculations. Birth-year dependent cohort mortality tables using a yearly trend to extrapolate from a base year are implemented, as well as period life table, cohort life tables using an age shift, and merged life tables. Additionally, several data sets from various countries are included to provide widely-used tables out of the box.
Maintained by Reinhold Kainhofer. Last updated 1 years ago.
6.8 match 1 stars 5.70 score 84 scripts 2 dependentsyuimaproject
yuima:The YUIMA Project Package for SDEs
Simulation and Inference for SDEs and Other Stochastic Processes.
Maintained by Stefano M. Iacus. Last updated 3 days ago.
5.3 match 9 stars 7.26 score 92 scripts 2 dependentsjamesliley
OptHoldoutSize:Estimation of Optimal Size for a Holdout Set for Updating a Predictive Score
Predictive scores must be updated with care, because actions taken on the basis of existing risk scores causes bias in risk estimates from the updated score. A holdout set is a straightforward way to manage this problem: a proportion of the population is 'held-out' from computation of the previous risk score. This package provides tools to estimate a size for this holdout set and associated errors. Comprehensive vignettes are included. Please see: Haidar-Wehbe S, Emerson SR, Aslett LJM, Liley J (2022) <arXiv:2202.06374> for details of methods.
Maintained by James Liley. Last updated 3 years ago.
10.9 match 3.18 score 10 scriptskenaho1
asbio:A Collection of Statistical Tools for Biologists
Contains functions from: Aho, K. (2014) Foundational and Applied Statistics for Biologists using R. CRC/Taylor and Francis, Boca Raton, FL, ISBN: 978-1-4398-7338-0.
Maintained by Ken Aho. Last updated 2 months ago.
4.7 match 5 stars 7.32 score 310 scripts 3 dependentssizespectrum
mizer:Dynamic Multi-Species Size Spectrum Modelling
A set of classes and methods to set up and run multi-species, trait based and community size spectrum ecological models, focused on the marine environment.
Maintained by Gustav Delius. Last updated 2 months ago.
3.6 match 38 stars 9.43 score 207 scriptsstatnet
ergm.multi:Fit, Simulate and Diagnose Exponential-Family Models for Multiple or Multilayer Networks
A set of extensions for the 'ergm' package to fit multilayer/multiplex/multirelational networks and samples of multiple networks. 'ergm.multi' is a part of the Statnet suite of packages for network analysis. See Krivitsky, Koehly, and Marcum (2020) <doi:10.1007/s11336-020-09720-7> and Krivitsky, Coletti, and Hens (2023) <doi:10.1080/01621459.2023.2242627>.
Maintained by Pavel N. Krivitsky. Last updated 4 months ago.
3.5 match 14 stars 9.67 score 11 scripts 5 dependentsjoaquinauza
DetLifeInsurance:Life Insurance Premium and Reserves Valuation
Methods for valuation of life insurance premiums and reserves (including variable-benefit and fractional coverage) based on "Actuarial Mathematics" by Bowers, H.U. Gerber, J.C. Hickman, D.A. Jones and C.J. Nesbitt (1997, ISBN: 978-0938959465), "Actuarial Mathematics for Life Contingent Risks" by Dickson, David C. M., Hardy, Mary R. and Waters, Howard R (2009) <doi:10.1017/CBO9780511800146> and "Life Contingencies" by Jordan, C. W (1952) <doi:10.1017/S002026810005410X>. It also contains functions for equivalent interest and discount rate calculation, present and future values of annuities, and loan amortization schedule.
Maintained by Joaquin Auza. Last updated 5 years ago.
7.1 match 10 stars 4.70 score 3 scriptsdrizopoulos
ltm:Latent Trait Models under IRT
Analysis of multivariate dichotomous and polytomous data using latent trait models under the Item Response Theory approach. It includes the Rasch, the Two-Parameter Logistic, the Birnbaum's Three-Parameter, the Graded Response, and the Generalized Partial Credit Models.
Maintained by Dimitris Rizopoulos. Last updated 3 years ago.
3.4 match 30 stars 9.59 score 1.0k scripts 27 dependentscarloscinelli
benford.analysis:Benford Analysis for Data Validation and Forensic Analytics
Provides tools that make it easier to validate data using Benford's Law.
Maintained by Carlos Cinelli. Last updated 6 years ago.
5.7 match 62 stars 5.66 score 74 scriptsruben0dewitte
distributionsrd:Distribution Fitting and Evaluation
A library of density, distribution function, quantile function, (bounded) raw moments and random generation for a collection of distributions relevant for the firm size literature. Additionally, the package contains tools to fit these distributions using maximum likelihood and evaluate these distributions based on (i) log-likelihood ratio and (ii) deviations between the empirical and parametrically implied moments of the distributions. We add flexibility by allowing the considered distributions to be combined into piecewise composite or finite mixture distributions, as well as to be used when truncated. See Dewitte (2020) <> for a description and application of methods available in this package.
Maintained by Ruben Dewitte. Last updated 5 years ago.
19.1 match 1.70 score 10 scriptstidyverse
lubridate:Make Dealing with Dates a Little Easier
Functions to work with date-times and time-spans: fast and user friendly parsing of date-time data, extraction and updating of components of a date-time (years, months, days, hours, minutes, and seconds), algebraic manipulation on date-time and time-span objects. The 'lubridate' package has a consistent and memorable syntax that makes working with dates easy and fun.
Maintained by Vitalie Spinu. Last updated 3 months ago.
1.5 match 757 stars 20.95 score 135k scripts 1.9k dependentsasarafoglou
multibridge:Evaluating Multinomial Order Restrictions with Bridge Sampling
Evaluate hypotheses concerning the distribution of multinomial proportions using bridge sampling. The bridge sampling routine is able to compute Bayes factors for hypotheses that entail inequality constraints, equality constraints, free parameters, and mixtures of all three. These hypotheses are tested against the encompassing hypothesis, that all parameters vary freely or against the null hypothesis that all category proportions are equal. For more information see Sarafoglou et al. (2020) <doi:10.31234/>.
Maintained by Alexandra Sarafoglou. Last updated 2 years ago.
7.3 match 4.32 score 14 scriptsalanarnholt
BSDA:Basic Statistics and Data Analysis
Data sets for book "Basic Statistics and Data Analysis" by Larry J. Kitchens.
Maintained by Alan T. Arnholt. Last updated 2 years ago.
3.4 match 7 stars 9.11 score 1.3k scripts 6 dependentsbayes-rules
bayesrules:Datasets and Supplemental Functions from Bayes Rules! Book
Provides datasets and functions used for analysis and visualizations in the Bayes Rules! book (<>). The package contains a set of functions that summarize and plot Bayesian models from some conjugate families and another set of functions for evaluation of some Bayesian models.
Maintained by Mine Dogucu. Last updated 3 years ago.
3.8 match 72 stars 8.06 score 466 scriptsspatial-ews
spatialwarnings:Spatial Early Warning Signals of Ecosystem Degradation
Tools to compute and assess significance of early-warnings signals (EWS) of ecosystem degradation on raster data sets. EWS are spatial metrics derived from raster data -- e.g. spatial autocorrelation -- that increase before an ecosystem undergoes a non-linear transition (Genin et al. (2018) <doi:10.1111/2041-210X.13058>).
Maintained by Alexandre Genin. Last updated 6 months ago.
5.5 match 15 stars 5.32 score 46 scriptsr-forge
Sleuth3:Data Sets from Ramsey and Schafer's "Statistical Sleuth (3rd Ed)"
Data sets from Ramsey, F.L. and Schafer, D.W. (2013), "The Statistical Sleuth: A Course in Methods of Data Analysis (3rd ed)", Cengage Learning.
Maintained by Berwin A Turlach. Last updated 1 years ago.
4.5 match 6.38 score 522 scriptstermehs
netropy:Statistical Entropy Analysis of Network Data
Statistical entropy analysis of network data as introduced by Frank and Shafie (2016) <doi:10.1177/0759106315615511>, and a in textbook which is in progress.
Maintained by Termeh Shafie. Last updated 5 months ago.
4.5 match 12 stars 6.26 score 9 scriptsmucollective
multiverse:Create 'multiverse analysis' in R
Implement 'multiverse' style analyses (Steegen S., Tuerlinckx F, Gelman A., Vanpaemal, W., 2016) <doi:10.1177/1745691616658637> to show the robustness of statistical inference. 'Multiverse analysis' is a philosophy of statistical reporting where paper authors report the outcomes of many different statistical analyses in order to show how fragile or robust their findings are. The 'multiverse' package (Sarma A., Kale A., Moon M., Taback N., Chevalier F., Hullman J., Kay M., 2021) <doi:10.31219/> allows users to concisely and flexibly implement 'multiverse-style' analysis, which involve declaring alternate ways of performing an analysis step, in R and R Notebooks.
Maintained by Abhraneel Sarma. Last updated 4 months ago.
3.3 match 62 stars 8.37 score 42 scriptsskeydan
torchaudio:R Interface to 'pytorch''s 'torchaudio'
Provides access to datasets, models and processing facilities for deep learning in audio.
Maintained by Sigrid Keydana. Last updated 2 years ago.
7.8 match 3.46 score 58 scriptsigorlaltuf
dail:Data from Access to Information Law
Downloads the public data available from the Brazilian Access to Information Law and and performs a search on information requests and appeals made since 2015.
Maintained by Igor Laltuf. Last updated 1 months ago.
7.1 match 11 stars 3.74 score 10 scriptscran
astrochron:A Computational Tool for Astrochronology
Routines for astrochronologic testing, astronomical time scale construction, and time series analysis <doi:10.1016/j.earscirev.2018.11.015>. Also included are a range of statistical analysis and modeling routines that are relevant to time scale development and paleoclimate analysis.
Maintained by Stephen Meyers. Last updated 6 months ago.
6.7 match 5 stars 3.85 score 141 scriptsrspatial
geosphere:Spherical Trigonometry
Spherical trigonometry for geographic applications. That is, compute distances and related measures for angular (longitude/latitude) locations.
Maintained by Robert J. Hijmans. Last updated 6 months ago.
1.8 match 36 stars 13.79 score 5.7k scripts 116 dependentskjhealy
socviz:Utility Functions and Data Sets for Data Visualization
Supporting materials for a course and book on data visualization. It contains utility functions for graphs and several sample data sets. See Healy (2019) <ISBN 978-0691181622>.
Maintained by Kieran Healy. Last updated 5 years ago.
3.5 match 190 stars 7.09 score 628 scriptsyihui
animation:A Gallery of Animations in Statistics and Utilities to Create Animations
Provides functions for animations in statistics, covering topics in probability theory, mathematical statistics, multivariate statistics, non-parametric statistics, sampling survey, linear models, time series, computational statistics, data mining and machine learning. These functions may be helpful in teaching statistics and data analysis. Also provided in this package are a series of functions to save animations to various formats, e.g. Flash, 'GIF', HTML pages, 'PDF' and videos. 'PDF' animations can be inserted into 'Sweave' / 'knitr' easily.
Maintained by Yihui Xie. Last updated 2 years ago.
1.9 match 208 stars 12.08 score 2.5k scripts 29 dependentsbioc
plgem:Detect differential expression in microarray and proteomics datasets with the Power Law Global Error Model (PLGEM)
The Power Law Global Error Model (PLGEM) has been shown to faithfully model the variance-versus-mean dependence that exists in a variety of genome-wide datasets, including microarray and proteomics data. The use of PLGEM has been shown to improve the detection of differentially expressed genes or proteins in these datasets.
Maintained by Norman Pavelka. Last updated 5 months ago.
4.9 match 4.38 score 8 scripts 1 dependentscran
BenfordTests:Statistical Tests for Evaluating Conformity to Benford's Law
Several specialized statistical tests and support functions for determining if numerical data could conform to Benford's law.
Maintained by Dieter William Joenssen. Last updated 10 years ago.
21.0 match 1.00 scorebioc
limma:Linear Models for Microarray and Omics Data
Data analysis, linear models and differential expression for omics data.
Maintained by Gordon Smyth. Last updated 5 days ago.
1.5 match 13.81 score 16k scripts 585 dependentsbioc
HiContacts:Analysing cool files in R with HiContacts
HiContacts provides a collection of tools to analyse and visualize Hi-C datasets imported in R by HiCExperiment.
Maintained by Jacques Serizay. Last updated 5 months ago.
3.5 match 12 stars 5.95 score 49 scriptsjverzani
UsingR:Data Sets, Etc. for the Text "Using R for Introductory Statistics", Second Edition
A collection of data sets to accompany the textbook "Using R for Introductory Statistics," second edition.
Maintained by John Verzani. Last updated 3 years ago.
4.0 match 1 stars 4.97 score 1.4k scriptsstatnet
lolog:Latent Order Logistic Graph Models
Estimation of Latent Order Logistic (LOLOG) Models for Networks. LOLOGs are a flexible and fully general class of statistical graph models. This package provides functions for performing MOM, GMM and variational inference. Visual diagnostics and goodness of fit metrics are provided. See Fellows (2018) <arXiv:1804.04583> for a detailed description of the methods.
Maintained by Ian E. Fellows. Last updated 1 years ago.
3.4 match 5 stars 5.56 score 72 scriptsbioc
BioNAR:Biological Network Analysis in R
the R package BioNAR, developed to step by step analysis of PPI network. The aim is to quantify and rank each proteinโs simultaneous impact into multiple complexes based on network topology and clustering. Package also enables estimating of co-occurrence of diseases across the network and specific clusters pointing towards shared/common mechanisms.
Maintained by Anatoly Sorokin. Last updated 19 days ago.
3.1 match 3 stars 5.90 score 35 scriptszauster
npExact:Exact Nonparametric Hypothesis Tests for the Mean, Variance and Stochastic Inequality
Provides several novel exact hypothesis tests with minimal assumptions on the errors. The tests are exact, meaning that their p-values are correct for the given sample sizes (the p-values are not derived from asymptotic analysis). The test for stochastic inequality is for ordinal comparisons based on two independent samples and requires no assumptions on the errors. The other tests include tests for the mean and variance of a single sample and comparing means in independent samples. All these tests only require that the data has known bounds (such as percentages that lie in [0,100]. These bounds are part of the input.
Maintained by Oliver Reiter. Last updated 6 years ago.
6.7 match 2.70 score 8 scriptslightbluetitan
crimedatasets:A Comprehensive Collection of Crime-Related Datasets
A comprehensive collection of datasets exclusively focused on crimes, criminal activities, and related topics. This package serves as a valuable resource for researchers, analysts, and students interested in crime analysis, criminology, social and economic studies related to criminal behavior. Datasets span global and local contexts, with a mix of tabular and spatial data.
Maintained by Renzo Caceres Rossi. Last updated 3 months ago.
3.6 match 8 stars 4.90 score 3 scriptsfdzul
denhotspots:a package for calculate gi and hi local spatial statistics
the denhotspots package calculate the gi and hi local spatial statistics for areal data.
Maintained by The package maintainer. Last updated 6 days ago.
4.0 match 1 stars 4.38 score 6 scriptsanespinosa
netmem:Social Network Measures using Matrices
Measures to describe and manipulate networks using matrices.
Maintained by Alejandro Espinosa-Rada. Last updated 5 days ago.
4.0 match 11 stars 4.33 score 13 scriptsbioc
Glimma:Interactive visualizations for gene expression analysis
This package produces interactive visualizations for RNA-seq data analysis, utilizing output from limma, edgeR, or DESeq2. It produces interactive htmlwidgets versions of popular RNA-seq analysis plots to enhance the exploration of analysis results by overlaying interactive features. The plots can be viewed in a web browser or embedded in notebook documents.
Maintained by Shian Su. Last updated 1 months ago.
1.6 match 32 stars 10.58 score 600 scripts 1 dependentskimberlywebb
COMBO:Correcting Misclassified Binary Outcomes in Association Studies
Use frequentist and Bayesian methods to estimate parameters from a binary outcome misclassification model. These methods correct for the problem of "label switching" by assuming that the sum of outcome sensitivity and specificity is at least 1. A description of the analysis methods is available in Hochstedler and Wells (2023) <doi:10.48550/arXiv.2303.10215>.
Maintained by Kimberly Hochstedler Webb. Last updated 19 days ago.
3.2 match 1 stars 5.08 score 4 scriptsedwbaker
sonicscrewdriver:Bioacoustic Analysis and Publication Tools
Provides tools for manipulating sound files for bioacoustic analysis, and preparing analyses these for publication. The package validates that values are physically possible wherever feasible.
Maintained by Ed Baker. Last updated 1 months ago.
2.3 match 6 stars 7.12 score 26 scriptsmikmart
monad:Operators and Generics for Monads
Compose generic monadic function pipelines with %>>% and %>-% based on implementing the 'S7' generics fmap() and bind(). Methods are provided for the built-in list type and the maybe class from the 'maybe' package. The concepts are modelled directly after the Monad typeclass in Haskell, but adapted for idiomatic use in R.
Maintained by Mikko Marttila. Last updated 5 months ago.
4.5 match 6 stars 3.48 score 3 scriptsinsightsengineering Random ADaM Datasets
A set of functions to create random Analysis Data Model (ADaM) datasets and cached dataset. ADaM dataset specifications are described by the Clinical Data Interchange Standards Consortium (CDISC) Analysis Data Model Team.
Maintained by Joe Zhu. Last updated 5 months ago.
1.8 match 33 stars 8.60 score 52 scriptsfcheysson
hawkesbow:Estimation of Hawkes Processes from Binned Observations
Implements an estimation method for Hawkes processes when count data are only observed in discrete time, using a spectral approach derived from the Bartlett spectrum, see Cheysson and Lang (2020) <arXiv:2003.04314>. Some general use functions for Hawkes processes are also included: simulation of (in)homogeneous Hawkes process, maximum likelihood estimation, residual analysis, etc.
Maintained by Felix Cheysson. Last updated 1 years ago.
3.3 match 7 stars 4.54 score 4 scriptsausgis
geosimilarity:Geographically Optimal Similarity
Understanding spatial association is essential for spatial statistical inference, including factor exploration and spatial prediction. Geographically optimal similarity (GOS) model is an effective method for spatial prediction, as described in Yongze Song (2022) <doi:10.1007/s11004-022-10036-8>. GOS was developed based on the geographical similarity principle, as described in Axing Zhu (2018) <doi:10.1080/19475683.2018.1534890>. GOS has advantages in more accurate spatial prediction using fewer samples and critically reduced prediction uncertainty.
Maintained by Wenbo Lv. Last updated 1 months ago.
2.8 match 6 stars 5.38 score 5 scriptssmac-group
simts:Time Series Analysis Tools
A system contains easy-to-use tools as a support for time series analysis courses. In particular, it incorporates a technique called Generalized Method of Wavelet Moments (GMWM) as well as its robust implementation for fast and robust parameter estimation of time series models which is described, for example, in Guerrier et al. (2013) <doi: 10.1080/01621459.2013.799920>. More details can also be found in the paper linked to via the URL below.
Maintained by Stรฉphane Guerrier. Last updated 2 years ago.
1.9 match 15 stars 7.68 score 59 scripts 4 dependentscran
fBasics:Rmetrics - Markets and Basic Statistics
Provides a collection of functions to explore and to investigate basic properties of financial returns and related quantities. The covered fields include techniques of explorative data analysis and the investigation of distributional properties, including parameter estimation and hypothesis testing. Even more there are several utility functions for data handling and management.
Maintained by Georgi N. Boshnakov. Last updated 7 months ago.
2.0 match 2 stars 7.11 score 129 dependentscdr-er
ZLAvian:Zipf's Law of Abbreviation in Animal Vocalisations
Assesses evidence for Zipf's Law of Abbreviation in animal vocalisation using IDs, note class and note duration. The package also provides a web plot function for visualisation.
Maintained by CD Durrant. Last updated 10 months ago.
7.1 match 2.00 scorecran
BSS:Brownian Semistationary Processes
Efficient simulation of Brownian semistationary (BSS) processes using the hybrid simulation scheme, as described in Bennedsen, Lunde, Pakkannen (2017) <arXiv:1507.03004v4>, as well as functions to fit BSS processes to data, and functions to estimate the stochastic volatility process of a BSS process.
Maintained by Phillip Murray. Last updated 5 years ago.
7.0 match 2.00 score 2 scriptsrobinhankin
hyper2:The Hyperdirichlet Distribution, Mark 2
A suite of routines for the hyperdirichlet distribution and reified Bradley-Terry; supersedes the 'hyperdirichlet' package; uses 'disordR' discipline <doi:10.48550/ARXIV.2210.03856>. To cite in publications please use Hankin 2017 <doi:10.32614/rj-2017-061>, and for Generalized Plackett-Luce likelihoods use Hankin 2024 <doi:10.18637/jss.v109.i08>.
Maintained by Robin K. S. Hankin. Last updated 3 days ago.
2.3 match 5 stars 6.01 score 38 scripts 1 dependentslarssnip
micropan:Microbial Pan-Genome Analysis
A collection of functions for computations and visualizations of microbial pan-genomes.
Maintained by Lars Snipen. Last updated 3 years ago.
2.0 match 21 stars 6.15 score 67 scriptspharmaverse
sdtmchecks:Data Quality Checks for Study Data Tabulation Model (SDTM) Datasets
A series of checks to identify common issues in Study Data Tabulation Model (SDTM) datasets. These checks are intended to be generalizable, actionable, and meaningful for analysis.
Maintained by Will Harris. Last updated 3 months ago.
1.6 match 21 stars 7.66 score 15 scriptsbioc
plyinteractions:Extending tidy verbs to genomic interactions
Operate on `GInteractions` objects as tabular data using `dplyr`-like verbs. The functions and methods in `plyinteractions` provide a grammatical approach to manipulate `GInteractions`, to facilitate their integration in genomic analysis workflows.
Maintained by Jacques Serizay. Last updated 5 months ago.
2.5 match 4.75 score 14 scriptsr-box
boxr:Interface for the ' API'
An R interface for the remote file hosting service 'Box' (<>). In addition to uploading and downloading files, this package includes functions which mirror base R operations for local files, (e.g. box_load(), box_save(), box_read(), box_setwd(), etc.), as well as 'git' style functions for entire directories (e.g. box_fetch(), box_push()).
Maintained by Ian Lyttle. Last updated 11 months ago.
1.3 match 63 stars 8.65 score 238 scriptsadimajo
glmtree:Logistic Regression Trees
A logistic regression tree is a decision tree with logistic regressions at its leaves. A particular stochastic expectation maximization algorithm is used to draw a few good trees, that are then assessed via the user's criterion of choice among BIC / AIC / test set Gini. The formal development is given in a PhD chapter, see Ehrhardt (2019) <>.
Maintained by Adrien Ehrhardt. Last updated 1 years ago.
2.4 match 6 stars 4.78 score 3 scriptsbioc
miaSim:Microbiome Data Simulation
Microbiome time series simulation with generalized Lotka-Volterra model, Self-Organized Instability (SOI), and other models. Hubbell's Neutral model is used to determine the abundance matrix. The resulting abundance matrix is applied to (Tree)SummarizedExperiment objects.
Maintained by Yagmur Simsek. Last updated 5 months ago.
1.7 match 21 stars 6.64 score 23 scriptstesselle
folio:Datasets for Teaching Archaeology and Paleontology
Datasets for teaching quantitative approaches and modeling in archaeology and paleontology. This package provides several types of data related to broad topics (cultural evolution, radiocarbon dating, paleoenvironments, etc.), which can be used to illustrate statistical methods in the classroom (multivariate data analysis, compositional data analysis, diversity measurement, etc.).
Maintained by Nicolas Frerebeau. Last updated 1 months ago.
3.8 match 3.02 score 2 scripts 1 dependentscran
erer:Empirical Research in Economics with R
Several functions, datasets, and sample codes related to empirical research in economics are included. They cover the marginal effects for binary or ordered choice models, static and dynamic Almost Ideal Demand System (AIDS) models, and a typical event analysis in finance.
Maintained by Changyou Sun. Last updated 6 months ago.
3.3 match 3 stars 3.34 score 211 scripts 1 dependentscran
HTMLUtils:Facilitates Automated HTML Report Creation
Facilitates automated HTML report creation, in particular framed HTML pages and dynamically sortable tables.
Maintained by "Markus Loecher, Berlin School of Economics and Law (BSEL)". Last updated 1 years ago.
7.4 match 1.48 score 9 scripts 1 dependentsusepa
ctxR:Utilities for Interacting with the 'CTX' APIs
Access chemical, hazard, bioactivity, and exposure data from the Computational Toxicology and Exposure ('CTX') APIs <>. 'ctxR' was developed to streamline the process of accessing the information available through the 'CTX' APIs without requiring prior knowledge of how to use APIs. Most data is also available on the CompTox Chemical Dashboard ('CCD') <> and other resources found at the EPA Computational Toxicology and Exposure Online Resources <>.
Maintained by Paul Kruse. Last updated 2 months ago.
1.3 match 10 stars 8.02 score 13 scripts 1 dependentscran
RobPer:Robust Periodogram and Periodicity Detection Methods
Calculates periodograms based on (robustly) fitting periodic functions to light curves (irregularly observed time series, possibly with measurement accuracies, occurring in astroparticle physics). Three main functions are included: RobPer() calculates the periodogram. Outlying periodogram bars (indicating a period) can be detected with betaCvMfit(). Artificial light curves can be generated using the function tsgen(). For more details see the corresponding article: Thieler, Fried and Rathjens (2016), Journal of Statistical Software 69(9), 1-36, <doi:10.18637/jss.v069.i09>.
Maintained by Jonathan Rathjens. Last updated 3 years ago.
3.6 match 3 stars 2.95 score 1 dependentsropensci
skynet:Generates Networks from BTS Data
A flexible tool that allows generating bespoke air transport statistics for urban studies based on publicly available data from the Bureau of Transport Statistics (BTS) in the United States <>.
Maintained by Filipe Teixeira. Last updated 6 months ago.
2.3 match 11 stars 4.67 score 43 scriptsdanforthcenter
pcvr:Plant Phenotyping and Bayesian Statistics
Analyse common types of plant phenotyping data, provide a simplified interface to longitudinal growth modeling and select Bayesian statistics, and streamline use of 'PlantCV' output. Several Bayesian methods and reporting guidelines for Bayesian methods are described in Kruschke (2018) <doi:10.1177/2515245918771304>, Kruschke (2013) <doi:10.1037/a0029146>, and Kruschke (2021) <doi:10.1038/s41562-021-01177-7>.
Maintained by Josh Sumner. Last updated 5 days ago.
1.5 match 4 stars 6.99 score 39 scriptssujit-sahu
ipsRdbs:Introduction to Probability, Statistics and R for Data-Based Sciences
Contains data sets, programmes and illustrations discussed in the book, "Introduction to Probability, Statistics and R: Foundations for Data-Based Sciences." Sahu (2024, isbn:9783031378645) describes the methods in detail.
Maintained by Sujit K. Sahu. Last updated 11 months ago.
2.8 match 1 stars 3.70 score 2 scriptsjustinmshea
wooldridge:115 Data Sets from "Introductory Econometrics: A Modern Approach, 7e" by Jeffrey M. Wooldridge
Students learning both econometrics and R may find the introduction to both challenging. The wooldridge data package aims to lighten the task by efficiently loading any data set found in the text with a single command. Data sets have been compressed to a fraction of their original size. Documentation files contain page numbers, the original source, time of publication, and notes from the author suggesting avenues for further analysis and research. If one needs an introduction to R model syntax, a vignette contains solutions to examples from chapters of the text. Data sets are from the 7th edition (Wooldridge 2020, ISBN-13 978-1-337-55886-0), and are backwards compatible with all previous versions of the text.
Maintained by Justin M. Shea. Last updated 3 months ago.
1.1 match 203 stars 9.38 score 1.4k scriptsalmutveraart
trawl:Estimation and Simulation of Trawl Processes
Contains R functions for simulating and estimating integer-valued trawl processes as described in the article Veraart (2019),"Modeling, simulation and inference for multivariate time series of counts using trawl processes", Journal of Multivariate Analysis, 169, pages 110-129, <doi:10.1016/j.jmva.2018.08.012> and for simulating random vectors from the bivariate negative binomial and the bi- and trivariate logarithmic series distributions.
Maintained by Almut E. D. Veraart. Last updated 4 years ago.
3.5 match 2.81 score 32 scriptsasa12138
MetaNet:Network Analysis for Omics Data
Comprehensive network analysis package. Calculate correlation network fastly, accelerate lots of analysis by parallel computing. Support for multi-omics data, search sub-nets fluently. Handle bigger data, more than 10,000 nodes in each omics. Offer various layout method for multi-omics network and some interfaces to other software ('Gephi', 'Cytoscape', 'ggplot2'), easy to visualize. Provide comprehensive topology indexes calculation, including ecological network stability.
Maintained by Chen Peng. Last updated 11 days ago.
dataimportnetwork analysisomicssoftwarevisualization
1.8 match 13 stars 5.51 score 9 scriptsasgr
celestial:Collection of Common Astronomical Conversion Routines and Functions
Contains a number of common astronomy utility functions for cosmology and angular coordinates.
Maintained by Aaron Robotham. Last updated 1 years ago.
1.9 match 9 stars 5.22 score 68 scripts 9 dependentsmarcodvisser
aprof:Amdahl's Profiler, Directed Optimization Made Easy
Assists the evaluation of whether and where to focus code optimization, using Amdahl's law and visual aids based on line profiling. Amdahl's profiler organizes profiling output files (including memory profiling) in a visually appealing way. It is meant to help to balance development vs. execution time by helping to identify the most promising sections of code to optimize and projecting potential gains. The package is an addition to R's standard profiling tools and is not a wrapper for them.
Maintained by Marco D. Visser. Last updated 29 days ago.
2.3 match 21 stars 4.02 score 8 scriptsflr
FLSAM:An Implementation of the State-Space Assessment Model for FLR
This package provides an FLR wrapper to the SAM state-space assessment model.
Maintained by N.T. Hintzen. Last updated 3 months ago.
2.0 match 4 stars 4.51 score 406 scriptsagandy
systemicrisk:Systemic Risk and Network Reconstruction
Analysis of risk through liability matrices. Contains a Gibbs sampler for network reconstruction, where only row and column sums of the liabilities matrix as well as some other fixed entries are observed, following the methodology of Gandy&Veraart (2016) <doi:10.1287/mnsc.2016.2546>. It also incorporates models that use a power law distribution on the degree distribution.
Maintained by Axel Gandy. Last updated 11 months ago.
2.3 match 5 stars 3.88 score 51 scriptslafaye
ConvergenceConcepts:Seeing Convergence Concepts in Action
This is a pedagogical package, designed to help students understanding convergence of random variables. It provides a way to investigate interactively various modes of convergence (in probability, almost surely, in law and in mean) of a sequence of i.i.d. random variables. Visualisation of simulated sample paths is possible through interactive plots. The approach is illustrated by examples and exercises through the function 'investigate', as described in Lafaye de Micheaux and Liquet (2009) <doi:10.1198/tas.2009.0032>. The user can study his/her own sequences of random variables.
Maintained by Pierre Lafaye De Micheaux. Last updated 3 years ago.
8.8 match 1.00 score 10 scriptsusepa
ccdR:Utilities for Interacting with the 'CTX' APIs
Access chemical, hazard, bioactivity, and exposure data from the Computational Toxicology and Exposure ('CTX') APIs <>. 'ccdR' was developed to streamline the process of accessing the information available through the 'CTX' APIs without requiring prior knowledge of how to use APIs. Most data is also available on the CompTox Chemical Dashboard ('CCD') <> and other resources found at the EPA Computational Toxicology and Exposure Online Resources <>.
Maintained by Paul Kruse. Last updated 8 months ago.
1.3 match 2 stars 6.38 score 7 scriptslanedrew
ldmppr:Estimate and Simulate from Location Dependent Marked Point Processes
A suite of tools for estimating, assessing model fit, simulating from, and visualizing location dependent marked point processes characterized by regularity in the pattern. You provide a reference marked point process, a set of raster images containing location specific covariates, and select the estimation algorithm and type of mark model. 'ldmppr' estimates the process and mark models and allows you to check the appropriateness of the model using a variety of diagnostic tools. Once a satisfactory model fit is obtained, you can simulate from the model and visualize the results. Documentation for the package 'ldmppr' is available in the form of a vignette.
Maintained by Lane Drew. Last updated 20 days ago.
1.7 match 1 stars 5.00 score 2 scriptsjqveenstra
arfima:Fractional ARIMA (and Other Long Memory) Time Series Modeling
Simulates, fits, and predicts long-memory and anti-persistent time series, possibly mixed with ARMA, regression, transfer-function components. Exact methods (MLE, forecasting, simulation) are used.
Maintained by JQ Veenstra. Last updated 1 years ago.
1.5 match 14 stars 5.31 score 81 scripts 1 dependentsjsegrestin
comstab:Partitioning the Drivers of Stability of Ecological Communities
Contains the basic functions to apply the unified framework for partitioning the drivers of stability of ecological communities. Segrestin et al. (2024) <doi:10.1111/geb.13828>.
Maintained by Jules Segrestin. Last updated 8 months ago.
1.9 match 6 stars 4.18 scorecdalitz
moonboot:m-Out-of-n Bootstrap Functions
Functions and examples based on the m-out-of-n bootstrap suggested by Politis, D.N. and Romano, J.P. (1994) <doi:10.1214/aos/1176325770>. Additionally there are functions to estimate the scaling factor tau and the subsampling size m. For a detailed description and a full list of references, see Dalitz, C. and Lรถgler, F. (2024) <doi:10.48550/arXiv.2412.05032>.
Maintained by Christoph Dalitz. Last updated 23 days ago.
2.0 match 2 stars 3.78 score 1 scriptspaulgovan
ReliaGrowR:Reliability Growth Analysis
Modeling and plotting functions for Reliability Growth Analysis (RGA). Models include the Duane (1962) <doi:10.1109/TA.1964.4319640>, Non-Homogeneous Poisson Process (NHPP) by Crow (1975) <>, Piecewise Weibull NHPP by Guo et al. (2010) <doi:10.1109/RAMS.2010.5448029>, and Piecewise Weibull NHPP with Change Point Detection based on the 'segmented' package by Muggeo (2024) <>.
Maintained by Paul Govan. Last updated 4 months ago.
1.3 match 1 stars 5.64 score 12 scripts 3 dependentsjwb133
InformativeCensoring:Multiple Imputation for Informative Censoring
Multiple Imputation for Informative Censoring. This package implements two methods. Gamma Imputation described in <DOI:10.1002/sim.6274> and Risk Score Imputation described in <DOI:10.1002/sim.3480>.
Maintained by Jonathan Bartlett. Last updated 2 years ago.
1.6 match 4.78 score 9 scripts 1 dependentsalan-turing-institute
PosteriorBootstrap:Non-Parametric Sampling with Parallel Monte Carlo
An implementation of a non-parametric statistical model using a parallelised Monte Carlo sampling scheme. The method implemented in this package allows non-parametric inference to be regularized for small sample sizes, while also being more accurate than approximations such as variational Bayes. The concentration parameter is an effective sample size parameter, determining the faith we have in the model versus the data. When the concentration is low, the samples are close to the exact Bayesian logistic regression method; when the concentration is high, the samples are close to the simplified variational Bayes logistic regression. The method is described in full in the paper Lyddon, Walker, and Holmes (2018), "Nonparametric learning from Bayesian models with randomized objective functions" <arXiv:1806.11544>.
Maintained by James Robinson. Last updated 2 years ago.
1.5 match 4 stars 4.78 scorecran
bioSNR:Bioacoustic Basic Operations with Decibels and the Passive Sonar Equation
A beginners toolbox to help those in ecology who want to deepen their understanding or utilize Bioacoustics in their work. The package has a number of utilizations from calculating frequency from waveform, performing operations in dB, and determining acoustic range of recorders. The majority of this package is based on key concepts learned from the K. Lisa Yang Center for Conservation Bioacoustics at Cornell University and their associated course: Introduction to Bioacoustics course. More information can be found within the walk through vignettes at <>.
Maintained by Matthew Duggan. Last updated 2 years ago.
2.3 match 3.18 scorecran
sonar:Fundamental Formulas for Sonar
Formulas for calculating sound velocity, water pressure, depth, density, absorption and sonar equations.
Maintained by Jose Gama. Last updated 9 years ago.
7.1 match 1.00 scorecran
PAmeasures:Prediction and Accuracy Measures for Nonlinear Models and for Right-Censored Time-to-Event Data
We propose a pair of summary measures for the predictive power of a prediction function based on a regression model. The regression model can be linear or nonlinear, parametric, semi-parametric, or nonparametric, and correctly specified or mis-specified. The first measure, R-squared, is an extension of the classical R-squared statistic for a linear model, quantifying the prediction function's ability to capture the variability of the response. The second measure, L-squared, quantifies the prediction function's bias for predicting the mean regression function. When used together, they give a complete summary of the predictive power of a prediction function. Please refer to Gang Li and Xiaoyan Wang (2016) <arXiv:1611.03063> for more details.
Maintained by Xiaoyan Wang. Last updated 7 years ago.
4.0 match 1 stars 1.78 score 2 dependentscran
REAT:Regional Economic Analysis Toolbox
Collection of models and analysis methods used in regional and urban economics and (quantitative) economic geography, e.g. measures of inequality, regional disparities and convergence, regional specialization as well as accessibility and spatial interaction models.
Maintained by Thomas Wieland. Last updated 4 years ago.
1.9 match 3 stars 3.62 score 140 scriptsmaximilianaxer
quaxnat:Estimation of Natural Regeneration Potential
Functions for estimating the potential dispersal of tree species using regeneration densities and dispersal distances to nearest seed trees. A quantile regression is implemented to determine the dispersal potential. Spatial prediction can be used to identify natural regeneration potential for forest restoration as described in Axer et al (2021) <doi:10.1016/j.foreco.2020.118802>.
Maintained by Maximilian Axer. Last updated 5 months ago.
1.9 match 1 stars 3.54 score 2 scriptscran
SuessR:Suess and Laws Corrections for Marine Stable Carbon Isotope Data
Generates region-specific Suess and Laws corrections for stable carbon isotope data from marine organisms collected between 1850 and 2023. Version 0.1.6 of 'SuessR' contains four built-in regions: the Bering Sea ('Bering Sea'), the Aleutian archipelago ('Aleutian Islands'), the Gulf of Alaska ('Gulf of Alaska'), and the subpolar North Atlantic ('Subpolar North Atlantic'). Users can supply their own environmental data for regions currently not built into the package to generate corrections for those regions.
Maintained by Casey Clark. Last updated 25 days ago.
6.6 match 1.00 score 2 scriptsjhudsl
crsra:Tidying and Analyzing 'Coursera' Research Export Data
Tidies and performs preliminary analysis of 'Coursera' research export data. These export data can be downloaded by anyone who has classes on Coursera and wants to analyze the data. Coursera is one of the leading providers of MOOCs and was launched in January 2012. With over 25 million learners, Coursera is the most popular provider in the world being followed by EdX, the MOOC provider that was a result of a collaboration between Harvard University and MIT, with over 10 million users. Coursera has over 150 university partners from 29 countries and offers a total of 2000+ courses from computer science to philosophy. Besides, Coursera offers 180+ specialization, Coursera's credential system, and four fully online Masters degrees. For more information about Coursera check Coursera's About page on <>.
Maintained by Aboozar Hadavand. Last updated 4 years ago.
1.6 match 2 stars 4.15 score 14 scriptsnetworkgroupr
fastnet:Large-Scale Social Network Analysis
We present an implementation of the algorithms required to simulate large-scale social networks and retrieve their most relevant metrics.
Maintained by Nazrul Shaikh. Last updated 8 years ago.
1.7 match 5 stars 3.37 score 47 scriptsantoinerollandlyon
voteSim:Generate Simulated Data for Voting Rules using Evaluations
Provide functions to generate random simulated evaluations on candidates by voters for evaluation-based elections. Functions are based on several models for continuous or discrete evaluations.
Maintained by Antoine Rolland. Last updated 1 years ago.
3.3 match 1.70 score 1 scriptscran
fairml:Fair Models in Machine Learning
Fair machine learning regression models which take sensitive attributes into account in model estimation. Currently implementing Komiyama et al. (2018) <>, Zafar et al. (2019) <> and my own approach from Scutari, Panero and Proissl (2022) <> that uses ridge regression to enforce fairness.
Maintained by Marco Scutari. Last updated 2 years ago.
3.6 match 1 stars 1.52 score 1 dependentsdcnorris
DTAT:Dose Titration Algorithm Tuning
Dose Titration Algorithm Tuning (DTAT) is a methodologic framework allowing dose individualization to be conceived as a continuous learning process that begins in early-phase clinical trials and continues throughout drug development, on into clinical practice. This package includes code that researchers may use to reproduce or extend key results of the DTAT research programme, plus tools for trialists to design and simulate a '3+3/PC' dose-finding study. Please see Norris (2017a) <doi:10.12688/f1000research.10624.3> and Norris (2017c) <doi:10.1101/240846>.
Maintained by David C. Norris. Last updated 10 months ago.
1.9 match 2.90 score 20 scriptsnicolasv-dev
drimmR:Estimation, Simulation and Reliability of Drifting Markov Models
Performs the drifting Markov models (DMM) which are non-homogeneous Markov models designed for modeling the heterogeneities of sequences in a more flexible way than homogeneous Markov chains or even hidden Markov models. In this context, we developed an R package dedicated to the estimation, simulation and the exact computation of associated reliability of drifting Markov models. The implemented methods are described in Vergne, N. (2008), <doi:10.2202/1544-6115.1326> and Barbu, V.S., Vergne, N. (2019) <doi:10.1007/s11009-018-9682-8> .
Maintained by Nicolas Vergne. Last updated 4 years ago.
5.4 match 1.00 scoremabe0033
SimEUCartelLaw:Simulation of Legal Exemption System for European Cartel Law
Monte Carlo simulations of a game-theoretic model for the legal exemption system of the European cartel law are implemented in order to estimate the (mean) deterrent effect of this system. The input and output parameters of the simulated cartel opportunities can be visualized by three-dimensional projections. A description of the model is given in Moritz et al. (2018) <doi:10.1515/bejeap-2017-0235>.
Maintained by Martin Becker. Last updated 3 years ago.
5.1 match 1.00 score 7 scriptsrwrandles
washex:Washington State Legislative Explorer
Gets data from the Washington State Legislature.
Maintained by Rohnin Randles. Last updated 3 years ago.
1.9 match 2.70 score 2 scriptsropenspain
senadoRES:Information About the Senate of Spain
Retrieve and parse information about the Spanish Congress.
Maintained by Lluรญs Revilla Sancho. Last updated 3 months ago.
2.3 match 1 stars 2.18 score 3 scriptswjbraun
MiscMath:Miscellaneous Mathematical Tools
Some basic math calculators for finding angles for triangles and for finding the greatest common divisor of two numbers and so on.
Maintained by W.J. Braun. Last updated 2 months ago.
4.5 match 1.00 score 2 scriptsgrosssbm
sbm:Stochastic Blockmodels
A collection of tools and functions to adjust a variety of stochastic blockmodels (SBM). Supports at the moment Simple, Bipartite, 'Multipartite' and Multiplex SBM (undirected or directed with Bernoulli, Poisson or Gaussian emission laws on the edges, and possibly covariate for Simple and Bipartite SBM). See Lรฉger (2016) <doi:10.48550/arXiv.1602.07587>, 'Barbillon et al.' (2020) <doi:10.1111/rssa.12193> and 'Bar-Hen et al.' (2020) <doi:10.48550/arXiv.1807.10138>.
Maintained by Julien Chiquet. Last updated 6 months ago.
0.5 match 16 stars 8.27 score 98 scripts 2 dependentsabjur
abjutils:Useful Tools for Jurimetrical Analysis Used by the Brazilian Jurimetrics Association
The Brazilian Jurimetrics Association (ABJ in Portuguese, see <> for more information) is a non-profit organization which aims to investigate and promote the use of statistics and probability in the study of Law and its institutions. This package implements general purpose tools used by ABJ, such as functions for sampling and basic manipulation of Brazilian lawsuits identification number. It also implements functions for text cleaning, such as accentuation removal.
Maintained by Caio Lente. Last updated 1 years ago.
0.5 match 55 stars 6.76 score 78 scripts 1 dependentscran
gllm:Generalised log-Linear Model
Routines for log-linear models of incomplete contingency tables, including some latent class models, via EM and Fisher scoring approaches. Allows bootstrapping. See Espeland and Hui (1987) <doi:10.2307/2531553> for general approach.
Maintained by David Duffy. Last updated 2 years ago.
3.4 match 1.00 scorepachadotdev
chilemapas:Mapas de las Divisiones Politicas y Administrativas de Chile (Maps of the Political and Administrative Divisions of Chile)
Mapas terrestres con topologias simplificadas. Estos mapas no tienen precision geodesica, por lo que aplica el DFL-83 de 1979 de la Republica de Chile y se consideran referenciales sin validez legal. No se incluyen los territorios antarticos y bajo ningun evento estos mapas significan que exista una cesion u ocupacion de territorios soberanos en contra del Derecho Internacional por parte de Chile. Esta paquete esta documentado intencionalmente en castellano asciificado para que funcione sin problema en diferentes plataformas. (Terrestrial maps with simplified toplogies. These maps lack geodesic precision, therefore DFL-83 1979 of the Republic of Chile applies and are considered to have no legal validity. Antartic territories are excluded and under no event these maps mean there is a cession or occupation of sovereign territories against International Laws from Chile. This package was intentionally documented in asciified spanish to make it work without problem on different platforms.)
Maintained by Mauricio Vargas. Last updated 3 years ago.
0.5 match 34 stars 6.20 score 93 scriptsr-forge
zipfR:Statistical Models for Word Frequency Distributions
Statistical models and utilities for the analysis of word frequency distributions. The utilities include functions for loading, manipulating and visualizing word frequency data and vocabulary growth curves. The package also implements several statistical models for the distribution of word frequencies in a population. (The name of this package derives from the most famous word frequency distribution, Zipf's law.)
Maintained by Stefan Evert. Last updated 4 years ago.
0.5 match 5.94 score 188 scripts 12 dependentsevanbiederstedt
RMTstat:Distributions, Statistics and Tests Derived from Random Matrix Theory
Functions for working with the Tracy-Widom laws and other distributions related to the eigenvalues of large Wishart matrices. The tables for computing the Tracy-Widom densities and distribution functions were computed by functions were computed by Momar Dieng's MATLAB package "RMLab". This package is part of a collaboration between Iain Johnstone, Zongming Ma, Patrick Perry, and Morteza Shahram.
Maintained by Evan Biederstedt. Last updated 3 years ago.
0.5 match 6 stars 5.39 score 30 scripts 9 dependentsaftonsteps
ggalignment:Plots 'D&D'-Style Alignment Charts
'D&D' alignment charts show 9 boxes with values for good through evil and values for chaotic through lawful. This package easily creates these alignment charts from user-provided image paths and alignment values.
Maintained by Afton Coombs. Last updated 15 days ago.
0.5 match 10 stars 5.30 score 6 scriptsabjur
abjData:Databases Used Routinely by the Brazilian Jurimetrics Association
The Brazilian Jurimetrics Association (ABJ in Portuguese, see <> for more information) is a non-profit organization which aims to investigate and promote the use of statistics and probability in the study of Law and its institutions. This package has a set of datasets commonly used in our book.
Maintained by Julio Trecenti. Last updated 2 years ago.
0.5 match 19 stars 5.32 score 55 scriptscb4ds
DGEobj.utils:Differential Gene Expression (DGE) Analysis Utility Toolkit
Provides a function toolkit to facilitate reproducible RNA-Seq Differential Gene Expression (DGE) analysis (Law (2015) <doi:10.12688/f1000research.9005.3>). The tools include both analysis work-flow and utility functions: mapping/unit conversion, count normalization, accounting for unknown covariates, and more. This is a complement/cohort to the 'DGEobj' package that provides a flexible container to manage and annotate Differential Gene Expression analysis results.
Maintained by Connie Brett. Last updated 2 months ago.
0.5 match 2 stars 5.26 score 30 scripts 1 dependentsjpnolan
gensphere:Generalized Spherical Distributions
Define and compute with generalized spherical distributions - multivariate probability laws that are specified by a star shaped contour (directional behavior) and a radial component. The methods are described in Nolan (2016) <doi:10.1186/s40488-016-0053-0>.
Maintained by John P Nolan. Last updated 4 years ago.
2.2 match 1.00 score 2 scriptsbioc
HybridMTest:Hybrid Multiple Testing
Performs hybrid multiple testing that incorporates method selection and assumption evaluations into the analysis using empirical Bayes probability (EBP) estimates obtained by Grenander density estimation. For instance, for 3-group comparison analysis, Hybrid Multiple testing considers EBPs as weighted EBPs between F-test and H-test with EBPs from Shapiro Wilk test of normality as weigth. Instead of just using EBPs from F-test only or using H-test only, this methodology combines both types of EBPs through EBPs from Shapiro Wilk test of normality. This methodology uses then the law of total EBPs.
Maintained by Demba Fofana. Last updated 5 months ago.
0.5 match 4.38 score 5 scripts 1 dependentsmarcellochiodi
etasFLP:Mixed FLP and ML Estimation of ETAS Space-Time Point Processes for Earthquake Description
Estimation of the components of an ETAS (Epidemic Type Aftershock Sequence) model for earthquake description. Non-parametric background seismicity can be estimated through FLP (Forward Likelihood Predictive). New version 2.0.0: covariates have been introduced to explain the effects of external factors on the induced seismicity; the parametrization has been changed; Chiodi, Adelfio (2017)<doi:10.18637/jss.v076.i03>.
Maintained by Marcello Chiodi. Last updated 2 years ago.
1.8 match 1 stars 1.20 score 16 scriptsims-fhs
simtimer:Datetimes as Integers for Discrete-Event Simulations
Handles datetimes as integers for the usage inside Discrete-Event Simulations (DES). The conversion is made using the internally generic function as.numeric() of the base package. DES is described in Simulation Modeling and Analysis by Averill Law and David Kelton (1999) <doi:10.2307/2288169>.
Maintained by Adrian Staempfli. Last updated 6 years ago.
0.5 match 3.88 score 15 scriptsjmcurran
fitPS:Fit Zeta Distributions to Forensic Data
Fits Zeta distributions (discrete power laws) to data that arises from forensic surveys of clothing on the presence of glass and paint in various populations. The general method is described to some extent in Coulson, S.A., Buckleton, J.S., Gummer, A.B., and Triggs, C.M. (2001) <doi:10.1016/S1355-0306(01)71847-3>, although the implementation differs.
Maintained by James Curran. Last updated 3 days ago.
0.5 match 3.90 score 6 scriptsreviewburner
AnimalSequences:Analyse Animal Sequential Behaviour and Communication
All animal behaviour occurs sequentially. The package has a number of functions to format sequence data from different sources, to analyse sequential behaviour and communication in animals. It also has functions to plot the data and to calculate the entropy of sequences.
Maintained by Alex Mielke. Last updated 6 months ago.
1.9 match 1.00 scorejcatwood
VeccTMVN:Multivariate Normal Probabilities using Vecchia Approximation
Under a different representation of the multivariate normal (MVN) probability, we can use the Vecchia approximation to sample the integrand at a linear complexity with respect to n. Additionally, both the SOV algorithm from Genz (92) and the exponential-tilting method from Botev (2017) can be adapted to linear complexity. The reference for the method implemented in this package is Jian Cao and Matthias Katzfuss (2024) "Linear-Cost Vecchia Approximation of Multivariate Normal Probabilities" <doi:10.48550/arXiv.2311.09426>. Two major references for the development of our method are Alan Genz (1992) "Numerical Computation of Multivariate Normal Probabilities" <doi:10.1080/10618600.1992.10477010> and Z. I. Botev (2017) "The Normal Law Under Linear Restrictions: Simulation and Estimation via Minimax Tilting" <doi:10.48550/arXiv.1603.04166>.
Maintained by Jian Cao. Last updated 4 months ago.
0.5 match 2 stars 3.56 score 36 scriptsranbi1990
ssizeRNA:Sample Size Calculation for RNA-Seq Experimental Design
We propose a procedure for sample size calculation while controlling false discovery rate for RNA-seq experimental design. Our procedure depends on the Voom method proposed for RNA-seq data analysis by Law et al. (2014) <DOI:10.1186/gb-2014-15-2-r29> and the sample size calculation method proposed for microarray experiments by Liu and Hwang (2007) <DOI:10.1093/bioinformatics/btl664>. We develop a set of functions that calculates appropriate sample sizes for two-sample t-test for RNA-seq experiments with fixed or varied set of parameters. The outputs also contain a plot of power versus sample size, a table of power at different sample sizes, and a table of critical test values at different sample sizes. To install this package, please use 'source(""); biocLite("ssizeRNA")'. For R version 3.5 or greater, please use 'if(!requireNamespace("BiocManager", quietly = TRUE)){install.packages("BiocManager")}; BiocManager::install("ssizeRNA")'.
Maintained by Ran Bi. Last updated 6 years ago.
0.5 match 1 stars 3.53 score 28 scripts 1 dependentscran
QCApro:Advanced Functionality for Performing and Evaluating Qualitative Comparative Analysis
Provides advanced functionality for performing configurational comparative research with Qualitative Comparative Analysis (QCA), including crisp-set, multi-value, and fuzzy-set QCA. It also offers advanced tools for sensitivity diagnostics and methodological evaluations of QCA.
Maintained by Alrik Thiem. Last updated 7 years ago.
1.7 match 1 stars 1.00 scoremarkusloecher
MultiJoin:Enables Efficient Joining of Data File on Common Fields using the Unix Utility Join
Wrapper around the Unix join facility which is more efficient than the built-in R routine merge(). The package enables the joining of multiple files on disk at once. The files can be compressed and various filters can be deployed before joining. Compiles only under Unix.
Maintained by "Markus Loecher". Last updated 6 years ago.
1.6 match 1.00 score 6 scriptskuriwaki
ddi:The Data Defect Index for Samples that May not be IID
Implements Meng's data defect index (ddi), which represents the degree of sample bias relative to an iid sample. The data defect correlation (ddc) represents the correlation between the outcome of interest and the selection into the sample; when the sample selection is independent across the population, the ddc is zero. Details are in Meng (2018) <doi:10.1214/18-AOAS1161SF>, "Statistical Paradises and Paradoxes in Big Data (I): Law of Large Populations, Big Data Paradox, and the 2016 US Presidential Election." Survey estimates from the Cooperative Congressional Election Study (CCES) is included to replicate the article's results.
Maintained by Shiro Kuriwaki. Last updated 5 years ago.
0.5 match 3 stars 3.18 score 4 scriptsdavharris
blender:Analyze biotic homogenization of landscapes
Tools for assessing exotic species' contributions to landscape homogeneity using average pairwise Jaccard similarity and an analytical approximation derived in Harris et al. (2011, "Occupancy is nine-tenths of the law," The American Naturalist). Also includes a randomization method for assessing sources of model error.
Maintained by David J. Harris. Last updated 13 years ago.
0.5 match 3.00 score 4 scriptscran
ragtop:Pricing Equity Derivatives with Extensions of Black-Scholes
Algorithms to price American and European equity options, convertible bonds and a variety of other financial derivatives. It uses an extension of the usual Black-Scholes model in which jump to default may occur at a probability specified by a power-law link between stock price and hazard rate as found in the paper by Takahashi, Kobayashi, and Nakagawa (2001) <doi:10.3905/jfi.2001.319302>. We use ideas and techniques from Andersen and Buffum (2002) <doi:10.2139/ssrn.355308> and Linetsky (2006) <doi:10.1111/j.1467-9965.2006.00271.x>.
Maintained by Brian K. Boonstra. Last updated 5 years ago.
0.5 match 2.70 scoreadamtclark
ecostatscale:Statistical Scaling Functions for Ecological Systems
Implementation of the scaling functions presented in "General statistical scaling laws for stability in ecological systems" by Clark et al in Ecology Letters <DOI:10.1111/ele.13760>. Includes functions for extrapolating variability, resistance, and resilience across spatial and ecological scales, as well as a basic simulation function for producing time series, and a regression routine for generating unbiased parameter estimates. See the main text of the paper for more details.
Maintained by Adam Clark. Last updated 1 years ago.
0.5 match 3 stars 2.48 scorecran
probs:Elementary Probability on Finite Sample Spaces
Performs elementary probability calculations on finite sample spaces, which may be represented by data frames or lists. This package is meant to rescue some widely used functions from the archived 'prob' package (see <>). Functionality includes setting up sample spaces, counting tools, defining probability spaces, performing set algebra, calculating probability and conditional probability, tools for simulation and checking the law of large numbers, adding random variables, and finding marginal distributions. Characteristic functions for all base R distributions are included.
Maintained by Joe gr. Schlarmann. Last updated 9 months ago.
0.5 match 1.70 scorecran
Dpit:Distribution Pitting
Compares distributions with one another in terms of their fit to each sample in a dataset that contains multiple samples, as described in Joo, Aguinis, and Bradley (in press). Users can examine the fit of seven distributions per sample: pure power law, lognormal, exponential, power law with an exponential cutoff, normal, Poisson, and Weibull. Automation features allow the user to compare all distributions for all samples with a single command line, which creates a separate row containing results for each sample until the entire dataset has been analyzed.
Maintained by Harry Joo. Last updated 8 years ago.
0.8 match 1.00 score 2 scriptslivinoa1980
powerindexR:Measuring the Power in Voting Systems
This R package allows the determination of some distributions of the voters' power when passing laws in weighted voting situations.
Maintained by Livino M. Armijos-Toro. Last updated 10 months ago.
0.5 match 1.00 scorejpnolan
ecdfHT:Empirical CDF for Heavy Tailed Data
Computes and plots a transformed empirical CDF (ecdf) as a diagnostic for heavy tailed data, specifically data with power law decay on the tails. Routines for annotating the plot, comparing data to a model, fitting a nonparametric model, and some multivariate extensions are given.
Maintained by John P Nolan. Last updated 9 years ago.
0.5 match 1.00 score 5 scriptscran
InvasionCorrection:Invasion Correction
The correction is achieved under the assumption that non-migrating cells of the essay approximately form a quadratic flow profile due to frictional effects, compare law of Hagen-Poiseuille for flow in a tube. The script fits a conical plane to give xyz-coordinates of the cells. It outputs the number of migrated cells and the new corrected coordinates.
Maintained by Marcus Rosenblatt. Last updated 8 years ago.
0.5 match 1.00 scoreimranshakoor
DDPM:Data Sets for Discrete Probability Models
A wide collection of univariate discrete data sets from various applied domains related to distribution theory. The functions allow quick, easy, and efficient access to 100 univariate discrete data sets. The data are related to different applied domains, including medical, reliability analysis, engineering, manufacturing, occupational safety, geological sciences, terrorism, psychology, agriculture, environmental sciences, road traffic accidents, demography, actuarial science, law, and justice. The documentation, along with associated references for further details and uses, is presented.
Maintained by Muhammad Imran. Last updated 2 years ago.
0.5 match 1.00 scoreebner-kit
gofgamma:Goodness-of-Fit Tests for the Gamma Distribution
We implement various classical tests for the composite hypothesis of testing the fit to the family of gamma distributions as the Kolmogorov-Smirnov test, the Cramer-von Mises test, the Anderson Darling test and the Watson test. For each test a parametric bootstrap procedure is implemented, as considered in Henze, Meintanis & Ebner (2012) <doi:10.1080/03610926.2010.542851>. The recent procedures presented in Henze, Meintanis & Ebner (2012) <doi:10.1080/03610926.2010.542851> and Betsch & Ebner (2019) <doi:10.1007/s00184-019-00708-7> are implemented. Estimation of parameters of the gamma law are implemented using the method of Bhattacharya (2001) <doi:10.1080/00949650108812100>.
Maintained by Bruno Ebner. Last updated 5 years ago.
0.5 match 1.00 score 2 scripts