R-universe search: breakdown

pbiecek

breakDown:Model Agnostic Explainers for Individual Predictions

Model agnostic tool for decomposition of predictions from black boxes. Break Down Table shows contributions of every variable to a final prediction. Break Down Plot presents variable contributions in a concise graphical way. This package work for binary classifiers and general regression models.

Maintained by Przemyslaw Biecek. Last updated 1 years ago.

data-science iml interpretability machine-learning visual-explanations xai

76.0 match 103 stars 8.90 score 91 scripts 2 dependents

bioc

amplican:Automated analysis of CRISPR experiments

`amplican` performs alignment of the amplicon reads, normalizes gathered data, calculates multiple statistics (e.g. cut rates, frameshifts) and presents results in form of aggregated reports. Data and statistics can be broken down by experiments, barcodes, user defined groups, guides and amplicons allowing for quick identification of potential problems.

Maintained by Eivind Valen. Last updated 5 months ago.

immunooncology technology alignment qpcr crispr cpp

16.5 match 10 stars 7.54 score 41 scripts

modeloriented

iBreakDown:Model Agnostic Instance Level Variable Attributions

Model agnostic tool for decomposition of predictions from black boxes. Supports additive attributions and attributions with interactions. The Break Down Table shows contributions of every variable to a final prediction. The Break Down Plot presents variable contributions in a concise graphical way. This package works for classification and regression models. It is an extension of the 'breakDown' package (Staniak and Biecek 2018) <doi:10.32614/RJ-2018-072>, with new and faster strategies for orderings. It supports interactions in explanations and has interactive visuals (implemented with 'D3.js' library). The methodology behind is described in the 'iBreakDown' article (Gosiewska and Biecek 2019) <arXiv:1903.11420> This package is a part of the 'DrWhy.AI' universe (Biecek 2018) <arXiv:1806.08915>.

Maintained by Przemyslaw Biecek. Last updated 1 years ago.

breakdown iml interpretability shapley xai

11.5 match 84 stars 10.07 score 56 scripts 22 dependents

dmurdoch

plotrix:Various Plotting Functions

Lots of plots, various labeling, axis and color scaling functions. The author/maintainer died in September 2023.

Maintained by Duncan Murdoch. Last updated 1 years ago.

5.4 match 5 stars 11.31 score 9.2k scripts 361 dependents

hms-dbmi

UpSetR:A More Scalable Alternative to Venn and Euler Diagrams for Visualizing Intersecting Sets

Creates visualizations of intersecting sets using a novel matrix design, along with visualizations of several common set, element and attribute related tasks (Conway 2017) <doi:10.1093/bioinformatics/btx364>.

Maintained by Jake Conway. Last updated 4 years ago.

gehlenborglab ggplot2 upset upsetr visualization

3.8 match 781 stars 15.33 score 4.8k scripts 42 dependents

kjhealy

gssrdoc:Document General Social Survey Variable

The General Social Survey (GSS) is a long-running, mostly annual survey of US households. It is administered by the National Opinion Research Center (NORC). This package contains the a tibble with information on the survey variables, together with every variable documented as an R help page. For more information on the GSS see \url{http://gss.norc.org}.

Maintained by Kieran Healy. Last updated 11 months ago.

25.6 match 2.28 score 38 scripts

jsjuni

rollupTree:Perform Recursive Computations

Mass rollup for a Bill of Materials is an example of a class of computations in which elements are arranged in a tree structure and some property of each element is a computed function of the corresponding values of its child elements. Leaf elements, i.e., those with no children, have values assigned. In many cases, the combining function is simple arithmetic sum; in other cases (e.g., mass properties), the combiner may involve other information such as the geometric relationship between parent and child, or statistical relations such as root-sum-of-squares (RSS). This package implements a general function for such problems. It is adapted to specific recursive computations by functional programming techniques; the caller passes a function as the update parameter to rollup() (or, at a lower level, passes functions as the get, set, combine, and override parameters to update_prop()) at runtime to specify the desired operations. The implementation relies on graph-theoretic algorithms from the 'igraph' package of Csárdi, et al. (2006 <doi:10.5281/zenodo.7682609>).

Maintained by James Steven Jenkins. Last updated 9 days ago.

10.7 match 2 stars 5.32 score 1 dependents

thijsjanzen

GenomeAdmixR:Simulate Admixture of Genomes

Individual-based simulations forward in time, simulating how patterns in ancestry along the genome change after admixture. Full description can be found in Janzen (2021) <doi:10.1111/2041-210X.13612>.

Maintained by Thijs Janzen. Last updated 1 years ago.

cpp

8.4 match 5 stars 5.24 score 14 scripts

robinhankin

hyper2:The Hyperdirichlet Distribution, Mark 2

A suite of routines for the hyperdirichlet distribution and reified Bradley-Terry; supersedes the 'hyperdirichlet' package; uses 'disordR' discipline <doi:10.48550/ARXIV.2210.03856>. To cite in publications please use Hankin 2017 <doi:10.32614/rj-2017-061>, and for Generalized Plackett-Luce likelihoods use Hankin 2024 <doi:10.18637/jss.v109.i08>.

Maintained by Robin K. S. Hankin. Last updated 3 days ago.

cpp

7.2 match 5 stars 6.01 score 38 scripts 1 dependents

spatstat

spatstat.data:Datasets for 'spatstat' Family

Contains all the datasets for the 'spatstat' family of packages.

Maintained by Adrian Baddeley. Last updated 22 hours ago.

kernel-density point-process spatial-analysis spatial-data spatial-data-analysis spatstat statistical-analysis statistical-methods statistical-tests statistics

3.8 match 6 stars 11.02 score 186 scripts 228 dependents

laresbernardo

lares:Analytics & Machine Learning Sidekick

Auxiliary package for better/faster analytics, visualization, data mining, and machine learning tasks. With a wide variety of family functions, like Machine Learning, Data Wrangling, Marketing Mix Modeling (Robyn), Exploratory, API, and Scrapper, it helps the analyst or data scientist to get quick and robust results, without the need of repetitive coding or advanced R programming skills.

Maintained by Bernardo Lares. Last updated 23 days ago.

analytics api automation automl data-science descriptive-statistics h2o machine-learning marketing mmm predictive-modeling puzzle rlanguage robyn visualization

4.0 match 233 stars 9.84 score 185 scripts 1 dependents

humaniverse

asylum:Data on Asylum and Resettlement for the UK

Data on Asylum and Resettlement for the UK, provided by the Home Office <https://www.gov.uk/government/statistical-data-sets/immigration-system-statistics-data-tables>.

Maintained by Matthew Gwynfryn Thomas. Last updated 17 days ago.

7.8 match 3 stars 4.99 score 36 scripts

valentint

rrcov:Scalable Robust Estimators with High Breakdown Point

Robust Location and Scatter Estimation and Robust Multivariate Analysis with High Breakdown Point: principal component analysis (Filzmoser and Todorov (2013), <doi:10.1016/j.ins.2012.10.017>), linear and quadratic discriminant analysis (Todorov and Pires (2007)), multivariate tests (Todorov and Filzmoser (2010) <doi:10.1016/j.csda.2009.08.015>), outlier detection (Todorov et al. (2010) <doi:10.1007/s11634-010-0075-2>). See also Todorov and Filzmoser (2009) <urn:isbn:978-3838108148>, Todorov and Filzmoser (2010) <doi:10.18637/jss.v032.i03> and Boudt et al. (2019) <doi:10.1007/s11222-019-09869-x>.

Maintained by Valentin Todorov. Last updated 7 months ago.

fortran openblas

3.4 match 2 stars 10.51 score 484 scripts 96 dependents

schochastics

networkdata:Repository of Network Datasets

The package contains a large collection of network dataset with different context. This includes social networks, animal networks and movie networks. All datasets are in 'igraph' format.

Maintained by David Schoch. Last updated 12 months ago.

dataset network-analysis

6.6 match 143 stars 5.01 score 143 scripts

mayer79

flashlight:Shed Light on Black Box Machine Learning Models

Shed light on black box machine learning models by the help of model performance, variable importance, global surrogate models, ICE profiles, partial dependence (Friedman J. H. (2001) <doi:10.1214/aos/1013203451>), accumulated local effects (Apley D. W. (2016) <arXiv:1612.08468>), further effects plots, interaction strength, and variable contribution breakdown (Gosiewska and Biecek (2019) <arxiv:1903.11420>). All tools are implemented to work with case weights and allow for stratified analysis. Furthermore, multiple flashlights can be combined and analyzed together.

Maintained by Michael Mayer. Last updated 8 months ago.

interpretability interpretable-machine-learning machine-learning xai

5.3 match 22 stars 6.25 score 54 scripts 1 dependents

billpetti

baseballr:Acquiring and Analyzing Baseball Data

Provides numerous utilities for acquiring and analyzing baseball data from online sources such as 'Baseball Reference' <https://www.baseball-reference.com/>, 'FanGraphs' <https://www.fangraphs.com/>, and the 'MLB Stats' API <https://www.mlb.com/>.

Maintained by Saiem Gilani. Last updated 4 months ago.

baseball pitchfx sabermetrics statcast

3.5 match 380 stars 8.98 score 582 scripts

alanarnholt

BSDA:Basic Statistics and Data Analysis

Data sets for book "Basic Statistics and Data Analysis" by Larry J. Kitchens.

Maintained by Alan T. Arnholt. Last updated 2 years ago.

3.4 match 7 stars 9.11 score 1.3k scripts 6 dependents

r-forge

robustbase:Basic Robust Statistics

"Essential" Robust Statistics. Tools allowing to analyze data with robust methods. This includes regression methodology including model selections and multivariate statistics where we strive to cover the book "Robust Statistics, Theory and Methods" by 'Maronna, Martin and Yohai'; Wiley 2006.

Maintained by Martin Maechler. Last updated 4 months ago.

fortran openblas

1.7 match 13.33 score 1.7k scripts 480 dependents

r-forge

Sleuth3:Data Sets from Ramsey and Schafer's "Statistical Sleuth (3rd Ed)"

Data sets from Ramsey, F.L. and Schafer, D.W. (2013), "The Statistical Sleuth: A Course in Methods of Data Analysis (3rd ed)", Cengage Learning.

Maintained by Berwin A Turlach. Last updated 1 years ago.

3.5 match 6.38 score 522 scripts

r-forge

Sleuth2:Data Sets from Ramsey and Schafer's "Statistical Sleuth (2nd Ed)"

Data sets from Ramsey, F.L. and Schafer, D.W. (2002), "The Statistical Sleuth: A Course in Methods of Data Analysis (2nd ed)", Duxbury.

Maintained by Berwin A Turlach. Last updated 1 years ago.

3.5 match 5.70 score 191 scripts

appliedstat

rQCC:Robust Quality Control Chart

Constructs various robust quality control charts based on the median or Hodges-Lehmann estimator (location) and the median absolute deviation (MAD) or Shamos estimator (scale). The estimators used for the robust control charts are all unbiased with a sample of finite size. For more details, see Park, Kim and Wang (2022) <doi:10.1080/03610918.2019.1699114>. In addition, using this R package, the conventional quality control charts such as X-bar, S, R, p, np, u, c, g, h, and t charts are also easily constructed. This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2022R1A2C1091319).

Maintained by Chanseok Park. Last updated 1 years ago.

control-chart goodness-of-fit r-language weibull

4.1 match 2 stars 4.70 score 3 scripts

hardin47

biwt:Functions to Compute the Biweight Mean Vector and Covariance and Correlation Matrices

The base functions compute multivariate location, scale, and correlation estimates based on Tukey's biweight M-estimator. Using the base function, the computations can be applied to a large number of observations to create either a matrix of biweight distances or biweight correlations.

Maintained by Johanna Hardin. Last updated 6 months ago.

3.4 match 5.58 score 16 scripts 2 dependents

jverzani

UsingR:Data Sets, Etc. for the Text "Using R for Introductory Statistics", Second Edition

A collection of data sets to accompany the textbook "Using R for Introductory Statistics," second edition.

Maintained by John Verzani. Last updated 3 years ago.

3.8 match 1 stars 4.97 score 1.4k scripts

jaseziv

worldfootballR:Extract and Clean World Football (Soccer) Data

Allow users to obtain clean and tidy football (soccer) game, team and player data. Data is collected from a number of popular sites, including 'FBref', transfer and valuations data from 'Transfermarkt'<https://www.transfermarkt.com/> and shooting location and other match stats data from 'Understat'<https://understat.com/>. It gives users the ability to access data more efficiently, rather than having to export data tables to files before being able to complete their analysis.

Maintained by Jason Zivkovic. Last updated 30 days ago.

fbref football football-data soccer-data sports-data transfermarkt understat

1.8 match 499 stars 10.22 score 516 scripts 2 dependents

rsoc

soc.ca:Specific Correspondence Analysis for the Social Sciences

Specific and class specific multiple correspondence analysis on survey-like data. Soc.ca is optimized to the needs of the social scientist and presents easily interpretable results in near publication ready quality.

Maintained by Anton Grau Larsen. Last updated 1 years ago.

4.3 match 14 stars 4.15 score 50 scripts

stevenmmortimer

rdfp:An Implementation of the 'DoubleClick for Publishers' API

Functions to interact with the 'Google DoubleClick for Publishers (DFP)' API <https://developers.google.com/ad-manager/api/start> (recently renamed to 'Google Ad Manager'). This package is automatically compiled from the API WSDL (Web Service Description Language) files to dictate how the API is structured. Theoretically, all API actions are possible using this package; however, care must be taken to format the inputs correctly and parse the outputs correctly. Please see the 'Google Ad Manager' API reference <https://developers.google.com/ad-manager/api/rel_notes> and this package's website <https://stevenmmortimer.github.io/rdfp/> for more information, documentation, and examples.

Maintained by Steven M. Mortimer. Last updated 6 years ago.

api-client api-wrapper dfp dfp-api doubleclick doubleclick-for-publishers google-dfp

2.5 match 16 stars 6.93 score 214 scripts

ropensci

rredlist:'IUCN' Red List Client

'IUCN' Red List (<https://api.iucnredlist.org/>) client. The 'IUCN' Red List is a global list of threatened and endangered species. Functions cover all of the Red List 'API' routes. An 'API' key is required.

Maintained by William Gearty. Last updated 1 months ago.

iucn biodiversity api web-services traits habitat species conservation api-wrapper iucn-red-list taxize

1.5 match 53 stars 11.49 score 195 scripts 24 dependents

immunogenomics

harmony:Fast, Sensitive, and Accurate Integration of Single Cell Data

Implementation of the Harmony algorithm for single cell integration, described in Korsunsky et al <doi:10.1038/s41592-019-0619-0>. Package includes a standalone Harmony function and interfaces to external frameworks.

Maintained by Ilya Korsunsky. Last updated 4 months ago.

algorithm data-integration scrna-seq openblas cpp

1.3 match 554 stars 13.74 score 5.5k scripts 8 dependents

gluc

data.tree:General Purpose Hierarchical Data Structure

Create tree structures from hierarchical data, and traverse the tree in various orders. Aggregate, cumulate, print, plot, convert to and from data.frame and more. Useful for decision trees, machine learning, finance, conversion from and to JSON, and many other applications.

Maintained by Christoph Glur. Last updated 5 months ago.

1.3 match 209 stars 12.84 score 1.1k scripts 88 dependents

business-science

timetk:A Tool Kit for Working with Time Series

Easy visualization, wrangling, and feature engineering of time series data for forecasting and machine learning prediction. Consolidates and extends time series functionality from packages including 'dplyr', 'stats', 'xts', 'forecast', 'slider', 'padr', 'recipes', and 'rsample'.

Maintained by Matt Dancho. Last updated 1 years ago.

coercion coercion-functions data-mining dplyr forecast forecasting forecasting-models machine-learning series-decomposition series-signature tibble tidy tidyquant tidyverse time time-series timeseries

1.1 match 625 stars 14.15 score 4.0k scripts 16 dependents

craig-parylo

cvdprevent:Wrapper for the 'CVD Prevent' Application Programming Interface

Provides an R wrapper to the 'CVD Prevent' application programming interface (API). Users can make API requests through built-in R functions. The Cardiovascular Disease Prevention Audit (CVDPREVENT) is an England-wide primary care audit that automatically extracts routinely held GP health data. <https://bmchealthdocs.atlassian.net/wiki/spaces/CP/pages/317882369/CVDPREVENT+API+Documentation>.

Maintained by Craig Parylo. Last updated 1 months ago.

3.2 match 3 stars 5.02 score 4 scripts

bioc

GenomicDataCommons:NIH / NCI Genomic Data Commons Access

Programmatically access the NIH / NCI Genomic Data Commons RESTful service.

Maintained by Sean Davis. Last updated 1 months ago.

dataimport sequencing api-client bioconductor bioinformatics cancer core-services data-science genomics nci tcga vignette

1.2 match 87 stars 11.94 score 238 scripts 12 dependents

thijsjanzen

junctions:The Breakdown of Genomic Ancestry Blocks in Hybrid Lineages

Individual based simulations of hybridizing populations, where the accumulation of junctions is tracked. Furthermore, mathematical equations are provided to verify simulation outcomes. Both simulations and mathematical equations are based on Janzen (2018, <doi:10.1101/058107>) and Janzen (2022, <doi:10.1111/1755-0998.13519>).

Maintained by Thijs Janzen. Last updated 3 days ago.

cpp

2.9 match 2 stars 4.71 score 17 scripts

mountainmath

cmhc:Access, Retrieve, and Work with CMHC Data

Wrapper around the Canadian Mortgage and Housing Corporation (CMHC) web interface. It enables programmatic and reproducible access to a wide variety of housing data from CMHC.

Maintained by Jens von Bergmann. Last updated 1 months ago.

1.9 match 20 stars 7.09 score 68 scripts

hafen

geofacet:'ggplot2' Faceting Utilities for Geographical Data

Provides geographical faceting functionality for 'ggplot2'. Geographical faceting arranges a sequence of plots of data for different geographical entities into a grid that preserves some of the geographical orientation.

Maintained by Ryan Hafen. Last updated 7 months ago.

geography ggplot2 visualization

1.3 match 340 stars 9.78 score 1.5k scripts 4 dependents

valentint

robust:Port of the S+ "Robust Library"

Methods for robust statistics, a state of the art in the early 2000s, notably for robust regression and robust multivariate analysis.

Maintained by Valentin Todorov. Last updated 7 months ago.

fortran openblas

1.7 match 7.52 score 572 scripts 8 dependents

giocomai

plausibler:Access Plausible Analytics API

Access Plausible Analytics API.

Maintained by Giorgio Comai. Last updated 3 days ago.

api-wrapper plausible-analytics

5.3 match 5 stars 2.40 score

valentint

rrcovNA:Scalable Robust Estimators with High Breakdown Point for Incomplete Data

Robust Location and Scatter Estimation and Robust Multivariate Analysis with High Breakdown Point for Incomplete Data (missing values) (Todorov et al. (2010) <doi:10.1007/s11634-010-0075-2>).

Maintained by Valentin Todorov. Last updated 3 months ago.

fortran

3.3 match 1 stars 3.77 score 59 scripts

uniprjrc

fsdaR:Robust Data Analysis Through Monitoring and Dynamic Visualization

Provides interface to the 'MATLAB' toolbox 'Flexible Statistical Data Analysis (FSDA)' which is comprehensive and computationally efficient software package for robust statistics in regression, multivariate and categorical data analysis. The current R version implements tools for regression: (forward search, S- and MM-estimation, least trimmed squares (LTS) and least median of squares (LMS)), for multivariate analysis (forward search, S- and MM-estimation), for cluster analysis and cluster-wise regression. The distinctive feature of our package is the possibility of monitoring the statistics of interest as a function of breakdown point, efficiency or subset size, depending on the estimator. This is accompanied by a rich set of graphical features, such as dynamic brushing, linking, particularly useful for exploratory data analysis.

Maintained by Valentin Todorov. Last updated 1 years ago.

openjdk

2.1 match 5 stars 5.37 score 93 scripts

selesnow

rfacebookstat:Load Data from Facebook API Marketing

Load data by campaigns, ads, ad sets and insights, ad account and business manager from Facebook Marketing API into R. For more details see official documents by Facebook Marketing API <https://developers.facebook.com/docs/marketing-apis/>.

Maintained by Alexey Seleznev. Last updated 26 days ago.

1.5 match 31 stars 7.43 score 48 scripts 1 dependents

ropensci

phylotaR:Automated Phylogenetic Sequence Cluster Identification from 'GenBank'

A pipeline for the identification, within taxonomic groups, of orthologous sequence clusters from 'GenBank' <https://www.ncbi.nlm.nih.gov/genbank/> as the first step in a phylogenetic analysis. The pipeline depends on a local alignment search tool and is, therefore, not dependent on differences in gene naming conventions and naming errors.

Maintained by Shixiang Wang. Last updated 8 months ago.

blastn genbank peer-reviewed phylogenetics sequence-alignment

1.9 match 23 stars 5.86 score 156 scripts

christopherkenny

censable:Making Census Data More Usable

Creates a common framework for organizing, naming, and gathering population, age, race, and ethnicity data from the Census Bureau. Accesses the API <https://www.census.gov/data/developers/data-sets.html>. Provides tools for adding information to existing data to line up with Census data.

Maintained by Christopher T. Kenny. Last updated 10 months ago.

1.9 match 8 stars 5.78 score 42 scripts 4 dependents

r4epi

epikit:Miscellaneous Helper Tools for Epidemiologists

Contains tools for formatting inline code, renaming redundant columns, aggregating age categories, adding survey weights, finding the earliest date of an event, plotting z-curves, generating population counts and calculating proportions with confidence intervals. This is part of the 'R4Epis' project <https://r4epis.netlify.app/>.

Maintained by Zhian N. Kamvar. Last updated 1 months ago.

1.7 match 10 stars 6.32 score 22 scripts 2 dependents

cran

NISTnls:Nonlinear least squares examples from NIST

Datasets for testing nonlinear regression routines.

Maintained by Douglas Bates. Last updated 13 years ago.

4.0 match 2.69 score 99 scripts

peterkdunn

GLMsData:Generalized Linear Model Data Sets

Data sets from the book Generalized Linear Models with Examples in R by Dunn and Smyth.

Maintained by Peter K. Dunn. Last updated 3 years ago.

4.0 match 2.61 score 220 scripts

odelmarcelle

sentopics:Tools for Joint Sentiment and Topic Analysis of Textual Data

A framework that joins topic modeling and sentiment analysis of textual data. The package implements a fast Gibbs sampling estimation of Latent Dirichlet Allocation (Griffiths and Steyvers (2004) <doi:10.1073/pnas.0307752101>) and Joint Sentiment/Topic Model (Lin, He, Everson and Ruger (2012) <doi:10.1109/TKDE.2011.48>). It offers a variety of helpers and visualizations to analyze the result of topic modeling. The framework also allows enriching topic models with dates and externally computed sentiment measures. A flexible aggregation scheme enables the creation of time series of sentiment or topical proportions from the enriched topic models. Moreover, a novel method jointly aggregates topic proportions and sentiment measures to derive time series of topical sentiment.

Maintained by Olivier Delmarcelle. Last updated 2 months ago.

openblas cpp openmp

1.9 match 8 stars 5.38 score 5 scripts

bpulidob

ehymet:Methodologies for Functional Data Based on the Epigraph and Hypograph Indices

Implements methods for functional data analysis based on the epigraph and hypograph indices. These methods transform functional datasets, whether in one or multiple dimensions, into multivariate datasets. The transformation involves applying the epigraph, hypograph, and their modified versions to both the original curves and their first and second derivatives. The calculation of these indices is tailored to the dimensionality of the functional dataset, with special considerations for dependencies between dimensions in multidimensional cases. This approach extends traditional multivariate data analysis techniques to the functional data setting. A key application of this package is the EHyClus method, which enhances clustering analysis for functional data across one or multiple dimensions using the epigraph and hypograph indices. See Pulido et al. (2023) <doi:10.1007/s11222-023-10213-7> and Pulido et al. (2024) <doi:10.48550/arXiv.2307.16720>.

Maintained by Belen Pulido. Last updated 4 months ago.

1.5 match 5.04 score 8 scripts

johnchaston

MAGNAMWAR:A Pipeline for Meta-Genome Wide Association

Correlates variation within the meta-genome to target species phenotype variations in meta-genome with association studies. Follows the pipeline described in Chaston, J.M. et al. (2014) <doi:10.1128/mBio.01631-14>.

Maintained by John Chaston. Last updated 7 years ago.

1.9 match 3.90 score 16 scripts

girelaignacio

mpitbR:Calculate Alkire-Foster Multidimensional Poverty Measures

Estimate Multidimensional Poverty Indices disaggregated by population subgroups based on the Alkire and Foster method (2011) <doi:10.1016/j.jpubeco.2010.11.006>. This includes the calculation of standard errors and confidence intervals. Other partial indices such as incidence, intensity and indicator-specific measures as well as intertemporal changes analysis can also be estimated. The standard errors and confidence intervals are calculated considering the complex survey design.

Maintained by Ignacio Girela. Last updated 1 months ago.

1.5 match 4.85 score 1 scripts

tmaturi

NPIstats:Nonparametric Predictive Inference

An implementation of the Nonparametric Predictive Inference approach in R. It provides tools for quantifying uncertainty via lower and upper probabilities. It includes useful functions for pairwise and multiple comparisons: comparing two groups with and without terminated tails, selecting the best group, selecting the subset of best groups, selecting the subset including the best group.

Maintained by Tahani Coolen-Maturi. Last updated 4 years ago.

3.6 match 2.00 score 4 scripts

fifis

pnd:Parallel Numerical Derivatives, Gradients, Jacobians, and Hessians of Arbitrary Accuracy Order

Numerical derivatives through finite-difference approximations can be calculated using the 'pnd' package with parallel capabilities and optimal step-size selection to improve accuracy. These functions facilitate efficient computation of derivatives, gradients, Jacobians, and Hessians, allowing for more evaluations to reduce the mathematical and machine errors. Designed for compatibility with the 'numDeriv' package, which has not received updates in several years, it introduces advanced features such as computing derivatives of arbitrary order, improving the accuracy of Hessian approximations by avoiding repeated differencing, and parallelising slow functions on Windows, Mac, and Linux.

Maintained by Andreï Victorovitch Kostyrka. Last updated 5 days ago.

1.3 match 1 stars 5.22 score 5 scripts

c-monaghan

lwc2022:Langa-Weir Classification of Cognitive Function for 2022 HRS Data

Generates the Langa-Weir classification of cognitive function for the 2022 Health and Retirement Study (HRS) cognition data. It is particularly useful for researchers studying cognitive aging who wish to work with the most recent release of HRS data. The package provides user-friendly functions for data preprocessing, scoring, and classification allowing users to easily apply the Langa-Weir classification system. For details regarding the; HRS <https://hrsdata.isr.umich.edu/> and Langa-Weir classifications <https://hrsdata.isr.umich.edu/data-products/langa-weir-classification-cognitive-function-1995-2020>.

Maintained by Cormac Monaghan. Last updated 4 months ago.

1.5 match 4.48 score 4 scripts

nicholasjcooper

NCmisc:Miscellaneous Functions for Creating Adaptive Functions and Scripts

A set of handy functions. Includes a versatile one line progress bar, one line function timer with detailed output, time delay function, text histogram, object preview, CRAN package search, simpler package installer, Linux command install check, a flexible Mode function, top function, simulation of correlated data, and more.

Maintained by Nicholas Cooper. Last updated 2 years ago.

1.7 match 3.86 score 172 scripts 5 dependents

sakoehler7

eesim:Simulate and Evaluate Time Series for Environmental Epidemiology

Provides functions to create simulated time series of environmental exposures (e.g., temperature, air pollution) and health outcomes for use in power analysis and simulation studies in environmental epidemiology. This package also provides functions to evaluate the results of simulation studies based on these simulated time series. This work was supported by a grant from the National Institute of Environmental Health Sciences (R00ES022631) and a fellowship from the Colorado State University Programs for Research and Scholarly Excellence.

Maintained by Brooke Anderson. Last updated 8 years ago.

1.2 match 8 stars 5.23 score 42 scripts

selesnow

racademyocean:Client for 'AcademyOcean API'

Provide function for work with 'AcademyOcean API' <https://academyocean.com/api>.

Maintained by Alexey Seleznev. Last updated 7 months ago.

1.9 match 3.00 score 3 scripts

tonyfischetti

libbib:Various Utilities for Library Science/Assessment and Cataloging

Provides functions for validating and normalizing bibliographic codes such as ISBN, ISSN, and LCCN. Also includes functions to communicate with the WorldCat API, translate Call numbers (Library of Congress and Dewey Decimal) to their subject classifications or subclassifications, and provides various loadable data files such call number / subject crosswalks and code tables.

Maintained by Tony Fischetti. Last updated 2 years ago.

1.7 match 3.20 score 32 scripts

jimlemon

prettyR:Pretty Descriptive Stats

Functions for conventionally formatting descriptive stats, reshaping data frames and formatting R output as HTML.

Maintained by Jim Lemon. Last updated 6 years ago.

1.8 match 3.13 score 207 scripts 1 dependents

christopherkenny

ppmf:Read Census Privacy Protected Microdata Files

Implements data processing described in <doi:10.1126/sciadv.abk3283> to align modern differentially private data with formatting of older US Census data releases. The primary goal is to read in Census Privacy Protected Microdata Files data in a reproducible way. This includes tools for aggregating to relevant levels of geography by creating geographic identifiers which match the US Census Bureau's numbering. Additionally, there are tools for grouping race numeric identifiers into categories, consistent with OMB (Office of Management and Budget) classifications. Functions exist for downloading and linking to existing sources of privacy protected microdata.

Maintained by Christopher T. Kenny. Last updated 2 years ago.

2.0 match 1 stars 2.70 score 3 scripts

inventionate

TimeSpaceAnalysis:Statistical tools for time-space analysis

Use Geometric Data Analysis approaches (e.g. MCA or MFA), time pattern analysis (see "time sequence clustering") and places chronologies (see "time geography") analysis.

Maintained by Fabian Mundt. Last updated 6 days ago.

1.8 match 2.48 score 2 scripts

gowerc

diffdf:Dataframe Difference Tool

Functions for comparing two data.frames against each other. The core functionality is to provide a detailed breakdown of any differences between two data.frames as well as providing utility functions to help narrow down the source of problems and differences.

Maintained by Craig Gower-Page. Last updated 6 months ago.

0.5 match 48 stars 7.55 score 135 scripts

imranshakoor

DataSetsUni:A Collection of Univariate Data Sets

A collection of widely used univariate data sets of various applied domains on applications of distribution theory. The functions allow researchers and practitioners to quickly, easily, and efficiently access and use these data sets. The data are related to different applied domains and as follows: Bio-medical, survival analysis, medicine, reliability analysis, hydrology, actuarial science, operational research, meteorology, extreme values, quality control, engineering, finance, sports and economics. The total 100 data sets are documented along with associated references for further details and uses.

Maintained by Muhammad Imran. Last updated 2 years ago.

3.6 match 1.00 score 1 scripts

fauvernierma

survPen:Multidimensional Penalized Splines for (Excess) Hazard Models, Relative Mortality Ratio Models and Marginal Intensity Models

Fits (excess) hazard, relative mortality ratio or marginal intensity models with multidimensional penalized splines allowing for time-dependent effects, non-linear effects and interactions between several continuous covariates. In survival and net survival analysis, in addition to modelling the effect of time (via the baseline hazard), one has often to deal with several continuous covariates and model their functional forms, their time-dependent effects, and their interactions. Model specification becomes therefore a complex problem and penalized regression splines represent an appealing solution to that problem as splines offer the required flexibility while penalization limits overfitting issues. Current implementations of penalized survival models can be slow or unstable and sometimes lack some key features like taking into account expected mortality to provide net survival and excess hazard estimates. In contrast, survPen provides an automated, fast, and stable implementation (thanks to explicit calculation of the derivatives of the likelihood) and offers a unified framework for multidimensional penalized hazard and excess hazard models. Later versions (>2.0.0) include penalized models for relative mortality ratio, and marginal intensity in recurrent event setting. survPen may be of interest to those who 1) analyse any kind of time-to-event data: mortality, disease relapse, machinery breakdown, unemployment, etc 2) wish to describe the associated hazard and to understand which predictors impact its dynamics, 3) wish to model the relative mortality ratio between a cohort and a reference population, 4) wish to describe the marginal intensity for recurrent event data. See Fauvernier et al. (2019a) <doi:10.21105/joss.01434> for an overview of the package and Fauvernier et al. (2019b) <doi:10.1111/rssc.12368> for the method.

Maintained by Mathieu Fauvernier. Last updated 3 months ago.

cpp

0.5 match 12 stars 6.82 score 85 scripts 1 dependents

cran

rpnf:Point and Figure Package

A set of functions to analyze and print the development of a commodity using the Point and Figure (P&F) approach. A P&F processor can be used to calculate daily statistics for the time series. These statistics can be used for deeper investigations as well as to create plots. Plots can be generated as well known X/O Plots in plain text format, and additionally in a more graphical format.

Maintained by Sascha Herrmann. Last updated 9 years ago.

1.6 match 3 stars 1.95 score 30 scripts

pbiecek

ceterisParibus:Ceteris Paribus Profiles

Ceteris Paribus Profiles (What-If Plots) are designed to present model responses around selected points in a feature space. For example around a single prediction for an interesting observation. Plots are designed to work in a model-agnostic fashion, they are working for any predictive Machine Learning model and allow for model comparisons. Ceteris Paribus Plots supplement the Break Down Plots from 'breakDown' package.

Maintained by Przemyslaw Biecek. Last updated 5 years ago.

0.5 match 42 stars 5.48 score 36 scripts

macmillancontentscience

morphemepiece:Morpheme Tokenization

Tokenize text into morphemes. The morphemepiece algorithm uses a lookup table to determine the morpheme breakdown of words, and falls back on a modified wordpiece tokenization algorithm for words not found in the lookup table.

Maintained by Jonathan Bratt. Last updated 3 years ago.

0.5 match 11 stars 5.04 score 8 scripts

muschellij2

glassdoor:Interface to 'Glassdoor' API

Interacts with the 'Glassdoor' API <https://www.glassdoor.com/developer/index.htm>. Allows the user to search job statistics, employer statistics, and job progression, where 'Glassdoor' provides a breakdown of other jobs a person did after their current one.

Maintained by John Muschelli. Last updated 6 years ago.

0.5 match 6 stars 4.59 score 13 scripts

luqqe

mblm:Median-Based Linear Models

Provides linear models based on Theil-Sen single median and Siegel repeated medians. They are very robust (29 or 50 percent breakdown point, respectively), and if no outliers are present, the estimators are very similar to OLS.

Maintained by Lukasz Komsta. Last updated 6 years ago.

0.5 match 4.38 score 150 scripts 5 dependents

gcohenfr

riv:Robust Instrumental Variables Estimator

Finds a robust instrumental variables estimator using a high breakdown point S-estimator of multivariate location and scatter matrix.

Maintained by Gabriela Cohen-Freue. Last updated 7 years ago.

0.5 match 1.00 score 7 scripts