Showing 200 of total 438 results (show query)
gssrdoc:Document General Social Survey Variable
The General Social Survey (GSS) is a long-running, mostly annual survey of US households. It is administered by the National Opinion Research Center (NORC). This package contains the a tibble with information on the survey variables, together with every variable documented as an R help page. For more information on the GSS see \url{}.
Maintained by Kieran Healy. Last updated 11 months ago.
734.8 match 2.28 score 38 scriptsjaredhuling
personalized:Estimation and Validation Methods for Subgroup Identification and Personalized Medicine
Provides functions for fitting and validation of models for subgroup identification and personalized medicine / precision medicine under the general subgroup identification framework of Chen et al. (2017) <doi:10.1111/biom.12676>. This package is intended for use for both randomized controlled trials and observational studies and is described in detail in Huling and Yu (2021) <doi:10.18637/jss.v098.i05>.
Maintained by Jared Huling. Last updated 3 years ago.
68.8 match 32 stars 7.38 score 125 scripts 1 dependentsropensci
charlatan:Make Fake Data
Make fake data that looks realistic, supporting addresses, person names, dates, times, colors, coordinates, currencies, digital object identifiers ('DOIs'), jobs, phone numbers, 'DNA' sequences, doubles and integers from distributions and within a range.
Maintained by Roel M. Hogervorst. Last updated 1 months ago.
48.5 match 296 stars 10.06 score 180 scripts 1 dependentsalanarnholt
BSDA:Basic Statistics and Data Analysis
Data sets for book "Basic Statistics and Data Analysis" by Larry J. Kitchens.
Maintained by Alan T. Arnholt. Last updated 2 years ago.
27.1 match 7 stars 9.11 score 1.3k scripts 6 dependentsrevelle
psychTools:Tools to Accompany the 'psych' Package for Psychological Research
Support functions, data sets, and vignettes for the 'psych' package. Contains several of the biggest data sets for the 'psych' package as well as four vignettes. A few helper functions for file manipulation are included as well. For more information, see the <> web page.
Maintained by William Revelle. Last updated 12 months ago.
38.2 match 5.89 score 178 scripts 5 dependentstherneau
survival:Survival Analysis
Contains the core survival analysis routines, including definition of Surv objects, Kaplan-Meier and Aalen-Johansen (multi-state) curves, Cox models, and parametric accelerated failure time models.
Maintained by Terry M Therneau. Last updated 3 months ago.
10.9 match 400 stars 20.43 score 29k scripts 3.9k dependentsjansteinfeld
PP:Person Parameter Estimation
The PP package includes estimation of (MLE, WLE, MAP, EAP, ROBUST) person parameters for the 1,2,3,4-PL model and the GPCM (generalized partial credit model). The parameters are estimated under the assumption that the item parameters are known and fixed. The package is useful e.g. in the case that items from an item pool / item bank with known item parameters are administered to a new population of test-takers and an ability estimation for every test-taker is needed.
Maintained by Jan Steinfeld. Last updated 2 years ago.
31.6 match 1 stars 6.76 score 43 scripts 1 dependentsmicrosoft
wpa:Tools for Analysing and Visualising Viva Insights Data
Opinionated functions that enable easier and faster analysis of Viva Insights data. There are three main types of functions in 'wpa': (i) Standard functions create a 'ggplot' visual or a summary table based on a specific Viva Insights metric; (2) Report Generation functions generate HTML reports on a specific analysis area, e.g. Collaboration; (3) Other miscellaneous functions cover more specific applications (e.g. Subject Line text mining) of Viva Insights data. This package adheres to 'tidyverse' principles and works well with the pipe syntax. 'wpa' is built with the beginner-to-intermediate R users in mind, and is optimised for simplicity.
Maintained by Martin Chan. Last updated 4 months ago.
27.4 match 30 stars 6.69 score 39 scripts 1 dependentsalexanderrobitzsch
sirt:Supplementary Item Response Theory Models
Supplementary functions for item response models aiming to complement existing R packages. The functionality includes among others multidimensional compensatory and noncompensatory IRT models (Reckase, 2009, <doi:10.1007/978-0-387-89976-3>), MCMC for hierarchical IRT models and testlet models (Fox, 2010, <doi:10.1007/978-1-4419-0742-4>), NOHARM (McDonald, 1982, <doi:10.1177/014662168200600402>), Rasch copula model (Braeken, 2011, <doi:10.1007/s11336-010-9190-4>; Schroeders, Robitzsch & Schipolowski, 2014, <doi:10.1111/jedm.12054>), faceted and hierarchical rater models (DeCarlo, Kim & Johnson, 2011, <doi:10.1111/j.1745-3984.2011.00143.x>), ordinal IRT model (ISOP; Scheiblechner, 1995, <doi:10.1007/BF02301417>), DETECT statistic (Stout, Habing, Douglas & Kim, 1996, <doi:10.1177/014662169602000403>), local structural equation modeling (LSEM; Hildebrandt, Luedtke, Robitzsch, Sommer & Wilhelm, 2016, <doi:10.1080/00273171.2016.1142856>).
Maintained by Alexander Robitzsch. Last updated 3 months ago.
14.3 match 23 stars 10.01 score 280 scripts 22 dependentsmicrosoft
vivainsights:Analyze and Visualize Data from 'Microsoft Viva Insights'
Provides a versatile range of functions, including exploratory data analysis, time-series analysis, organizational network analysis, and data validation, whilst at the same time implements a set of best practices in analyzing and visualizing data specific to 'Microsoft Viva Insights'.
Maintained by Martin Chan. Last updated 24 days ago.
23.0 match 11 stars 6.12 score 68 scriptsropengov
hetu:Structural Handling of Finnish Personal Identity Codes
Structural handling of Finnish identity codes (natural persons and organizations); extract information, check ID validity and diagnostics.
Maintained by Pyry Kantanen. Last updated 3 months ago.
22.9 match 2 stars 4.86 score 18 scriptspaws-r
paws:Amazon Web Services Software Development Kit
Interface to Amazon Web Services <>, including storage, database, and compute services, such as 'Simple Storage Service' ('S3'), 'DynamoDB' 'NoSQL' database, and 'Lambda' functions-as-a-service.
Maintained by Dyfan Jones. Last updated 4 days ago.
9.3 match 332 stars 11.25 score 177 scripts 12 dependentsperson-c
easybio:Comprehensive Single-Cell Annotation and Transcriptomic Analysis Toolkit
Provides a comprehensive toolkit for single-cell annotation with the 'CellMarker2.0' database (see Xia Li, Peng Wang, Yunpeng Zhang (2023) <doi: 10.1093/nar/gkac947>). Streamlines biological label assignment in single-cell RNA-seq data and facilitates transcriptomic analysis, including preparation of TCGA<> and GEO<> datasets, differential expression analysis and visualization of enrichment analysis results. Additional utility functions support various bioinformatics workflows. See Wei Cui (2024) <doi: 10.1101/2024.09.14.609619> for more details.
Maintained by Wei Cui. Last updated 13 days ago.
15.0 match 10 stars 6.62 score 35 scriptsobiba
opalr:'Opal' Data Repository Client and 'DataSHIELD' Utils
Data integration Web application for biobanks by 'OBiBa'. 'Opal' is the core database application for biobanks. Participant data, once collected from any data source, must be integrated and stored in a central data repository under a uniform model. 'Opal' is such a central repository. It can import, process, validate, query, analyze, report, and export data. 'Opal' is typically used in a research center to analyze the data acquired at assessment centres. Its ultimate purpose is to achieve seamless data-sharing among biobanks. This 'Opal' client allows to interact with 'Opal' web services and to perform operations on the R server side. 'DataSHIELD' administration tools are also provided.
Maintained by Yannick Marcon. Last updated 2 months ago.
12.7 match 3 stars 7.76 score 179 scripts 2 dependentsjorgetendeiro
PerFit:Person Fit
Several person-fit statistics (PFSs; Meijer and Sijtsma, 2001, <doi:10.1177/01466210122031957>) are offered. These statistics allow assessing whether individual response patterns to tests or questionnaires are (im)plausible given the other respondents in the sample or given a specified item response theory model. Some PFSs apply to dichotomous data, such as the likelihood-based PFSs (lz, lz*) and the group-based PFSs (personal biserial correlation, caution index, (normed) number of Guttman errors, agreement/disagreement/dependability statistics, U3, ZU3, NCI, Ht). PFSs suitable to polytomous data include extensions of lz, U3, and (normed) number of Guttman errors.
Maintained by Jorge N. Tendeiro. Last updated 3 years ago.
28.7 match 1 stars 3.36 score 46 scriptssjentsch
jmvReadWrite:Read and Write 'jamovi' Files ('.omv')
The free and open a statistical spreadsheet 'jamovi' (<>) aims to make statistical analyses easy and intuitive. 'jamovi' produces syntax that can directly be used in R (in connection with the R-package 'jmv'). Having import / export routines for the data files 'jamovi' produces ('.omv') permits an easy transfer of data and analyses between 'jamovi' and R.
Maintained by Sebastian Jentschke. Last updated 1 days ago.
15.2 match 5 stars 6.09 score 32 scriptsg-corbelli
persval:Computing Personal Values Scores
Compute personal values scores from various questionnaires based on the theoretical constructs proposed by professor Shalom H. Schwartz. Designed for researchers and practitioners in psychology, sociology, and related fields, the package facilitates the quantification of different dimensions related to personal values from survey data. It incorporates the recommended statistical adjustment to enhance the accuracy and interpretation of the results. Note: The package 'persval' is independently developed based on the personal values theoretical framework, and is not directly endorsed by professor Schwartz.
Maintained by Giuseppe Corbelli. Last updated 7 months ago.
28.1 match 2 stars 3.30 scorejhhmuc
pairwise:Rasch Model Parameters by Pairwise Algorithm
Performs the explicit calculation -- not estimation! -- of the Rasch item parameters for dichotomous and polytomous item responses, using a pairwise comparison approach. Person parameters (WLE) are calculated according to Warm's weighted likelihood approach.
Maintained by Joerg-Henrik Heine. Last updated 2 years ago.
23.2 match 3.96 score 38 scripts 1 dependentsflorale
multilevelcoda:Estimate Bayesian Multilevel Models for Compositional Data
Implement Bayesian Multilevel Modelling for compositional data in a multilevel framework. Compute multilevel compositional data and Isometric log ratio (ILR) at between and within-person levels, fit Bayesian multilevel models for compositional predictors and outcomes, and run post-hoc analyses such as isotemporal substitution models. References: Le, Stanford, Dumuid, and Wiley (2024) <doi:10.48550/arXiv.2405.03985>, Le, Dumuid, Stanford, and Wiley (2024) <doi:10.48550/arXiv.2411.12407>.
Maintained by Flora Le. Last updated 3 days ago.
10.8 match 14 stars 8.31 score 118 scriptsfriendly
heplots:Visualizing Hypothesis Tests in Multivariate Linear Models
Provides HE plot and other functions for visualizing hypothesis tests in multivariate linear models. HE plots represent sums-of-squares-and-products matrices for linear hypotheses and for error using ellipses (in two dimensions) and ellipsoids (in three dimensions). The related 'candisc' package provides visualizations in a reduced-rank canonical discriminant space when there are more than a few response variables.
Maintained by Michael Friendly. Last updated 9 days ago.
7.3 match 9 stars 11.49 score 1.1k scripts 7 dependentspaws-r
paws.machine.learning:'Amazon Web Services' Machine Learning Services
Interface to 'Amazon Web Services' machine learning services, including 'SageMaker' managed machine learning service, natural language processing, speech recognition, translation, and more <>.
Maintained by Dyfan Jones. Last updated 4 days ago.
9.3 match 332 stars 9.05 score 14 dependentsmarsicofl
mispitools:Missing Person Identification Tools
An open source software package written in R statistical language. It consists of a set of decision-making tools to conduct missing person searches. Particularly, it allows computing optimal LR threshold for declaring potential matches in DNA-based database search. More recently 'mispitools' incorporates preliminary investigation data based LRs. Statistical weight of different traces of evidence such as biological sex, age and hair color are presented. For citing mispitools please use the following references: Marsico and Caridi, 2023 <doi:10.1016/j.fsigen.2023.102891> and Marsico, Vigeland et al. 2021 <doi:10.1016/j.fsigen.2021.102519>.
Maintained by Franco Marsico. Last updated 3 months ago.
12.2 match 35 stars 6.74 score 19 scripts 1 dependentsmagnusdv
forrel:Forensic Pedigree Analysis and Relatedness Inference
Forensic applications of pedigree analysis, including likelihood ratios for relationship testing, general relatedness inference, marker simulation, and power analysis. 'forrel' is part of the 'pedsuite', a collection of packages for pedigree analysis, further described in the book 'Pedigree Analysis in R' (Vigeland, 2021, ISBN:9780128244302). Several functions deal specifically with power analysis in missing person cases, implementing methods described in Vigeland et al. (2020) <doi:10.1016/j.fsigen.2020.102376>. Data import from the 'Familias' software (Egeland et al. (2000) <doi:10.1016/S0379-0738(00)00147-X>) is supported through the 'pedFamilias' package.
Maintained by Magnus Dehli Vigeland. Last updated 5 days ago.
11.6 match 11 stars 6.98 score 63 scripts 7 dependentscddesja
profileR:Profile Analysis of Multivariate Data in R
A suite of multivariate methods and data visualization tools to implement profile analysis and cross-validation techniques described in Davison & Davenport (2002) <DOI: 10.1037/1082-989X.7.4.468>, Bulut (2013), and other published and unpublished resources. The package includes routines to perform criterion-related profile analysis, profile analysis via multidimensional scaling, moderated profile analysis, generalizability theory, profile analysis by group, and a within-person factor model to derive score profiles.
Maintained by Christopher David Desjardins. Last updated 2 years ago.
14.1 match 3 stars 5.65 score 50 scriptsludovikcoba
rrecsys:Environment for Evaluating Recommender Systems
Processes standard recommendation datasets (e.g., a user-item rating matrix) as input and generates rating predictions and lists of recommended items. Standard algorithm implementations which are included in this package are the following: Global/Item/User-Average baselines, Weighted Slope One, Item-Based KNN, User-Based KNN, FunkSVD, BPR and weighted ALS. They can be assessed according to the standard offline evaluation methodology (Shani, et al. (2011) <doi:10.1007/978-0-387-85820-3_8>) for recommender systems using measures such as MAE, RMSE, Precision, Recall, F1, AUC, NDCG, RankScore and coverage measures. The package (Coba, et al.(2017) <doi: 10.1007/978-3-319-60042-0_36>) is intended for rapid prototyping of recommendation algorithms and education purposes.
Maintained by Ludovik Çoba. Last updated 3 years ago.
11.0 match 23 stars 6.84 score 25 scriptsr-lib
gh:'GitHub' 'API'
Minimal client to access the 'GitHub' 'API'.
Maintained by Gábor Csárdi. Last updated 1 months ago.
4.8 match 224 stars 15.55 score 444 scripts 401 dependentsmwheymans
psfmi:Prediction Model Pooling, Selection and Performance Evaluation Across Multiply Imputed Datasets
Pooling, backward and forward selection of linear, logistic and Cox regression models in multiply imputed datasets. Backward and forward selection can be done from the pooled model using Rubin's Rules (RR), the D1, D2, D3, D4 and the median p-values method. This is also possible for Mixed models. The models can contain continuous, dichotomous, categorical and restricted cubic spline predictors and interaction terms between all these type of predictors. The stability of the models can be evaluated using (cluster) bootstrapping. The package further contains functions to pool model performance measures as ROC/AUC, Reclassification, R-squared, scaled Brier score, H&L test and calibration plots for logistic regression models. Internal validation can be done across multiply imputed datasets with cross-validation or bootstrapping. The adjusted intercept after shrinkage of pooled regression coefficients can be obtained. Backward and forward selection as part of internal validation is possible. A function to externally validate logistic prediction models in multiple imputed datasets is available and a function to compare models. For Cox models a strata variable can be included. Eekhout (2017) <doi:10.1186/s12874-017-0404-7>. Wiel (2009) <doi:10.1093/biostatistics/kxp011>. Marshall (2009) <doi:10.1186/1471-2288-9-57>.
Maintained by Martijn Heymans. Last updated 2 years ago.
10.0 match 10 stars 7.17 score 70 scriptsrichardli
SUMMER:Small-Area-Estimation Unit/Area Models and Methods for Estimation in R
Provides methods for spatial and spatio-temporal smoothing of demographic and health indicators using survey data, with particular focus on estimating and projecting under-five mortality rates, described in Mercer et al. (2015) <doi:10.1214/15-AOAS872>, Li et al. (2019) <doi:10.1371/journal.pone.0210645>, Wu et al. (DHS Spatial Analysis Reports No. 21, 2021), and Li et al. (2023) <doi:10.48550/arXiv.2007.05117>.
Maintained by Zehang R Li. Last updated 2 months ago.
6.9 match 23 stars 10.28 score 134 scripts 2 dependentserossiter
catSurv:Computerized Adaptive Testing for Survey Research
Provides methods of computerized adaptive testing for survey researchers. See Montgomery and Rossiter (2020) <doi:10.1093/jssam/smz027>. Includes functionality for data fit with the classic item response methods including the latent trait model, Birnbaum`s three parameter model, the graded response, and the generalized partial credit model. Additionally, includes several ability parameter estimation and item selection routines. During item selection, all calculations are done in compiled C++ code.
Maintained by Erin Rossiter. Last updated 10 months ago.
15.1 match 12 stars 4.68 score 3 scriptspmair78
eRm:Extended Rasch Modeling
Fits Rasch models (RM), linear logistic test models (LLTM), rating scale model (RSM), linear rating scale models (LRSM), partial credit models (PCM), and linear partial credit models (LPCM). Missing values are allowed in the data matrix. Additional features are the ML estimation of the person parameters, Andersen's LR-test, item-specific Wald test, Martin-Loef-Test, nonparametric Monte-Carlo Tests, itemfit and personfit statistics including infit and outfit measures, ICC and other plots, automated stepwise item elimination, simulation module for various binary data matrices.
Maintained by Patrick Mair. Last updated 1 years ago.
11.0 match 4 stars 6.42 score 182 scripts 5 dependentshojsgaard
doBy:Groupwise Statistics, LSmeans, Linear Estimates, Utilities
Utility package containing: 1) Facilities for working with grouped data: 'do' something to data stratified 'by' some variables. 2) LSmeans (least-squares means), general linear estimates. 3) Restrict functions to a smaller domain. 4) Miscellaneous other utilities.
Maintained by Søren Højsgaard. Last updated 4 days ago.
4.5 match 1 stars 14.94 score 3.2k scripts 939 dependentsrstudio
gt:Easily Create Presentation-Ready Display Tables
Build display tables from tabular data with an easy-to-use set of functions. With its progressive approach, we can construct display tables with a cohesive set of table parts. Table values can be formatted using any of the included formatting functions. Footnotes and cell styles can be precisely added through a location targeting system. The way in which 'gt' handles things for you means that you don't often have to worry about the fine details.
Maintained by Richard Iannone. Last updated 11 days ago.
3.6 match 2.1k stars 18.36 score 20k scripts 112 dependentstrinker
qdap:Bridging the Gap Between Qualitative Data and Quantitative Analysis
Automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables, providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. 'qdap' is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/ Natural Language Processing.
Maintained by Tyler Rinker. Last updated 4 years ago.
6.7 match 176 stars 9.61 score 1.3k scripts 3 dependentsdrizopoulos
JMbayes2:Extended Joint Models for Longitudinal and Time-to-Event Data
Fit joint models for longitudinal and time-to-event data under the Bayesian approach. Multiple longitudinal outcomes of mixed type (continuous/categorical) and multiple event times (competing risks and multi-state processes) are accommodated. Rizopoulos (2012, ISBN:9781439872864).
Maintained by Dimitris Rizopoulos. Last updated 11 days ago.
7.5 match 84 stars 8.27 score 264 scripts 2 dependentsegeulgen
PANACEA:Personalized Network-Based Anti-Cancer Therapy Evaluation
Identification of the most appropriate pharmacotherapy for each patient based on genomic alterations is a major challenge in personalized oncology. 'PANACEA' is a collection of personalized anti-cancer drug prioritization approaches utilizing network methods. The methods utilize personalized "driverness" scores from 'driveR' to rank drugs, mapping these onto a protein-protein interaction network. The "distance-based" method scores each drug based on these scores and distances between drugs and genes to rank given drugs. The "RWR" method propagates these scores via a random-walk with restart framework to rank the drugs. The methods are described in detail in Ulgen E, Ozisik O, Sezerman OU. 2023. PANACEA: network-based methods for pharmacotherapy prioritization in personalized oncology. Bioinformatics <doi:10.1093/bioinformatics/btad022>.
Maintained by Ege Ulgen. Last updated 2 years ago.
13.0 match 10 stars 4.70 score 3 scriptslbbe-software
fitdistrplus:Help to Fit of a Parametric Distribution to Non-Censored or Censored Data
Extends the fitdistr() function (of the MASS package) with several functions to help the fit of a parametric distribution to non-censored or censored data. Censored data may contain left censored, right censored and interval censored values, with several lower and upper bounds. In addition to maximum likelihood estimation (MLE), the package provides moment matching (MME), quantile matching (QME), maximum goodness-of-fit estimation (MGE) and maximum spacing estimation (MSE) methods (available only for non-censored data). Weighted versions of MLE, MME, QME and MSE are available. See e.g. Casella & Berger (2002), Statistical inference, Pacific Grove, for a general introduction to parametric estimation.
Maintained by Aurélie Siberchicot. Last updated 13 days ago.
3.6 match 54 stars 16.15 score 4.5k scripts 153 dependentstilltnet
egor:Import and Analyse Ego-Centered Network Data
Tools for importing, analyzing and visualizing ego-centered network data. Supports several data formats, including the export formats of 'EgoNet', 'EgoWeb 2.0' and 'openeddi'. An interactive (shiny) app for the intuitive visualization of ego-centered networks is provided. Also included are procedures for creating and visualizing Clustered Graphs (Lerner 2008 <DOI:10.1109/PACIFICVIS.2008.4475458>).
Maintained by Till Krenz. Last updated 13 days ago.
6.7 match 24 stars 8.64 score 76 scripts 2 dependentslaresbernardo
lares:Analytics & Machine Learning Sidekick
Auxiliary package for better/faster analytics, visualization, data mining, and machine learning tasks. With a wide variety of family functions, like Machine Learning, Data Wrangling, Marketing Mix Modeling (Robyn), Exploratory, API, and Scrapper, it helps the analyst or data scientist to get quick and robust results, without the need of repetitive coding or advanced R programming skills.
Maintained by Bernardo Lares. Last updated 24 days ago.
5.7 match 233 stars 9.84 score 185 scripts 1 dependentsjl5000
tidyged:Handle GEDCOM Files Using Tidyverse Principles
Create and summarise family tree GEDCOM files using tidy dataframes.
Maintained by Jamie Lendrum. Last updated 3 years ago.
8.9 match 8 stars 5.96 score 23 scripts 3 dependentsfatelarico
FinNet:Quickly Build and Manipulate Financial Networks
Providing classes, methods, and functions to deal with financial networks. Users can easily store information about both physical and legal persons by using pre-made classes that are studied for integration with scraping packages such as 'rvest' and 'RSelenium'. Moreover, the package assists in creating various types of financial networks depending on the type of relation between its units depending on the relation under scrutiny (ownership, board interlocks, etc.), the desired tie type (valued or binary), and renders them in the most common formats (adjacency matrix, incidence matrix, edge list, 'igraph', 'network'). There are also ad-hoc functions for the Fiedler value, global network efficiency, and cascade-failure analysis.
Maintained by Fabio Ashtar Telarico. Last updated 5 months ago.
10.7 match 2 stars 4.78 score 7 scriptswadpac
GGIR:Raw Accelerometer Data Analysis
A tool to process and analyse data collected with wearable raw acceleration sensors as described in Migueles and colleagues (JMPB 2019), and van Hees and colleagues (JApplPhysiol 2014; PLoSONE 2015). The package has been developed and tested for binary data from 'GENEActiv' <>, binary (.gt3x) and .csv-export data from 'Actigraph' <> devices, and binary (.cwa) and .csv-export data from 'Axivity' <>. These devices are currently widely used in research on human daily physical activity. Further, the package can handle accelerometer data file from any other sensor brand providing that the data is stored in csv format. Also the package allows for external function embedding.
Maintained by Vincent T van Hees. Last updated 2 days ago.
3.8 match 109 stars 13.20 score 342 scripts 3 dependentsrobjhyndman
fpp2:Data for "Forecasting: Principles and Practice" (2nd Edition)
All data sets required for the examples and exercises in the book "Forecasting: principles and practice" (2nd ed, 2018) by Rob J Hyndman and George Athanasopoulos <>. All packages required to run the examples are also loaded.
Maintained by Rob Hyndman. Last updated 2 years ago.
5.8 match 106 stars 8.57 score 1.8k scripts 1 dependentsbxc147
Epi:Statistical Analysis in Epidemiology
Functions for demographic and epidemiological analysis in the Lexis diagram, i.e. register and cohort follow-up data. In particular representation, manipulation, rate estimation and simulation for multistate data - the Lexis suite of functions, which includes interfaces to 'mstate', 'etm' and 'cmprsk' packages. Contains functions for Age-Period-Cohort and Lee-Carter modeling and a function for interval censored data and some useful functions for tabulation and plotting, as well as a number of epidemiological data sets.
Maintained by Bendix Carstensen. Last updated 2 months ago.
5.1 match 4 stars 9.65 score 708 scripts 11 dependentsropensci
cffr:Generate Citation File Format ('cff') Metadata for R Packages
The Citation File Format version 1.2.0 <doi:10.5281/zenodo.5171937> is a human and machine readable file format which provides citation metadata for software. This package provides core utilities to generate and validate this metadata.
Maintained by Diego Hernangómez. Last updated 4 days ago.
5.0 match 26 stars 9.74 score 116 scripts 3 dependentsrolkra
explore:Simplifies Exploratory Data Analysis
Interactive data exploration with one line of code, automated reporting or use an easy to remember set of tidy functions for low code exploratory data analysis.
Maintained by Roland Krasser. Last updated 3 months ago.
4.2 match 228 stars 11.43 score 221 scripts 1 dependentssquidgroup
squid:Statistical Quantification of Individual Differences
A simulation-based tool made to help researchers to become familiar with multilevel variations, and to build up sampling designs for their study. This tool has two main objectives: First, it provides an educational tool useful for students, teachers and researchers who want to learn to use mixed-effects models. Users can experience how the mixed-effects model framework can be used to understand distinct biological phenomena by interactively exploring simulated multilevel data. Second, it offers research opportunities to those who are already familiar with mixed-effects models, as it enables the generation of data sets that users may download and use for a range of simulation-based statistical analyses such as power and sensitivity analysis of multilevel and multivariate data [Allegue, H., Araya-Ajoy, Y.G., Dingemanse, N.J., Dochtermann N.A., Garamszegi, L.Z., Nakagawa, S., Reale, D., Schielzeth, H. and Westneat, D.F. (2016) <doi: 10.1111/2041-210X.12659>].
Maintained by Hassen Allegue. Last updated 3 years ago.
10.0 match 34 stars 4.76 score 17 scriptsdietrichson
ProPublicaR:Access Functions for ProPublica's APIs
Provides wrapper functions to access the ProPublica's Congress and Campaign Finance APIs. The Congress API provides near real-time access to legislative data from the House of Representatives, the Senate and the Library of Congress. The Campaign Finance API provides data from United States Federal Election Commission filings and other sources. The API covers summary information for candidates and committees, as well as certain types of itemized data. For more information about these APIs go to: <>.
Maintained by Aleksander Dietrichson. Last updated 2 years ago.
10.6 match 12 stars 4.38 score 1 scriptsprogramgirl
PopulateR:Create Data Frames for the Micro-Simulation of Human Populations
Tools for constructing detailed synthetic human populations from frequency tables. Add ages based on age groups and sex, create households, add students to education facilities, create employers, add employers to employees, and create interpersonal networks.
Maintained by Michelle Gosse. Last updated 1 months ago.
11.4 match 1 stars 3.88 scorecran
LTASR:Functions to Replicate the Center for Disease Control and Prevention's 'LTAS' Software in R
A suite of functions for reading in a rate file in XML format, stratify a cohort, and calculate 'SMRs' from the stratified cohort and rate file.
Maintained by Stephen Bertke. Last updated 7 months ago.
10.5 match 4.11 score 32 scriptsflujoo
personr:Test Your Personality
An R-package-version of an open online science-based personality test from <>, providing a better-designed interface and a more detailed report. The core command launch_test() opens a personality test in your browser, and generates a report after you click "Submit". In this report, your results are compared with other people's, to show what these results mean. Other people's data is from <>.
Maintained by Renfei Mao. Last updated 4 years ago.
15.9 match 2.70 score 2 scriptsluca-scr
qcc:Quality Control Charts
Shewhart quality control charts for continuous, attribute and count data. Cusum and EWMA charts. Operating characteristic curves. Process capability analysis. Pareto chart and cause-and-effect chart. Multivariate control charts.
Maintained by Luca Scrucca. Last updated 2 years ago.
3.8 match 46 stars 11.29 score 730 scripts 6 dependentsindrag49
QGameTheory:Quantum Game Theory Simulator
General purpose toolbox for simulating quantum versions of game theoretic models (Flitney and Abbott 2002) <arXiv:quant-ph/0208069>. Quantum (Nielsen and Chuang 2010, ISBN:978-1-107-00217-3) versions of models that have been handled are: Penny Flip Game (David A. Meyer 1998) <arXiv:quant-ph/9804010>, Prisoner's Dilemma (J. Orlin Grabbe 2005) <arXiv:quant-ph/0506219>, Two Person Duel (Flitney and Abbott 2004) <arXiv:quant-ph/0305058>, Battle of the Sexes (Nawaz and Toor 2004) <arXiv:quant-ph/0110096>, Hawk and Dove Game (Nawaz and Toor 2010) <arXiv:quant-ph/0108075>, Newcomb's Paradox (Piotrowski and Sladkowski 2002) <arXiv:quant-ph/0202074> and Monty Hall Problem (Flitney and Abbott 2002) <arXiv:quant-ph/0109035>.
Maintained by Indranil Ghosh. Last updated 5 months ago.
11.3 match 11 stars 3.74 score 1 scriptspatriciamar
ShinyItemAnalysis:Test and Item Analysis via Shiny
Package including functions and interactive shiny application for the psychometric analysis of educational tests, psychological assessments, health-related and other types of multi-item measurements, or ratings from multiple raters.
Maintained by Patricia Martinkova. Last updated 1 months ago.
5.3 match 44 stars 7.88 score 105 scripts 3 dependentshojsgaard
gRbase:A Package for Graphical Modelling in R
The 'gRbase' package provides graphical modelling features used by e.g. the packages 'gRain', 'gRim' and 'gRc'. 'gRbase' implements graph algorithms including (i) maximum cardinality search (for marked and unmarked graphs). (ii) moralization, (iii) triangulation, (iv) creation of junction tree. 'gRbase' facilitates array operations, 'gRbase' implements functions for testing for conditional independence. 'gRbase' illustrates how hierarchical log-linear models may be implemented and describes concept of graphical meta data. The facilities of the package are documented in the book by Højsgaard, Edwards and Lauritzen (2012, <doi:10.1007/978-1-4614-2299-0>) and in the paper by Dethlefsen and Højsgaard, (2005, <doi:10.18637/jss.v014.i17>). Please see 'citation("gRbase")' for citation details.
Maintained by Søren Højsgaard. Last updated 4 months ago.
4.5 match 3 stars 9.24 score 241 scripts 20 dependentssmac-group
simts:Time Series Analysis Tools
A system contains easy-to-use tools as a support for time series analysis courses. In particular, it incorporates a technique called Generalized Method of Wavelet Moments (GMWM) as well as its robust implementation for fast and robust parameter estimation of time series models which is described, for example, in Guerrier et al. (2013) <doi: 10.1080/01621459.2013.799920>. More details can also be found in the paper linked to via the URL below.
Maintained by Stéphane Guerrier. Last updated 2 years ago.
5.3 match 15 stars 7.68 score 59 scripts 4 dependentsmpjashby
sfhotspot:Hot-Spot Analysis with Simple Features
Identify and understand clusters of points (typically representing the locations of places or events) stored in simple-features (SF) objects. This is useful for analysing, for example, hot-spots of crime events. The package emphasises producing results from point SF data in a single step using reasonable default values for all other arguments, to aid rapid data analysis by users who are starting out. Functions available include kernel density estimation (for details, see Yip (2020) <doi:10.22224/gistbok/2020.1.12>), analysis of spatial association (Getis and Ord (1992) <doi:10.1111/j.1538-4632.1992.tb00261.x>) and hot-spot classification (Chainey (2020) ISBN:158948584X).
Maintained by Matt Ashby. Last updated 24 days ago.
7.3 match 12 stars 5.56 score 30 scriptshenrikbengtsson
port4me:Get the Same, Personal, Free 'TCP' Port over and over
An R implementation of the cross-platform, language-independent "port4me" algorithm (<>), which (1) finds a free Transmission Control Protocol ('TCP') port in [1024,65535] that the user can open, (2) is designed to work in multi-user environments, (3), gives different users, different ports, (4) gives the user the same port over time with high probability, (5) gives different ports for different software tools, and (6) requires no configuration.
Maintained by Henrik Bengtsson. Last updated 1 years ago.
7.6 match 13 stars 5.11 score 5 scriptsdrizopoulos
ltm:Latent Trait Models under IRT
Analysis of multivariate dichotomous and polytomous data using latent trait models under the Item Response Theory approach. It includes the Rasch, the Two-Parameter Logistic, the Birnbaum's Three-Parameter, the Graded Response, and the Generalized Partial Credit Models.
Maintained by Dimitris Rizopoulos. Last updated 3 years ago.
4.0 match 30 stars 9.59 score 1.0k scripts 27 dependentssilvadenisson
electionsBR:R Functions to Download and Clean Brazilian Electoral Data
Offers a set of functions to easily download and clean Brazilian electoral data from the Superior Electoral Court and 'CepespData' websites. Among other features, the package retrieves data on local and federal elections for all positions (city councilor, mayor, state deputy, federal deputy, governor, and president) aggregated by state, city, and electoral zones.
Maintained by Denisson Silva. Last updated 4 months ago.
5.1 match 65 stars 7.54 score 66 scriptstrinker
qdapRegex:Regular Expression Removal, Extraction, and Replacement Tools
A collection of regular expression tools associated with the 'qdap' package that may be useful outside of the context of discourse analysis. Tools include removal/extraction/replacement of abbreviations, dates, dollar amounts, email addresses, hash tags, numbers, percentages, citations, person tags, phone numbers, times, and zip codes.
Maintained by Tyler Rinker. Last updated 1 years ago.
4.1 match 50 stars 9.48 score 502 scripts 41 dependentsropengov
sweidnumbr:Handling of Swedish Identity Numbers
Structural handling of identity numbers used in the Swedish administration such as personal identity numbers ('personnummer') and organizational identity numbers ('organisationsnummer').
Maintained by Mans Magnusson. Last updated 1 years ago.
7.0 match 8 stars 5.38 scorephilchalmers
mirt:Multidimensional Item Response Theory
Analysis of discrete response data using unidimensional and multidimensional item analysis models under the Item Response Theory paradigm (Chalmers (2012) <doi:10.18637/jss.v048.i06>). Exploratory and confirmatory item factor analysis models are estimated with quadrature (EM) or stochastic (MHRM) methods. Confirmatory bi-factor and two-tier models are available for modeling item testlets using dimension reduction EM algorithms, while multiple group analyses and mixed effects designs are included for detecting differential item, bundle, and test functioning, and for modeling item and person covariates. Finally, latent class models such as the DINA, DINO, multidimensional latent class, mixture IRT models, and zero-inflated response models are supported, as well as a wide family of probabilistic unfolding models.
Maintained by Phil Chalmers. Last updated 11 days ago.
2.5 match 210 stars 14.98 score 2.5k scripts 40 dependentsdexter-psychometrics
dexter:Data Management and Analysis of Tests
A system for the management, assessment, and psychometric analysis of data from educational and psychological tests.
Maintained by Jesse Koops. Last updated 6 days ago.
4.1 match 8 stars 8.97 score 135 scripts 2 dependentsbayesball
ProbBayes:Probability and Bayesian Modeling
Functions and datasets to accompany J. Albert and J. Hu, "Probability and Bayesian Modeling", CRC Press, (2019, ISBN: 1138492566).
Maintained by Jim Albert. Last updated 4 years ago.
8.5 match 5 stars 4.30 score 80 scriptsalexrecuenco
TexExamRandomizer:Personalizes and Randomizes Exams Written in 'LaTeX'
Randomizing exams with 'LaTeX'. If you can compile your main document with 'LaTeX', the program should be able to compile the randomized versions without much extra effort when creating the document.
Maintained by Alejandro Gonzalez Recuenco. Last updated 1 years ago.
8.0 match 1 stars 4.52 score 22 scriptsskvrnami
hlidacr:Access Data from the 'Hlídač Státu' API
Provides access to datasets published by 'Hlídač státu' <>, a Czech watchdog, via their API.
Maintained by Michael Škvrňák. Last updated 3 years ago.
7.8 match 8 stars 4.60 score 6 scriptslisaannyu
lifelogr:Life Logging
Provides a framework for combining self-data (exercise, sleep, etc.) from multiple sources (fitbit, Apple Health), creating visualizations, and experimenting on onself.
Maintained by Lisa Ann Yu. Last updated 8 years ago.
12.1 match 2.93 score 43 scriptsandreacapozio
TMDb:Access to TMDb API
Provides an R-interface to the TMDb API (see TMDb API on <>). The Movie Database (TMDb) is a popular user editable database for movies and TV shows (see <>).
Maintained by Andrea Capozio. Last updated 5 years ago.
17.2 match 2.00 score 99 scriptsdragosmg
rocnp:Work with Romanian Personal Numeric Codes PNC / CNP
A set of tools for working with Romanian personal numeric codes. The core is a validation function which applies several verification criteria to assess the validity of numeric codes. This is accompanied by functionality for extracting the different components of a personal numeric code. A personal numeric code is issued to all Romanian residents either at birth or when they obtain a residence permit.
Maintained by Dragoș Moldovan-Grünfeld. Last updated 3 years ago.
12.6 match 2.70 score 2 scriptsgesistsa
webtrackR:Preprocessing and Analyzing Web Tracking Data
Data structures and methods to work with web tracking data. The functions cover data preprocessing steps, enriching web tracking data with external information and methods for the analysis of digital behavior as used in several academic papers (e.g., Clemm von Hohenberg et al., 2023 <doi:10.17605/OSF.IO/M3U9P>; Stier et al., 2022 <doi:10.1017/S0003055421001222>).
Maintained by David Schoch. Last updated 3 months ago.
5.6 match 9 stars 6.03 score 8 scriptshumaniverse
asylum:Data on Asylum and Resettlement for the UK
Data on Asylum and Resettlement for the UK, provided by the Home Office <>.
Maintained by Matthew Gwynfryn Thomas. Last updated 17 days ago.
6.7 match 3 stars 4.99 score 36 scriptsphilchalmers
mirtCAT:Computerized Adaptive Testing with Multidimensional Item Response Theory
Provides tools to generate HTML interfaces for adaptive and non-adaptive tests using the shiny package (Chalmers (2016) <doi:10.18637/jss.v071.i05>). Suitable for applying unidimensional and multidimensional computerized adaptive tests (CAT) using item response theory methodology and for creating simple questionnaires forms to collect response data directly in R. Additionally, optimal test designs (e.g., "shadow testing") are supported for tests that contain a large number of item selection constraints. Finally, package contains tools useful for performing Monte Carlo simulations for studying test item banks.
Maintained by Phil Chalmers. Last updated 5 months ago.
3.5 match 95 stars 9.41 score 62 scripts 3 dependentsdonaldrwilliams
BGGM:Bayesian Gaussian Graphical Models
Fit Bayesian Gaussian graphical models. The methods are separated into two Bayesian approaches for inference: hypothesis testing and estimation. There are extensions for confirmatory hypothesis testing, comparing Gaussian graphical models, and node wise predictability. These methods were recently introduced in the Gaussian graphical model literature, including Williams (2019) <doi:10.31234/>, Williams and Mulder (2019) <doi:10.31234/>, Williams, Rast, Pericchi, and Mulder (2019) <doi:10.31234/>.
Maintained by Philippe Rast. Last updated 3 months ago.
3.4 match 55 stars 9.64 score 102 scripts 1 dependentslightbluetitan
timeSeriesDataSets:Time Series Data Sets
Provides a diverse collection of time series datasets spanning various fields such as economics, finance, energy, healthcare, and more. Designed to support time series analysis in R by offering datasets from multiple disciplines, making it a valuable resource for researchers and analysts.
Maintained by Renzo Caceres Rossi. Last updated 6 months ago.
5.8 match 10 stars 5.71 score 103 scriptsmplex
multiplex:Algebraic Tools for the Analysis of Multiple Social Networks
Algebraic procedures for analyses of multiple social networks are delivered with this package as described in Ostoic (2020) <DOI:10.18637/jss.v092.i11>. 'multiplex' makes possible, among other things, to create and manipulate multiplex, multimode, and multilevel network data with different formats. Effective ways are available to treat multiple networks with routines that combine algebraic systems like the partially ordered semigroup with decomposition procedures or semiring structures with the relational bundles occurring in different types of multivariate networks. 'multiplex' provides also an algebraic approach for affiliation networks through Galois derivations between families of the pairs of subsets in the two domains of the network with visualization options.
Maintained by Antonio Rivero Ostoic. Last updated 2 months ago.
4.0 match 23 stars 8.12 score 69 scripts 2 dependentsjemus42
tRakt:Get Data from ''
A wrapper for the <> API to retrieve data about shows and movies, including user ratings, credits and related metadata. Additional functions retrieve user-specific information including collections and history of watched items. A full API reference is available at <>.
Maintained by Lukas Burk. Last updated 6 hours ago.
5.3 match 22 stars 6.07 score 33 scriptsalexanderrobitzsch
TAM:Test Analysis Modules
Includes marginal maximum likelihood estimation and joint maximum likelihood estimation for unidimensional and multidimensional item response models. The package functionality covers the Rasch model, 2PL model, 3PL model, generalized partial credit model, multi-faceted Rasch model, nominal item response model, structured latent class model, mixture distribution IRT models, and located latent class models. Latent regression models and plausible value imputation are also supported. For details see Adams, Wilson and Wang, 1997 <doi:10.1177/0146621697211001>, Adams, Wilson and Wu, 1997 <doi:10.3102/10769986022001047>, Formann, 1982 <doi:10.1002/bimj.4710240209>, Formann, 1992 <doi:10.1080/01621459.1992.10475229>.
Maintained by Alexander Robitzsch. Last updated 6 months ago.
3.6 match 16 stars 8.93 score 258 scripts 25 dependentsjakobraymaekers
cellWise:Analyzing Data with Cellwise Outliers
Tools for detecting cellwise outliers and robust methods to analyze data which may contain them. Contains the implementation of the algorithms described in Rousseeuw and Van den Bossche (2018) <doi:10.1080/00401706.2017.1340909> (open access) Hubert et al. (2019) <doi:10.1080/00401706.2018.1562989> (open access), Raymaekers and Rousseeuw (2021) <doi:10.1080/00401706.2019.1677270> (open access), Raymaekers and Rousseeuw (2021) <doi:10.1007/s10994-021-05960-5> (open access), Raymaekers and Rousseeuw (2021) <doi:10.52933/jdssv.v1i3.18> (open access), Raymaekers and Rousseeuw (2022) <arXiv:2207.13493> (open access) Rousseeuw (2022) <doi:10.1016/j.ecosta.2023.01.007> (open access). Examples can be found in the vignettes: "DDC_examples", "MacroPCA_examples", "wrap_examples", "transfo_examples", "DI_examples", "cellMCD_examples" , "Correspondence_analysis_examples", and "cellwise_weights_examples".
Maintained by Jakob Raymaekers. Last updated 1 years ago.
5.2 match 2 stars 6.06 score 54 scripts 16 dependentsmoosa-r
rbioapi:User-Friendly R Interface to Biologic Web Services' API
Currently fully supports Enrichr, JASPAR, miEAA, PANTHER, Reactome, STRING, and UniProt! The goal of rbioapi is to provide a user-friendly and consistent interface to biological databases and services. In a way that insulates the user from the technicalities of using web services API and creates a unified and easy-to-use interface to biological and medical web services. This is an ongoing project; New databases and services will be added periodically. Feel free to suggest any databases or services you often use.
Maintained by Moosa Rezwani. Last updated 1 months ago.
4.1 match 20 stars 7.60 score 55 scriptsalexanderrobitzsch
CDM:Cognitive Diagnosis Modeling
Functions for cognitive diagnosis modeling and multidimensional item response modeling for dichotomous and polytomous item responses. This package enables the estimation of the DINA and DINO model (Junker & Sijtsma, 2001, <doi:10.1177/01466210122032064>), the multiple group (polytomous) GDINA model (de la Torre, 2011, <doi:10.1007/s11336-011-9207-7>), the multiple choice DINA model (de la Torre, 2009, <doi:10.1177/0146621608320523>), the general diagnostic model (GDM; von Davier, 2008, <doi:10.1348/000711007X193957>), the structured latent class model (SLCA; Formann, 1992, <doi:10.1080/01621459.1992.10475229>) and regularized latent class analysis (Chen, Li, Liu, & Ying, 2017, <doi:10.1007/s11336-016-9545-6>). See George, Robitzsch, Kiefer, Gross, and Uenlue (2017) <doi:10.18637/jss.v074.i02> or Robitzsch and George (2019, <doi:10.1007/978-3-030-05584-4_26>) for further details on estimation and the package structure. For tutorials on how to use the CDM package see George and Robitzsch (2015, <doi:10.20982/tqmp.11.3.p189>) as well as Ravand and Robitzsch (2015).
Maintained by Alexander Robitzsch. Last updated 9 months ago.
3.5 match 22 stars 8.76 score 138 scripts 28 dependentsr-lib
usethis:Automate Package and Project Setup
Automate package and project setup tasks that are otherwise performed manually. This includes setting up unit testing, test coverage, continuous integration, Git, 'GitHub', licenses, 'Rcpp', 'RStudio' projects, and more.
Maintained by Jennifer Bryan. Last updated 11 days ago.
1.8 match 869 stars 17.54 score 5.6k scripts 336 dependentsjibarozzo
nplyr:A Grammar of Nested Data Manipulation
Provides functions for manipulating nested data frames in a list-column using 'dplyr' <> syntax. Rather than unnesting, then manipulating a data frame, 'nplyr' allows users to manipulate each nested data frame directly. 'nplyr' is a wrapper for 'dplyr' functions that provide tools for common data manipulation steps: filtering rows, selecting columns, summarising grouped data, among others.
Maintained by Bolívar Aponte Rolón. Last updated 1 months ago.
4.7 match 120 stars 6.56 score 1 dependentspaulhendricks
generator:Generate Data Containing Fake Personally Identifiable Information
Allows users to quickly and easily generate fake data containing Personally Identifiable Information (PII) through convenience functions.
Maintained by Paul Hendricks. Last updated 8 years ago.
5.1 match 24 stars 5.99 score 81 scriptsmikldk
malan:MAle Lineage ANalysis
MAle Lineage ANalysis by simulating genealogies backwards and imposing short tandem repeats (STR) mutations forwards. Intended for forensic Y chromosomal STR (Y-STR) haplotype analyses. Numerous analyses are possible, e.g. number of matches and meiotic distance to matches. Refer to papers mentioned in citation("malan") (DOI's: <doi:10.1371/journal.pgen.1007028>, <doi:10.21105/joss.00684> and <doi:10.1016/j.fsigen.2018.10.004>).
Maintained by Mikkel Meyer Andersen. Last updated 1 years ago.
6.7 match 4.48 score 6 scriptsmrcaseb
personalr:Automated Personal Package Setup
Functions to setup a personal R package that attaches given libraries and exports personal helper functions.
Maintained by Sebastian Carl. Last updated 3 years ago.
7.9 match 13 stars 3.81 score 1 scriptsjimbrig
jimstools:Tools for R
What the package does (one paragraph).
Maintained by Jimmy Briggs. Last updated 3 years ago.
10.0 match 2 stars 3.00 score 2 scriptsmarjoleinf
pre:Prediction Rule Ensembles
Derives prediction rule ensembles (PREs). Largely follows the procedure for deriving PREs as described in Friedman & Popescu (2008; <DOI:10.1214/07-AOAS148>), with adjustments and improvements. The main function pre() derives prediction rule ensembles consisting of rules and/or linear terms for continuous, binary, count, multinomial, and multivariate continuous responses. Function gpe() derives generalized prediction ensembles, consisting of rules, hinge and linear functions of the predictor variables.
Maintained by Marjolein Fokkema. Last updated 9 months ago.
3.5 match 58 stars 8.49 score 98 scripts 1 dependentsle-huynh
lehuynh:Le-Huynh Truc-Ly's R Code and Templates
Miscellaneous R functions (for graphics, data import, data transformation, and general utilities) and templates (for exploratory analysis, Bayesian modeling, and crafting scientific manuscripts).
Maintained by Truc-Ly Le-Huynh. Last updated 9 months ago.
7.5 match 3 stars 3.88 score 4 scriptssbgraves237
Ecdat:Data Sets for Econometrics
Data sets for econometrics, including political science.
Maintained by Spencer Graves. Last updated 4 months ago.
4.0 match 2 stars 7.25 score 740 scripts 3 dependentsohdsi
CohortConstructor:Build and Manipulate Study Cohorts Using a Common Data Model
Create and manipulate study cohorts in data mapped to the Observational Medical Outcomes Partnership Common Data Model.
Maintained by Edward Burn. Last updated 4 days ago.
3.0 match 2 stars 9.71 score 207 scripts 2 dependentsvenelin
PCMBaseCpp:Fast Likelihood Calculation for Phylogenetic Comparative Models
Provides a C++ backend for multivariate phylogenetic comparative models implemented in the R-package 'PCMBase'. Can be used in combination with 'PCMBase' to enable fast and parallel likelihood calculation. Implements the pruning likelihood calculation algorithm described in Mitov et al. (2018) <arXiv:1809.09014>. Uses the 'SPLITT' C++ library for parallel tree traversal described in Mitov and Stadler (2018) <doi:10.1111/2041-210X.13136>.
Maintained by Venelin Mitov. Last updated 5 years ago.
6.6 match 4.19 score 31 scriptspaulhendricks
detector:Detect Data Containing Personally Identifiable Information
Allows users to quickly and easily detect data containing Personally Identifiable Information (PII) through convenience functions.
Maintained by Paul Hendricks. Last updated 8 years ago.
5.2 match 15 stars 5.34 score 29 scriptsr4goodacademy
R4GoodPersonalFinances:Make Better Financial Decisions
Make informed, data-driven decisions for your personal or household finances. Use tools and methods that are selected carefully to align with academic consensus, bridging the gap between theoretical knowledge and practical application. They assist you in finding optimal asset allocation, preparing for retirement or financial independence, calculating optimal spending, and more. For more details see: Haghani V., White J. (2023, ISBN:978-1-119-74791-8), Idzorek T., Kaplan P. (2024, ISBN:9781952927379).
Maintained by Kamil Wais. Last updated 3 days ago.
8.0 match 1 stars 3.40 scorelcbc-uio
questionnaires:Package with functions to calculate components and sums for LCBC questionnaires
Creates summaries and factorials of answers to questionnaires.
Maintained by Athanasia Mo Mowinckel. Last updated 2 years ago.
5.9 match 3 stars 4.63 score 13 scriptslong39ng
remmy:API Client for 'Lemmy'
An HTTP API client for 'Lemmy' (<>) in R. Code and documentation are generated from the official 'JavaScript' client source (<>).
Maintained by Long Nguyen. Last updated 2 years ago.
10.0 match 2.70 score 4 scriptssigbertklinke
plot.matrix:Visualizes a Matrix as Heatmap
Visualizes a matrix object plainly as heatmap. It provides S3 functions to plot simple matrices and loading matrices.
Maintained by Sigbert Klinke. Last updated 3 years ago.
3.5 match 8 stars 7.63 score 300 scripts 7 dependentspsychmeta
psychmeta:Psychometric Meta-Analysis Toolkit
Tools for computing bare-bones and psychometric meta-analyses and for generating psychometric data for use in meta-analysis simulations. Supports bare-bones, individual-correction, and artifact-distribution methods for meta-analyzing correlations and d values. Includes tools for converting effect sizes, computing sporadic artifact corrections, reshaping meta-analytic databases, computing multivariate corrections for range variation, and more. Bugs can be reported to <> or <>.
Maintained by Jeffrey A. Dahlke. Last updated 9 months ago.
3.2 match 57 stars 8.25 score 151 scriptsjpmonteagudo28
despair:Motivational Quotes and Shakespearean Bard–bits for Personal Projects
Generate motivational quotes and Shakespearean word combinations (bard–bits) that a user can consider for their personal projects. Each of the package functions takes two arguments, cat which default to any, and a a numeric or character seed to ensure reproducible results.
Maintained by JP Monteagudo. Last updated 3 months ago.
7.0 match 3 stars 3.78 score 5 scriptsbwiernik
configural:Multivariate Profile Analysis
R functions for criterion profile analysis, Davison and Davenport (2002) <doi:10.1037/1082-989X.7.4.468> and meta-analytic criterion profile analysis, Wiernik, Wilmot, Davison, and Ones (2020) <doi:10.1037/met0000305>. Sensitivity analyses to aid in interpreting criterion profile analysis results are also included.
Maintained by Brenton M. Wiernik. Last updated 12 months ago.
6.6 match 4 stars 3.96 score 23 scriptsseankross
postcards:Create Beautiful, Simple Personal Websites
A collection of R Markdown templates for creating simple and easy to personalize single page websites.
Maintained by Sean Kross. Last updated 3 years ago.
3.5 match 559 stars 7.35 score 132 scriptsbayesiandemography
bage:Bayesian Estimation and Forecasting of Age-Specific Rates
Fast Bayesian estimation and forecasting of age-specific rates, probabilities, and means, based on 'Template Model Builder'.
Maintained by John Bryant. Last updated 2 months ago.
3.5 match 3 stars 7.30 score 39 scriptsstat-wangxg
SurvMA:Model Averaging Prediction of Personalized Survival Probabilities
Provide methods for model averaging prediction of personalized survival probabilities.
Maintained by Mengyu Li. Last updated 6 months ago.
8.5 match 2 stars 3.00 scorerundel
ghclass:Tools for Managing Classes on GitHub
Interface for the GitHub API that enables efficient management of courses on GitHub. It has a functionality for managing organizations, teams, repositories, and users on GitHub and helps automate most of the tedious and repetitive tasks around creating and distributing assignments.
Maintained by Colin Rundel. Last updated 1 months ago.
3.4 match 142 stars 7.32 score 70 scriptseconomic
realtalk:Price index data for the US economy
Makes it easy to use US price index data like the CPI.
Maintained by Ben Zipperer. Last updated 4 days ago.
7.0 match 5 stars 3.51 score 10 scriptsohdsi
FeatureExtraction:Generating Features for a Cohort
An R interface for generating features for a cohort using data in the Common Data Model. Features can be constructed using default or custom made feature definitions. Furthermore it's possible to aggregate features and get the summary statistics.
Maintained by Ger Inberg. Last updated 5 months ago.
2.4 match 62 stars 10.30 score 209 scripts 1 dependentslightbluetitan
usdatasets:A Comprehensive Collection of U.S. Datasets
Provides a diverse collection of U.S. datasets encompassing various fields such as crime, economics, education, finance, energy, healthcare, and more. It serves as a valuable resource for researchers and analysts seeking to perform in-depth analyses and derive insights from U.S.-specific data.
Maintained by Renzo Caceres Rossi. Last updated 5 months ago.
4.0 match 7 stars 5.99 score 141 scriptsluffylouis
handyFunctions:Useful Functions for Handfully Manipulating and Analyzing Data with Data.frame Format
Some useful functions for simply manipulating and analyzing data with data.frame format. It mainly includes the following sections: ReformatDataframe (reformat dataframe with the modifiers), InteractDataframe, and Post-VCF (for downstream analysis for data generated from vcftools (Petr Danecek, 2011) (<>) or plink (Chang CC, 2015) <10.1186/s13742-015-0047-8>.
Maintained by Hongfei Liu. Last updated 2 years ago.
7.3 match 2 stars 3.28 score 19 scriptsropensci
git2r:Provides Access to Git Repositories
Interface to the 'libgit2' library, which is a pure C implementation of the 'Git' core methods. Provides access to 'Git' repositories to extract data and running some basic 'Git' commands.
Maintained by Stefan Widgren. Last updated 12 days ago.
1.7 match 218 stars 13.86 score 836 scripts 49 dependentsoscarkjell
text:Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning
Link R with Transformers from Hugging Face to transform text variables to word embeddings; where the word embeddings are used to statistically test the mean difference between set of texts, compute semantic similarity scores between texts, predict numerical variables, and visual statistically significant words according to various dimensions etc. For more information see <>.
Maintained by Oscar Kjell. Last updated 4 days ago.
1.8 match 146 stars 13.16 score 436 scripts 1 dependentshandcock
RDS:Respondent-Driven Sampling
Provides functionality for carrying out estimation with data collected using Respondent-Driven Sampling. This includes Heckathorn's RDS-I and RDS-II estimators as well as Gile's Sequential Sampling estimator. The package is part of the "RDS Analyst" suite of packages for the analysis of respondent-driven sampling data. See Gile and Handcock (2010) <doi:10.1111/j.1467-9531.2010.01223.x>, Gile and Handcock (2015) <doi:10.1111/rssa.12091> and Gile, Beaudry, Handcock and Ott (2018) <doi:10.1146/annurev-statistics-031017-100704>.
Maintained by Mark S. Handcock. Last updated 6 months ago.
6.0 match 1 stars 3.87 score 82 scripts 3 dependentsrstudio
renv:Project Environments
A dependency management toolkit for R. Using 'renv', you can create and manage project-local R libraries, save the state of these libraries to a 'lockfile', and later restore your library as required. Together, these tools can help make your projects more isolated, portable, and reproducible.
Maintained by Kevin Ushey. Last updated 3 days ago.
1.3 match 1.0k stars 18.55 score 1.5k scripts 113 dependentsmyeomans
doc2concrete:Measuring Concreteness in Natural Language
Models for detecting concreteness in natural language. This package is built in support of Yeomans (2021) <doi:10.1016/j.obhdp.2020.10.008>, which reviews linguistic models of concreteness in several domains. Here, we provide an implementation of the best-performing domain-general model (from Brysbaert et al., (2014) <doi:10.3758/s13428-013-0403-5>) as well as two pre-trained models for the feedback and plan-making domains.
Maintained by Mike Yeomans. Last updated 1 years ago.
4.0 match 13 stars 5.59 score 20 scripts 1 dependentsr-lib
credentials:Tools for Managing SSH and Git Credentials
Setup and retrieve HTTPS and SSH credentials for use with 'git' and other services. For HTTPS remotes the package interfaces the 'git-credential' utility which 'git' uses to store HTTP usernames and passwords. For SSH remotes we provide convenient functions to find or generate appropriate SSH keys. The package both helps the user to setup a local git installation, and also provides a back-end for git/ssh client libraries to authenticate with existing user credentials.
Maintained by Jeroen Ooms. Last updated 5 months ago.
1.8 match 72 stars 12.40 score 91 scripts 380 dependentsjustinmshea
neverhpfilter:An Alternative to the Hodrick-Prescott Filter
In the working paper titled "Why You Should Never Use the Hodrick-Prescott Filter", James D. Hamilton proposes a new alternative to economic time series filtering. The neverhpfilter package provides functions and data for reproducing his work. Hamilton (2017) <doi:10.3386/w23429>.
Maintained by Justin M. Shea. Last updated 2 years ago.
3.8 match 14 stars 5.93 score 61 scriptsandreaczhang
qtwAcademic:'Quarto' Website Templates for Academics
Provides three 'Quarto' website templates as an R project, which are commonly used by academics. Templates for personal websites and course/workshop websites are included, as well as a template with minimal content for customization.
Maintained by Chi Zhang. Last updated 2 years ago.
3.9 match 39 stars 5.77 score 1 scriptsbruce1edward
PDN:Personalized Disease Network
Building patient level networks for prediction of medical outcomes and draw the cluster of network. This package is based on paper Personalized disease networks for understanding and predicting cardiovascular diseases and other complex processes (See Cabrera et al. <>).
Maintained by Zhenbang Wang. Last updated 7 years ago.
11.0 match 2.00 score 7 scriptsshixiangwang
tinyscholar:Get and Show Personal 'Google Scholar' Profile
Provides functions to get personal 'Google Scholar' profile data from web API and show it in table or figure format.
Maintained by Shixiang Wang. Last updated 1 years ago.
4.8 match 8 stars 4.60 score 7 scriptsshaelebrown
TDApplied:Machine Learning and Inference for Topological Data Analysis
Topological data analysis is a powerful tool for finding non-linear global structure in whole datasets. The main tool of topological data analysis is persistent homology, which computes a topological shape descriptor of a dataset called a persistence diagram. 'TDApplied' provides useful and efficient methods for analyzing groups of persistence diagrams with machine learning and statistical inference, and these functions can also interface with other data science packages to form flexible and integrated topological data analysis pipelines.
Maintained by Shael Brown. Last updated 5 months ago.
3.3 match 16 stars 6.60 score 8 scriptsrpruim
fastR2:Foundations and Applications of Statistics Using R (2nd Edition)
Data sets and utilities to accompany the second edition of "Foundations and Applications of Statistics: an Introduction using R" (R Pruim, published by AMS, 2017), a text covering topics from probability and mathematical statistics at an advanced undergraduate level. R is integrated throughout, and access to all the R code in the book is provided via the snippet() function.
Maintained by Randall Pruim. Last updated 1 years ago.
3.8 match 13 stars 5.85 score 108 scriptsjorgetendeiro
GGUM:Generalized Graded Unfolding Model
An implementation of the generalized graded unfolding model (GGUM) in R, see Roberts, Donoghue, and Laughlin (2000) <doi:10.1177/01466216000241001>). It allows to simulate data sets based on the GGUM. It fits the GGUM and the GUM, and it retrieves item and person parameter estimates. Several plotting functions are available (item and test information functions; item and test characteristic curves; item category response curves). Additionally, there are some functions that facilitate the communication between R and 'GGUM2004'. Finally, a model-fit checking utility, MODFIT(), is also available.
Maintained by Jorge N. Tendeiro. Last updated 2 years ago.
4.3 match 6 stars 4.99 score 18 scripts 1 dependentsbioc
DiffLogo:DiffLogo: A comparative visualisation of biooligomer motifs
DiffLogo is an easy-to-use tool to visualize motif differences.
Maintained by Hendrik Treutler. Last updated 5 months ago.
3.2 match 8 stars 6.66 score 27 scriptsdhaine
episensr:Basic Sensitivity Analysis of Epidemiological Results
Basic sensitivity analysis of the observed relative risks adjusting for unmeasured confounding and misclassification of the exposure/outcome, or both. It follows the bias analysis methods and examples from the book by Lash T.L, Fox M.P, and Fink A.K. "Applying Quantitative Bias Analysis to Epidemiologic Data", ('Springer', 2021).
Maintained by Denis Haine. Last updated 1 years ago.
3.3 match 13 stars 6.48 score 39 scripts 1 dependentsdonaldrwilliams
GGMncv:Gaussian Graphical Models with Nonconvex Regularization
Estimate Gaussian graphical models with nonconvex penalties <doi:10.31234/>, including the atan Wang and Zhu (2016) <doi:10.1155/2016/6495417>, seamless L0 Dicker, Huang, and Lin (2013) <doi:10.5705/ss.2011.074>, exponential Wang, Fan, and Zhu <doi:10.1007/s10463-016-0588-3>, smooth integration of counting and absolute deviation Lv and Fan (2009) <doi:10.1214/09-AOS683>, logarithm Mazumder, Friedman, and Hastie (2011) <doi:10.1198/jasa.2011.tm09738>, Lq, smoothly clipped absolute deviation Fan and Li (2001) <doi:10.1198/016214501753382273>, and minimax concave penalty Zhang (2010) <doi:10.1214/09-AOS729>. There are also extensions for computing variable inclusion probabilities, multiple regression coefficients, and statistical inference <doi:10.1214/15-EJS1031>.
Maintained by Donald Williams. Last updated 3 years ago.
3.4 match 5 stars 6.22 score 22 scripts 2 dependentsdtkaplan
LSTbook:Data and Software for "Lessons in Statistical Thinking"
"Lessons in Statistical Thinking" D.T. Kaplan (2014) <> is a textbook for a first or second course in statistics that embraces data wrangling, causal reasoning, modeling, statistical adjustment, and simulation. 'LSTbook' supports the student-centered, tidy, pipeline-oriented computing style featured in the book.
Maintained by Daniel Kaplan. Last updated 1 days ago.
3.4 match 4 stars 6.29 score 27 scriptsjacobpstein
pii:Search Data Frames for Personally Identifiable Information
Check a data frame for personal information, including names, location, disability status, and geo-coordinates.
Maintained by Jacob Patterson-Stein. Last updated 2 months ago.
5.2 match 7 stars 4.02 score 4 scriptspsirusteam
TeachingSampling:Selection of Samples and Parameter Estimation in Finite Population
Allows the user to draw probabilistic samples and make inferences from a finite population based on several sampling designs.
Maintained by Hugo Andres Gutierrez Rojas. Last updated 5 years ago.
3.6 match 4 stars 5.80 score 217 scripts 4 dependentsdarwin-eu
CDMConnector:Connect to an OMOP Common Data Model
Provides tools for working with observational health data in the Observational Medical Outcomes Partnership (OMOP) Common Data Model format with a pipe friendly syntax. Common data model database table references are stored in a single compound object along with metadata.
Maintained by Adam Black. Last updated 18 days ago.
1.8 match 12 stars 11.39 score 502 scripts 12 dependentsazure
Microsoft365R:Interface to the 'Microsoft 365' Suite of Cloud Services
An interface to the 'Microsoft 365' (formerly known as 'Office 365') suite of cloud services, building on the framework supplied by the 'AzureGraph' package. Enables access from R to data stored in 'Teams', 'SharePoint Online' and 'OneDrive', including the ability to list drive folder contents, upload and download files, send messages, and retrieve data lists. Also provides a full-featured 'Outlook' email client, with the ability to send emails and manage emails and mail folders.
Maintained by Hong Ooi. Last updated 15 days ago.
1.8 match 325 stars 11.14 score 88 scripts 7 dependentstyee001
VGAMdata:Data Supporting the 'VGAM' Package
Mainly data sets to accompany the VGAM package and the book "Vector Generalized Linear and Additive Models: With an Implementation in R" (Yee, 2015) <DOI:10.1007/978-1-4939-2818-7>. These are used to illustrate vector generalized linear and additive models (VGLMs/VGAMs), and associated models (Reduced-Rank VGLMs, Quadratic RR-VGLMs, Row-Column Interaction Models, and constrained and unconstrained ordination models in ecology). This package now contains some old VGAM family functions which have been replaced by newer ones (often because they are now special cases).
Maintained by Thomas Yee. Last updated 1 months ago.
6.8 match 1 stars 2.94 score 95 scripts 1 dependentsfchamroukhi
samurais:Statistical Models for the Unsupervised Segmentation of Time-Series ('SaMUraiS')
Provides a variety of original and flexible user-friendly statistical latent variable models and unsupervised learning algorithms to segment and represent time-series data (univariate or multivariate), and more generally, longitudinal data, which include regime changes. 'samurais' is built upon the following packages, each of them is an autonomous time-series segmentation approach: Regression with Hidden Logistic Process ('RHLP'), Hidden Markov Model Regression ('HMMR'), Multivariate 'RHLP' ('MRHLP'), Multivariate 'HMMR' ('MHMMR'), Piece-Wise regression ('PWR'). For the advantages/differences of each of them, the user is referred to our mentioned paper references. These models are originally introduced and written in 'Matlab' by Faicel Chamroukhi <>.
Maintained by Florian Lecocq. Last updated 5 years ago.
3.2 match 12 stars 6.18 score 28 scriptsr-computing-lab
BGmisc:An R Package for Extended Behavior Genetics Analysis
Provides functions for behavior genetics analysis, including variance component model identification [Hunter et al. (2021) <doi:10.1007/s10519-021-10055-x>], calculation of relatedness coefficients using path-tracing methods [Wright (1922) <doi:10.1086/279872>; McArdle & McDonald (1984) <doi:10.1111/j.2044-8317.1984.tb00802.x>], inference of relatedness, pedigree conversion, and simulation of multi-generational family data [Lyu et al. (2024) <doi:10.1101/2024.12.19.629449>]. For a full overview, see Garrison et al. (2024) <doi:10.21105/joss.06203>.
Maintained by S. Mason Garrison. Last updated 24 days ago.
2.8 match 1 stars 6.83 score 35 scriptszhenkewu
baker:"Nested Partially Latent Class Models"
Provides functions to specify, fit and visualize nested partially-latent class models ( Wu, Deloria-Knoll, Hammitt, and Zeger (2016) <doi:10.1111/rssc.12101>; Wu, Deloria-Knoll, and Zeger (2017) <doi:10.1093/biostatistics/kxw037>; Wu and Chen (2021) <doi:10.1002/sim.8804>) for inference of population disease etiology and individual diagnosis. In the motivating Pneumonia Etiology Research for Child Health (PERCH) study, because both quantities of interest sum to one hundred percent, the PERCH scientists frequently refer to them as population etiology pie and individual etiology pie, hence the name of the package.
Maintained by Zhenke Wu. Last updated 11 months ago.
3.2 match 8 stars 6.00 score 21 scriptsmarkheckmann
gridsampler:A Simulation Tool to Determine the Required Sample Size for Repertory Grid Studies
Simulation tool to facilitate determination of required sample size to achieve category saturation for studies using multiple repertory grids in conjunction with content analysis.
Maintained by Mark Heckmann. Last updated 5 years ago.
3.8 match 4 stars 5.10 score 21 scriptskapelner
PTE:Personalized Treatment Evaluator
We provide inference for personalized medicine models. Namely, we answer the questions: (1) how much better does a purported personalized recommendation engine for treatments do over a business-as-usual approach and (2) is that difference statistically significant?
Maintained by Adam Kapelner. Last updated 6 years ago.
7.9 match 2.37 score 26 scriptskainhofer
MortalityTables:A Framework for Various Types of Mortality / Life Tables
Classes to implement, analyze and plot cohort life tables for actuarial calculations. Birth-year dependent cohort mortality tables using a yearly trend to extrapolate from a base year are implemented, as well as period life table, cohort life tables using an age shift, and merged life tables. Additionally, several data sets from various countries are included to provide widely-used tables out of the box.
Maintained by Reinhold Kainhofer. Last updated 1 years ago.
3.2 match 1 stars 5.70 score 84 scripts 2 dependentsropensci
osfr:Interface to the 'Open Science Framework' ('OSF')
An interface for interacting with 'OSF' (<>). 'osfr' enables you to access open research materials and data, or create and manage your own private or public projects.
Maintained by Aaron Wolen. Last updated 8 months ago.
1.8 match 145 stars 10.18 score 588 scripts 3 dependentsjrosen48
prcr:Person-Centered Analysis
Provides an easy-to-use yet adaptable set of tools to conduct person-center analysis using a two-step clustering procedure. As described in Bergman and El-Khouri (1999) <DOI:10.1002/(SICI)1521-4036(199910)41:6%3C753::AID-BIMJ753%3E3.0.CO;2-K>, hierarchical clustering is performed to determine the initial partition for the subsequent k-means clustering procedure.
Maintained by Joshua M Rosenberg. Last updated 5 years ago.
3.8 match 5 stars 4.65 score 18 scriptsbioc
DAPAR:Tools for the Differential Analysis of Proteins Abundance with R
The package DAPAR is a Bioconductor distributed R package which provides all the necessary functions to analyze quantitative data from label-free proteomics experiments. Contrarily to most other similar R packages, it is endowed with rich and user-friendly graphical interfaces, so that no programming skill is required (see `Prostar` package).
Maintained by Samuel Wieczorek. Last updated 5 months ago.
3.2 match 2 stars 5.42 score 22 scripts 1 dependentsfriendly
Guerry:Maps, Data and Methods Related to Guerry (1833) "Moral Statistics of France"
Contains maps of France in 1830 and multivariate datasets from A.-M. Guerry and others. Statistical and graphic methods related to Guerry's "Moral Statistics of France" are used to understand Guerry's data and illustrate methods. The goal is to facilitate the exploration and development of statistical and graphic methods for multivariate data in a geospatial context of historical interest.
Maintained by Michael Friendly. Last updated 2 months ago.
3.6 match 1 stars 4.72 score 53 scriptswenchao-ma
GDINA:The Generalized DINA Model Framework
A set of psychometric tools for cognitive diagnosis modeling based on the generalized deterministic inputs, noisy and gate (G-DINA) model by de la Torre (2011) <DOI:10.1007/s11336-011-9207-7> and its extensions, including the sequential G-DINA model by Ma and de la Torre (2016) <DOI:10.1111/bmsp.12070> for polytomous responses, and the polytomous G-DINA model by Chen and de la Torre <DOI:10.1177/0146621613479818> for polytomous attributes. Joint attribute distribution can be independent, saturated, higher-order, loglinear smoothed or structured. Q-matrix validation, item and model fit statistics, model comparison at test and item level and differential item functioning can also be conducted. A graphical user interface is also provided. For tutorials, please check Ma and de la Torre (2020) <DOI:10.18637/jss.v093.i14>, Ma and de la Torre (2019) <DOI:10.1111/emip.12262>, Ma (2019) <DOI:10.1007/978-3-030-05584-4_29> and de la Torre and Akbay (2019).
Maintained by Wenchao Ma. Last updated 1 months ago.
1.9 match 30 stars 8.92 score 94 scripts 6 dependentsnicebread
fSRM:Social Relations Analyses with Roles ("Family SRM")
Social Relations Analysis with roles ("Family SRM") are computed, using a structural equation modeling approach. Groups ranging from three members up to an unlimited number of members are supported and the mean structure can be computed. Means and variances can be compared between different groups of families and between roles.
Maintained by Felix Schönbrodt. Last updated 4 years ago.
15.8 match 1.04 score 11 scriptspedrosfig
BayesSampling:Bayes Linear Estimators for Finite Population
Allows the user to apply the Bayes Linear approach to finite population with the Simple Random Sampling - BLE_SRS() - and the Stratified Simple Random Sampling design - BLE_SSRS() - (both without replacement), to the Ratio estimator (using auxiliary information) - BLE_Ratio() - and to categorical data - BLE_Categorical(). The Bayes linear estimation approach is applied to a general linear regression model for finite population prediction in BLE_Reg() and it is also possible to achieve the design based estimators using vague prior distributions. Based on Gonçalves, K.C.M, Moura, F.A.S and Migon, H.S.(2014) <>.
Maintained by Pedro Soares Figueiredo. Last updated 4 years ago.
3.6 match 1 stars 4.56 score 12 scriptsstencila
stencilaschema:Bindings for Stencila Schema
Provides R bindings for the Stencila Schema <>. This package is primarily aimed at R developers wanting to programmatically generate, or modify, executable documents.
Maintained by Nokome Bentley. Last updated 3 years ago.
3.3 match 17 stars 4.93 score 2 scriptsirinagain
iglu:Interpreting Glucose Data from Continuous Glucose Monitors
Implements a wide range of metrics for measuring glucose control and glucose variability based on continuous glucose monitoring data. The list of implemented metrics is summarized in Rodbard (2009) <doi:10.1089/dia.2009.0015>. Additional visualization tools include time-series plots, lasagna plots and ambulatory glucose profile report.
Maintained by Irina Gaynanova. Last updated 10 days ago.
1.8 match 26 stars 9.00 score 39 scriptsrichardli
surveyPrev:Mapping the Prevalence of Binary Indicators using Survey Data in Small Areas
Provides a pipeline to perform small area estimation and prevalence mapping of binary indicators using health and demographic survey data, described in Fuglstad et al. (2022) <doi:10.48550/arXiv.2110.09576> and Wakefield et al. (2020) <doi:10.1111/insr.12400>.
Maintained by Qianyu Dong. Last updated 5 days ago.
2.8 match 1 stars 5.76 score 11 scriptstrinker
textshape:Tools for Reshaping Text
Tools that can be used to reshape and restructure text data.
Maintained by Tyler Rinker. Last updated 12 months ago.
1.8 match 50 stars 9.18 score 266 scripts 34 dependentsdarwin-eu
omopgenerics:Methods and Classes for the OMOP Common Data Model
Provides definitions of core classes and methods used by analytic pipelines that query the OMOP (Observational Medical Outcomes Partnership) common data model.
Maintained by Martí Català. Last updated 10 days ago.
1.6 match 9.97 score 193 scripts 16 dependentstbates
umx:Structural Equation Modeling and Twin Modeling in R
Quickly create, run, and report structural equation models, and twin models. See '?umx' for help, and umx_open_CRAN_page("umx") for NEWS. Timothy C. Bates, Michael C. Neale, Hermine H. Maes, (2019). umx: A library for Structural Equation and Twin Modelling in R. Twin Research and Human Genetics, 22, 27-41. <doi:10.1017/thg.2019.2>.
Maintained by Timothy C. Bates. Last updated 2 days ago.
1.7 match 44 stars 9.45 score 472 scriptsmikldk
disclapmix:Discrete Laplace Mixture Inference using the EM Algorithm
Make inference in a mixture of discrete Laplace distributions using the EM algorithm. This can e.g. be used for modelling the distribution of Y chromosomal haplotypes as described in [1, 2] (refer to the URL section).
Maintained by Mikkel Meyer Andersen. Last updated 2 years ago.
3.7 match 4.32 score 14 scriptswilcoxa
frequency:Easy Frequency Tables
Generate 'SPSS'/'SAS' styled frequency tables. Frequency tables are generated with variable and value label attributes where applicable with optional html output to quickly examine datasets.
Maintained by Alistair Wilcox. Last updated 4 years ago.
3.5 match 3 stars 4.51 score 36 scriptsadeverse
adegraphics:An S4 Lattice-Based Package for the Representation of Multivariate Data
Graphical functionalities for the representation of multivariate data. It is a complete re-implementation of the functions available in the 'ade4' package.
Maintained by Aurélie Siberchicot. Last updated 8 months ago.
1.5 match 9 stars 10.37 score 386 scripts 6 dependentsts404
WikidataR:Read-Write API Client Library for Wikidata
Read from, interrogate, and write to Wikidata <> - the multilingual, interdisciplinary, semantic knowledgebase. Includes functions to: read from Wikidata (single items, properties, or properties); query Wikidata (retrieving all items that match a set of criteria via Wikidata SPARQL query service); write to Wikidata (adding new items or statements via QuickStatements); and handle and manipulate Wikidata objects (as lists and tibbles). Uses the Wikidata and QuickStatements APIs.
Maintained by Thomas Shafee. Last updated 2 months ago.
1.8 match 22 stars 8.64 score 109 scripts 25 dependentshuanglabumn
oncoPredict:Drug Response Modeling and Biomarker Discovery
Allows for building drug response models using screening data between bulk RNA-Seq and a drug response metric and two additional tools for biomarker discovery that have been developed by the Huang Laboratory at University of Minnesota. There are 3 main functions within this package. (1) calcPhenotype is used to build drug response models on RNA-Seq data and impute them on any other RNA-Seq dataset given to the model. (2) GLDS is used to calculate the general level of drug sensitivity, which can improve biomarker discovery. (3) IDWAS can take the results from calcPhenotype and link the imputed response back to available genomic (mutation and CNV alterations) to identify biomarkers. Each of these functions comes from a paper from the Huang research laboratory. Below gives the relevant paper for each function. calcPhenotype - Geeleher et al, Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines. GLDS - Geeleher et al, Cancer biomarker discovery is improved by accounting for variability in general levels of drug sensitivity in pre-clinical models. IDWAS - Geeleher et al, Discovering novel pharmacogenomic biomarkers by imputing drug response in cancer patients from large genomics studies.
Maintained by Robert Gruener. Last updated 12 months ago.
2.4 match 18 stars 6.47 score 41 scriptsbioc
RImmPort:RImmPort: Enabling Ready-for-analysis Immunology Research Data
The RImmPort package simplifies access to ImmPort data for analysis in the R environment. It provides a standards-based interface to the ImmPort study data that is in a proprietary format.
Maintained by Zicheng Hu. Last updated 5 months ago.
3.5 match 4.33 score 27 scriptstmsalab
edmdata:Data Sets for Psychometric Modeling
Collection of data sets from various assessments that can be used to evaluate psychometric models. These data sets have been analyzed in the following papers that introduced new methodology as part of the application section: Jimenez, A., Balamuta, J. J., & Culpepper, S. A. (2023) <doi:10.1111/bmsp.12307>, Culpepper, S. A., & Balamuta, J. J. (2021) <doi:10.1080/00273171.2021.1985949>, Yinghan Chen et al. (2021) <doi:10.1007/s11336-021-09750-9>, Yinyin Chen et al. (2020) <doi:10.1007/s11336-019-09693-2>, Culpepper, S. A. (2019a) <doi:10.1007/s11336-019-09683-4>, Culpepper, S. A. (2019b) <doi:10.1007/s11336-018-9643-8>, Culpepper, S. A., & Chen, Y. (2019) <doi:10.3102/1076998618791306>, Culpepper, S. A., & Balamuta, J. J. (2017) <doi:10.1007/s11336-015-9484-7>, and Culpepper, S. A. (2015) <doi:10.3102/1076998615595403>.
Maintained by James Joseph Balamuta. Last updated 6 months ago.
3.6 match 5 stars 4.18 score 7 scripts 1 dependentscristoforosimonetto
msce:Hazard of Multi-Stage Clonal Expansion Models
Functions to calculate hazard and survival function of Multi-Stage Clonal Expansion Models used in cancer epidemiology. For the Two-Stage Clonal Expansion Model an exact solution is implemented assuming piecewise constant parameters. Numerical solutions are provided for its extensions.
Maintained by Cristoforo Simonetto. Last updated 4 years ago.
7.5 match 2.00 score 2 scriptstverbeke
SDaA:Sampling: Design and Analysis
Functions and Datasets from Lohr, S. (1999), Sampling: Design and Analysis, Duxbury.
Maintained by Tobias Verbeke. Last updated 3 years ago.
6.9 match 2.15 score 14 scriptslightbluetitan
educationR:A Comprehensive Collection of Educational Datasets
Provides a comprehensive collection of datasets related to education, covering topics such as student performance, learning methods, test scores, absenteeism, and other educational metrics. This package is designed as a resource for educational researchers, data analysts, and statisticians to explore and analyze data in the field of education.
Maintained by Renzo Caceres Rossi. Last updated 3 months ago.
3.4 match 4 stars 4.30 score 3 scriptsluomus
finbif:Interface for the 'Finnish Biodiversity Information Facility' API
A programmatic interface to the 'Finnish Biodiversity Information Facility' ('FinBIF') API (<>). 'FinBIF' aggregates Finnish biodiversity data from multiple sources in a single open access portal for researchers, citizen scientists, industry and government. 'FinBIF' allows users of biodiversity information to find, access, combine and visualise data on Finnish plants, animals and microorganisms. The 'finbif' package makes the publicly available data in 'FinBIF' easily accessible to programmers. Biodiversity information is available on taxonomy and taxon occurrence. Occurrence data can be filtered by taxon, time, location and other variables. The data accessed are conveniently preformatted for subsequent analyses.
Maintained by William K. Morris. Last updated 6 days ago.
1.8 match 5 stars 8.15 score 42 scripts 3 dependentspaytonjjones
networktree:Recursive Partitioning of Network Models
Network trees recursively partition the data with respect to covariates. Two network tree algorithms are available: model-based trees based on a multivariate normal model and nonparametric trees based on covariance structures. After partitioning, correlation-based networks (psychometric networks) can be fit on the partitioned data. For details see Jones, Mair, Simon, & Zeileis (2020) <doi:10.1007/s11336-020-09731-4>.
Maintained by Payton Jones. Last updated 3 years ago.
3.8 match 13 stars 3.85 score 11 scriptsmagnusdv
ribd:Pedigree-based Relatedness Coefficients
Recursive algorithms for computing various relatedness coefficients, including pairwise kinship, kappa and identity coefficients. Both autosomal and X-linked coefficients are computed. Founders are allowed to be inbred, which enables construction of any given kappa coefficients, as described in Vigeland (2020) <doi:10.1007/s00285-020-01505-x>. In addition to the standard coefficients, 'ribd' also computes a range of lesser-known coefficients, including generalised kinship coefficients, multi-person coefficients and two-locus coefficients (Vigeland, 2023, <doi:10.1093/g3journal/jkac326>). Many features of 'ribd' are available through the online app 'QuickPed' at <>; see Vigeland (2022) <doi:10.1186/s12859-022-04759-y>.
Maintained by Magnus Dehli Vigeland. Last updated 1 months ago.
2.4 match 6 stars 5.95 score 10 scripts 11 dependentsekstroem
MESS:Miscellaneous Esoteric Statistical Scripts
A mixed collection of useful and semi-useful diverse statistical functions, some of which may even be referenced in The R Primer book. See Ekstrøm, C. T. (2016). The R Primer. 2nd edition. Chapman & Hall.
Maintained by Claus Thorn Ekstrøm. Last updated 29 days ago.
1.8 match 4 stars 7.76 score 328 scripts 13 dependentsmarkean
retel:Regularized Exponentially Tilted Empirical Likelihood
Implements the regularized exponentially tilted empirical likelihood method. Details of the method are given in Kim, MacEachern, and Peruggia (2023) <doi:10.48550/arXiv.2312.17015>. This work was supported by the U.S. National Science Foundation under Grants No. SES-1921523 and DMS-2015552.
Maintained by Eunseop Kim. Last updated 11 months ago.
3.5 match 2 stars 3.86 score 72 scriptsbioc
dittoSeq:User Friendly Single-Cell and Bulk RNA Sequencing Visualization
A universal, user friendly, single-cell and bulk RNA sequencing visualization toolkit that allows highly customizable creation of color blindness friendly, publication-quality figures. dittoSeq accepts both SingleCellExperiment (SCE) and Seurat objects, as well as the import and usage, via conversion to an SCE, of SummarizedExperiment or DGEList bulk data. Visualizations include dimensionality reduction plots, heatmaps, scatterplots, percent composition or expression across groups, and more. Customizations range from size and title adjustments to automatic generation of annotations for heatmaps, overlay of trajectory analysis onto any dimensionality reduciton plot, hidden data overlay upon cursor hovering via ggplotly conversion, and many more. All with simple, discrete inputs. Color blindness friendliness is powered by legend adjustments (enlarged keys), and by allowing the use of shapes or letter-overlay in addition to the carefully selected dittoColors().
Maintained by Daniel Bunis. Last updated 5 months ago.
1.8 match 7.56 score 760 scripts 2 dependentsdarwin-eu
IncidencePrevalence:Estimate Incidence and Prevalence using the OMOP Common Data Model
Calculate incidence and prevalence using data mapped to the Observational Medical Outcomes Partnership (OMOP) common data model. Incidence and prevalence can be estimated for the total population in a database or for a stratification cohort.
Maintained by Edward Burn. Last updated 6 days ago.
1.7 match 9 stars 7.96 score 102 scripts 1 dependentscran
epiR:Tools for the Analysis of Epidemiological Data
Tools for the analysis of epidemiological and surveillance data. Contains functions for directly and indirectly adjusting measures of disease frequency, quantifying measures of association on the basis of single or multiple strata of count data presented in a contingency table, computation of confidence intervals around incidence risk and incidence rate estimates and sample size calculations for cross-sectional, case-control and cohort studies. Surveillance tools include functions to calculate an appropriate sample size for 1- and 2-stage representative freedom surveys, functions to estimate surveillance system sensitivity and functions to support scenario tree modelling analyses.
Maintained by Mark Stevenson. Last updated 2 months ago.
1.6 match 10 stars 8.18 score 10 dependentsinbo
checklist:A Thorough and Strict Set of Checks for R Packages and Source Code
An opinionated set of rules for R packages and R source code projects.
Maintained by Thierry Onkelinx. Last updated 27 days ago.
1.8 match 19 stars 7.24 score 21 scripts 2 dependentscran
multilevel:Multilevel Functions
Tools used by organizational researchers for the analysis of multilevel data. Includes four broad sets of tools. First, functions for estimating within-group agreement and reliability indices. Second, functions for manipulating multilevel and longitudinal (panel) data. Third, simulations for estimating power and generating multilevel data. Fourth, miscellaneous functions for estimating reliability and performing simple calculations and data transformations.
Maintained by Paul Bliese. Last updated 3 years ago.
3.4 match 3.79 score 4 dependentsbioc
ISLET:Individual-Specific ceLl typE referencing Tool
ISLET is a method to conduct signal deconvolution for general -omics data. It can estimate the individual-specific and cell-type-specific reference panels, when there are multiple samples observed from each subject. It takes the input of the observed mixture data (feature by sample matrix), and the cell type mixture proportions (sample by cell type matrix), and the sample-to-subject information. It can solve for the reference panel on the individual-basis and conduct test to identify cell-type-specific differential expression (csDE) genes. It also improves estimated cell type mixture proportions by integrating personalized reference panels.
Maintained by Hao Feng. Last updated 5 months ago.
3.2 match 4.00 score 9 scriptsropensci
dataspice:Create Lightweight Descriptions of Data
The goal of 'dataspice' is to make it easier for researchers to create basic, lightweight, and concise metadata files for their datasets. These basic files can then be used to make useful information available during analysis, create a helpful dataset "README" webpage, and produce more complex metadata formats to aid dataset discovery. Metadata fields are based on the '' and 'Ecological Metadata Language' standards.
Maintained by Bryce Mecum. Last updated 4 years ago.
1.7 match 162 stars 7.45 score 25 scriptscran
LARisk:Estimation of Lifetime Attributable Risk of Cancer from Radiation Exposure
Compute lifetime attributable risk of radiation-induced cancer reveals that it can be helpful with enhancement of the flexibility in research with fast calculation and various options. Important reference papers include Berrington de Gonzalez et al. (2012) <doi:10.1088/0952-4746/32/3/205>, National Research Council (2006, ISBN:978-0-309-09156-5).
Maintained by Juhee Lee. Last updated 3 years ago.
6.3 match 2.00 scorevjilmari
multid:Multivariate Difference Between Two Groups
Estimation of multivariate differences between two groups (e.g., multivariate sex differences) with regularized regression methods and predictive approach. See Lönnqvist & Ilmarinen (2021) <doi:10.1007/s11109-021-09681-2> and Ilmarinen et al. (2023) <doi:10.1177/08902070221088155>. Includes tools that help in understanding difference score reliability, predictions of difference score variables, conditional intra-class correlations, and heterogeneity of variance estimates. Package development was supported by the Academy of Finland research grant 338891.
Maintained by Ville-Juhani Ilmarinen. Last updated 6 months ago.
2.8 match 4.48 score 6 scriptsohdsi
omock:Creation of Mock Observational Medical Outcomes Partnership Common Data Model
Creates mock data for testing and package development for the Observational Medical Outcomes Partnership common data model. The package offers functions crafted with pipeline-friendly implementation, enabling users to effortlessly include only the necessary tables for their testing needs.
Maintained by Mike Du. Last updated 1 months ago.
1.7 match 2 stars 7.44 score 45 scripts 1 dependentsdonaldrwilliams
GGMnonreg:Non-Regularized Gaussian Graphical Models
Estimate non-regularized Gaussian graphical models, Ising models, and mixed graphical models. The current methods consist of multiple regression, a non-parametric bootstrap <doi:10.1080/00273171.2019.1575716>, and Fisher z transformed partial correlations <doi:10.1111/bmsp.12173>. Parameter uncertainty, predictability, and network replicability <doi:10.31234/> are also implemented.
Maintained by Donald Williams. Last updated 3 years ago.
3.4 match 6 stars 3.48 score 4 scriptscbiit
LDlinkR:Calculating Linkage Disequilibrium (LD) in Human Population Groups of Interest
Provides access to the 'LDlink' API (<>) using the R console. This programmatic access facilitates researchers who are interested in performing batch queries in 1000 Genomes Project (2015) <doi:10.1038/nature15393> data using 'LDlink'. 'LDlink' is an interactive and powerful suite of web-based tools for querying germline variants in human population groups of interest. For more details, please see Machiela et al. (2015) <doi:10.1093/bioinformatics/btv402>.
Maintained by Timothy A. Myers. Last updated 11 months ago.
1.3 match 58 stars 9.21 score 206 scripts 1 dependentsczopluoglu
EstCRM:Calibrating Parameters for the Samejima's Continuous IRT Model
Estimates item and person parameters for the Continuous Response Model (CRM; Samejima, 1973, <doi:10.1007/BF02291114>), computes item fit residual statistics, draws empirical 3D item category response curves, draws theoretical 3D item category response curves, and generates data under the CRM for simulation studies.
Maintained by Cengiz Zopluoglu. Last updated 2 years ago.
5.9 match 1 stars 1.95 score 6 scripts 3 dependentsropensci
spiro:Manage Data from Cardiopulmonary Exercise Testing
Import, process, summarize and visualize raw data from metabolic carts. See Robergs, Dwyer, and Astorino (2010) <doi:10.2165/11319670-000000000-00000> for more details on data processing.
Maintained by Simon Nolte. Last updated 25 days ago.
1.8 match 13 stars 6.35 score 43 scriptsjojo-
mipfp:Multidimensional Iterative Proportional Fitting and Alternative Models
An implementation of the iterative proportional fitting (IPFP), maximum likelihood, minimum chi-square and weighted least squares procedures for updating a N-dimensional array with respect to given target marginal distributions (which, in turn can be multidimensional). The package also provides an application of the IPFP to simulate multivariate Bernoulli distributions.
Maintained by Johan Barthelemy. Last updated 4 years ago.
1.7 match 24 stars 6.79 score 86 scripts 3 dependentsjwiley
brmsmargins:Bayesian Marginal Effects for 'brms' Models
Calculate Bayesian marginal effects, average marginal effects, and marginal coefficients (also called population averaged coefficients) for models fit using the 'brms' package including fixed effects, mixed effects, and location scale models. These are based on marginal predictions that integrate out random effects if necessary (see for example <doi:10.1186/s12874-015-0046-6> and <doi:10.1111/biom.12707>).
Maintained by Joshua F. Wiley. Last updated 2 months ago.
1.8 match 20 stars 6.22 score 42 scriptsjrosell
jrrosell:Personal R package for Jordi Rosell
Useful functions for personal usage.
Maintained by Jordi Rosell. Last updated 3 months ago.
3.6 match 2 stars 3.08 score 7 scriptsguangjianzhang
EFAutilities:Utility Functions for Exploratory Factor Analysis
A number of utility function for exploratory factor analysis are included in this package. In particular, it computes standard errors for parameter estimates and factor correlations under a variety of conditions.
Maintained by Guangjian Zhang. Last updated 2 years ago.
3.4 match 3.26 score 12 scriptsevilgraham
flatr:Transforms Contingency Tables to Data Frames, and Analyses Them
Contingency Tables are a pain to work with when you want to run regressions. This package takes them, flattens them into a long data frame, so you can more easily analyse them! As well, you can calculate other related statistics. All of this is done so in a 'tidy' manner, so it should tie in nicely with 'tidyverse' series of packages.
Maintained by Scott D. Graham. Last updated 7 years ago.
3.5 match 3 stars 3.18 score 6 scriptsokanbulut
eirm:Explanatory Item Response Modeling for Dichotomous and Polytomous Items
Analysis of dichotomous and polytomous response data using the explanatory item response modeling framework, as described in Bulut, Gorgun, & Yildirim-Erbasli (2021) <doi:10.3390/psych3030023>, Stanke & Bulut (2019) <doi:10.21449/ijate.515085>, and De Boeck & Wilson (2004) <doi:10.1007/978-1-4757-3990-9>. Generalized linear mixed modeling is used for estimating the effects of item-related and person-related variables on dichotomous and polytomous item responses.
Maintained by Okan Bulut. Last updated 2 years ago.
2.3 match 8 stars 4.90 scorerepboxr
GithubActions:Functions to facilitate use of Github Actions via R
Work in progress. Not yet working well.
Maintained by Sebastian Kranz. Last updated 9 months ago.
3.4 match 2 stars 3.26 score 2 dependentscran
BNPTSclust:A Bayesian Nonparametric Algorithm for Time Series Clustering
Performs the algorithm for time series clustering described in Nieto-Barajas and Contreras-Cristan (2014).
Maintained by David Alejandro Martell Juarez. Last updated 6 years ago.
6.9 match 4 stars 1.60 scorecran
metaumbrella:Umbrella Review Package for R
A comprehensive range of facilities to perform umbrella reviews with stratification of the evidence in R. The package accomplishes this aim by building on three core functions that: (i) automatically perform all required calculations in an umbrella review (including but not limited to meta-analyses), (ii) stratify evidence according to various classification criteria, and (iii) generate a visual representation of the results. Note that if you are not familiar with R, the core features of this package are available from a web browser (<>).
Maintained by Corentin J Gosling. Last updated 16 days ago.
2.4 match 9 stars 4.56 scorejhorzek
mxsem:Specify 'OpenMx' Models with a 'lavaan'-Style Syntax
Provides a 'lavaan'-like syntax for 'OpenMx' models. The syntax supports definition variables, bounds, and parameter transformations. This allows for latent growth curve models with person-specific measurement occasions, moderated nonlinear factor analysis and much more.
Maintained by Jannik H. Orzek. Last updated 4 months ago.
1.9 match 3 stars 5.93 score 47 scriptsjmcurran
relSim:Relative Simulator
A set of tools to explore the behaviour statistics used for forensic DNA interpretation when close relatives are involved. The package also offers some useful tools for exploring other forensic DNA situations.
Maintained by James M. Curran. Last updated 1 years ago.
3.4 match 3.18 score 30 scriptsegeulgen
driveR:Prioritizing Cancer Driver Genes Using Genomics Data
Cancer genomes contain large numbers of somatic alterations but few genes drive tumor development. Identifying cancer driver genes is critical for precision oncology. Most of current approaches either identify driver genes based on mutational recurrence or using estimated scores predicting the functional consequences of mutations. 'driveR' is a tool for personalized or batch analysis of genomic data for driver gene prioritization by combining genomic information and prior biological knowledge. As features, 'driveR' uses coding impact metaprediction scores, non-coding impact scores, somatic copy number alteration scores, hotspot gene/double-hit gene condition, 'phenolyzer' gene scores and memberships to cancer-related KEGG pathways. It uses these features to estimate cancer-type-specific probability for each gene of being a cancer driver using the related task of a multi-task learning classification model. The method is described in detail in Ulgen E, Sezerman OU. 2021. driveR: driveR: a novel method for prioritizing cancer driver genes using somatic genomics data. BMC Bioinformatics <doi:10.1186/s12859-021-04203-7>.
Maintained by Ege Ulgen. Last updated 2 years ago.
1.7 match 15 stars 6.29 score 260 scripts