Showing 68 of total 68 results (show query)
pbiecek
breakDown:Model Agnostic Explainers for Individual Predictions
Model agnostic tool for decomposition of predictions from black boxes. Break Down Table shows contributions of every variable to a final prediction. Break Down Plot presents variable contributions in a concise graphical way. This package work for binary classifiers and general regression models.
Maintained by Przemyslaw Biecek. Last updated 1 years ago.
data-scienceimlinterpretabilitymachine-learningvisual-explanationsxai
76.0 match 103 stars 8.90 score 91 scripts 2 dependentsbioc
amplican:Automated analysis of CRISPR experiments
`amplican` performs alignment of the amplicon reads, normalizes gathered data, calculates multiple statistics (e.g. cut rates, frameshifts) and presents results in form of aggregated reports. Data and statistics can be broken down by experiments, barcodes, user defined groups, guides and amplicons allowing for quick identification of potential problems.
Maintained by Eivind Valen. Last updated 5 months ago.
immunooncologytechnologyalignmentqpcrcrisprcpp
16.5 match 10 stars 7.54 score 41 scriptsmodeloriented
iBreakDown:Model Agnostic Instance Level Variable Attributions
Model agnostic tool for decomposition of predictions from black boxes. Supports additive attributions and attributions with interactions. The Break Down Table shows contributions of every variable to a final prediction. The Break Down Plot presents variable contributions in a concise graphical way. This package works for classification and regression models. It is an extension of the 'breakDown' package (Staniak and Biecek 2018) <doi:10.32614/RJ-2018-072>, with new and faster strategies for orderings. It supports interactions in explanations and has interactive visuals (implemented with 'D3.js' library). The methodology behind is described in the 'iBreakDown' article (Gosiewska and Biecek 2019) <arXiv:1903.11420> This package is a part of the 'DrWhy.AI' universe (Biecek 2018) <arXiv:1806.08915>.
Maintained by Przemyslaw Biecek. Last updated 1 years ago.
breakdownimlinterpretabilityshapleyxai
11.5 match 84 stars 10.07 score 56 scripts 22 dependentsdmurdoch
plotrix:Various Plotting Functions
Lots of plots, various labeling, axis and color scaling functions. The author/maintainer died in September 2023.
Maintained by Duncan Murdoch. Last updated 1 years ago.
5.4 match 5 stars 11.31 score 9.2k scripts 361 dependentshms-dbmi
UpSetR:A More Scalable Alternative to Venn and Euler Diagrams for Visualizing Intersecting Sets
Creates visualizations of intersecting sets using a novel matrix design, along with visualizations of several common set, element and attribute related tasks (Conway 2017) <doi:10.1093/bioinformatics/btx364>.
Maintained by Jake Conway. Last updated 4 years ago.
gehlenborglabggplot2upsetupsetrvisualization
3.8 match 781 stars 15.33 score 4.8k scripts 42 dependentskjhealy
gssrdoc:Document General Social Survey Variable
The General Social Survey (GSS) is a long-running, mostly annual survey of US households. It is administered by the National Opinion Research Center (NORC). This package contains the a tibble with information on the survey variables, together with every variable documented as an R help page. For more information on the GSS see \url{http://gss.norc.org}.
Maintained by Kieran Healy. Last updated 11 months ago.
25.6 match 2.28 score 38 scriptsthijsjanzen
GenomeAdmixR:Simulate Admixture of Genomes
Individual-based simulations forward in time, simulating how patterns in ancestry along the genome change after admixture. Full description can be found in Janzen (2021) <doi:10.1111/2041-210X.13612>.
Maintained by Thijs Janzen. Last updated 1 years ago.
8.4 match 5 stars 5.24 score 14 scriptsrobinhankin
hyper2:The Hyperdirichlet Distribution, Mark 2
A suite of routines for the hyperdirichlet distribution and reified Bradley-Terry; supersedes the 'hyperdirichlet' package; uses 'disordR' discipline <doi:10.48550/ARXIV.2210.03856>. To cite in publications please use Hankin 2017 <doi:10.32614/rj-2017-061>, and for Generalized Plackett-Luce likelihoods use Hankin 2024 <doi:10.18637/jss.v109.i08>.
Maintained by Robin K. S. Hankin. Last updated 3 days ago.
7.2 match 5 stars 6.01 score 38 scripts 1 dependentsspatstat
spatstat.data:Datasets for 'spatstat' Family
Contains all the datasets for the 'spatstat' family of packages.
Maintained by Adrian Baddeley. Last updated 22 hours ago.
kernel-densitypoint-processspatial-analysisspatial-dataspatial-data-analysisspatstatstatistical-analysisstatistical-methodsstatistical-testsstatistics
3.8 match 6 stars 11.02 score 186 scripts 228 dependentslaresbernardo
lares:Analytics & Machine Learning Sidekick
Auxiliary package for better/faster analytics, visualization, data mining, and machine learning tasks. With a wide variety of family functions, like Machine Learning, Data Wrangling, Marketing Mix Modeling (Robyn), Exploratory, API, and Scrapper, it helps the analyst or data scientist to get quick and robust results, without the need of repetitive coding or advanced R programming skills.
Maintained by Bernardo Lares. Last updated 23 days ago.
analyticsapiautomationautomldata-sciencedescriptive-statisticsh2omachine-learningmarketingmmmpredictive-modelingpuzzlerlanguagerobynvisualization
4.0 match 233 stars 9.84 score 185 scripts 1 dependentshumaniverse
asylum:Data on Asylum and Resettlement for the UK
Data on Asylum and Resettlement for the UK, provided by the Home Office <https://www.gov.uk/government/statistical-data-sets/immigration-system-statistics-data-tables>.
Maintained by Matthew Gwynfryn Thomas. Last updated 17 days ago.
7.8 match 3 stars 4.99 score 36 scriptsvalentint
rrcov:Scalable Robust Estimators with High Breakdown Point
Robust Location and Scatter Estimation and Robust Multivariate Analysis with High Breakdown Point: principal component analysis (Filzmoser and Todorov (2013), <doi:10.1016/j.ins.2012.10.017>), linear and quadratic discriminant analysis (Todorov and Pires (2007)), multivariate tests (Todorov and Filzmoser (2010) <doi:10.1016/j.csda.2009.08.015>), outlier detection (Todorov et al. (2010) <doi:10.1007/s11634-010-0075-2>). See also Todorov and Filzmoser (2009) <urn:isbn:978-3838108148>, Todorov and Filzmoser (2010) <doi:10.18637/jss.v032.i03> and Boudt et al. (2019) <doi:10.1007/s11222-019-09869-x>.
Maintained by Valentin Todorov. Last updated 7 months ago.
3.4 match 2 stars 10.51 score 484 scripts 96 dependentsschochastics
networkdata:Repository of Network Datasets
The package contains a large collection of network dataset with different context. This includes social networks, animal networks and movie networks. All datasets are in 'igraph' format.
Maintained by David Schoch. Last updated 12 months ago.
6.6 match 143 stars 5.01 score 143 scriptsmayer79
flashlight:Shed Light on Black Box Machine Learning Models
Shed light on black box machine learning models by the help of model performance, variable importance, global surrogate models, ICE profiles, partial dependence (Friedman J. H. (2001) <doi:10.1214/aos/1013203451>), accumulated local effects (Apley D. W. (2016) <arXiv:1612.08468>), further effects plots, interaction strength, and variable contribution breakdown (Gosiewska and Biecek (2019) <arxiv:1903.11420>). All tools are implemented to work with case weights and allow for stratified analysis. Furthermore, multiple flashlights can be combined and analyzed together.
Maintained by Michael Mayer. Last updated 8 months ago.
interpretabilityinterpretable-machine-learningmachine-learningxai
5.3 match 22 stars 6.25 score 54 scripts 1 dependentsbillpetti
baseballr:Acquiring and Analyzing Baseball Data
Provides numerous utilities for acquiring and analyzing baseball data from online sources such as 'Baseball Reference' <https://www.baseball-reference.com/>, 'FanGraphs' <https://www.fangraphs.com/>, and the 'MLB Stats' API <https://www.mlb.com/>.
Maintained by Saiem Gilani. Last updated 4 months ago.
baseballpitchfxsabermetricsstatcast
3.5 match 380 stars 8.98 score 582 scriptsalanarnholt
BSDA:Basic Statistics and Data Analysis
Data sets for book "Basic Statistics and Data Analysis" by Larry J. Kitchens.
Maintained by Alan T. Arnholt. Last updated 2 years ago.
3.4 match 7 stars 9.11 score 1.3k scripts 6 dependentsr-forge
robustbase:Basic Robust Statistics
"Essential" Robust Statistics. Tools allowing to analyze data with robust methods. This includes regression methodology including model selections and multivariate statistics where we strive to cover the book "Robust Statistics, Theory and Methods" by 'Maronna, Martin and Yohai'; Wiley 2006.
Maintained by Martin Maechler. Last updated 4 months ago.
1.7 match 13.33 score 1.7k scripts 480 dependentsr-forge
Sleuth3:Data Sets from Ramsey and Schafer's "Statistical Sleuth (3rd Ed)"
Data sets from Ramsey, F.L. and Schafer, D.W. (2013), "The Statistical Sleuth: A Course in Methods of Data Analysis (3rd ed)", Cengage Learning.
Maintained by Berwin A Turlach. Last updated 1 years ago.
3.5 match 6.38 score 522 scriptsr-forge
Sleuth2:Data Sets from Ramsey and Schafer's "Statistical Sleuth (2nd Ed)"
Data sets from Ramsey, F.L. and Schafer, D.W. (2002), "The Statistical Sleuth: A Course in Methods of Data Analysis (2nd ed)", Duxbury.
Maintained by Berwin A Turlach. Last updated 1 years ago.
3.5 match 5.70 score 191 scriptsappliedstat
rQCC:Robust Quality Control Chart
Constructs various robust quality control charts based on the median or Hodges-Lehmann estimator (location) and the median absolute deviation (MAD) or Shamos estimator (scale). The estimators used for the robust control charts are all unbiased with a sample of finite size. For more details, see Park, Kim and Wang (2022) <doi:10.1080/03610918.2019.1699114>. In addition, using this R package, the conventional quality control charts such as X-bar, S, R, p, np, u, c, g, h, and t charts are also easily constructed. This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2022R1A2C1091319).
Maintained by Chanseok Park. Last updated 1 years ago.
control-chartgoodness-of-fitr-languageweibull
4.1 match 2 stars 4.70 score 3 scriptshardin47
biwt:Functions to Compute the Biweight Mean Vector and Covariance and Correlation Matrices
The base functions compute multivariate location, scale, and correlation estimates based on Tukey's biweight M-estimator. Using the base function, the computations can be applied to a large number of observations to create either a matrix of biweight distances or biweight correlations.
Maintained by Johanna Hardin. Last updated 6 months ago.
3.4 match 5.58 score 16 scripts 2 dependentsjverzani
UsingR:Data Sets, Etc. for the Text "Using R for Introductory Statistics", Second Edition
A collection of data sets to accompany the textbook "Using R for Introductory Statistics," second edition.
Maintained by John Verzani. Last updated 3 years ago.
3.8 match 1 stars 4.97 score 1.4k scriptsjaseziv
worldfootballR:Extract and Clean World Football (Soccer) Data
Allow users to obtain clean and tidy football (soccer) game, team and player data. Data is collected from a number of popular sites, including 'FBref', transfer and valuations data from 'Transfermarkt'<https://www.transfermarkt.com/> and shooting location and other match stats data from 'Understat'<https://understat.com/>. It gives users the ability to access data more efficiently, rather than having to export data tables to files before being able to complete their analysis.
Maintained by Jason Zivkovic. Last updated 30 days ago.
fbreffootballfootball-datasoccer-datasports-datatransfermarktunderstat
1.8 match 499 stars 10.22 score 516 scripts 2 dependentsrsoc
soc.ca:Specific Correspondence Analysis for the Social Sciences
Specific and class specific multiple correspondence analysis on survey-like data. Soc.ca is optimized to the needs of the social scientist and presents easily interpretable results in near publication ready quality.
Maintained by Anton Grau Larsen. Last updated 1 years ago.
4.3 match 14 stars 4.15 score 50 scriptsstevenmmortimer
rdfp:An Implementation of the 'DoubleClick for Publishers' API
Functions to interact with the 'Google DoubleClick for Publishers (DFP)' API <https://developers.google.com/ad-manager/api/start> (recently renamed to 'Google Ad Manager'). This package is automatically compiled from the API WSDL (Web Service Description Language) files to dictate how the API is structured. Theoretically, all API actions are possible using this package; however, care must be taken to format the inputs correctly and parse the outputs correctly. Please see the 'Google Ad Manager' API reference <https://developers.google.com/ad-manager/api/rel_notes> and this package's website <https://stevenmmortimer.github.io/rdfp/> for more information, documentation, and examples.
Maintained by Steven M. Mortimer. Last updated 6 years ago.
api-clientapi-wrapperdfpdfp-apidoubleclickdoubleclick-for-publishersgoogle-dfp
2.5 match 16 stars 6.93 score 214 scriptsropensci
rredlist:'IUCN' Red List Client
'IUCN' Red List (<https://api.iucnredlist.org/>) client. The 'IUCN' Red List is a global list of threatened and endangered species. Functions cover all of the Red List 'API' routes. An 'API' key is required.
Maintained by William Gearty. Last updated 1 months ago.
iucnbiodiversityapiweb-servicestraitshabitatspeciesconservationapi-wrapperiucn-red-listtaxize
1.5 match 53 stars 11.49 score 195 scripts 24 dependentsimmunogenomics
harmony:Fast, Sensitive, and Accurate Integration of Single Cell Data
Implementation of the Harmony algorithm for single cell integration, described in Korsunsky et al <doi:10.1038/s41592-019-0619-0>. Package includes a standalone Harmony function and interfaces to external frameworks.
Maintained by Ilya Korsunsky. Last updated 4 months ago.
algorithmdata-integrationscrna-seqopenblascpp
1.3 match 554 stars 13.74 score 5.5k scripts 8 dependentsgluc
data.tree:General Purpose Hierarchical Data Structure
Create tree structures from hierarchical data, and traverse the tree in various orders. Aggregate, cumulate, print, plot, convert to and from data.frame and more. Useful for decision trees, machine learning, finance, conversion from and to JSON, and many other applications.
Maintained by Christoph Glur. Last updated 5 months ago.
1.3 match 209 stars 12.84 score 1.1k scripts 88 dependentsbusiness-science
timetk:A Tool Kit for Working with Time Series
Easy visualization, wrangling, and feature engineering of time series data for forecasting and machine learning prediction. Consolidates and extends time series functionality from packages including 'dplyr', 'stats', 'xts', 'forecast', 'slider', 'padr', 'recipes', and 'rsample'.
Maintained by Matt Dancho. Last updated 1 years ago.
coercioncoercion-functionsdata-miningdplyrforecastforecastingforecasting-modelsmachine-learningseries-decompositionseries-signaturetibbletidytidyquanttidyversetimetime-seriestimeseries
1.1 match 625 stars 14.15 score 4.0k scripts 16 dependentscraig-parylo
cvdprevent:Wrapper for the 'CVD Prevent' Application Programming Interface
Provides an R wrapper to the 'CVD Prevent' application programming interface (API). Users can make API requests through built-in R functions. The Cardiovascular Disease Prevention Audit (CVDPREVENT) is an England-wide primary care audit that automatically extracts routinely held GP health data. <https://bmchealthdocs.atlassian.net/wiki/spaces/CP/pages/317882369/CVDPREVENT+API+Documentation>.
Maintained by Craig Parylo. Last updated 1 months ago.
3.2 match 3 stars 5.02 score 4 scriptsbioc
GenomicDataCommons:NIH / NCI Genomic Data Commons Access
Programmatically access the NIH / NCI Genomic Data Commons RESTful service.
Maintained by Sean Davis. Last updated 1 months ago.
dataimportsequencingapi-clientbioconductorbioinformaticscancercore-servicesdata-sciencegenomicsncitcgavignette
1.2 match 87 stars 11.94 score 238 scripts 12 dependentsthijsjanzen
junctions:The Breakdown of Genomic Ancestry Blocks in Hybrid Lineages
Individual based simulations of hybridizing populations, where the accumulation of junctions is tracked. Furthermore, mathematical equations are provided to verify simulation outcomes. Both simulations and mathematical equations are based on Janzen (2018, <doi:10.1101/058107>) and Janzen (2022, <doi:10.1111/1755-0998.13519>).
Maintained by Thijs Janzen. Last updated 3 days ago.
2.9 match 2 stars 4.71 score 17 scriptsmountainmath
cmhc:Access, Retrieve, and Work with CMHC Data
Wrapper around the Canadian Mortgage and Housing Corporation (CMHC) web interface. It enables programmatic and reproducible access to a wide variety of housing data from CMHC.
Maintained by Jens von Bergmann. Last updated 1 months ago.
1.9 match 20 stars 7.09 score 68 scriptshafen
geofacet:'ggplot2' Faceting Utilities for Geographical Data
Provides geographical faceting functionality for 'ggplot2'. Geographical faceting arranges a sequence of plots of data for different geographical entities into a grid that preserves some of the geographical orientation.
Maintained by Ryan Hafen. Last updated 7 months ago.
1.3 match 340 stars 9.78 score 1.5k scripts 4 dependentsvalentint
robust:Port of the S+ "Robust Library"
Methods for robust statistics, a state of the art in the early 2000s, notably for robust regression and robust multivariate analysis.
Maintained by Valentin Todorov. Last updated 7 months ago.
1.7 match 7.52 score 572 scripts 8 dependentsgiocomai
plausibler:Access Plausible Analytics API
Access Plausible Analytics API.
Maintained by Giorgio Comai. Last updated 3 days ago.
api-wrapperplausible-analytics
5.3 match 5 stars 2.40 scorevalentint
rrcovNA:Scalable Robust Estimators with High Breakdown Point for Incomplete Data
Robust Location and Scatter Estimation and Robust Multivariate Analysis with High Breakdown Point for Incomplete Data (missing values) (Todorov et al. (2010) <doi:10.1007/s11634-010-0075-2>).
Maintained by Valentin Todorov. Last updated 3 months ago.
3.3 match 1 stars 3.77 score 59 scriptsuniprjrc
fsdaR:Robust Data Analysis Through Monitoring and Dynamic Visualization
Provides interface to the 'MATLAB' toolbox 'Flexible Statistical Data Analysis (FSDA)' which is comprehensive and computationally efficient software package for robust statistics in regression, multivariate and categorical data analysis. The current R version implements tools for regression: (forward search, S- and MM-estimation, least trimmed squares (LTS) and least median of squares (LMS)), for multivariate analysis (forward search, S- and MM-estimation), for cluster analysis and cluster-wise regression. The distinctive feature of our package is the possibility of monitoring the statistics of interest as a function of breakdown point, efficiency or subset size, depending on the estimator. This is accompanied by a rich set of graphical features, such as dynamic brushing, linking, particularly useful for exploratory data analysis.
Maintained by Valentin Todorov. Last updated 1 years ago.
2.1 match 5 stars 5.37 score 93 scriptsselesnow
rfacebookstat:Load Data from Facebook API Marketing
Load data by campaigns, ads, ad sets and insights, ad account and business manager from Facebook Marketing API into R. For more details see official documents by Facebook Marketing API <https://developers.facebook.com/docs/marketing-apis/>.
Maintained by Alexey Seleznev. Last updated 26 days ago.
1.5 match 31 stars 7.43 score 48 scripts 1 dependentsropensci
phylotaR:Automated Phylogenetic Sequence Cluster Identification from 'GenBank'
A pipeline for the identification, within taxonomic groups, of orthologous sequence clusters from 'GenBank' <https://www.ncbi.nlm.nih.gov/genbank/> as the first step in a phylogenetic analysis. The pipeline depends on a local alignment search tool and is, therefore, not dependent on differences in gene naming conventions and naming errors.
Maintained by Shixiang Wang. Last updated 8 months ago.
blastngenbankpeer-reviewedphylogeneticssequence-alignment
1.9 match 23 stars 5.86 score 156 scriptschristopherkenny
censable:Making Census Data More Usable
Creates a common framework for organizing, naming, and gathering population, age, race, and ethnicity data from the Census Bureau. Accesses the API <https://www.census.gov/data/developers/data-sets.html>. Provides tools for adding information to existing data to line up with Census data.
Maintained by Christopher T. Kenny. Last updated 10 months ago.
1.9 match 8 stars 5.78 score 42 scripts 4 dependentsr4epi
epikit:Miscellaneous Helper Tools for Epidemiologists
Contains tools for formatting inline code, renaming redundant columns, aggregating age categories, adding survey weights, finding the earliest date of an event, plotting z-curves, generating population counts and calculating proportions with confidence intervals. This is part of the 'R4Epis' project <https://r4epis.netlify.app/>.
Maintained by Zhian N. Kamvar. Last updated 1 months ago.
1.7 match 10 stars 6.32 score 22 scripts 2 dependentscran
NISTnls:Nonlinear least squares examples from NIST
Datasets for testing nonlinear regression routines.
Maintained by Douglas Bates. Last updated 13 years ago.
4.0 match 2.69 score 99 scriptspeterkdunn
GLMsData:Generalized Linear Model Data Sets
Data sets from the book Generalized Linear Models with Examples in R by Dunn and Smyth.
Maintained by Peter K. Dunn. Last updated 3 years ago.
4.0 match 2.61 score 220 scriptsodelmarcelle
sentopics:Tools for Joint Sentiment and Topic Analysis of Textual Data
A framework that joins topic modeling and sentiment analysis of textual data. The package implements a fast Gibbs sampling estimation of Latent Dirichlet Allocation (Griffiths and Steyvers (2004) <doi:10.1073/pnas.0307752101>) and Joint Sentiment/Topic Model (Lin, He, Everson and Ruger (2012) <doi:10.1109/TKDE.2011.48>). It offers a variety of helpers and visualizations to analyze the result of topic modeling. The framework also allows enriching topic models with dates and externally computed sentiment measures. A flexible aggregation scheme enables the creation of time series of sentiment or topical proportions from the enriched topic models. Moreover, a novel method jointly aggregates topic proportions and sentiment measures to derive time series of topical sentiment.
Maintained by Olivier Delmarcelle. Last updated 2 months ago.
1.9 match 8 stars 5.38 score 5 scriptsjohnchaston
MAGNAMWAR:A Pipeline for Meta-Genome Wide Association
Correlates variation within the meta-genome to target species phenotype variations in meta-genome with association studies. Follows the pipeline described in Chaston, J.M. et al. (2014) <doi:10.1128/mBio.01631-14>.
Maintained by John Chaston. Last updated 7 years ago.
1.9 match 3.90 score 16 scriptsgirelaignacio
mpitbR:Calculate Alkire-Foster Multidimensional Poverty Measures
Estimate Multidimensional Poverty Indices disaggregated by population subgroups based on the Alkire and Foster method (2011) <doi:10.1016/j.jpubeco.2010.11.006>. This includes the calculation of standard errors and confidence intervals. Other partial indices such as incidence, intensity and indicator-specific measures as well as intertemporal changes analysis can also be estimated. The standard errors and confidence intervals are calculated considering the complex survey design.
Maintained by Ignacio Girela. Last updated 1 months ago.
1.5 match 4.85 score 1 scriptstmaturi
NPIstats:Nonparametric Predictive Inference
An implementation of the Nonparametric Predictive Inference approach in R. It provides tools for quantifying uncertainty via lower and upper probabilities. It includes useful functions for pairwise and multiple comparisons: comparing two groups with and without terminated tails, selecting the best group, selecting the subset of best groups, selecting the subset including the best group.
Maintained by Tahani Coolen-Maturi. Last updated 4 years ago.
3.6 match 2.00 score 4 scriptsfifis
pnd:Parallel Numerical Derivatives, Gradients, Jacobians, and Hessians of Arbitrary Accuracy Order
Numerical derivatives through finite-difference approximations can be calculated using the 'pnd' package with parallel capabilities and optimal step-size selection to improve accuracy. These functions facilitate efficient computation of derivatives, gradients, Jacobians, and Hessians, allowing for more evaluations to reduce the mathematical and machine errors. Designed for compatibility with the 'numDeriv' package, which has not received updates in several years, it introduces advanced features such as computing derivatives of arbitrary order, improving the accuracy of Hessian approximations by avoiding repeated differencing, and parallelising slow functions on Windows, Mac, and Linux.
Maintained by Andreï Victorovitch Kostyrka. Last updated 5 days ago.
1.3 match 1 stars 5.22 score 5 scriptsc-monaghan
lwc2022:Langa-Weir Classification of Cognitive Function for 2022 HRS Data
Generates the Langa-Weir classification of cognitive function for the 2022 Health and Retirement Study (HRS) cognition data. It is particularly useful for researchers studying cognitive aging who wish to work with the most recent release of HRS data. The package provides user-friendly functions for data preprocessing, scoring, and classification allowing users to easily apply the Langa-Weir classification system. For details regarding the; HRS <https://hrsdata.isr.umich.edu/> and Langa-Weir classifications <https://hrsdata.isr.umich.edu/data-products/langa-weir-classification-cognitive-function-1995-2020>.
Maintained by Cormac Monaghan. Last updated 4 months ago.
1.5 match 4.48 score 4 scriptsnicholasjcooper
NCmisc:Miscellaneous Functions for Creating Adaptive Functions and Scripts
A set of handy functions. Includes a versatile one line progress bar, one line function timer with detailed output, time delay function, text histogram, object preview, CRAN package search, simpler package installer, Linux command install check, a flexible Mode function, top function, simulation of correlated data, and more.
Maintained by Nicholas Cooper. Last updated 2 years ago.
1.7 match 3.86 score 172 scripts 5 dependentssakoehler7
eesim:Simulate and Evaluate Time Series for Environmental Epidemiology
Provides functions to create simulated time series of environmental exposures (e.g., temperature, air pollution) and health outcomes for use in power analysis and simulation studies in environmental epidemiology. This package also provides functions to evaluate the results of simulation studies based on these simulated time series. This work was supported by a grant from the National Institute of Environmental Health Sciences (R00ES022631) and a fellowship from the Colorado State University Programs for Research and Scholarly Excellence.
Maintained by Brooke Anderson. Last updated 8 years ago.
1.2 match 8 stars 5.23 score 42 scriptsselesnow
racademyocean:Client for 'AcademyOcean API'
Provide function for work with 'AcademyOcean API' <https://academyocean.com/api>.
Maintained by Alexey Seleznev. Last updated 7 months ago.
1.9 match 3.00 score 3 scriptstonyfischetti
libbib:Various Utilities for Library Science/Assessment and Cataloging
Provides functions for validating and normalizing bibliographic codes such as ISBN, ISSN, and LCCN. Also includes functions to communicate with the WorldCat API, translate Call numbers (Library of Congress and Dewey Decimal) to their subject classifications or subclassifications, and provides various loadable data files such call number / subject crosswalks and code tables.
Maintained by Tony Fischetti. Last updated 2 years ago.
1.7 match 3.20 score 32 scriptsjimlemon
prettyR:Pretty Descriptive Stats
Functions for conventionally formatting descriptive stats, reshaping data frames and formatting R output as HTML.
Maintained by Jim Lemon. Last updated 6 years ago.
1.8 match 3.13 score 207 scripts 1 dependentschristopherkenny
ppmf:Read Census Privacy Protected Microdata Files
Implements data processing described in <doi:10.1126/sciadv.abk3283> to align modern differentially private data with formatting of older US Census data releases. The primary goal is to read in Census Privacy Protected Microdata Files data in a reproducible way. This includes tools for aggregating to relevant levels of geography by creating geographic identifiers which match the US Census Bureau's numbering. Additionally, there are tools for grouping race numeric identifiers into categories, consistent with OMB (Office of Management and Budget) classifications. Functions exist for downloading and linking to existing sources of privacy protected microdata.
Maintained by Christopher T. Kenny. Last updated 2 years ago.
2.0 match 1 stars 2.70 score 3 scriptsinventionate
TimeSpaceAnalysis:Statistical tools for time-space analysis
Use Geometric Data Analysis approaches (e.g. MCA or MFA), time pattern analysis (see "time sequence clustering") and places chronologies (see "time geography") analysis.
Maintained by Fabian Mundt. Last updated 6 days ago.
1.8 match 2.48 score 2 scriptsgowerc
diffdf:Dataframe Difference Tool
Functions for comparing two data.frames against each other. The core functionality is to provide a detailed breakdown of any differences between two data.frames as well as providing utility functions to help narrow down the source of problems and differences.
Maintained by Craig Gower-Page. Last updated 6 months ago.
0.5 match 48 stars 7.55 score 135 scriptsimranshakoor
DataSetsUni:A Collection of Univariate Data Sets
A collection of widely used univariate data sets of various applied domains on applications of distribution theory. The functions allow researchers and practitioners to quickly, easily, and efficiently access and use these data sets. The data are related to different applied domains and as follows: Bio-medical, survival analysis, medicine, reliability analysis, hydrology, actuarial science, operational research, meteorology, extreme values, quality control, engineering, finance, sports and economics. The total 100 data sets are documented along with associated references for further details and uses.
Maintained by Muhammad Imran. Last updated 2 years ago.
3.6 match 1.00 score 1 scriptscran
rpnf:Point and Figure Package
A set of functions to analyze and print the development of a commodity using the Point and Figure (P&F) approach. A P&F processor can be used to calculate daily statistics for the time series. These statistics can be used for deeper investigations as well as to create plots. Plots can be generated as well known X/O Plots in plain text format, and additionally in a more graphical format.
Maintained by Sascha Herrmann. Last updated 9 years ago.
1.6 match 3 stars 1.95 score 30 scriptspbiecek
ceterisParibus:Ceteris Paribus Profiles
Ceteris Paribus Profiles (What-If Plots) are designed to present model responses around selected points in a feature space. For example around a single prediction for an interesting observation. Plots are designed to work in a model-agnostic fashion, they are working for any predictive Machine Learning model and allow for model comparisons. Ceteris Paribus Plots supplement the Break Down Plots from 'breakDown' package.
Maintained by Przemyslaw Biecek. Last updated 5 years ago.
0.5 match 42 stars 5.48 score 36 scriptsmacmillancontentscience
morphemepiece:Morpheme Tokenization
Tokenize text into morphemes. The morphemepiece algorithm uses a lookup table to determine the morpheme breakdown of words, and falls back on a modified wordpiece tokenization algorithm for words not found in the lookup table.
Maintained by Jonathan Bratt. Last updated 3 years ago.
0.5 match 11 stars 5.04 score 8 scriptsmuschellij2
glassdoor:Interface to 'Glassdoor' API
Interacts with the 'Glassdoor' API <https://www.glassdoor.com/developer/index.htm>. Allows the user to search job statistics, employer statistics, and job progression, where 'Glassdoor' provides a breakdown of other jobs a person did after their current one.
Maintained by John Muschelli. Last updated 6 years ago.
0.5 match 6 stars 4.59 score 13 scriptsluqqe
mblm:Median-Based Linear Models
Provides linear models based on Theil-Sen single median and Siegel repeated medians. They are very robust (29 or 50 percent breakdown point, respectively), and if no outliers are present, the estimators are very similar to OLS.
Maintained by Lukasz Komsta. Last updated 6 years ago.
0.5 match 4.38 score 150 scripts 5 dependentsgcohenfr
riv:Robust Instrumental Variables Estimator
Finds a robust instrumental variables estimator using a high breakdown point S-estimator of multivariate location and scatter matrix.
Maintained by Gabriela Cohen-Freue. Last updated 7 years ago.
0.5 match 1.00 score 7 scripts