Showing 77 of total 77 results (show query)
data-cleaning
errorlocate:Locate Errors with Validation Rules
Errors in data can be located and removed using validation rules from package 'validate'. See also Van der Loo and De Jonge (2018) <doi:10.1002/9781118897126>, chapter 7.
Maintained by Edwin de Jonge. Last updated 9 months ago.
data-cleaningerrorsinvalidation
10.0 match 22 stars 6.11 score 59 scriptsrstudio
shiny:Web Application Framework for R
Makes it incredibly easy to build interactive web applications with R. Automatic "reactive" binding between inputs and outputs and extensive prebuilt widgets make it possible to build beautiful, responsive, and powerful applications with minimal effort.
Maintained by Winston Chang. Last updated 13 days ago.
reactiverstudioshinyweb-appweb-development
2.3 match 5.4k stars 21.28 score 108k scripts 1.8k dependentsropensci
drake:A Pipeline Toolkit for Reproducible Computation at Scale
A general-purpose computational engine for data analysis, drake rebuilds intermediate data objects when their dependencies change, and it skips work when the results are already up to date. Not every execution starts from scratch, there is native support for parallel and distributed computing, and completed projects have tangible evidence that they are reproducible. Extensive documentation, from beginner-friendly tutorials to practical examples and more, is available at the reference website <https://docs.ropensci.org/drake/> and the online manual <https://books.ropensci.org/drake/>.
Maintained by William Michael Landau. Last updated 3 months ago.
data-sciencedrakehigh-performance-computingmakefilepeer-reviewedpipelinereproducibilityreproducible-researchropensciworkflow
3.6 match 1.3k stars 11.49 score 1.7k scripts 1 dependentsr-spatial
sf:Simple Features for R
Support for simple feature access, a standardized way to encode and analyze spatial vector data. Binds to 'GDAL' <doi: 10.5281/zenodo.5884351> for reading and writing data, to 'GEOS' <doi: 10.5281/zenodo.11396894> for geometrical operations, and to 'PROJ' <doi: 10.5281/zenodo.5884394> for projection conversions and datum transformations. Uses by default the 's2' package for geometry operations on geodetic (long/lat degree) coordinates.
Maintained by Edzer Pebesma. Last updated 16 days ago.
1.8 match 1.4k stars 22.42 score 117k scripts 1.2k dependentsdipterix
dipsaus:A Dipping Sauce for Data Analysis and Visualizations
Works as an "add-on" to packages like 'shiny', 'future', as well as 'rlang', and provides utility functions. Just like dipping sauce adding flavors to potato chips or pita bread, 'dipsaus' for data analysis and visualizations adds handy functions and enhancements to popular packages. The goal is to provide simple solutions that are frequently asked for online, such as how to synchronize 'shiny' inputs without freezing the app, or how to get memory size on 'Linux' or 'MacOS' system. The enhancements roughly fall into these four categories: 1. 'shiny' input widgets; 2. high-performance computing using the 'future' package; 3. modify R calls and convert among numbers, strings, and other objects. 4. utility functions to get system information such like CPU chip-set, memory limit, etc.
Maintained by Zhengjia Wang. Last updated 5 days ago.
4.5 match 13 stars 7.90 score 85 scripts 3 dependentsr-lib
fs:Cross-Platform File System Operations Based on 'libuv'
A cross-platform interface to file system operations, built on top of the 'libuv' C library.
Maintained by Gábor Csárdi. Last updated 4 months ago.
1.7 match 370 stars 20.26 score 8.1k scripts 5.2k dependentsr-lib
clock:Date-Time Types and Tools
Provides a comprehensive library for date-time manipulations using a new family of orthogonal date-time classes (durations, time points, zoned-times, and calendars) that partition responsibilities so that the complexities of time zones are only considered when they are really needed. Capabilities include: date-time parsing, formatting, arithmetic, extraction and updating of components, and rounding.
Maintained by Davis Vaughan. Last updated 2 days ago.
2.0 match 106 stars 14.48 score 296 scripts 407 dependentsnealrichardson
httpcache:Query Cache for HTTP Clients
In order to improve performance for HTTP API clients, 'httpcache' provides simple tools for caching and invalidating cache. It includes the HTTP verb functions GET, PUT, PATCH, POST, and DELETE, which are drop-in replacements for those in the 'httr' package. These functions are cache-aware and provide default settings for cache invalidation suitable for RESTful APIs; the package also enables custom cache-management strategies. Finally, 'httpcache' includes a basic logging framework to facilitate the measurement of HTTP request time and cache performance.
Maintained by Neal Richardson. Last updated 2 years ago.
4.5 match 13 stars 6.35 score 12 scripts 8 dependentsmoore-institute-4-plastic-pollution-res
One4All:Validate, Share, and Download Data
Designed to enhance data validation and management processes by employing a set of functions that read a set of rules from a 'CSV' or 'Excel' file and apply them to a dataset. Funded by the National Renewable Energy Laboratory and Possibility Lab, maintained by the Moore Institute for Plastic Pollution Research.
Maintained by Hannah Sherrod. Last updated 8 months ago.
4.0 match 3 stars 6.33 score 15 scriptsropensci
CoordinateCleaner:Automated Cleaning of Occurrence Records from Biological Collections
Automated flagging of common spatial and temporal errors in biological and paleontological collection data, for the use in conservation, ecology and paleontology. Includes automated tests to easily flag (and exclude) records assigned to country or province centroid, the open ocean, the headquarters of the Global Biodiversity Information Facility, urban areas or the location of biodiversity institutions (museums, zoos, botanical gardens, universities). Furthermore identifies per species outlier coordinates, zero coordinates, identical latitude/longitude and invalid coordinates. Also implements an algorithm to identify data sets with a significant proportion of rounded coordinates. Especially suited for large data sets. The reference for the methodology is: Zizka et al. (2019) <doi:10.1111/2041-210X.13152>.
Maintained by Alexander Zizka. Last updated 1 years ago.
2.3 match 82 stars 10.93 score 306 scripts 3 dependentsr-spatial
lwgeom:Bindings to Selected 'liblwgeom' Functions for Simple Features
Access to selected functions found in 'liblwgeom' <https://github.com/postgis/postgis/tree/master/liblwgeom>, the light-weight geometry library used by 'PostGIS' <http://postgis.net/>.
Maintained by Edzer Pebesma. Last updated 1 months ago.
1.9 match 61 stars 12.95 score 1.7k scripts 66 dependentsr-lib
styler:Non-Invasive Pretty Printing of R Code
Pretty-prints R code without changing the user's formatting intent.
Maintained by Lorenz Walthert. Last updated 1 months ago.
1.5 match 754 stars 16.15 score 940 scripts 62 dependentsrstudio
config:Manage Environment Specific Configuration Values
Manage configuration values across multiple environments (e.g. development, test, production). Read values using a function that determines the current environment and returns the appropriate value.
Maintained by Andrie de Vries. Last updated 2 months ago.
1.5 match 261 stars 15.69 score 1.8k scripts 202 dependentsbioc
iSEE:Interactive SummarizedExperiment Explorer
Create an interactive Shiny-based graphical user interface for exploring data stored in SummarizedExperiment objects, including row- and column-level metadata. The interface supports transmission of selections between plots and tables, code tracking, interactive tours, interactive or programmatic initialization, preservation of app state, and extensibility to new panel types via S4 classes. Special attention is given to single-cell data in a SingleCellExperiment object with visualization of dimensionality reduction results.
Maintained by Kevin Rue-Albrecht. Last updated 10 days ago.
cellbasedassaysclusteringdimensionreductionfeatureextractiongeneexpressionguiimmunooncologyshinyappssinglecelltranscriptiontranscriptomicsvisualizationdimension-reductionfeature-extractiongene-expressionhacktoberfesthuman-cell-atlasshinysingle-cell
1.8 match 225 stars 12.86 score 380 scripts 9 dependentsirworkshop
campfin:Wrangle Campaign Finance Data
Explore and normalize American campaign finance data. Created by the Investigative Reporting Workshop to facilitate work on The Accountability Project, an effort to collect public data into a central, standard database that is more easily searched: <https://publicaccountability.org/>.
Maintained by Kiernan Nicholls. Last updated 1 years ago.
campaign-financedata-journalism
4.0 match 17 stars 5.66 score 54 scriptsbioc
AnnotationHub:Client to access AnnotationHub resources
This package provides a client for the Bioconductor AnnotationHub web resource. The AnnotationHub web resource provides a central location where genomic files (e.g., VCF, bed, wig) and other resources from standard locations (e.g., UCSC, Ensembl) can be discovered. The resource includes metadata about each resource, e.g., a textual description, tags, and date of modification. The client creates and manages a local cache of files retrieved by the user, helping with quick and reproducible access.
Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.
infrastructuredataimportguithirdpartyclientcore-packageu24ca289073
1.5 match 17 stars 13.89 score 2.7k scripts 102 dependentsr-spatial
spdep:Spatial Dependence: Weighting Schemes, Statistics
A collection of functions to create spatial weights matrix objects from polygon 'contiguities', from point patterns by distance and tessellations, for summarizing these objects, and for permitting their use in spatial data analysis, including regional aggregation by minimum spanning tree; a collection of tests for spatial 'autocorrelation', including global 'Morans I' and 'Gearys C' proposed by 'Cliff' and 'Ord' (1973, ISBN: 0850860369) and (1981, ISBN: 0850860814), 'Hubert/Mantel' general cross product statistic, Empirical Bayes estimates and 'Assunção/Reis' (1999) <doi:10.1002/(SICI)1097-0258(19990830)18:16%3C2147::AID-SIM179%3E3.0.CO;2-I> Index, 'Getis/Ord' G ('Getis' and 'Ord' 1992) <doi:10.1111/j.1538-4632.1992.tb00261.x> and multicoloured join count statistics, 'APLE' ('Li 'et al.' ) <doi:10.1111/j.1538-4632.2007.00708.x>, local 'Moran's I', 'Gearys C' ('Anselin' 1995) <doi:10.1111/j.1538-4632.1995.tb00338.x> and 'Getis/Ord' G ('Ord' and 'Getis' 1995) <doi:10.1111/j.1538-4632.1995.tb00912.x>, 'saddlepoint' approximations ('Tiefelsdorf' 2002) <doi:10.1111/j.1538-4632.2002.tb01084.x> and exact tests for global and local 'Moran's I' ('Bivand et al.' 2009) <doi:10.1016/j.csda.2008.07.021> and 'LOSH' local indicators of spatial heteroscedasticity ('Ord' and 'Getis') <doi:10.1007/s00168-011-0492-y>. The implementation of most of these measures is described in 'Bivand' and 'Wong' (2018) <doi:10.1007/s11749-018-0599-x>, with further extensions in 'Bivand' (2022) <doi:10.1111/gean.12319>. 'Lagrange' multiplier tests for spatial dependence in linear models are provided ('Anselin et al'. 1996) <doi:10.1016/0166-0462(95)02111-6>, as are 'Rao' score tests for hypothesised spatial 'Durbin' models based on linear models ('Koley' and 'Bera' 2023) <doi:10.1080/17421772.2023.2256810>. A local indicators for categorical data (LICD) implementation based on 'Carrer et al.' (2021) <doi:10.1016/j.jas.2020.105306> and 'Bivand et al.' (2017) <doi:10.1016/j.spasta.2017.03.003> was added in 1.3-7. From 'spdep' and 'spatialreg' versions >= 1.2-1, the model fitting functions previously present in this package are defunct in 'spdep' and may be found in 'spatialreg'.
Maintained by Roger Bivand. Last updated 18 days ago.
spatial-autocorrelationspatial-dependencespatial-weights
1.3 match 131 stars 16.62 score 6.0k scripts 107 dependentssigbertklinke
smvgraph:Visualization and Clustering of Data in a Shiny App
Various visualisations of univariate and multivariate graphs (e.g. mosaic diagram, scatterplot matrix, Andrews curves, parallel coordinate diagram, radar diagram and Chernoff plots) as well as clustering methods (e.g. k-means, agglomerative, EM clustering and DBSCAN) are implemented as a Shiny app. The app allows interactive changes, e.g. of the order of variables. It is intended for use in teaching.
Maintained by Sigbert Klinke. Last updated 2 years ago.
5.6 match 3.70 scorevoisinneg
queryup:Query the 'UniProtKB' REST API
Retrieve protein information from the 'UniProtKB' REST API (see <https://www.uniprot.org/help/api_queries>).
Maintained by Guillaume Voisinne. Last updated 2 years ago.
4.8 match 4 stars 4.30 score 7 scriptsmarkedmondson1234
googleAuthR:Authenticate and Create Google APIs
Create R functions that interact with OAuth2 Google APIs <https://developers.google.com/apis-explorer/> easily, with auto-refresh and Shiny compatibility.
Maintained by Erik Grönroos. Last updated 10 months ago.
apiauthenticationgooglegoogleauthroauth2-flowshiny
1.5 match 178 stars 12.84 score 804 scripts 13 dependentsvubiostat
redcapAPI:Interface to 'REDCap'
Access data stored in 'REDCap' databases using the Application Programming Interface (API). 'REDCap' (Research Electronic Data CAPture; <https://projectredcap.org>, Harris, et al. (2009) <doi:10.1016/j.jbi.2008.08.010>, Harris, et al. (2019) <doi:10.1016/j.jbi.2019.103208>) is a web application for building and managing online surveys and databases developed at Vanderbilt University. The API allows users to access data and project meta data (such as the data dictionary) from the web programmatically. The 'redcapAPI' package facilitates the process of accessing data with options to prepare an analysis-ready data set consistent with the definitions in a database's data dictionary.
Maintained by Shawn Garbett. Last updated 9 days ago.
1.8 match 22 stars 10.47 score 134 scripts 2 dependentsr-lum
Luminescence:Comprehensive Luminescence Dating Data Analysis
A collection of various R functions for the purpose of Luminescence dating data analysis. This includes, amongst others, data import, export, application of age models, curve deconvolution, sequence analysis and plotting of equivalent dose distributions.
Maintained by Sebastian Kreutzer. Last updated 16 hours ago.
bayesian-statisticsdata-sciencegeochronologyluminescenceluminescence-datingopen-scienceoslplottingradiofluorescencetlxsygcpp
1.6 match 15 stars 10.77 score 178 scripts 8 dependentsrwetherall
alfr:Connectivity to 'Alfresco' Content Management Repositories
Allows you to connect to an 'Alfresco' content management repository and interact with its contents using simple and intuitive functions. You will be able to establish a connection session to the 'Alfresco' repository, read and upload content and manage folder hierarchies. For more details on the 'Alfresco' content management repository see <https://www.alfresco.com/ecm-software/document-management>.
Maintained by Roy Wetherall. Last updated 6 years ago.
4.5 match 3.70 score 3 scriptsipums
ipumsr:An R Interface for Downloading, Reading, and Handling IPUMS Data
An easy way to work with census, survey, and geographic data provided by IPUMS in R. Generate and download data through the IPUMS API and load IPUMS files into R with their associated metadata to make analysis easier. IPUMS data describing 1.4 billion individuals drawn from over 750 censuses and surveys is available free of charge from the IPUMS website <https://www.ipums.org>.
Maintained by Derek Burk. Last updated 18 days ago.
1.5 match 28 stars 11.07 score 720 scripts 2 dependentszijguo
controlfunctionIV:Control Function Methods with Possibly Invalid Instrumental Variables
Inference with control function methods for nonlinear outcome models when the model is known ('Guo and Small' (2016) <arXiv:1602.01051>) and when unknown but semiparametric ('Li and Guo' (2021) <arXiv:2010.09922>).
Maintained by Zijian Guo. Last updated 3 years ago.
4.5 match 4 stars 3.30 score 1 scriptsddisab01
quest:Prepare Questionnaire Data for Analysis
Offers a suite of functions to prepare questionnaire data for analysis (perhaps other types of data as well). By data preparation, I mean data analytic tasks to get your raw data ready for statistical modeling (e.g., regression). There are functions to investigate missing data, reshape data, validate responses, recode variables, score questionnaires, center variables, aggregate by groups, shift scores (i.e., leads or lags), etc. It provides functions for both single level and multilevel (i.e., grouped) data. With a few exceptions (e.g., ncases()), functions without an "s" at the end of their primary word (e.g., center_by()) act on atomic vectors, while functions with an "s" at the end of their primary word (e.g., centers_by()) act on multiple columns of a data.frame.
Maintained by David Disabato. Last updated 1 years ago.
7.5 match 1.98 score 12 scriptsipeagit
gtfs2gps:Converting Transport Data from GTFS Format to GPS-Like Records
Convert general transit feed specification (GTFS) data to global positioning system (GPS) records in 'data.table' format. It also has some functions to subset GTFS data in time and space and to convert both representations to simple feature format.
Maintained by Pedro R. Andrade. Last updated 2 months ago.
gps-formatgtfsgtfs-formatpublic-transportrspatialtransportcpp
1.8 match 73 stars 8.22 score 101 scripts 1 dependentsbioc
gDRutils:A package with helper functions for processing drug response data
This package contains utility functions used throughout the gDR platform to fit data, manipulate data, and convert and validate data structures. This package also has the necessary default constants for gDR platform. Many of the functions are utilized by the gDRcore package.
Maintained by Arkadiusz Gladki. Last updated 4 days ago.
1.8 match 2 stars 7.40 score 3 scripts 3 dependentscran
tidytidbits:A Collection of Tools and Helpers Extending the Tidyverse
A selection of various tools to extend a data analysis workflow based on the 'tidyverse' packages. This includes high-level data frame editing methods (in the style of 'mutate'/'mutate_at'), some methods in the style of 'purrr' and 'forcats', 'lookup' methods for dict-like lists, a generic method for lumping a data frame by a given count, various low-level methods for special treatment of 'NA' values, 'python'-style tuple-assignment and 'truthy'/'falsy' checks, saving to PDF and PNG from a pipe and various small utilities.
Maintained by Marcel Wiesweg. Last updated 3 years ago.
5.3 match 2.48 score 2 dependentshyunseungkang
sisVIVE:Some Invalid Some Valid Instrumental Variables Estimator
Selects invalid instruments amongst a candidate of potentially bad instruments. The algorithm selects potentially invalid instruments and provides an estimate of the causal effect between exposure and outcome.
Maintained by Hyunseung Kang. Last updated 8 years ago.
7.3 match 1 stars 1.78 score 15 scriptsmazamascience
AirMonitor:Air Quality Data Analysis
Utilities for working with hourly air quality monitoring data with a focus on small particulates (PM2.5). A compact data model is structured as a list with two dataframes. A 'meta' dataframe contains spatial and measuring device metadata associated with deployments at known locations. A 'data' dataframe contains a 'datetime' column followed by columns of measurements associated with each "device-deployment". Algorithms to calculate NowCast and the associated Air Quality Index (AQI) are defined at the US Environmental Projection Agency AirNow program: <https://document.airnow.gov/technical-assistance-document-for-the-reporting-of-daily-air-quailty.pdf>.
Maintained by Jonathan Callahan. Last updated 6 months ago.
1.9 match 7 stars 6.57 score 178 scriptspharmaverse
sdtmchecks:Data Quality Checks for Study Data Tabulation Model (SDTM) Datasets
A series of checks to identify common issues in Study Data Tabulation Model (SDTM) datasets. These checks are intended to be generalizable, actionable, and meaningful for analysis.
Maintained by Will Harris. Last updated 3 months ago.
1.6 match 21 stars 7.66 score 15 scriptsmandymejia
BayesfMRI:Spatial Bayesian Methods for Task Functional MRI Studies
Performs a spatial Bayesian general linear model (GLM) for task functional magnetic resonance imaging (fMRI) data on the cortical surface. Additional models include group analysis and inference to detect thresholded areas of activation. Includes direct support for the 'CIFTI' neuroimaging file format. For more information see A. F. Mejia, Y. R. Yue, D. Bolin, F. Lindgren, M. A. Lindquist (2020) <doi:10.1080/01621459.2019.1611582> and D. Spencer, Y. R. Yue, D. Bolin, S. Ryan, A. F. Mejia (2022) <doi:10.1016/j.neuroimage.2022.118908>.
Maintained by Amanda Mejia. Last updated 8 days ago.
2.0 match 26 stars 5.77 score 19 scriptsdlcarl
TSCI:Tools for Causal Inference with Possibly Invalid Instrumental Variables
Two stage curvature identification with machine learning for causal inference in settings when instrumental variable regression is not suitable because of potentially invalid instrumental variables. Based on Guo and Buehlmann (2022) "Two Stage Curvature Identification with Machine Learning: Causal Inference with Possibly Invalid Instrumental Variables" <arXiv:2203.12808>. The vignette is available in Carl, Emmenegger, Bühlmann and Guo (2023) "TSCI: two stage curvature identification for causal inference with invalid instruments" <arXiv:2304.00513>.
Maintained by David Carl. Last updated 1 years ago.
3.8 match 1 stars 3.00 score 3 scriptsdcgerard
seqgendiff:RNA-Seq Generation/Modification for Simulation
Generates/modifies RNA-seq data for use in simulations. We provide a suite of functions that will add a known amount of signal to a real RNA-seq dataset. The advantage of using this approach over simulating under a theoretical distribution is that common/annoying aspects of the data are more preserved, giving a more realistic evaluation of your method. The main functions are select_counts(), thin_diff(), thin_lib(), thin_gene(), thin_2group(), thin_all(), and effective_cor(). See Gerard (2020) <doi:10.1186/s12859-020-3450-9> for details on the implemented methods.
Maintained by David Gerard. Last updated 10 months ago.
1.9 match 10 stars 5.86 score 72 scriptscthombor
SafeVote:Election Vote Counting with Safety Features
Fork of 'vote_2.3-2', Raftery et al. (2021) <DOI:10.32614/RJ-2021-086>, with additional support for stochastic experimentation.
Maintained by Clark Thomborson. Last updated 5 months ago.
4.0 match 2.70 score 5 scriptstrevorld
datetimeoffset:Datetimes with Optional UTC Offsets and/or Heterogeneous Time Zones
Supports import/export for a number of datetime string standards and R datetime classes often including lossless re-export of any original reduced precision including 'ISO 8601' <https://en.wikipedia.org/wiki/ISO_8601> and 'pdfmark' <https://opensource.adobe.com/dc-acrobat-sdk-docs/library/pdfmark/> datetime strings. Supports local/global datetimes with optional UTC offsets and/or (possibly heterogeneous) time zones with up to nanosecond precision.
Maintained by Trevor L. Davis. Last updated 2 months ago.
2.0 match 6 stars 5.26 score 1 scripts 2 dependentssnystrom
cmdfun:Framework for Building Interfaces to Shell Commands
Writing interfaces to command line software is cumbersome. 'cmdfun' provides a framework for building function calls to seamlessly interface with shell commands by allowing lazy evaluation of command line arguments. 'cmdfun' also provides methods for handling user-specific paths to tool installs or secrets like API keys. Its focus is to equally serve package builders who wish to wrap command line software, and to help analysts stay inside R when they might usually leave to execute non-R software.
Maintained by Spencer Nystrom. Last updated 4 years ago.
1.7 match 15 stars 6.05 score 7 scripts 5 dependentshanase
vote:Election Vote Counting
Counting election votes and determining election results by different methods, including the single transferable vote or ranked choice, approval, score, plurality, condorcet and two-round runoff methods (Raftery et al., 2021 <doi:10.32614/RJ-2021-086>).
Maintained by Hana Sevcikova. Last updated 13 days ago.
2.3 match 4 stars 4.38 score 30 scriptsbluegreen-labs
appeears:Interface to 'AppEEARS' NASA Web Services
Programmatic interface to the NASA Application for Extracting and Exploring Analysis Ready Samples services (AppEEARS; <https://appeears.earthdatacloud.nasa.gov/>). The package provides easy access to analysis ready earth observation data in R.
Maintained by Koen Hufkens. Last updated 19 days ago.
1.9 match 11 stars 5.22 score 15 scriptsdavisvaughan
almanac:Tools for Working with Recurrence Rules
Provides tools for defining recurrence rules and recurrence sets. Recurrence rules are a programmatic way to define a recurring event, like the first Monday of December. Multiple recurrence rules can be combined into larger recurrence sets. A full holiday and calendar interface is also provided that can generate holidays within a particular year, can detect if a date is a holiday, can respect holiday observance rules, and allows for custom holidays.
Maintained by Davis Vaughan. Last updated 2 years ago.
calendarsholidaysrecurrence-rules
1.1 match 73 stars 8.40 score 65 scripts 1 dependentsmandymejia
hrf:Hemodynamic Response Function
Computes the hemodynamic response function (HRF) for task functional magnetic resonance imaging (fMRI) data. Also includes functions for constructing a design matrix from task fMRI event timings, and for comparing multiple design matrices in a general linear model (GLM). A wrapper function is provided for GLM analysis of CIFTI-format data. Lastly, there are supporting functions which provide visual summaries of the HRFs and design matrices.
Maintained by Amanda Mejia. Last updated 1 months ago.
2.0 match 1 stars 4.18 scoreropensci
dwctaxon:Edit and Validate Darwin Core Taxon Data
Edit and validate taxonomic data in compliance with Darwin Core standards (Darwin Core 'Taxon' class <https://dwc.tdwg.org/terms/#taxon>).
Maintained by Joel H. Nitta. Last updated 8 months ago.
1.3 match 6 stars 6.13 score 28 scriptszijguo
RobustIV:Robust Instrumental Variable Methods in Linear Models
Inference for the treatment effect with possibly invalid instrumental variables via TSHT('Guo et al.' (2016) <arXiv:1603.05224>) and SearchingSampling('Guo' (2021) <arXiv:2104.06911>), which are effective for both low- and high-dimensional covariates and instrumental variables; test of endogeneity in high dimensions ('Guo et al.' (2016) <arXiv:1609.06713>).
Maintained by Zijian Guo. Last updated 3 years ago.
2.2 match 3 stars 3.65 score 3 scriptss3alfisc
fwildclusterboot:Fast Wild Cluster Bootstrap Inference for Linear Models
Implementation of fast algorithms for wild cluster bootstrap inference developed in 'Roodman et al' (2019, 'STATA' Journal, <doi:10.1177/1536867X19830877>) and 'MacKinnon et al' (2022), which makes it feasible to quickly calculate bootstrap test statistics based on a large number of bootstrap draws even for large samples. Multiple bootstrap types as described in 'MacKinnon, Nielsen & Webb' (2022) are supported. Further, 'multiway' clustering, regression weights, bootstrap weights, fixed effects and 'subcluster' bootstrapping are supported. Further, both restricted ('WCR') and unrestricted ('WCU') bootstrap are supported. Methods are provided for a variety of fitted models, including 'lm()', 'feols()' (from package 'fixest') and 'felm()' (from package 'lfe'). Additionally implements a 'heteroskedasticity-robust' ('HC1') wild bootstrap. Last, the package provides an R binding to 'WildBootTests.jl', which provides additional speed gains and functionality, including the 'WRE' bootstrap for instrumental variable models (based on models of type 'ivreg()' from package 'ivreg') and hypotheses with q > 1.
Maintained by Alexander Fischer. Last updated 2 years ago.
clustered-standard-errorslinear-regression-modelswild-bootstrapwild-cluster-bootstrapopenblascppopenmp
1.2 match 24 stars 6.67 score 109 scripts 2 dependentsliao961120
linguisticsdown:Easy Linguistics Document Writing with R Markdown
Provides 'Shiny gadgets' to search, type, and insert IPA symbols into documents or scripts, requiring only knowledge about phonetics or 'X-SAMPA'. Also provides functions to facilitate the rendering of IPA symbols in 'LaTeX' and PDF format, making IPA symbols properly rendered in all output formats. A minimal R Markdown template for authoring Linguistics related documents is also bundled with the package. Some helper functions to facilitate authoring with R Markdown is also provided.
Maintained by Yongfu Liao. Last updated 6 years ago.
linguisticsrmarkdownrmarkdown-template
1.7 match 26 stars 4.59 score 30 scriptswa-department-of-agriculture
soils:Visualize and Report Soil Health Data
Collection of soil health data visualization and reporting tools, including a RStudio project template with everything you need to generate custom HTML and Microsoft Word reports for each participant in your soil health sampling project.
Maintained by Jadey N Ryan. Last updated 1 months ago.
1.3 match 11 stars 5.74 score 9 scriptsmyominnoo
mStats:Medical Statistics & Epidemiological Analysis
A set of tidyverse-friendly functions for data management, calculation of epidemiological measures, statistical analysis, and table creation.
Maintained by Myo Minn Oo. Last updated 1 years ago.
data-managementepidemiological-calculationsmedical-statistics
1.5 match 4.98 score 16 scripts 1 dependentsjoeroe
era:Year-Based Time Scales
Provides a consistent representation of year-based time scales as a numeric vector with an associated 'era'. There are built-in era definitions for many year numbering systems used in contemporary and historic calendars (e.g. Common Era, Islamic 'Hijri' years); year-based time scales used in archaeology, astronomy, geology, and other palaeosciences (e.g. Before Present, SI-prefixed 'annus'); and support for arbitrary user-defined eras. Years can converted from any one era to another using a generalised transformation function. Methods are also provided for robust casting and coercion between years and other numeric types, type-stable arithmetic with years, and pretty-printing in tables.
Maintained by Joe Roe. Last updated 4 months ago.
archaeologygeologypaleoclimatepaleontologyvctrs
1.5 match 15 stars 4.88 score 4 scriptsdfsp-spirit
fsbrain:Managing and Visualizing Brain Surface Data
Provides high-level access to neuroimaging data from standard software packages like 'FreeSurfer' <http://freesurfer.net/> on the level of subjects and groups. Load morphometry data, surfaces and brain parcellations based on atlases. Mask data using labels, load data for specific atlas regions only, and visualize data and statistical results directly in 'R'.
Maintained by Tim Schäfer. Last updated 4 months ago.
3dbraindtifreesurfermeshmrineuroimagingresearchsurfacevisualizationvoxel
1.1 match 66 stars 6.47 score 15 scriptspedropark99
figma:Web Client/Wrapper to the 'Figma API'
An easy-to-use web client/wrapper for the 'Figma API' <https://www.figma.com/developers/api>. It allows you to bring all data from a 'Figma' file to your 'R' session. This includes the data of all objects that you have drawn in this file, and their respective canvas/page metadata.
Maintained by Pedro Faria. Last updated 2 years ago.
1.3 match 4 stars 5.30 score 33 scriptsbioc
mosbi:Molecular Signature identification using Biclustering
This package is a implementation of biclustering ensemble method MoSBi (Molecular signature Identification from Biclustering). MoSBi provides standardized interfaces for biclustering results and can combine their results with a multi-algorithm ensemble approach to compute robust ensemble biclusters on molecular omics data. This is done by computing similarity networks of biclusters and filtering for overlaps using a custom error model. After that, the louvain modularity it used to extract bicluster communities from the similarity network, which can then be converted to ensemble biclusters. Additionally, MoSBi includes several network visualization methods to give an intuitive and scalable overview of the results. MoSBi comes with several biclustering algorithms, but can be easily extended to new biclustering algorithms.
Maintained by Tim Daniel Rose. Last updated 5 months ago.
softwarestatisticalmethodclusteringnetworkcpp
1.6 match 4.30 score 8 scriptsdgkf
parttime:Partial Datetime Handling
Datetimes and timestamps are invariably an imprecise notation, with any partial representation implying some amount of uncertainty. To handle this, 'parttime' provides classes for embedding partial missingness as a central part of its datetime classes. This central feature allows for more ergonomic use of datetimes for challenging datetime computation, including calculations of overlapping date ranges, imputations, and more thoughtful handling of ambiguity that arises from uncertain time zones. This package was developed first and foremost with pharmaceutical applications in mind, but aims to be agnostic to application to accommodate general use cases just as conveniently.
Maintained by Doug Kelkhoff. Last updated 1 years ago.
1.7 match 17 stars 3.93 score 3 scriptsdigitalnature-ie
igr:Irish Grid Reference Utilities
Convert between Irish grid references and Irish Grid coordinates. Irish grid references can also be converted to or from an 'sf' object in any coordinate reference system. Precisions from 1 m to 100 km including 2 km (tetrads) are supported, as are datasets with mixed precision. Conversion to 'sf' polygons is precision-aware.
Maintained by John Kennedy. Last updated 27 days ago.
1.2 match 4 stars 5.41 score 9 scriptsusaid-oha-si
Wavelength:Wavelength
USAID OHA Office. Munging of mission weekly HFR data.
Maintained by Aaron Chafetz. Last updated 2 years ago.
1.9 match 3 stars 3.39 score 55 scriptsbioc
Basic4Cseq:Basic4Cseq: an R/Bioconductor package for analyzing 4C-seq data
Basic4Cseq is an R/Bioconductor package for basic filtering, analysis and subsequent visualization of 4C-seq data. Virtual fragment libraries can be created for any BSGenome package, and filter functions for both reads and fragments and basic quality controls are included. Fragment data in the vicinity of the experiment's viewpoint can be visualized as a coverage plot based on a running median approach and a multi-scale contact profile.
Maintained by Carolin Walter. Last updated 5 months ago.
immunooncologyvisualizationqualitycontrolsequencingcoveragealignmentrnaseqsequencematchingdataimport
1.7 match 3.30 score 5 scriptscolinfay
gargoyle:An Event-Based Mechanism for 'Shiny'
An event-Based framework for building 'Shiny' apps. Instead of relying on standard 'Shiny' reactive objects, this package allow to relying on a lighter set of triggers, so that reactive contexts can be invalidated with more control.
Maintained by Colin Fay. Last updated 7 months ago.
0.5 match 56 stars 7.37 score 28 scripts 1 dependentscanmod
LBoM.tools:Tools for Curating London Bills of Mortality and Registrar General Data
Part of an open toolchain for processing infectious disease datasets available through the IIDDA data repository.
Maintained by Steve Walker. Last updated 6 months ago.
2.3 match 1.70 scoreopenjusticeok
ojoutils:A Collection of Nifty Functions and Objects for OJO Analysts
We find ourselves repeating the same simple tasks or running a series of R commands over and over again. No more!
Maintained by Brancen Gregory. Last updated 1 months ago.
1.2 match 2.90 score 2 scriptsconverseg
ML2Pvae:Variational Autoencoder Models for IRT Parameter Estimation
Based on the work of Curi, Converse, Hajewski, and Oliveira (2019) <doi:10.1109/IJCNN.2019.8852333>. This package provides easy-to-use functions which create a variational autoencoder (VAE) to be used for parameter estimation in Item Response Theory (IRT) - namely the Multidimensional Logistic 2-Parameter (ML2P) model. To use a neural network as such, nontrivial modifications to the architecture must be made, such as restricting the nonzero weights in the decoder according to some binary matrix Q. The functions in this package allow for straight-forward construction, training, and evaluation so that minimal knowledge of 'tensorflow' or 'keras' is required.
Maintained by Geoffrey Converse. Last updated 3 years ago.
1.7 match 2.00 score 4 scriptsxinyongtian
ModelMatrixModel:Create Model Matrix and Save the Transforming Parameters
The model.matrix() function in R is convenient for transforming training dataset for modeling. But it does not save any parameter used in transformation, so it is hard to apply the same transformation to test dataset or new dataset. This package is created to solve the problem.
Maintained by Xinyong Tian. Last updated 4 years ago.
1.3 match 2.70 score 7 scriptsdatalowe
synr:Explore and Process Synesthesia Consistency Test Data
Explore synesthesia consistency test data, calculate consistency scores, and classify participant data as valid or invalid.
Maintained by Lowe Wilsson. Last updated 1 years ago.
0.5 match 5.32 score 139 scriptsxtimbeau
sourcoise:Source a Script and Cache
Provides a function that behave nearly as base::source() but implements a caching mechanism on disk, project based. It allows to quasi source() R scripts that gather data but can fail or consume to much time to respond even if nothing new is expected. It comes with tools to check and execute on demand or when cache is invalid the script.
Maintained by Xavier Timbeau. Last updated 11 hours ago.
0.5 match 4.65 scorebioc
basecallQC:Working with Illumina Basecalling and Demultiplexing input and output files
The basecallQC package provides tools to work with Illumina bcl2Fastq (versions >= 2.1.7) software.Prior to basecalling and demultiplexing using the bcl2Fastq software, basecallQC functions allow the user to update Illumina sample sheets from versions <= 1.8.9 to >= 2.1.7 standards, clean sample sheets of common problems such as invalid sample names and IDs, create read and index basemasks and the bcl2Fastq command. Following the generation of basecalled and demultiplexed data, the basecallQC packages allows the user to generate HTML tables, plots and a self contained report of summary metrics from Illumina XML output files.
Maintained by Thomas Carroll. Last updated 5 months ago.
sequencinginfrastructuredataimportqualitycontrol
0.5 match 4.32 score 21 scriptscran
RSpincalc:Conversion Between Attitude Representations of DCM, Euler Angles, Quaternions, and Euler Vectors
Conversion between attitude representations: DCM, Euler angles, Quaternions, and Euler vectors. Plus conversion between 2 Euler angle set types (xyx, yzy, zxz, xzx, yxy, zyz, xyz, yzx, zxy, xzy, yxz, zyx). Fully vectorized code, with warnings/errors for Euler angles (singularity, out of range, invalid angle order), DCM (orthogonality, not proper, exceeded tolerance to unity determinant) and Euler vectors(not unity). Also quaternion and other useful functions. Based on SpinCalc by John Fuller and SpinConv by Paolo de Leva.
Maintained by Jose Gama. Last updated 10 years ago.
0.5 match 1.48 score 1 dependentscran
BCBCSF:Bias-Corrected Bayesian Classification with Selected Features
Fully Bayesian Classification with a subset of high-dimensional features, such as expression levels of genes. The data are modeled with a hierarchical Bayesian models using heavy-tailed t distributions as priors. When a large number of features are available, one may like to select only a subset of features to use, typically those features strongly correlated with the response in training cases. Such a feature selection procedure is however invalid since the relationship between the response and the features has be exaggerated by feature selection. This package provides a way to avoid this bias and yield better-calibrated predictions for future cases when one uses F-statistic to select features.
Maintained by Longhai Li. Last updated 9 years ago.
0.5 match 1.48 score 1 dependentsxinweima
ramchoice:Revealed Preference and Attention Analysis in Random Limited Attention Models
It is widely documented in psychology, economics and other disciplines that socio-economic agent may not pay full attention to all available alternatives, rendering standard revealed preference theory invalid. This package implements the estimation and inference procedures of Cattaneo, Ma, Masatlioglu and Suleymanov (2020) <arXiv:1712.03448> and Cattaneo, Cheung, Ma, and Masatlioglu (2022) <arXiv:2110.10650>, which utilizes standard choice data to partially identify and estimate a decision maker's preference and attention. For inference, several simulation-based critical values are provided.
Maintained by Xinwei Ma. Last updated 1 years ago.
0.5 match 1.00 score 3 scripts