R-universe search: resource

obiba

resourcer:Resource Resolver

A resource represents some data or a computation unit. It is described by a URL and credentials. This package proposes a Resource model with "resolver" and "client" classes to facilitate the access and the usage of the resources.

Maintained by Yannick Marcon. Last updated 2 years ago.

160.8 match 2 stars 5.10 score 42 scripts 1 dependents

polar-fhir

fhircrackr:Handling HL7 FHIR® Resources in R

Useful tools for conveniently downloading FHIR resources in xml format and converting them to R data.frames. The package uses FHIR-search to download bundles from a FHIR server, provides functions to save and read xml-files containing such bundles and allows flattening the bundles to data.frames using XPath expressions. FHIR® is the registered trademark of HL7 and is used with the permission of HL7. Use of the FHIR trademark does not constitute endorsement of this product by HL7.

Maintained by Julia Palm. Last updated 12 days ago.

fhir fhir-client

56.9 match 33 stars 7.63 score 46 scripts

ropensci

targets:Dynamic Function-Oriented 'Make'-Like Declarative Pipelines

Pipeline tools coordinate the pieces of computationally demanding analysis projects. The 'targets' package is a 'Make'-like pipeline tool for statistics and data science in R. The package skips costly runtime for tasks that are already up to date, orchestrates the necessary computation with implicit parallel computing, and abstracts files as R objects. If all the current output matches the current upstream code and data, then the whole pipeline is up to date, and the results are more trustworthy than otherwise. The methodology in this package borrows from GNU 'Make' (2015, ISBN:978-9881443519) and 'drake' (2018, <doi:10.21105/joss.00550>).

Maintained by William Michael Landau. Last updated 3 days ago.

data-science high-performance-computing make peer-reviewed pipeline r-targetopia reproducibility reproducible-research targets workflow

25.2 match 973 stars 15.20 score 4.6k scripts 22 dependents

obiba

opalr:'Opal' Data Repository Client and 'DataSHIELD' Utils

Data integration Web application for biobanks by 'OBiBa'. 'Opal' is the core database application for biobanks. Participant data, once collected from any data source, must be integrated and stored in a central data repository under a uniform model. 'Opal' is such a central repository. It can import, process, validate, query, analyze, report, and export data. 'Opal' is typically used in a research center to analyze the data acquired at assessment centres. Its ultimate purpose is to achieve seamless data-sharing among biobanks. This 'Opal' client allows to interact with 'Opal' web services and to perform operations on the R server side. 'DataSHIELD' administration tools are also provided.

Maintained by Yannick Marcon. Last updated 2 months ago.

46.8 match 3 stars 7.76 score 179 scripts 2 dependents

bioc

OmnipathR:OmniPath web service client and more

A client for the OmniPath web service (https://www.omnipathdb.org) and many other resources. It also includes functions to transform and pretty print some of the downloaded data, functions to access a number of other resources such as BioPlex, ConsensusPathDB, EVEX, Gene Ontology, Guide to Pharmacology (IUPHAR/BPS), Harmonizome, HTRIdb, Human Phenotype Ontology, InWeb InBioMap, KEGG Pathway, Pathway Commons, Ramilowski et al. 2015, RegNetwork, ReMap, TF census, TRRUST and Vinayagam et al. 2011. Furthermore, OmnipathR features a close integration with the NicheNet method for ligand activity prediction from transcriptomics data, and its R implementation `nichenetr` (available only on github).

Maintained by Denes Turei. Last updated 19 days ago.

graphandnetwork network pathways software thirdpartyclient dataimport datarepresentation genesignaling generegulation systemsbiology transcriptomics singlecell annotation kegg complexes enzyme-ptm networks networks-biology omnipath proteins quarto

32.8 match 126 stars 9.90 score 226 scripts 2 dependents

cloudyr

googleComputeEngineR:R Interface with Google Compute Engine

Interact with the 'Google Compute Engine' API in R. Lets you create, start and stop instances in the 'Google Cloud'. Support for preconfigured instances, with templates for common R needs.

Maintained by Mark Edmondson. Last updated 1 days ago.

api cloud-computing cloudyr google-cloud googleauthr launching-virtual-machines

25.1 match 152 stars 9.73 score 235 scripts

azure

AzureRMR:Interface to 'Azure Resource Manager'

A lightweight but powerful R interface to the 'Azure Resource Manager' REST API. The package exposes a comprehensive class framework and related tools for creating, updating and deleting 'Azure' resource groups, resources and templates. While 'AzureRMR' can be used to manage any 'Azure' service, it can also be extended by other packages to provide extra functionality for specific services. Part of the 'AzureR' family of packages.

Maintained by Hong Ooi. Last updated 1 years ago.

azure azure-resource-manager azure-sdk-r cloud

24.3 match 20 stars 9.94 score 51 scripts 12 dependents

usepa

ctxR:Utilities for Interacting with the 'CTX' APIs

Access chemical, hazard, bioactivity, and exposure data from the Computational Toxicology and Exposure ('CTX') APIs <https://www.epa.gov/comptox-tools/computational-toxicology-and-exposure-apis>. 'ctxR' was developed to streamline the process of accessing the information available through the 'CTX' APIs without requiring prior knowledge of how to use APIs. Most data is also available on the CompTox Chemical Dashboard ('CCD') <https://comptox.epa.gov/dashboard/> and other resources found at the EPA Computational Toxicology and Exposure Online Resources <https://www.epa.gov/comptox-tools>.

Maintained by Paul Kruse. Last updated 2 months ago.

ccte comptox ord

29.8 match 10 stars 8.02 score 13 scripts 1 dependents

alisonlanski

IPEDSuploadables:Transforms Institutional Data into Text Files for IPEDS Automated Import/Upload

Starting from user-supplied institutional data, these scripts transform, aggregate, and reshape the information to produce key-value pair data files that are able to be uploaded to IPEDS (Integrated Postsecondary Education Data System) through their submission portal <https://surveys.nces.ed.gov/ipeds/>. Starting data specifications can be found in the vignettes. Final files are saved locally to a location of the user's choice. User-friendly readable files can also be produced for purposes of data review and validation.

Maintained by Alison Lanski. Last updated 3 months ago.

30.5 match 8 stars 7.05 score 39 scripts

r-simmer

simmer:Discrete-Event Simulation for R

A process-oriented and trajectory-based Discrete-Event Simulation (DES) package for R. It is designed as a generic yet powerful framework. The architecture encloses a robust and fast simulation core written in 'C++' with automatic monitoring capabilities. It provides a rich and flexible R API that revolves around the concept of trajectory, a common path in the simulation model for entities of the same type. Documentation about 'simmer' is provided by several vignettes included in this package, via the paper by Ucar, Smeets & Azcorra (2019, <doi:10.18637/jss.v090.i02>), and the paper by Ucar, Hernández, Serrano & Azcorra (2018, <doi:10.1109/MCOM.2018.1700960>); see 'citation("simmer")' for details.

Maintained by Iñaki Ucar. Last updated 6 months ago.

discrete-event simulation cpp

18.7 match 223 stars 11.47 score 440 scripts 6 dependents

djvanderlaan

datapackage:Creating and Reading Data Packages

Open, read data from and modify Data Packages. Data Packages are an open standard for bundling and describing data sets (<https://datapackage.org>). When data is read from a Data Package care is taken to convert the data as much a possible to R appropriate data types. The package can be extended with plugins for additional data types.

Maintained by Jan van der Laan. Last updated 7 days ago.

datapackage frictionless

36.3 match 2 stars 5.62 score

emf-creaf

indicspecies:Relationship Between Species and Groups of Sites

Functions to assess the strength and statistical significance of the relationship between species occurrence/abundance and groups of sites [De Caceres & Legendre (2009) <doi:10.1890/08-1823.1>]. Also includes functions to measure species niche breadth using resource categories [De Caceres et al. (2011) <doi:10.1111/J.1600-0706.2011.19679.x>].

Maintained by Miquel De Cáceres. Last updated 25 days ago.

19.9 match 10 stars 9.49 score 386 scripts 4 dependents

bupaverse

edeaR:Exploratory and Descriptive Event-Based Data Analysis

Exploratory and descriptive analysis of event based data. Provides methods for describing and selecting process data, and for preparing event log data for process mining. Builds on the S3-class for event logs implemented in the package 'bupaR'.

Maintained by Gert Janssenswillen. Last updated 4 months ago.

20.3 match 12 stars 9.17 score 149 scripts 8 dependents

usepa

ccdR:Utilities for Interacting with the 'CTX' APIs

Access chemical, hazard, bioactivity, and exposure data from the Computational Toxicology and Exposure ('CTX') APIs <https://api-ccte.epa.gov/docs/>. 'ccdR' was developed to streamline the process of accessing the information available through the 'CTX' APIs without requiring prior knowledge of how to use APIs. Most data is also available on the CompTox Chemical Dashboard ('CCD') <https://comptox.epa.gov/dashboard/> and other resources found at the EPA Computational Toxicology and Exposure Online Resources <https://www.epa.gov/comptox-tools>.

Maintained by Paul Kruse. Last updated 8 months ago.

28.4 match 2 stars 6.38 score 7 scripts

ropensci

frictionless:Read and Write Frictionless Data Packages

Read and write Frictionless Data Packages. A 'Data Package' (<https://specs.frictionlessdata.io/data-package/>) is a simple container format and standard to describe and package a collection of (tabular) data. It is typically used to publish FAIR (<https://www.go-fair.org/fair-principles/>) and open datasets.

Maintained by Peter Desmet. Last updated 6 months ago.

frictionlessdata oscibio

18.3 match 30 stars 9.79 score 55 scripts 6 dependents

bupaverse

bupaR:Business Process Analysis in R

Comprehensive Business Process Analysis toolkit. Creates S3-class for event log objects, and related handler functions. Imports related packages for filtering event data, computation of descriptive statistics, handling of 'Petri Net' objects and visualization of process maps. See also packages 'edeaR','processmapR', 'eventdataR' and 'processmonitR'.

Maintained by Gert Janssenswillen. Last updated 2 years ago.

19.6 match 55 stars 9.07 score 389 scripts 11 dependents

hzambran

hydroTSM:Time Series Management and Analysis for Hydrological Modelling

S3 functions for management, analysis, interpolation and plotting of time series used in hydrology and related environmental sciences. In particular, this package is highly oriented to hydrological modelling tasks. The focus of this package has been put in providing a collection of tools useful for the daily work of hydrologists (although an effort was made to optimise each function as much as possible, functionality has had priority over speed). Bugs / comments / questions / collaboration of any kind are very welcomed, and in particular, datasets that can be included in this package for academic purposes.

Maintained by Mauricio Zambrano-Bigiarini. Last updated 1 months ago.

hydrology hydrology-modeling hydrology-statistical resource water-resources

17.5 match 45 stars 10.14 score 340 scripts 10 dependents

bioc

AnnotationHub:Client to access AnnotationHub resources

This package provides a client for the Bioconductor AnnotationHub web resource. The AnnotationHub web resource provides a central location where genomic files (e.g., VCF, bed, wig) and other resources from standard locations (e.g., UCSC, Ensembl) can be discovered. The resource includes metadata about each resource, e.g., a textual description, tags, and date of modification. The client creates and manages a local cache of files retrieved by the user, helping with quick and reproducible access.

Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.

infrastructure dataimport gui thirdpartyclient core-package u24ca289073

11.7 match 17 stars 13.89 score 2.7k scripts 102 dependents

bioc

BiocFileCache:Manage Files Across Sessions

This package creates a persistent on-disk cache of files that the user can add, update, and retrieve. It is useful for managing resources (such as custom Txdb objects) that are costly or difficult to create, web resources, and data files used across sessions.

Maintained by Lori Shepherd. Last updated 2 months ago.

dataimport core-package u24ca289073

11.5 match 13 stars 13.76 score 486 scripts 429 dependents

ropensci

ckanr:Client for the Comprehensive Knowledge Archive Network ('CKAN') API

Client for 'CKAN' API (<https://ckan.org/>). Includes interface to 'CKAN' 'APIs' for search, list, show for packages, organizations, and resources. In addition, provides an interface to the 'datastore' API.

Maintained by Francisco Alves. Last updated 2 years ago.

database open-data ckan api data dataset api-wrapper ckan-api

17.4 match 100 stars 8.67 score 448 scripts 4 dependents

molgenis

MolgenisArmadillo:Armadillo Client for the Armadillo Service

A set of functions to manage data shared on a 'MOLGENIS Armadillo' server.

Maintained by Mariska Slofstra. Last updated 17 days ago.

hacktoberfest

19.2 match 3 stars 7.51 score 28 scripts

sizespectrum

mizer:Dynamic Multi-Species Size Spectrum Modelling

A set of classes and methods to set up and run multi-species, trait based and community size spectrum ecological models, focused on the marine environment.

Maintained by Gustav Delius. Last updated 2 months ago.

ecosystem-model fish-population-dynamics fisheries fisheries-management marine-ecosystem population-dynamics simulation size-structure species-interactions transport-equation cpp

15.1 match 38 stars 9.43 score 207 scripts

tiledb-inc

tiledb:Modern Database Engine for Complex Data Based on Multi-Dimensional Arrays

The modern database 'TileDB' introduces a powerful on-disk format for storing and accessing any complex data based on multi-dimensional arrays. It supports dense and sparse arrays, dataframes and key-values stores, cloud storage ('S3', 'GCS', 'Azure'), chunked arrays, multiple compression, encryption and checksum filters, uses a fully multi-threaded implementation, supports parallel I/O, data versioning ('time travel'), metadata and groups. It is implemented as an embeddable cross-platform C++ library with APIs from several languages, and integrations. This package provides the R support.

Maintained by Isaiah Norton. Last updated 5 days ago.

array hdfs s3 storage-manager tiledb cpp

10.7 match 107 stars 11.96 score 306 scripts 4 dependents

kenaho1

asbio:A Collection of Statistical Tools for Biologists

Contains functions from: Aho, K. (2014) Foundational and Applied Statistics for Biologists using R. CRC/Taylor and Francis, Boca Raton, FL, ISBN: 978-1-4398-7338-0.

Maintained by Ken Aho. Last updated 2 months ago.

16.9 match 5 stars 7.32 score 310 scripts 3 dependents

dataoneorg

dataone:R Interface to the DataONE REST API

Provides read and write access to data and metadata from the DataONE network <https://www.dataone.org> of data repositories. Each DataONE repository implements a consistent repository application programming interface. Users call methods in R to access these remote repository functions, such as methods to query the metadata catalog, get access to metadata for particular data packages, and read the data objects from the data repository. Users can also insert and update data objects on repositories that support these methods.

Maintained by Matthew B. Jones. Last updated 3 years ago.

12.4 match 36 stars 9.93 score 472 scripts 3 dependents

obiba

s3.resourcer:S3 Resource Resolver

A S3 resource is provided by Amazon Web Services S3 or a S3-compatible object store (such as Minio). The resource can be a tidy file to be downloaded from the object store, or a data lake (such as Delta Lake) Parquet file to be read by Apache Spark.

Maintained by Yannick Marcon. Last updated 2 months ago.

45.2 match 2.70 score 3 scripts

datashield

DSI:'DataSHIELD' Interface

'DataSHIELD' is an infrastructure and series of R packages that enables the remote and 'non-disclosive' analysis of sensitive research data. This package defines the API that is to be implemented by 'DataSHIELD' compliant data repositories.

Maintained by Yannick Marcon. Last updated 4 months ago.

16.9 match 2 stars 7.01 score 106 scripts 4 dependents

zizroc

villager:A Framework for Designing and Running Agent Based Models

This is a package for creating and running Agent Based Models (ABM). It provides a set of base classes with core functionality to allow bootstrapped models. For more intensive modeling, the supplied classes can be extended to fit researcher needs.

Maintained by Thomas Thelen. Last updated 9 months ago.

abm agent-based-modeling simulation

16.2 match 57 stars 6.79 score 18 scripts

modeloriented

DALEX:moDel Agnostic Language for Exploration and eXplanation

Any unverified black box model is the path to failure. Opaqueness leads to distrust. Distrust leads to ignoration. Ignoration leads to rejection. DALEX package xrays any model and helps to explore and explain its behaviour. Machine Learning (ML) models are widely used and have various applications in classification or regression. Models created with boosting, bagging, stacking or similar techniques are often used due to their high performance. But such black-box models usually lack direct interpretability. DALEX package contains various methods that help to understand the link between input variables and model output. Implemented methods help to explore the model on the level of a single instance as well as a level of the whole dataset. All model explainers are model agnostic and can be compared across different models. DALEX package is the cornerstone for 'DrWhy.AI' universe of packages for visual model exploration. Find more details in (Biecek 2018) <https://jmlr.org/papers/v19/18-416.html>.

Maintained by Przemyslaw Biecek. Last updated 1 months ago.

black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai

8.0 match 1.4k stars 13.40 score 876 scripts 21 dependents

ropensci

redland:RDF Library Bindings in R

Provides methods to parse, query and serialize information stored in the Resource Description Framework (RDF). RDF is described at <https://www.w3.org/TR/rdf-primer/>. This package supports RDF by implementing an R interface to the Redland RDF C library, described at <https://librdf.org/docs/api/index.html>. In brief, RDF provides a structured graph consisting of Statements composed of Subject, Predicate, and Object Nodes.

Maintained by Matthew B. Jones. Last updated 1 years ago.

redland

13.0 match 17 stars 7.85 score 98 scripts 13 dependents

bioc

ontoProc:processing of ontologies of anatomy, cell lines, and so on

Support harvesting of diverse bioinformatic ontologies, making particular use of the ontologyIndex package on CRAN. We provide snapshots of key ontologies for terms about cells, cell lines, chemical compounds, and anatomy, to help analyze genome-scale experiments, particularly cell x compound screens. Another purpose is to strengthen development of compelling use cases for richer interfaces to emerging ontologies.

Maintained by Vincent Carey. Last updated 4 days ago.

infrastructure go bioinformatics genomics ontology

15.0 match 3 stars 6.37 score 75 scripts 2 dependents

civisanalytics

civis:R Client for the 'Civis Platform API'

A convenient interface for making requests directly to the 'Civis Platform API' <https://www.civisanalytics.com/platform/>. Full documentation available 'here' <https://civisanalytics.github.io/civis-r/>.

Maintained by Peter Cooman. Last updated 2 months ago.

11.1 match 16 stars 7.84 score 144 scripts

paws-r

paws:Amazon Web Services Software Development Kit

Interface to Amazon Web Services <https://aws.amazon.com>, including storage, database, and compute services, such as 'Simple Storage Service' ('S3'), 'DynamoDB' 'NoSQL' database, and 'Lambda' functions-as-a-service.

Maintained by Dyfan Jones. Last updated 4 days ago.

aws aws-sdk

7.7 match 332 stars 11.25 score 177 scripts 12 dependents

bioc

RITAN:Rapid Integration of Term Annotation and Network resources

Tools for comprehensive gene set enrichment and extraction of multi-resource high confidence subnetworks. RITAN facilitates bioinformatic tasks for enabling network biology research.

Maintained by Michael Zimmermann. Last updated 5 months ago.

qualitycontrol network networkenrichment networkinference genesetenrichment functionalgenomics graphandnetwork

15.9 match 5.40 score 9 scripts

mrc-ide

naomi.resources:Data dependencies for Naomi output generation

Makes data for Naomi output generation as an R package.

Maintained by Rachel Esra. Last updated 1 years ago.

39.4 match 2.18 score 1 scripts

r-dbi

DBI:R Database Interface

A database interface definition for communication between R and relational database management systems. All classes in this package are virtual and need to be extended by the various R/DBMS implementations.

Maintained by Kirill Müller. Last updated 3 months ago.

database interface

4.0 match 302 stars 20.88 score 19k scripts 2.9k dependents

shevandrin

rqti:Create Tests According to QTI 2.1 Standard

Create tests and tasks compliant with the Question & Test Interoperability (QTI) information model version 2.1. Input sources are Rmd/md description files or S4-class objects. Output formats include standalone zip or xml files. Supports the generation of basic task types (single and multiple choice, order, pair association, matching tables, filling gaps and essay) and provides a comprehensive set of attributes for customizing tests.

Maintained by Andrey Shevandrin. Last updated 4 days ago.

14.0 match 5 stars 5.89 score 26 scripts

bioc

ExperimentHub:Client to access ExperimentHub resources

This package provides a client for the Bioconductor ExperimentHub web resource. ExperimentHub provides a central location where curated data from experiments, publications or training courses can be accessed. Each resource has associated metadata, tags and date of modification. The client creates and manages a local cache of files retrieved enabling quick and reproducible access.

Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.

infrastructure dataimport gui thirdpartyclient core-package u24ca289073

6.9 match 9 stars 11.98 score 764 scripts 55 dependents

azure

azuremlsdk:Interface to the 'Azure Machine Learning' 'SDK'

Interface to the 'Azure Machine Learning' Software Development Kit ('SDK'). Data scientists can use the 'SDK' to train, deploy, automate, and manage machine learning models on the 'Azure Machine Learning' service. To learn more about 'Azure Machine Learning' visit the website: <https://docs.microsoft.com/en-us/azure/machine-learning/service/overview-what-is-azure-ml>.

Maintained by Diondra Peck. Last updated 3 years ago.

amlcompute azure azure-machine-learning azureml dsi machine-learning rstudio sdk-r

9.0 match 106 stars 8.91 score 221 scripts

rstudio

rmarkdown:Dynamic Documents for R

Convert R Markdown documents into a variety of formats.

Maintained by Yihui Xie. Last updated 4 months ago.

literate-programming markdown pandoc rmarkdown

3.7 match 2.9k stars 21.79 score 14k scripts 3.7k dependents

mikejohnson51

climateR:climateR

Find, subset, and retrive geospatial data by AOI.

Maintained by Mike Johnson. Last updated 3 months ago.

aoi climate dataset geospatial gridded-climate-data weather

9.1 match 187 stars 8.74 score 156 scripts 1 dependents

confoobio

GMSE:Generalised Management Strategy Evaluation Simulator

Integrates game theory and ecological theory to construct social-ecological models that simulate the management of populations and stakeholder actions. These models build off of a previously developed management strategy evaluation (MSE) framework to simulate all aspects of management: population dynamics, manager observation of populations, manager decision making, and stakeholder responses to management decisions. The newly developed generalised management strategy evaluation (GMSE) framework uses genetic algorithms to mimic the decision-making process of managers and stakeholders under conditions of change, uncertainty, and conflict. Simulations can be run using gmse(), gmse_apply(), and gmse_gui() functions.

Maintained by A. Bradley Duthie. Last updated 3 years ago.

adaptive-management agricultural-modelling conflict conflict-resolution conservation ecological-modelling ecological-models ecology food-security game-theory genetic-algorithm genetic-algorithms management-decisions management-strategy-evaluation population-model simulation wildlife-management

14.6 match 10 stars 5.43 score 178 scripts

bioc

decoupleR:decoupleR: Ensemble of computational methods to infer biological activities from omics data

Many methods allow us to extract biological activities from omics data using information from prior knowledge resources, reducing the dimensionality for increased statistical power and better interpretability. Here, we present decoupleR, a Bioconductor package containing different statistical methods to extract these signatures within a unified framework. decoupleR allows the user to flexibly test any method with any resource. It incorporates methods that take into account the sign and weight of network interactions. decoupleR can be used with any omic, as long as its features can be linked to a biological process based on prior knowledge. For example, in transcriptomics gene sets regulated by a transcription factor, or in phospho-proteomics phosphosites that are targeted by a kinase.

Maintained by Pau Badia-i-Mompel. Last updated 5 months ago.

differentialexpression functionalgenomics geneexpression generegulation network software statisticalmethod transcription

6.9 match 230 stars 11.27 score 316 scripts 3 dependents

ropensci

SymbiotaR2:Downloading Data from Symbiota2 Portals into R

Download data from Symbiota2 portals using Symbiota's API. Covers the Checklists, Collections, Crowdsource, Exsiccati, Glossary, ImageProcessor, Key, Media, Occurrence, Reference, Taxa, Traits, and UserRoles API families. Each Symbiota2 portal owner can load their own plugins (and modified code), and so this package may not cover every possible API endpoint from a given Symbiota2 instance.

Maintained by Austin Koontz. Last updated 3 years ago.

database library specimen-records symbiota symbiota2 symbiota2-portal

23.4 match 2 stars 3.30 score 4 scripts

hneth

riskyr:Rendering Risk Literacy more Transparent

Risk-related information (like the prevalence of conditions, the sensitivity and specificity of diagnostic tests, or the effectiveness of interventions or treatments) can be expressed in terms of frequencies or probabilities. By providing a toolbox of corresponding metrics and representations, 'riskyr' computes, translates, and visualizes risk-related information in a variety of ways. Adopting multiple complementary perspectives provides insights into the interplay between key parameters and renders teaching and training programs on risk literacy more transparent.

Maintained by Hansjoerg Neth. Last updated 10 months ago.

2x2-matrix bayesian-inference contingency-table representation risk risk-literacy visualization

10.0 match 19 stars 7.36 score 80 scripts

ropensci

datapack:A Flexible Container to Transport and Manipulate Data and Associated Resources

Provides a flexible container to transport and manipulate complex sets of data. These data may consist of multiple data files and associated meta data and ancillary files. Individual data objects have associated system level meta data, and data files are linked together using the OAI-ORE standard resource map which describes the relationships between the files. The OAI- ORE standard is described at <https://www.openarchives.org/ore/>. Data packages can be serialized and transported as structured files that have been created following the BagIt specification. The BagIt specification is described at <https://tools.ietf.org/html/draft-kunze-bagit-08>.

Maintained by Matthew B. Jones. Last updated 3 years ago.

8.5 match 44 stars 8.56 score 195 scripts 4 dependents

bioc

SeqArray:Data Management of Large-Scale Whole-Genome Sequence Variant Calls

Data management of large-scale whole-genome sequencing variant calls with thousands of individuals: genotypic data (e.g., SNVs, indels and structural variation calls) and annotations in SeqArray GDS files are stored in an array-oriented and compressed manner, with efficient data access using the R programming language.

Maintained by Xiuwen Zheng. Last updated 10 days ago.

infrastructure datarepresentation sequencing genetics bioinformatics gds-format snp snv wes wgs cpp

6.0 match 45 stars 12.08 score 1.1k scripts 9 dependents

eblondel

geosapi:GeoServer REST API R Interface

Provides an R interface to the GeoServer REST API, allowing to upload and publish data in a GeoServer web-application and expose data to OGC Web-Services. The package currently supports all CRUD (Create,Read,Update,Delete) operations on GeoServer workspaces, namespaces, datastores (stores of vector data), featuretypes, layers, styles, as well as vector data upload operations. For more information about the GeoServer REST API, see <https://docs.geoserver.org/stable/en/user/rest/>.

Maintained by Emmanuel Blondel. Last updated 15 days ago.

api geoserver gis publication rest spatial

11.6 match 34 stars 6.23 score 33 scripts

ropensci

tidyhydat:Extract and Tidy Canadian 'Hydrometric' Data

Provides functions to access historical and real-time national 'hydrometric' data from Water Survey of Canada data sources (<https://dd.weather.gc.ca/hydrometric/csv/> and <https://collaboration.cmc.ec.gc.ca/cmc/hydrometrics/www/>) and then applies tidy data principles.

Maintained by Sam Albers. Last updated 5 days ago.

citz government-data hydrology hydrometrics tidy-data water-resources

7.5 match 71 stars 9.59 score 202 scripts 3 dependents

thinkr-open

golem:A Framework for Robust Shiny Applications

An opinionated framework for building a production-ready 'Shiny' application. This package contains a series of tools for building a robust 'Shiny' application from start to finish.

Maintained by Colin Fay. Last updated 7 months ago.

golemverse hacktoberfest shiny shiny-apps shiny-r shinyapps

5.0 match 921 stars 14.23 score 167 scripts 62 dependents

hneth

unikn:Graphical Elements of the University of Konstanz's Corporate Design

Define and use graphical elements of corporate design manuals in R. The 'unikn' package provides color functions (by defining dedicated colors and color palettes, and commands for finding, changing, viewing, and using them) and styled text elements (e.g., for marking, underlining, or plotting colored titles). The pre-defined range of colors and text decoration functions is based on the corporate design of the University of Konstanz <https://www.uni-konstanz.de/>, but can be adapted and extended for other purposes or institutions.

Maintained by Hansjoerg Neth. Last updated 3 months ago.

branding color color-palette colorscheme corporate-design palette text-decoration university-colors visual-identity

8.0 match 39 stars 8.82 score 156 scripts 2 dependents

dalekube

hR:Better Data Engineering in Human Resources

Methods for data engineering in the human resources (HR) corporate domain. Designed for HR analytics practitioners and workforce-oriented data sets.

Maintained by Dale Kube. Last updated 14 hours ago.

analytics data data-engineering data-science human-resources

13.9 match 21 stars 5.02 score 8 scripts

bioc

BiocIO:Standard Input and Output for Bioconductor Packages

The `BiocIO` package contains high-level abstract classes and generics used by developers to build IO funcionality within the Bioconductor suite of packages. Implements `import()` and `export()` standard generics for importing and exporting biological data formats. `import()` supports whole-file as well as chunk-wise iterative import. The `import()` interface optionally provides a standard mechanism for 'lazy' access via `filter()` (on row or element-like components of the file resource), `select()` (on column-like components of the file resource) and `collect()`. The `import()` interface optionally provides transparent access to remote (e.g. via https) as well as local access. Developers can register a file extension, e.g., `.loom` for dispatch from character-based URIs to specific `import()` / `export()` methods based on classes representing file types, e.g., `LoomFile()`.

Maintained by Marcel Ramos. Last updated 4 months ago.

annotation dataimport bioconductor-package core-package

6.8 match 1 stars 10.20 score 19 scripts 487 dependents

projectmosaic

mosaic:Project MOSAIC Statistics and Mathematics Teaching Utilities

Data sets and utilities from Project MOSAIC (<http://www.mosaic-web.org>) used to teach mathematics, statistics, computation and modeling. Funded by the NSF, Project MOSAIC is a community of educators working to tie together aspects of quantitative work that students in science, technology, engineering and mathematics will need in their professional lives, but which are usually taught in isolation, if at all.

Maintained by Randall Pruim. Last updated 1 years ago.

5.1 match 93 stars 13.32 score 7.2k scripts 7 dependents

bcgov

bcdata:Search and Retrieve Data from the BC Data Catalogue

Search, query, and download tabular and 'geospatial' data from the British Columbia Data Catalogue (<https://catalogue.data.gov.bc.ca/>). Search catalogue data records based on keywords, data licence, sector, data format, and B.C. government organization. View metadata directly in R, download many data formats, and query 'geospatial' data available via the B.C. government Web Feature Service ('WFS') using 'dplyr' syntax.

Maintained by Andy Teucher. Last updated 1 months ago.

bcdc citz data-science env

6.6 match 83 stars 10.29 score 186 scripts 4 dependents

rstudio

rstudioapi:Safely Access the RStudio API

Access the RStudio API (if available) and provide informative error messages when it's not.

Maintained by Kevin Ushey. Last updated 4 months ago.

3.6 match 172 stars 18.81 score 3.6k scripts 2.1k dependents

ikosmidis

brglm2:Bias Reduction in Generalized Linear Models

Estimation and inference from generalized linear models based on various methods for bias reduction and maximum penalized likelihood with powers of the Jeffreys prior as penalty. The 'brglmFit' fitting method can achieve reduction of estimation bias by solving either the mean bias-reducing adjusted score equations in Firth (1993) <doi:10.1093/biomet/80.1.27> and Kosmidis and Firth (2009) <doi:10.1093/biomet/asp055>, or the median bias-reduction adjusted score equations in Kenne et al. (2017) <doi:10.1093/biomet/asx046>, or through the direct subtraction of an estimate of the bias of the maximum likelihood estimator from the maximum likelihood estimates as in Cordeiro and McCullagh (1991) <https://www.jstor.org/stable/2345592>. See Kosmidis et al (2020) <doi:10.1007/s11222-019-09860-6> for more details. Estimation in all cases takes place via a quasi Fisher scoring algorithm, and S3 methods for the construction of of confidence intervals for the reduced-bias estimates are provided. In the special case of generalized linear models for binomial and multinomial responses (both ordinal and nominal), the adjusted score approaches to mean and media bias reduction have been found to return estimates with improved frequentist properties, that are also always finite, even in cases where the maximum likelihood estimates are infinite (e.g. complete and quasi-complete separation; see Kosmidis and Firth, 2020 <doi:10.1093/biomet/asaa052>, for a proof for mean bias reduction in logistic regression).

Maintained by Ioannis Kosmidis. Last updated 6 months ago.

adjusted-score-equations algorithms bias-reducing-adjustments bias-reduction estimation glm logistic-regression nominal-responses ordinal-responses regression regression-algorithms statistics

6.5 match 32 stars 10.41 score 106 scripts 10 dependents

bioc

AnnotationFilter:Facilities for Filtering Bioconductor Annotation Resources

This package provides class and other infrastructure to implement filters for manipulating Bioconductor annotation resources. The filters will be used by ensembldb, Organism.dplyr, and other packages.

Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.

annotation infrastructure software bioconductor-package core-package

6.5 match 5 stars 10.20 score 45 scripts 162 dependents

bioc

cbpManager:Generate, manage, and edit data and metadata files suitable for the import in cBioPortal for Cancer Genomics

This R package provides an R Shiny application that enables the user to generate, manage, and edit data and metadata files suitable for the import in cBioPortal for Cancer Genomics. Create cancer studies and edit its metadata. Upload mutation data of a patient that will be concatenated to the data_mutation_extended.txt file of the study. Create and edit clinical patient data, sample data, and timeline data. Create custom timeline tracks for patients.

Maintained by Arsenij Ustjanzew. Last updated 5 months ago.

immunooncology dataimport datarepresentation gui thirdpartyclient preprocessing visualization cancer-genomics cbioportal clinical-data filegenerator mutation-data patient-data

11.7 match 8 stars 5.51 score 1 scripts

bioc

BiocFHIR:Illustration of FHIR ingestion and transformation using R

FHIR R4 bundles in JSON format are derived from https://synthea.mitre.org/downloads. Transformation inspired by a kaggle notebook published by Dr Alexander Scarlat, https://www.kaggle.com/code/drscarlat/fhir-starter-parse-healthcare-bundles-into-tables. This is a very limited illustration of some basic parsing and reorganization processes. Additional tooling will be required to move beyond the Synthea data illustrations.

Maintained by Vincent Carey. Last updated 5 months ago.

infrastructure dataimport datarepresentation fhir

11.0 match 4 stars 5.78 score 15 scripts

bioc

UCell:Rank-based signature enrichment analysis for single-cell data

UCell is a package for evaluating gene signatures in single-cell datasets. UCell signature scores, based on the Mann-Whitney U statistic, are robust to dataset size and heterogeneity, and their calculation demands less computing time and memory than other available methods, enabling the processing of large datasets in a few minutes even on machines with limited computing power. UCell can be applied to any single-cell data matrix, and includes functions to directly interact with SingleCellExperiment and Seurat objects.

Maintained by Massimo Andreatta. Last updated 5 months ago.

singlecell genesetenrichment transcriptomics geneexpression cellbasedassays

6.0 match 143 stars 10.43 score 454 scripts 2 dependents

ocha-dap

ripc:Download and Tidy IPC and CH Data

Utilities to access Integrated Food Security Phase Classification (IPC) and Cadre Harmonisé (CH) food security data. Wrapper functions are available for all of the 'IPC-CH' Public API (<https://docs.api.ipcinfo.org>) simplified and advanced endpoints to easily download the data in a clean and tidy format.

Maintained by Seth Caldwell. Last updated 9 months ago.

12.6 match 2 stars 4.70 score 4 scripts

psolymos

ResourceSelection:Resource Selection (Probability) Functions for Use-Availability Data

Resource Selection (Probability) Functions for use-availability wildlife data based on weighted distributions as described in Lele and Keim (2006) <doi:10.1890/0012-9658(2006)87%5B3021:WDAEOR%5D2.0.CO;2>, Lele (2009) <doi:10.2193/2007-535>, and Solymos & Lele (2016) <doi:10.1111/2041-210X.12432>.

Maintained by Peter Solymos. Last updated 10 months ago.

ecology estimation lele rsf rspf solymos weighted-distributions

6.8 match 8 stars 8.37 score 752 scripts 3 dependents

bioc

HubPub:Utilities to create and use Bioconductor Hubs

HubPub provides users with functionality to help with the Bioconductor Hub structures. The package provides the ability to create a skeleton of a Hub style package that the user can then populate with the necessary information. There are also functions to help add resources to the Hub package metadata files as well as publish data to the Bioconductor S3 bucket.

Maintained by Kayla Interdonato. Last updated 3 days ago.

dataimport infrastructure software thirdpartyclient bioconductor-package

11.0 match 3 stars 5.18 score 4 scripts

sebkrantz

collapse:Advanced and Fast Data Transformation

A C/C++ based package for advanced data transformation and statistical computing in R that is extremely fast, class-agnostic, robust and programmer friendly. Core functionality includes a rich set of S3 generic grouped and weighted statistical functions for vectors, matrices and data frames, which provide efficient low-level vectorizations, OpenMP multithreading, and skip missing values by default. These are integrated with fast grouping and ordering algorithms (also callable from C), and efficient data manipulation functions. The package also provides a flexible and rigorous approach to time series and panel data in R. It further includes fast functions for common statistical procedures, detailed (grouped, weighted) summary statistics, powerful tools to work with nested data, fast data object conversions, functions for memory efficient R programming, and helpers to effectively deal with variable labels, attributes, and missing data. It is well integrated with base R classes, 'dplyr'/'tibble', 'data.table', 'sf', 'units', 'plm' (panel-series and data frames), and 'xts'/'zoo'.

Maintained by Sebastian Krantz. Last updated 6 days ago.

data-aggregation data-analysis data-manipulation data-processing data-science data-transformation econometrics high-performance panel-data scientific-computing statistics time-series weighted weights cpp openmp

3.3 match 672 stars 16.63 score 708 scripts 97 dependents

program--

HSClientR:A HydroShare API client for R

A RESTful API wrapper for accessing <https://hydroshare.org> data in R.

Maintained by Justin Singh-Mohudpur. Last updated 4 years ago.

api-wrapper cuashi hydrology hydroshare water-resources

23.2 match 4 stars 2.30 score 2 scripts

jchrom

trelloR:Access the Trello API

An R client for the Trello API. Supports free-tier features such as access to private boards, creating and updating cards and other resources, and downloading data in a structured way.

Maintained by Jakub Chromec. Last updated 2 years ago.

api trello

8.6 match 42 stars 6.18 score 24 scripts

rchlumsk

RavenR:Raven Hydrological Modelling Framework R Support and Analysis

Utilities for processing input and output files associated with the Raven Hydrological Modelling Framework. Includes various plotting functions, model diagnostics, reading output files into extensible time series format, and support for writing Raven input files. The 'RavenR' package is also archived at Chlumsky et al. (2020) <doi:10.5281/zenodo.4248183>. The Raven Hydrologic Modelling Framework method can be referenced with Craig et al. (2020) <doi:10.1016/j.envsoft.2020.104728>.

Maintained by Robert Chlumsky. Last updated 4 months ago.

diagnostics hydrology modeling modelling visualization water water-resources watershed cpp

7.5 match 36 stars 7.06 score 20 scripts

schnorr

starvz:R-Based Visualization Techniques for Task-Based Applications

Performance analysis workflow that combines the power of the R language (and the tidyverse realm) and many auxiliary tools to provide a consistent, flexible, extensible, fast, and versatile framework for the performance analysis of task-based applications that run on top of the StarPU runtime (with its MPI (Message Passing Interface) layer for multi-node support). Its goal is to provide a fruitful prototypical environment to conduct performance analysis hypothesis-checking for task-based applications that run on heterogeneous (multi-GPU, multi-core) multi-node HPC (High-performance computing) platforms.

Maintained by Lucas Leandro Nesi. Last updated 5 months ago.

cpp

10.7 match 13 stars 4.94 score 27 scripts

bioc

CompoundDb:Creating and Using (Chemical) Compound Annotation Databases

CompoundDb provides functionality to create and use (chemical) compound annotation databases from a variety of different sources such as LipidMaps, HMDB, ChEBI or MassBank. The database format allows to store in addition MS/MS spectra along with compound information. The package provides also a backend for Bioconductor's Spectra package and allows thus to match experimetal MS/MS spectra against MS/MS spectra in the database. Databases can be stored in SQLite format and are thus portable.

Maintained by Johannes Rainer. Last updated 2 months ago.

massspectrometry metabolomics annotation databases mass-spectrometry

6.1 match 17 stars 8.40 score 69 scripts 1 dependents

bioc

AlphaMissenseR:Accessing AlphaMissense Data Resources in R

The AlphaMissense publication <https://www.science.org/doi/epdf/10.1126/science.adg7492> outlines how a variant of AlphaFold / DeepMind was used to predict missense variant pathogenicity. Supporting data on Zenodo <https://zenodo.org/record/10813168> include, for instance, 71M variants across hg19 and hg38 genome builds. The 'AlphaMissenseR' package allows ready access to the data, downloading individual files to DuckDB databases for exploration and integration into *R* and *Bioconductor* workflows.

Maintained by Martin Morgan. Last updated 5 months ago.

snp annotation functionalgenomics structuralprediction transcriptomics variantannotation geneprediction immunooncology

7.5 match 8 stars 6.86 score 10 scripts

pandora-isomemo

Pandora:Retrieve Data using the API of the 'Pandora' Data Platform

API wrapper that contains functions to retrieve data from the 'Pandora' databases. Web services for API: <https://pandora.earth/>.

Maintained by Jan Abel. Last updated 1 months ago.

12.6 match 4.00 score 2 scripts

bcgov

bcmaps:Map Layers and Spatial Utilities for British Columbia

Various layers of B.C., including administrative boundaries, natural resource management boundaries, census boundaries etc. All layers are available in BC Albers (<https://spatialreference.org/ref/epsg/3005/>) equal-area projection, which is the B.C. government standard. The layers are sourced from the British Columbia and Canadian government under open licenses, including B.C. Data Catalogue (<https://data.gov.bc.ca>), the Government of Canada Open Data Portal (<https://open.canada.ca/en/using-open-data>), and Statistics Canada (<https://www.statcan.gc.ca/en/reference/licence>).

Maintained by Andy Teucher. Last updated 3 months ago.

data-science env

5.8 match 73 stars 8.65 score 254 scripts

cran

bigmemory.sri:A Shared Resource Interface for Bigmemory Project Packages

A shared resource interface for the bigmemory and synchronicity packages.

Maintained by Michael J. Kane. Last updated 1 years ago.

9.4 match 5.21 score 66 dependents

jsta

nhdR:Tools for Working with the National Hydrography Dataset

Tools for working with the National Hydrography Dataset, with functions for querying, downloading, and networking both the NHD <https://www.usgs.gov/national-hydrography> and NHDPlus <https://www.epa.gov/waterdata/nhdplus-national-hydrography-dataset-plus> datasets.

Maintained by Jemma Stachelek. Last updated 2 years ago.

geospatial national-hydrography-dataset nhd water-quality water-resources

7.5 match 38 stars 6.48 score 53 scripts

rstudio

shiny:Web Application Framework for R

Makes it incredibly easy to build interactive web applications with R. Automatic "reactive" binding between inputs and outputs and extensive prebuilt widgets make it possible to build beautiful, responsive, and powerful applications with minimal effort.

Maintained by Winston Chang. Last updated 14 days ago.

reactive rstudio shiny web-app web-development

2.3 match 5.4k stars 21.28 score 108k scripts 1.8k dependents

warwick-stats-resources

warwickplots:Palettes and Themes Consistent with The University of Warwick's Brand

Colour palettes and a 'ggplot2' theme that are consistent with The University of Warwick' branding. Built using the 'palettes' package, which provides methods for printing, formatting, casting and coercion, extraction and updating of components, plotting, colour mixing arithmetic, and colour interpolation.

Maintained by Ella Kaye. Last updated 10 months ago.

13.3 match 1 stars 3.56 score 12 scripts

yihui

knitr:A General-Purpose Package for Dynamic Report Generation in R

Provides a general-purpose tool for dynamic report generation in R using Literate Programming techniques.

Maintained by Yihui Xie. Last updated 9 hours ago.

dynamic-documents knitr literate-programming rmarkdown sweave

2.0 match 2.4k stars 23.61 score 116k scripts 4.2k dependents

r-causal

ggdag:Analyze and Create Elegant Directed Acyclic Graphs

Tidy, analyze, and plot directed acyclic graphs (DAGs). 'ggdag' is built on top of 'dagitty', an R package that uses the 'DAGitty' web tool (<https://dagitty.net/>) for creating and analyzing DAGs. 'ggdag' makes it easy to tidy and plot 'dagitty' objects using 'ggplot2' and 'ggraph', as well as common analytic and graphical functions, such as determining adjustment sets and node relationships.

Maintained by Malcolm Barrett. Last updated 8 months ago.

causal-inference dag ggplot-extension

4.0 match 443 stars 11.78 score 1.8k scripts 5 dependents

cole-brokamp

fr:Frictionless Standards

A "tabular-data-resource" (<https://specs.frictionlessdata.io/tabular-data-resource/>) is a simple format to describe a singular tabular data resource such as a CSV file. It includes support both for metadata such as author and title and a schema to describe the data, for example the types of the fields/columns in the data. Create a tabular-data-resource by providing a data.frame and specifying metadata. Write and read tabular-data-resources to and from disk.

Maintained by Cole Brokamp. Last updated 4 months ago.

8.9 match 3 stars 5.28 score 63 scripts

ropensci

EDIutils:An API Client for the Environmental Data Initiative Repository

A client for the Environmental Data Initiative repository REST API. The 'EDI' data repository <https://portal.edirepository.org/nis/home.jsp> is for publication and reuse of ecological data with emphasis on metadata accuracy and completeness. It is built upon the 'PASTA+' software stack <https://pastaplus-core.readthedocs.io/en/latest/index.html#> and was developed in collaboration with the US 'LTER' Network <https://lternet.edu/>. 'EDIutils' includes functions to search and access existing data, evaluate and upload new data, and assist other data management tasks common to repository users.

Maintained by Colin Smith. Last updated 1 years ago.

ecology eml-metadata open-access open-data research-data-management research-data-repository

7.2 match 10 stars 6.47 score 117 scripts

tudo-r

BatchJobs:Batch Computing with R

Provides Map, Reduce and Filter variants to generate jobs on batch computing systems like PBS/Torque, LSF, SLURM and Sun Grid Engine. Multicore and SSH systems are also supported. For further details see the project web page.

Maintained by Bernd Bischl. Last updated 3 years ago.

5.3 match 85 stars 8.57 score 616 scripts 3 dependents

r-simmer

simmer.bricks:Helper Methods for 'simmer' Trajectories

Provides wrappers for common activity patterns in 'simmer' trajectories.

Maintained by Iñaki Ucar. Last updated 2 years ago.

discrete-event simulation

8.0 match 6 stars 5.64 score 49 scripts 1 dependents

ropensci

fireexposuR:Compute and Visualize Wildfire Exposure

This package computes and visualizes wildfire exposure using the methods documented in a series of scientific publications.

Maintained by Air Forbes. Last updated 20 days ago.

8.5 match 5 stars 5.23 score 4 scripts

rstudio

pins:Pin, Discover, and Share Resources

Publish data sets, models, and other R objects, making it easy to share them across projects and with your colleagues. You can pin objects to a variety of "boards", including local folders (to share on a networked drive or with 'DropBox'), 'Posit Connect', 'AWS S3', and more.

Maintained by Julia Silge. Last updated 1 months ago.

azure gcloud rpins rsconnect s3 storage

3.1 match 321 stars 14.17 score 1.9k scripts 17 dependents

ncss-tech

soilDB:Soil Database Interface

A collection of functions for reading soil data from U.S. Department of Agriculture Natural Resources Conservation Service (USDA-NRCS) and National Cooperative Soil Survey (NCSS) databases.

Maintained by Andrew Brown. Last updated 7 days ago.

kssl nasis nrcs soil soil-data-access soil-survey soilweb sql usda

3.9 match 87 stars 11.34 score 1.0k scripts 1 dependents

mrc-ide

orderly2:Orderly Next Generation

Distributed reproducible computing framework, adopting ideas from git, docker and other software. By defining a lightweight interface around the inputs and outputs of an analysis, a lot of the repetitive work for reproducible research can be automated. We define a simple format for organising and describing work that facilitates collaborative reproducible research and acknowledges that all analyses are run multiple times over their lifespans.

Maintained by Rich FitzJohn. Last updated 2 months ago.

5.3 match 8 stars 8.30 score 49 scripts 2 dependents

vimc

orderly:Lightweight Reproducible Reporting

Order, create and store reports from R. By defining a lightweight interface around the inputs and outputs of an analysis, a lot of the repetitive work for reproducible research can be automated. We define a simple format for organising and describing work that facilitates collaborative reproducible research and acknowledges that all analyses are run multiple times over their lifespans.

Maintained by Rich FitzJohn. Last updated 2 years ago.

4.5 match 117 stars 9.63 score 94 scripts 4 dependents

ouhscbbmc

REDCapR:Interaction Between R and REDCap

Encapsulates functions to streamline calls from R to the REDCap API. REDCap (Research Electronic Data CAPture) is a web application for building and managing online surveys and databases developed at Vanderbilt University. The Application Programming Interface (API) offers an avenue to access and modify data programmatically, improving the capacity for literate and reproducible programming.

Maintained by Will Beasley. Last updated 2 months ago.

redcap redcap-api

3.5 match 118 stars 12.36 score 438 scripts 6 dependents

wlandau

crew:A Distributed Worker Launcher Framework

In computationally demanding analysis projects, statisticians and data scientists asynchronously deploy long-running tasks to distributed systems, ranging from traditional clusters to cloud services. The 'NNG'-powered 'mirai' R package by Gao (2023) <doi:10.5281/zenodo.7912722> is a sleek and sophisticated scheduler that efficiently processes these intense workloads. The 'crew' package extends 'mirai' with a unifying interface for third-party worker launchers. Inspiration also comes from packages. 'future' by Bengtsson (2021) <doi:10.32614/RJ-2021-048>, 'rrq' by FitzJohn and Ashton (2023) <https://github.com/mrc-ide/rrq>, 'clustermq' by Schubert (2019) <doi:10.1093/bioinformatics/btz284>), and 'batchtools' by Lang, Bischel, and Surmann (2017) <doi:10.21105/joss.00135>.

Maintained by William Michael Landau. Last updated 2 days ago.

high-performance-computing

3.8 match 136 stars 11.19 score 243 scripts 2 dependents

sharlagelfand

opendatatoronto:Access the City of Toronto Open Data Portal

Access data from the "City of Toronto Open Data Portal" (<https://open.toronto.ca>) directly from R.

Maintained by Sharla Gelfand. Last updated 3 years ago.

5.7 match 63 stars 7.49 score 486 scripts

cb4ds

periscope:Enterprise Streamlined 'Shiny' Application Framework

An enterprise-targeted scalable and UI-standardized 'shiny' framework including a variety of developer convenience functions with the goal of both streamlining robust application development while assisting with creating a consistent user experience regardless of application or developer.

Maintained by Constance Brett. Last updated 2 months ago.

6.0 match 18 stars 7.02 score 73 scripts

tidyverse

lubridate:Make Dealing with Dates a Little Easier

Functions to work with date-times and time-spans: fast and user friendly parsing of date-time data, extraction and updating of components of a date-time (years, months, days, hours, minutes, and seconds), algebraic manipulation on date-time and time-span objects. The 'lubridate' package has a consistent and memorable syntax that makes working with dates easy and fun.

Maintained by Vitalie Spinu. Last updated 3 months ago.

date date-time

2.0 match 757 stars 20.95 score 135k scripts 1.9k dependents

aravind-j

PGRdup:Discover Probable Duplicates in Plant Genetic Resources Collections

Provides functions to aid the identification of probable/possible duplicates in Plant Genetic Resources (PGR) collections using 'passport databases' comprising of information records of each constituent sample. These include methods for cleaning the data, creation of a searchable Key Word in Context (KWIC) index of keywords associated with sample records and the identification of nearly identical records with similar information by fuzzy, phonetic and semantic matching of keywords.

Maintained by J. Aravind. Last updated 2 years ago.

double-metaphone double-metaphone-algorithm natural-language-processing pgr plant-genetic-resources record-linkage

10.0 match 1 stars 4.06 score 23 scripts

abbvie-external

OmicNavigator:Open-Source Software for 'Omic' Data Analysis and Visualization

A tool for interactive exploration of the results from 'omics' experiments to facilitate novel discoveries from high-throughput biology. The software includes R functions for the 'bioinformatician' to deposit study metadata and the outputs from statistical analyses (e.g. differential expression, enrichment). These results are then exported to an interactive JavaScript dashboard that can be interrogated on the user's local machine or deployed online to be explored by collaborators. The dashboard includes 'sortable' tables, interactive plots including network visualization, and fine-grained filtering based on statistical significance.

Maintained by John Blischak. Last updated 4 days ago.

bioinformatics genomics omics opencpu

5.3 match 34 stars 7.68 score 31 scripts

bioc

BiocHubsShiny:View AnnotationHub and ExperimentHub Resources Interactively

A package that allows interactive exploration of AnnotationHub and ExperimentHub resources. It uses DT / DataTable to display resources for multiple organisms. It provides template code for reproducibility and for downloading resources via the indicated Hub package.

Maintained by Marcel Ramos. Last updated 12 days ago.

software shinyapps

10.2 match 3.90 score 1 scripts

molgenis

DSMolgenisArmadillo:'DataSHIELD' Client for 'MOLGENIS Armadillo'

'DataSHIELD' is an infrastructure and series of R packages that enables the remote and 'non-disclosive' analysis of sensitive research data. This package is the 'DataSHIELD' interface implementation to analyze data shared on a 'MOLGENIS Armadillo' server. 'MOLGENIS Armadillo' is a light-weight 'DataSHIELD' server using a file store and an 'RServe' server.

Maintained by Mariska Slofstra. Last updated 8 months ago.

hacktoberfest

6.0 match 6.54 score 48 scripts

paws-r

paws.management:'Amazon Web Services' Management & Governance Services

Interface to 'Amazon Web Services' management and governance services, including 'CloudWatch' application and infrastructure monitoring, 'Auto Scaling' for automatically scaling resources, and more <https://aws.amazon.com/>.

Maintained by Dyfan Jones. Last updated 4 days ago.

aws aws-sdk

4.3 match 332 stars 9.09 score 1 scripts 15 dependents

pvanlaake

ncdfCF:Easy Access to NetCDF Files with CF Metadata Conventions

Network Common Data Form ('netCDF') files are widely used for scientific data. Library-level access in R is provided through packages 'RNetCDF' and 'ncdf4'. Package 'ncdfCF' is built on top of 'RNetCDF' and makes the data and its attributes available as a set of R6 classes that are informed by the Climate and Forecasting Metadata Conventions. Access to the data uses standard R subsetting operators and common function forms.

Maintained by Patrick Van Laake. Last updated 3 days ago.

7.3 match 5.41 score 4 scripts

robitalec

distanceto:Calculate Distance to Features

Calculates distances from point locations to features. The usual approach for eg. resource selection function analyses is to generate a complete distance to features surface then sample it with your observed and random points. Since these raster based approaches can be pretty costly with large areas, and often lead to memory issues in R, the distanceto package opts to compute these distances using efficient, vector based approaches. As a helper, there's a decidedly low-res raster based approach for visually inspecting your region's distance surface. But the workhorse is distance_to.

Maintained by Alec L. Robitaille. Last updated 2 years ago.

animal distance-to ecology resource-selection rsf spatial

8.0 match 5 stars 4.88 score 10 scripts 1 dependents

aggregate-genius

periscope2:Enterprise Streamlined 'shiny' Application Framework Using 'bs4Dash'

A framework for building enterprise, scalable and UI-standardized 'shiny' applications. It brings enhanced features such as 'bootstrap' v4 <https://getbootstrap.com/docs/4.0/getting-started/introduction/>, additional and enhanced 'shiny' modules, customizable UI features, as well as an enhanced application file organization paradigm. This update allows developers to harness the ability to build powerful applications and enriches the 'shiny' developers' experience when building and maintaining applications.

Maintained by Mohammed Ali. Last updated 2 months ago.

periscope shiny

6.0 match 9 stars 6.49 score 34 scripts

epiforecasts

epinowcast:Flexible Hierarchical Nowcasting

Tools to enable flexible and efficient hierarchical nowcasting of right-truncated epidemiological time-series using a semi-mechanistic Bayesian model with support for a range of reporting and generative processes. Nowcasting, in this context, is gaining situational awareness using currently available observations and the reporting patterns of historical observations. This can be useful when tracking the spread of infectious disease in real-time: without nowcasting, changes in trends can be obfuscated by partial reporting or their detection may be delayed due to the use of simpler methods like truncation. While the package has been designed with epidemiological applications in mind, it could be applied to any set of right-truncated time-series count data.

Maintained by Sam Abbott. Last updated 11 months ago.

cmdstanr effective-reproduction-number-estimation epidemiology infectious-disease-surveillance nowcasting outbreak-analysis pandemic-preparedness real-time-infectious-disease-modelling stan

4.9 match 61 stars 7.88 score 65 scripts

epinowcast

epinowcast:Flexible Hierarchical Nowcasting

Tools to enable flexible and efficient hierarchical nowcasting of right-truncated epidemiological time-series using a semi-mechanistic Bayesian model with support for a range of reporting and generative processes. Nowcasting, in this context, is gaining situational awareness using currently available observations and the reporting patterns of historical observations. This can be useful when tracking the spread of infectious disease in real-time: without nowcasting, changes in trends can be obfuscated by partial reporting or their detection may be delayed due to the use of simpler methods like truncation. While the package has been designed with epidemiological applications in mind, it could be applied to any set of right-truncated time-series count data.

Maintained by Sam Abbott. Last updated 11 months ago.

cmdstanr effective-reproduction-number-estimation epidemiology infectious-disease-surveillance nowcasting outbreak-analysis pandemic-preparedness real-time-infectious-disease-modelling stan

4.9 match 61 stars 7.79 score 71 scripts

richfitz

storr:Simple Key Value Stores

Creates and manages simple key-value stores. These can use a variety of approaches for storing the data. This package implements the base methods and support for file system, in-memory and DBI-based database stores.

Maintained by Rich FitzJohn. Last updated 4 years ago.

3.8 match 117 stars 10.21 score 57 scripts 33 dependents

murrayefford

secr:Spatially Explicit Capture-Recapture

Functions to estimate the density and size of a spatially distributed animal population sampled with an array of passive detectors, such as traps, or by searching polygons or transects. Models incorporating distance-dependent detection are fitted by maximizing the likelihood. Tools are included for data manipulation and model selection.

Maintained by Murray Efford. Last updated 3 hours ago.

cpp

3.8 match 3 stars 10.16 score 410 scripts 5 dependents

bioc

rtracklayer:R interface to genome annotation files and the UCSC genome browser

Extensible framework for interacting with multiple genome browsers (currently UCSC built-in) and manipulating annotation tracks in various formats (currently GFF, BED, bedGraph, BED15, WIG, BigWig and 2bit built-in). The user may export/import tracks to/from the supported browsers, as well as query and modify the browser state, such as the current viewport.

Maintained by Michael Lawrence. Last updated 9 days ago.

annotation visualization dataimport zlib openssl curl

3.0 match 12.66 score 6.7k scripts 481 dependents

azure

AzureVM:Virtual Machines in 'Azure'

Functionality for working with virtual machines (VMs) in Microsoft's 'Azure' cloud: <https://azure.microsoft.com/en-us/services/virtual-machines/>. Includes facilities to deploy, startup, shutdown, and cleanly delete VMs and VM clusters. Deployment configurations can be highly customised, and can make use of existing resources as well as creating new ones. A selection of predefined configurations is provided to allow easy deployment of commonly used Linux and Windows images, including Data Science Virtual Machines. With a running VM, execute scripts and install optional extensions. Part of the 'AzureR' family of packages.

Maintained by Hong Ooi. Last updated 2 years ago.

azure azure-sdk-r azure-virtual-machine data-science-virtual-machine

7.4 match 14 stars 5.05 score 16 scripts

ropensci

hydroscoper:Interface to the Greek National Data Bank for Hydrometeorological Information

R interface to the Greek National Data Bank for Hydrological and Meteorological Information. It covers Hydroscope's data sources and provides functions to transliterate, translate and download them into tidy dataframes.

Maintained by Konstantinos Vantas. Last updated 8 months ago.

climate greece hydrology hydrometeorology hydroscope meteorological-data meteorological-stations peer-reviewed tidy-data time-series water-resources

7.5 match 14 stars 4.97 score 33 scripts

canmod

iidda:Processing Infectious Disease Datasets in IIDDA.

Part of an open toolchain for processing infectious disease datasets available through the IIDDA data repository.

Maintained by Steve Walker. Last updated 4 months ago.

6.0 match 6.07 score 133 scripts 3 dependents

cran

istacr:Obtaining Open Data from Instituto Canario De Estadistica (ISTAC) API

You can access to open data published in Instituto Canario De Estadistica (ISTAC) APIs at <https://datos.canarias.es/api/estadisticas/>.

Maintained by Alberto Gonzalez. Last updated 2 years ago.

35.5 match 1.00 score

ctmm-initiative

ctmm:Continuous-Time Movement Modeling

Functions for identifying, fitting, and applying continuous-space, continuous-time stochastic-process movement models to animal tracking data. The package is described in Calabrese et al (2016) <doi:10.1111/2041-210X.12559>, with models and methods based on those introduced and detailed in Fleming & Calabrese et al (2014) <doi:10.1086/675504>, Fleming et al (2014) <doi:10.1111/2041-210X.12176>, Fleming et al (2015) <doi:10.1103/PhysRevE.91.032107>, Fleming et al (2015) <doi:10.1890/14-2010.1>, Fleming et al (2016) <doi:10.1890/15-1607>, Péron & Fleming et al (2016) <doi:10.1186/s40462-016-0084-7>, Fleming & Calabrese (2017) <doi:10.1111/2041-210X.12673>, Péron et al (2017) <doi:10.1002/ecm.1260>, Fleming et al (2017) <doi:10.1016/j.ecoinf.2017.04.008>, Fleming et al (2018) <doi:10.1002/eap.1704>, Winner & Noonan et al (2018) <doi:10.1111/2041-210X.13027>, Fleming et al (2019) <doi:10.1111/2041-210X.13270>, Noonan & Fleming et al (2019) <doi:10.1186/s40462-019-0177-1>, Fleming et al (2020) <doi:10.1101/2020.06.12.130195>, Noonan et al (2021) <doi:10.1111/2041-210X.13597>, Fleming et al (2022) <doi:10.1111/2041-210X.13815>, Silva et al (2022) <doi:10.1111/2041-210X.13786>, Alston & Fleming et al (2023) <doi:10.1111/2041-210X.14025>.

Maintained by Christen H. Fleming. Last updated 2 months ago.

3.3 match 49 stars 10.57 score 534 scripts 4 dependents

bupaverse

processmapR:Construct Process Maps Using Event Data

Visualize event logs using directed graphs, i.e. process maps. Part of the 'bupaR' framework.

Maintained by Gert Janssenswillen. Last updated 7 months ago.

cpp

4.5 match 9 stars 7.70 score 169 scripts 3 dependents

bioc

iSEEindex:iSEE extension for a landing page to a custom collection of data sets

This package provides an interface to any collection of data sets within a single iSEE web-application. The main functionality of this package is to define a custom landing page allowing app maintainers to list a custom collection of data sets that users can selected from and directly load objects into an iSEE web-application.

Maintained by Kevin Rue-Albrecht. Last updated 5 months ago.

software infrastructure bioconductor hacktoberfest

6.1 match 2 stars 5.65 score 8 scripts

apache

arrow:Integration to 'Apache' 'Arrow'

'Apache' 'Arrow' <https://arrow.apache.org/> is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. This package provides an interface to the 'Arrow C++' library.

Maintained by Jonathan Keane. Last updated 1 months ago.

arrow curl openssl cpp

1.8 match 15k stars 19.22 score 10k scripts 81 dependents

pachadotdev

analogsea:Interface to 'DigitalOcean'

Provides a set of functions for interacting with the 'DigitalOcean' API <https://www.digitalocean.com/>, including creating images, destroying them, rebooting, getting details on regions, and available images.

Maintained by Mauricio Vargas. Last updated 2 years ago.

cloud-computing droplet ssh

4.5 match 159 stars 7.56 score 100 scripts 1 dependents

jamiemkass

ENMeval:Automated Tuning and Evaluations of Ecological Niche Models

Runs ecological niche models over all combinations of user-defined settings (i.e., tuning), performs cross validation to evaluate models, and returns data tables to aid in selection of optimal model settings that balance goodness-of-fit and model complexity. Also has functions to partition data spatially (or not) for cross validation, to plot multiple visualizations of results, to run null models to estimate significance and effect sizes of performance metrics, and to calculate range overlap between model predictions, among others. The package was originally built for Maxent models (Phillips et al. 2006, Phillips et al. 2017), but the current version allows possible extensions for any modeling algorithm. The extensive vignette, which guides users through most package functionality but unfortunately has a file size too big for CRAN, can be found here on the package's Github Pages website: <https://jamiemkass.github.io/ENMeval/articles/ENMeval-2.0-vignette.html>.

Maintained by Jamie M. Kass. Last updated 2 months ago.

3.0 match 49 stars 11.25 score 332 scripts 2 dependents

learnitr

learnitdown:R Markdown, Bookdown and Learnr Additions for Learning Material

Extension to R Markdown, Bookdown and Learnr for building better learning and e-learning material: H5P integration, course-contextual divs, differed loading of Shiny and learnr applications, and much more ...

Maintained by Philippe Grosjean. Last updated 6 months ago.

bookdown learning-resources r-markdown teaching-materials

7.5 match 13 stars 4.49 score 16 scripts

ipeagit

gtfs2emis:Estimating Public Transport Emissions from General Transit Feed Specification (GTFS) Data

A bottom up model to estimate the emission levels of public transport systems based on General Transit Feed Specification (GTFS) data. The package requires two main inputs: i) Public transport data in the GTFS standard format; and ii) Some basic information on fleet characteristics such as fleet age, technology, fuel and Euro stage. As it stands, the package estimates several pollutants at high spatial and temporal resolutions. Pollution levels can be calculated for specific transport routes, trips, time of the day or for the transport system as a whole. The output with emission estimates can be extracted in different formats, supporting analysis on how emission levels vary across space, time and by fleet characteristics. A full description of the methods used in the 'gtfs2emis' model is presented in Vieira, J. P. B.; Pereira, R. H. M.; Andrade, P. R. (2022) <doi:10.31219/osf.io/8m2cy>.

Maintained by Joao Bazzo. Last updated 2 months ago.

emissions environmental-modelling gtfs public-transport rspatial transport

4.5 match 28 stars 7.47 score 29 scripts

bioc

HIBAG:HLA Genotype Imputation with Attribute Bagging

Imputes HLA classical alleles using GWAS SNP data, and it relies on a training set of HLA and SNP genotypes. HIBAG can be used by researchers with published parameter estimates instead of requiring access to large training sample datasets. It combines the concepts of attribute bagging, an ensemble classifier method, with haplotype inference for SNPs and HLA types. Attribute bagging is a technique which improves the accuracy and stability of classifier ensembles using bootstrap aggregating and random variable selection.

Maintained by Xiuwen Zheng. Last updated 4 months ago.

genetics statisticalmethod bioinformatics gpu hla imputation mhc snp cpp

4.0 match 30 stars 8.24 score 48 scripts

helenkettle

microPop:Process-Based Modelling of Microbial Populations

Modelling interacting microbial populations - example applications include human gut microbiota, rumen microbiota and phytoplankton. Solves a system of ordinary differential equations to simulate microbial growth and resource uptake over time. This version contains network visualisation functions.

Maintained by Helen Kettle. Last updated 3 years ago.

12.5 match 2.64 score 11 scripts

mrcieu

TwoSampleMR:Two Sample MR Functions and Interface to MRC Integrative Epidemiology Unit OpenGWAS Database

A package for performing Mendelian randomization using GWAS summary data. It uses the IEU OpenGWAS database <https://gwas.mrcieu.ac.uk/> to automatically obtain data, and a wide range of methods to run the analysis.

Maintained by Gibran Hemani. Last updated 11 days ago.

2.9 match 467 stars 11.23 score 1.7k scripts 1 dependents

mrc-ide

hipercow:High Performance Computing

Set up cluster environments and jobs. Moo.

Maintained by Rich FitzJohn. Last updated 12 days ago.

5.0 match 1 stars 6.53 score 45 scripts 1 dependents

traitecoevo

APCalign:Resolving Plant Taxon Names Using the Australian Plant Census

The process of resolving taxon names is necessary when working with biodiversity data. 'APCalign' uses the Australian Plant Census (APC) and the Australian Plant Name Index (APNI) to align and update plant taxon names to current, accepted standards. 'APCalign' also supplies information about the established status of plant taxa across different states/territories.

Maintained by Daniel Falster. Last updated 1 months ago.

4.4 match 4 stars 7.30 score 23 scripts 1 dependents

predictiveecology

SpaDES.core:Core Utilities for Developing and Running Spatially Explicit Discrete Event Models

Provides the core framework for a discrete event system to implement a complete data-to-decisions, reproducible workflow. The core components facilitate the development of modular pieces, and enable the user to include additional functionality by running user-built modules. Includes conditional scheduling, restart after interruption, packaging of reusable modules, tools for developing arbitrary automated workflows, automated interweaving of modules of different temporal resolution, and tools for visualizing and understanding the within-project dependencies. The suggested package 'NLMR' can be installed from the repository (<https://PredictiveEcology.r-universe.dev>).

Maintained by Eliot J B McIntire. Last updated 19 days ago.

discrete-events-simulations simulation-framework simulation-modeling

3.0 match 10 stars 10.61 score 142 scripts 6 dependents

zhanxw

seqminer:Efficiently Read Sequence Data (VCF Format, BCF Format, METAL Format and BGEN Format) into R

Integrate sequencing data (Variant call format, e.g. VCF or BCF) or meta-analysis results in R. This package can help you (1) read VCF/BCF/BGEN files by chromosomal ranges (e.g. 1:100-200); (2) read RareMETAL summary statistics files; (3) read tables from a tabix-indexed files; (4) annotate VCF/BCF files; (5) create customized workflow based on Makefile.

Maintained by Xiaowei Zhan. Last updated 6 months ago.

annotation bcf bgen meta-analysis next-generation-sequencing plink sequencing tabix vcf workflow zlib bzip2 libzstd sqlite3 cpp

3.9 match 30 stars 8.29 score 111 scripts 6 dependents

stibu81

ibawds:Functions and Datasets for the Data Science Course at IBAW

A collection of useful functions and datasets for the Data Science Course at IBAW.

Maintained by Stefan Lanz. Last updated 10 days ago.

data-science-learning educational-resources

7.5 match 2 stars 4.26 score 8 scripts

rstudio

pagedown:Paginate the HTML Output of R Markdown with CSS for Print

Use the paged media properties in CSS and the JavaScript library 'paged.js' to split the content of an HTML document into discrete pages. Each page can have its page size, page numbers, margin boxes, and running headers, etc. Applications of this package include books, letters, reports, papers, business cards, resumes, and posters.

Maintained by Yihui Xie. Last updated 2 months ago.

css html paged-media pdf printing typesetting

2.7 match 909 stars 11.73 score 350 scripts 19 dependents

bioc

AnnotationHubData:Transform public data resources into Bioconductor Data Structures

These recipes convert a wide variety and a growing number of public bioinformatic data sets into easily-used standard Bioconductor data structures.

Maintained by Bioconductor Package Maintainer. Last updated 6 days ago.

dataimport

6.3 match 5.02 score 22 scripts 4 dependents

jmsigner

amt:Animal Movement Tools

Manage and analyze animal movement data. The functionality of 'amt' includes methods to calculate home ranges, track statistics (e.g. step lengths, speed, or turning angles), prepare data for fitting habitat selection analyses, and simulation of space-use from fitted step-selection functions.

Maintained by Johannes Signer. Last updated 4 months ago.

3.0 match 41 stars 10.54 score 418 scripts

r-lib

cleancall:C Resource Cleanup via Exit Handlers

Wrapper of .Call() that runs exit handlers to clean up C resources. Helps managing C (non-R) resources while using the R API.

Maintained by Gábor Csárdi. Last updated 4 months ago.

5.5 match 19 stars 5.53 score 1 scripts 2 dependents

datashield

DSLite:'DataSHIELD' Implementation on Local Datasets

'DataSHIELD' is an infrastructure and series of R packages that enables the remote and 'non-disclosive' analysis of sensitive research data. This 'DataSHIELD Interface' implementation is for analyzing datasets living in the current R session. The purpose of this is primarily for lightweight 'DataSHIELD' analysis package development.

Maintained by Yannick Marcon. Last updated 2 years ago.

6.0 match 4 stars 5.03 score 53 scripts

bioc

ReUseData:Reusable and reproducible Data Management

ReUseData is an _R/Bioconductor_ software tool to provide a systematic and versatile approach for standardized and reproducible data management. ReUseData facilitates transformation of shell or other ad hoc scripts for data preprocessing into workflow-based data recipes. Evaluation of data recipes generate curated data files in their generic formats (e.g., VCF, bed). Both recipes and data are cached using database infrastructure for easy data management and reuse. Prebuilt data recipes are available through ReUseData portal ("https://rcwl.org/dataRecipes/") with full annotation and user instructions. Pregenerated data are available through ReUseData cloud bucket that is directly downloadable through "getCloudData()".

Maintained by Qian Liu. Last updated 5 months ago.

software infrastructure dataimport preprocessing immunooncology

5.6 match 4 stars 5.38 score 7 scripts

critical-infrastructure-systems-lab

reservoir:Tools for Analysis, Design, and Operation of Water Supply Storages

Measure single-storage water supply system performance using resilience, reliability, and vulnerability metrics; assess storage-yield- reliability relationships; determine no-fail storage with sequent peak analysis; optimize release decisions for water supply, hydropower, and multi-objective reservoirs using deterministic and stochastic dynamic programming; generate inflow replicates using parametric and non-parametric models; evaluate inflow persistence using the Hurst coefficient.

Maintained by Sean Turner. Last updated 4 years ago.

hydrology reservoir simulation water-resources

7.5 match 28 stars 4.00 score 18 scripts

bioc

AnVILWorkflow:Run workflows implemented in Terra/AnVIL workspace

The AnVIL is a cloud computing resource developed in part by the National Human Genome Research Institute. The main cloud-based genomics platform deported by the AnVIL project is Terra. The AnVILWorkflow package allows remote access to Terra implemented workflows, enabling end-user to utilize Terra/ AnVIL provided resources - such as data, workflows, and flexible/scalble computing resources - through the conventional R functions.

Maintained by Sehyun Oh. Last updated 28 days ago.

infrastructure software anvil gcp terra workflows

5.0 match 6 stars 6.03 score 1 scripts

tidyverse

googledrive:An Interface to Google Drive

Manage Google Drive files from R.

Maintained by Jennifer Bryan. Last updated 7 months ago.

google-drive

2.0 match 329 stars 14.97 score 2.1k scripts 164 dependents

bioc

XINA:Multiplexes Isobaric Mass Tagged-based Kinetics Data for Network Analysis

The aim of XINA is to determine which proteins exhibit similar patterns within and across experimental conditions, since proteins with co-abundance patterns may have common molecular functions. XINA imports multiple datasets, tags dataset in silico, and combines the data for subsequent subgrouping into multiple clusters. The result is a single output depicting the variation across all conditions. XINA, not only extracts coabundance profiles within and across experiments, but also incorporates protein-protein interaction databases and integrative resources such as KEGG to infer interactors and molecular functions, respectively, and produces intuitive graphical outputs.

Maintained by Lang Ho Lee. Last updated 5 months ago.

systemsbiology proteomics rnaseq network

6.9 match 4.30 score 3 scripts

bioc

GenomeInfoDb:Utilities for manipulating chromosome names, including modifying them to follow a particular naming style

Contains data and functions that define and allow translation between different chromosome sequence naming conventions (e.g., "chr1" versus "1"), including a function that attempts to place sequence names in their natural, rather than lexicographic, order.

Maintained by Hervé Pagès. Last updated 2 months ago.

genetics datarepresentation annotation genomeannotation bioconductor-package core-package

1.8 match 32 stars 16.46 score 1.3k scripts 1.7k dependents

bioc

cBioPortalData:Exposes and Makes Available Data from the cBioPortal Web Resources

The cBioPortalData R package accesses study datasets from the cBio Cancer Genomics Portal. It accesses the data either from the pre-packaged zip / tar files or from the API interface that was recently implemented by the cBioPortal Data Team. The package can provide data in either tabular format or with MultiAssayExperiment object that uses familiar Bioconductor data representations.

Maintained by Marcel Ramos. Last updated 10 days ago.

software infrastructure thirdpartyclient bioconductor-package nci-itcr u24ca289073

2.9 match 33 stars 10.15 score 147 scripts 4 dependents

mapme-initiative

mapme.biodiversity:Efficient Monitoring of Global Biodiversity Portfolios

Biodiversity areas, especially primary forest, serve a multitude of functions for local economy, regional functionality of the ecosystems as well as the global health of our planet. Recently, adverse changes in human land use practices and climatic responses to increased greenhouse gas emissions, put these biodiversity areas under a variety of different threats. The present package helps to analyse a number of biodiversity indicators based on freely available geographical datasets. It supports computational efficient routines that allow the analysis of potentially global biodiversity portfolios. The primary use case of the package is to support evidence based reporting of an organization's effort to protect biodiversity areas under threat and to identify regions were intervention is most duly needed.

Maintained by Darius A. Görgen. Last updated 3 months ago.

environment eo gis mapme spatial sustainability

3.1 match 35 stars 9.24 score 287 scripts

bioc

rsbml:R support for SBML, using libsbml

Links R to libsbml for SBML parsing, validating output, provides an S4 SBML DOM, converts SBML to R graph objects. Optionally links to the SBML ODE Solver Library (SOSLib) for simulating models.

Maintained by Michael Lawrence. Last updated 18 days ago.

graphandnetwork pathways network libsbml cpp

6.0 match 4.71 score 19 scripts 1 dependents

tidymodels

corrr:Correlations in R

A tool for exploring correlations. It makes it possible to easily perform routine tasks when exploring correlation matrices such as ignoring the diagonal, focusing on the correlations of certain variables against others, or rearranging and visualizing the matrix in terms of the strength of the correlations.

Maintained by Max Kuhn. Last updated 1 years ago.

2.0 match 593 stars 13.82 score 2.9k scripts 7 dependents

quicklizard99

cheddar:Analysis and Visualisation of Ecological Communities

Provides a flexible, extendable representation of an ecological community and a range of functions for analysis and visualisation, focusing on food web, body mass and numerical abundance data. Allows inter-web comparisons such as examining changes in community structure over environmental, temporal or spatial gradients.

Maintained by Lawrence Hudson. Last updated 8 months ago.

cpp

4.0 match 15 stars 6.86 score 195 scripts

bioc

GenomicScores:Infrastructure to work with genomewide position-specific scores

Provide infrastructure to store and access genomewide position-specific scores within R and Bioconductor.

Maintained by Robert Castelo. Last updated 1 months ago.

infrastructure genetics annotation sequencing coverage annotationhubsoftware

3.1 match 8 stars 8.71 score 83 scripts 6 dependents

jiefei-wang

aws.ecx:Communicating with AWS EC2 and ECS using AWS REST APIs

Providing the functions for communicating with Amazon Web Services(AWS) Elastic Compute Cloud(EC2) and Elastic Container Service(ECS). The functions will have the prefix 'ecs_' or 'ec2_' depending on the class of the API. The request will be sent via the REST API and the parameters are given by the function argument. The credentials can be set via 'aws_set_credentials'. The EC2 documentation can be found at <https://docs.aws.amazon.com/AWSEC2/latest/APIReference/Welcome.html> and ECS can be found at <https://docs.aws.amazon.com/AmazonECS/latest/APIReference/Welcome.html>.

Maintained by Jiefei Wang. Last updated 3 years ago.

ec2 ecs ecs-functions

6.5 match 1 stars 4.18 score 2 scripts

frbcesab

rcompendium:Create a Package or Research Compendium Structure

Makes easier the creation of R package or research compendium (i.e. a predefined files/folders structure) so that users can focus on the code/analysis instead of wasting time organizing files. A full ready-to-work structure is set up with some additional features: version control, remote repository creation, CI/CD configuration (check package integrity under several OS, test code with 'testthat', and build and deploy website using 'pkgdown'). This package heavily relies on the R packages 'devtools' and 'usethis' and follows recommendations made by Wickham H. (2015) <ISBN:9781491910597> and Marwick B. et al. (2018) <doi:10.7287/peerj.preprints.3192v2>.

Maintained by Nicolas Casajus. Last updated 1 months ago.

reproducible-research research-compendium

4.0 match 40 stars 6.72 score 22 scripts

bioc

CAGEr:Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining

The _CAGEr_ package identifies transcription start sites (TSS) and their usage frequency from CAGE (Cap Analysis Gene Expression) sequencing data. It normalises raw CAGE tag count, clusters TSSs into tag clusters (TC) and aggregates them across multiple CAGE experiments to construct consensus clusters (CC) representing the promoterome. CAGEr provides functions to profile expression levels of these clusters by cumulative expression and rarefaction analysis, and outputs the plots in ggplot2 format for further facetting and customisation. After clustering, CAGEr performs analyses of promoter width and detects differential usage of TSSs (promoter shifting) between samples. CAGEr also exports its data as genome browser tracks, and as R objects for downsteam expression analysis by other Bioconductor packages such as DESeq2, CAGEfightR, or seqArchR.

Maintained by Charles Plessy. Last updated 5 months ago.

preprocessing sequencing normalization functionalgenomics transcription geneexpression clustering visualization

4.4 match 6.12 score 73 scripts

kaneplusplus

bigmemory:Manage Massive Matrices with Shared Memory and Memory-Mapped Files

Create, store, access, and manipulate massive matrices. Matrices are allocated to shared memory and may use memory-mapped files. Packages 'biganalytics', 'bigtabulate', 'synchronicity', and 'bigalgebra' provide advanced functionality.

Maintained by Michael J. Kane. Last updated 1 years ago.

cpp

2.3 match 127 stars 11.87 score 920 scripts 64 dependents

wadpac

GGIR:Raw Accelerometer Data Analysis

A tool to process and analyse data collected with wearable raw acceleration sensors as described in Migueles and colleagues (JMPB 2019), and van Hees and colleagues (JApplPhysiol 2014; PLoSONE 2015). The package has been developed and tested for binary data from 'GENEActiv' <https://activinsights.com/>, binary (.gt3x) and .csv-export data from 'Actigraph' <https://theactigraph.com> devices, and binary (.cwa) and .csv-export data from 'Axivity' <https://axivity.com>. These devices are currently widely used in research on human daily physical activity. Further, the package can handle accelerometer data file from any other sensor brand providing that the data is stored in csv format. Also the package allows for external function embedding.

Maintained by Vincent T van Hees. Last updated 3 days ago.

accelerometer activity-recognition circadian-rhythm movement-sensor sleep

2.0 match 109 stars 13.20 score 342 scripts 3 dependents

rstudio

bookdown:Authoring Books and Technical Documents with R Markdown

Output formats and utilities for authoring books and technical documents with R Markdown.

Maintained by Yihui Xie. Last updated 2 days ago.

book bookdown epub gitbook html latex rmarkdown

1.5 match 3.9k stars 17.51 score 1.7k scripts 136 dependents

carpentries

sandpaper:Create and Curate Carpentries Lessons

We provide tools to build a Carpentries-themed lesson repository into an accessible standalone static website. These include local tools and those designed to be used in a continuous integration context so that all the lesson author needs to focus on is writing the content of the actual lesson.

Maintained by Robert Davey. Last updated 2 months ago.

carpentries carpentries-infrastructure carpentries-workbench lesson-template lessons markdown static-site-generator

3.3 match 44 stars 7.72 score 8 scripts

rstudio

promises:Abstractions for Promise-Based Asynchronous Programming

Provides fundamental abstractions for doing asynchronous programming in R using promises. Asynchronous programming is useful for allowing a single R process to orchestrate multiple tasks in the background while also attending to something else. Semantics are similar to 'JavaScript' promises, but with a syntax that is idiomatic R.

Maintained by Joe Cheng. Last updated 1 months ago.

cpp

1.5 match 204 stars 17.10 score 688 scripts 2.6k dependents

lumenlearning

rise:Conduct RISE Analysis

Implements techniques for educational resource inspection, selection, and evaluation (RISE) described in Bodily, Nyland, and Wiley (2017) <doi:10.19173/irrodl.v18i2.2952>. Automates the process of identifying learning materials that are not effectively supporting student learning in technology-mediated courses by synthesizing information about access to course content and performance on assessments.

Maintained by David Wiley. Last updated 6 years ago.

continuous-improvement learning-analytics open-educational-resources

7.2 match 7 stars 3.54 score 7 scripts

bioc

SNPRelate:Parallel Computing Toolset for Relatedness and Principal Component Analysis of SNP Data

Genome-wide association studies (GWAS) are widely used to investigate the genetic basis of diseases and traits, but they pose many computational challenges. We developed an R package SNPRelate to provide a binary format for single-nucleotide polymorphism (SNP) data in GWAS utilizing CoreArray Genomic Data Structure (GDS) data files. The GDS format offers the efficient operations specifically designed for integers with two bits, since a SNP could occupy only two bits. SNPRelate is also designed to accelerate two key computations on SNP data using parallel computing for multi-core symmetric multiprocessing computer architectures: Principal Component Analysis (PCA) and relatedness analysis using Identity-By-Descent measures. The SNP GDS format is also used by the GWASTools package with the support of S4 classes and generic functions. The extended GDS format is implemented in the SeqArray package to support the storage of single nucleotide variations (SNVs), insertion/deletion polymorphism (indel) and structural variation calls in whole-genome and whole-exome variant data.

Maintained by Xiuwen Zheng. Last updated 5 months ago.

infrastructure genetics statisticalmethod principalcomponent bioinformatics gds-format pca simd snp openblas cpp

2.0 match 104 stars 12.69 score 1.6k scripts 18 dependents

aravind-j

EvaluateCore:Quality Evaluation of Core Collections

Implements various quality evaluation statistics to assess the value of plant germplasm core collections using qualitative and quantitative phenotypic trait data according to Odong et al. (2015) <doi:10.1007/s00122-012-1971-y>.

Maintained by J. Aravind. Last updated 7 days ago.

core-collections core-evaluation genebank germplasm pgr plant-genetic-resources

6.7 match 1 stars 3.80 score 21 scripts

famuvie

breedR:Statistical Methods for Forest Genetic Resources Analysts

Statistical tools to build predictive models for the breeders community. It aims to assess the genetic value of individuals under a number of situations, including spatial autocorrelation, genetic/environment interaction and competition. It is under active development as part of the Trees4Future project, particularly developed having forest genetic trials in mind. But can be used for animals or other situations as well.

Maintained by Facundo Muñoz. Last updated 8 months ago.

4.6 match 33 stars 5.44 score 24 scripts

roelandkindt

BiodiversityR:Package for Community Ecology and Suitability Analysis

Graphical User Interface (via the R-Commander) and utility functions (often based on the vegan package) for statistical analysis of biodiversity and ecological communities, including species accumulation curves, diversity indices, Renyi profiles, GLMs for analysis of species abundance and presence-absence, distance matrices, Mantel tests, and cluster, constrained and unconstrained ordination analysis. A book on biodiversity and community ecology analysis is available for free download from the website. In 2012, methods for (ensemble) suitability modelling and mapping were expanded in the package.

Maintained by Roeland Kindt. Last updated 2 months ago.

3.3 match 16 stars 7.42 score 390 scripts 2 dependents

ropensci

stplanr:Sustainable Transport Planning

Tools for transport planning with an emphasis on spatial transport data and non-motorized modes. The package was originally developed to support the 'Propensity to Cycle Tool', a publicly available strategic cycle network planning tool (Lovelace et al. 2017) <doi:10.5198/jtlu.2016.862>, but has since been extended to support public transport routing and accessibility analysis (Moreno-Monroy et al. 2017) <doi:10.1016/j.jtrangeo.2017.08.012> and routing with locally hosted routing engines such as 'OSRM' (Lowans et al. 2023) <doi:10.1016/j.enconman.2023.117337>. The main functions are for creating and manipulating geographic "desire lines" from origin-destination (OD) data (building on the 'od' package); calculating routes on the transport network locally and via interfaces to routing services such as <https://cyclestreets.net/> (Desjardins et al. 2021) <doi:10.1007/s11116-021-10197-1>; and calculating route segment attributes such as bearing. The package implements the 'travel flow aggregration' method described in Morgan and Lovelace (2020) <doi:10.1177/2399808320942779> and the 'OD jittering' method described in Lovelace et al. (2022) <doi:10.32866/001c.33873>. Further information on the package's aim and scope can be found in the vignettes and in a paper in the R Journal (Lovelace and Ellison 2018) <doi:10.32614/RJ-2018-053>, and in a paper outlining the landscape of open source software for geographic methods in transport planning (Lovelace, 2021) <doi:10.1007/s10109-020-00342-2>.

Maintained by Robin Lovelace. Last updated 7 months ago.

cycle cycling desire-lines origin-destination peer-reviewed pubic-transport route-network routes routing spatial transport transport-planning transportation walking

2.0 match 427 stars 12.31 score 684 scripts 3 dependents

ropensci

weatherOz:An API Client for Australian Weather and Climate Data Resources

Provides automated downloading, parsing and formatting of weather data for Australia through API endpoints provided by the Department of Primary Industries and Regional Development ('DPIRD') of Western Australia and by the Science and Technology Division of the Queensland Government's Department of Environment and Science ('DES'). As well as the Bureau of Meteorology ('BOM') of the Australian government precis and coastal forecasts, and downloading and importing radar and satellite imagery files. 'DPIRD' weather data are accessed through public 'APIs' provided by 'DPIRD', <https://www.agric.wa.gov.au/weather-api-20>, providing access to weather station data from the 'DPIRD' weather station network. Australia-wide weather data are based on data from the Australian Bureau of Meteorology ('BOM') data and accessed through 'SILO' (Scientific Information for Land Owners) Jeffrey et al. (2001) <doi:10.1016/S1364-8152(01)00008-1>. 'DPIRD' data are made available under a Creative Commons Attribution 3.0 Licence (CC BY 3.0 AU) license <https://creativecommons.org/licenses/by/3.0/au/deed.en>. SILO data are released under a Creative Commons Attribution 4.0 International licence (CC BY 4.0) <https://creativecommons.org/licenses/by/4.0/>. 'BOM' data are (c) Australian Government Bureau of Meteorology and released under a Creative Commons (CC) Attribution 3.0 licence or Public Access Licence ('PAL') as appropriate, see <http://www.bom.gov.au/other/copyright.shtml> for further details.

Maintained by Rodrigo Pires. Last updated 20 days ago.

dpird bom meteorological-data weather-forecast australia weather weather-data meteorology western-australia australia-bureau-of-meteorology western-australia-agriculture australia-agriculture australia-climate australia-weather api-client climate data rainfall weather-api

2.9 match 32 stars 8.54 score 40 scripts

r-forge

tm:Text Mining Package

A framework for text mining applications within R.

Maintained by Kurt Hornik. Last updated 26 days ago.

cpp

1.9 match 12.96 score 14k scripts 101 dependents

rstudio

packrat:A Dependency Management System for Projects and their R Package Dependencies

Manage the R packages your project depends on in an isolated, portable, and reproducible way.

Maintained by Aron Atkins. Last updated 2 months ago.

2.0 match 406 stars 12.15 score 256 scripts 9 dependents

datashield

DSOpal:'DataSHIELD' Implementation for 'Opal'

'DataSHIELD' is an infrastructure and series of R packages that enables the remote and 'non-disclosive' analysis of sensitive research data. This package is the 'DataSHIELD' interface implementation for 'Opal', which is the data integration application for biobanks by 'OBiBa'. Participant data, once collected from any data source, must be integrated and stored in a central data repository under a uniform model. 'Opal' is such a central repository. It can import, process, validate, query, analyze, report, and export data. 'Opal' is the reference implementation of the 'DataSHIELD' infrastructure.

Maintained by Yannick Marcon. Last updated 2 years ago.

6.3 match 3.85 score 141 scripts

nmfs-ost

asar:Build NOAA Stock Assessment Report

Build a full or update stock assessment report for any stock assessment model. Parameterization allows the user to call a template based on their regional science center, species, area, ect.

Maintained by Samantha Schiano. Last updated 7 days ago.

latex quarto stock-assessment-reports

3.5 match 21 stars 6.87 score 3 scripts

cran

sas7bdat:sas7bdat Reverse Engineering Documentation

Documentation and prototypes for the earliest (circa 2010) open-source effort to reverse engineer the sas7bdat file format. The package includes a prototype reader for sas7bdat files. However, newer packages may contain more robust readers for sas7bdat files.

Maintained by Matt Shotwell. Last updated 7 months ago.

3.8 match 4 stars 6.29 score 500 scripts 4 dependents

kylegrealis

froggeR:Enhance 'Quarto' Project Workflows and Standards

Streamlines 'Quarto' workflows by providing tools for consistent project setup and documentation. Enables portability through reusable metadata, automated project structure creation, and standardized templates. Features include enhanced project initialization, pre-formatted 'Quarto' documents, comprehensive data protection settings, custom styling, and structured documentation generation. Designed to improve efficiency and collaboration in R data science projects by reducing repetitive setup tasks while maintaining consistent formatting across multiple documents. There are many valuable resources providing in-depth explanations of customizing 'Quarto' templates and theme styling by the Posit team: <https://quarto.org/docs/output-formats/html-themes.html#customizing-themes> & <https://quarto.org/docs/output-formats/html-themes-more.html>, and at the Bootstrap community's GitHub at <https://github.com/twbs/bootstrap/blob/main/scss/_variables.scss>.

Maintained by Kyle Grealis. Last updated 12 hours ago.

data-science project-management quarto

3.5 match 26 stars 6.67 score 6 scripts

bioc

Organism.dplyr:dplyr-based Access to Bioconductor Annotation Resources

This package provides an alternative interface to Bioconductor 'annotation' resources, in particular the gene identifier mapping functionality of the 'org' packages (e.g., org.Hs.eg.db) and the genome coordinate functionality of the 'TxDb' packages (e.g., TxDb.Hsapiens.UCSC.hg38.knownGene).

Maintained by Martin Morgan. Last updated 5 months ago.

annotation sequencing genomeannotation bioconductor-package core-package

3.4 match 3 stars 6.77 score 63 scripts 1 dependents

selesnow

rgoogleads:Loading Data from 'Google Ads API'

Interface for loading data from 'Google Ads API', see <https://developers.google.com/google-ads/api/docs/start>. Package provide function for authorization and loading reports.

Maintained by Alexey Seleznev. Last updated 2 months ago.

3.6 match 14 stars 6.40 score 15 scripts 1 dependents

robjhyndman

tsfeatures:Time Series Feature Extraction

Methods for extracting various features from time series data. The features provided are those from Hyndman, Wang and Laptev (2013) <doi:10.1109/ICDMW.2015.104>, Kang, Hyndman and Smith-Miles (2017) <doi:10.1016/j.ijforecast.2016.09.004> and from Fulcher, Little and Jones (2013) <doi:10.1098/rsif.2013.0048>. Features include spectral entropy, autocorrelations, measures of the strength of seasonality and trend, and so on. Users can also define their own feature functions.

Maintained by Rob Hyndman. Last updated 8 months ago.

feature-extraction time-series

2.0 match 254 stars 11.47 score 268 scripts 22 dependents

bioc

TFutils:TFutils

This package helps users to work with TF metadata from various sources. Significant catalogs of TFs and classifications thereof are made available. Tools for working with motif scans are also provided.

Maintained by Vincent Carey. Last updated 4 months ago.

transcriptomics

4.7 match 4.80 score 21 scripts

thibautjombart

adegenet:Exploratory Analysis of Genetic and Genomic Data

Toolset for the exploration of genetic and genomic data. Adegenet provides formal (S4) classes for storing and handling various genetic data, including genetic markers with varying ploidy and hierarchical population structure ('genind' class), alleles counts by populations ('genpop'), and genome-wide SNP data ('genlight'). It also implements original multivariate methods (DAPC, sPCA), graphics, statistical tests, simulation tools, distance and similarity measures, and several spatial methods. A range of both empirical and simulated datasets is also provided to illustrate various methods.

Maintained by Zhian N. Kamvar. Last updated 1 months ago.

1.8 match 182 stars 12.60 score 1.9k scripts 29 dependents

bioc

OrganismDbi:Software to enable the smooth interfacing of different database packages

The package enables a simple unified interface to several annotation packages each of which has its own schema by taking advantage of the fact that each of these packages implements a select methods.

Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.

annotation infrastructure

3.0 match 7.45 score 34 scripts 35 dependents

bioc

MGnifyR:R interface to EBI MGnify metagenomics resource

Utility package to facilitate integration and analysis of EBI MGnify data in R. The package can be used to import microbial data for instance into TreeSummarizedExperiment (TreeSE). In TreeSE format, the data is directly compatible with miaverse framework.

Maintained by Tuomas Borman. Last updated 5 months ago.

infrastructure dataimport metagenomics

2.9 match 21 stars 7.61 score 32 scripts

ipums

ipumsr:An R Interface for Downloading, Reading, and Handling IPUMS Data

An easy way to work with census, survey, and geographic data provided by IPUMS in R. Generate and download data through the IPUMS API and load IPUMS files into R with their associated metadata to make analysis easier. IPUMS data describing 1.4 billion individuals drawn from over 750 censuses and surveys is available free of charge from the IPUMS website <https://www.ipums.org>.

Maintained by Derek Burk. Last updated 19 days ago.

2.0 match 28 stars 11.07 score 720 scripts 2 dependents

bioc

SpliceWiz:interactive analysis and visualization of alternative splicing in R

The analysis and visualization of alternative splicing (AS) events from RNA sequencing data remains challenging. SpliceWiz is a user-friendly and performance-optimized R package for AS analysis, by processing alignment BAM files to quantify read counts across splice junctions, IRFinder-based intron retention quantitation, and supports novel splicing event identification. We introduce a novel visualization for AS using normalized coverage, thereby allowing visualization of differential AS across conditions. SpliceWiz features a shiny-based GUI facilitating interactive data exploration of results including gene ontology enrichment. It is performance optimized with multi-threaded processing of BAM files and a new COV file format for fast recall of sequencing coverage. Overall, SpliceWiz streamlines AS analysis, enabling reliable identification of functionally relevant AS events for further characterization.

Maintained by Alex Chit Hei Wong. Last updated 4 days ago.

software transcriptomics rnaseq alternativesplicing coverage differentialsplicing differentialexpression gui sequencing cpp openmp

3.4 match 16 stars 6.41 score 8 scripts

paws-r

paws.security.identity:'Amazon Web Services' Security, Identity, & Compliance Services

Interface to 'Amazon Web Services' security, identity, and compliance services, including the 'Identity & Access Management' ('IAM') service for managing access to services and resources, and more <https://aws.amazon.com/>.

Maintained by Dyfan Jones. Last updated 4 days ago.

aws aws-sdk

2.4 match 332 stars 9.17 score 15 dependents

shikokuchuo

mirai:Minimalist Async Evaluation Framework for R

Designed for simplicity, a 'mirai' evaluates an R expression asynchronously in a parallel process, locally or distributed over the network. The result is automatically available upon completion. Modern networking and concurrency, built on 'nanonext' and 'NNG' (Nanomsg Next Gen), ensures reliable and efficient scheduling over fast inter-process communications or TCP/IP secured by TLS. Distributed computing can launch remote resources via SSH or cluster managers. An inherently queued architecture handles many more tasks than available processes, and requires no storage on the file system. Innovative features include support for otherwise non-exportable reference objects, event-driven promises, and asynchronous parallel map.

Maintained by Charlie Gao. Last updated 3 days ago.

async asynchronous-tasks concurrency distributed-computing high-performance-computing parallel-computing

1.8 match 217 stars 11.94 score 130 scripts 7 dependents

imangr

frostr:R API to MET Norway's 'Frost' API

An R API to MET Norway's 'Frost' API <https://frost.met.no/index.html> to retrieve data as data frames. The 'Frost' API, and the underlying data, is made available by the Norwegian Meteorological Institute (MET Norway). The data and products are distributed under the Norwegian License for Open Data 2.0 (NLOD) <https://data.norge.no/nlod/en/2.0> and Creative Commons 4.0 <https://creativecommons.org/licenses/by/4.0/>.

Maintained by Iman Ghayoornia. Last updated 5 years ago.

frost frost-api norway norwegian-data norwegian-weather-data weather-api weather-data

6.9 match 3 stars 3.18 score 4 scripts

chgrl

bReeze:Functions for Wind Resource Assessment

A collection of functions to analyse, visualize and interpret wind data and to calculate the potential energy production of wind turbines.

Maintained by Christian Graul. Last updated 1 years ago.

5.0 match 20 stars 4.34 score 22 scripts

jhudsl

ottrpal:Companion Tools for Open-Source Tools for Training Resources (OTTR)

Tools for converting Open-Source Tools for Training Resources (OTTR) courses into Leanpub or Coursera courses. 'ottrpal' is for use with the OTTR Template repository to create courses.

Maintained by Candace Savonen. Last updated 14 days ago.

edtech-software

3.3 match 3 stars 6.50 score 10 scripts 1 dependents

azure

AzureVision:Interface to Azure Computer Vision Services

An interface to 'Azure Computer Vision' <https://docs.microsoft.com/azure/cognitive-services/Computer-vision/Home> and 'Azure Custom Vision' <https://docs.microsoft.com/azure/cognitive-services/custom-vision-service/home>, building on the low-level functionality provided by the 'AzureCognitive' package. These services allow users to leverage the cloud to carry out visual recognition tasks using advanced image processing models, without needing powerful hardware of their own. Part of the 'AzureR' family of packages.

Maintained by Hong Ooi. Last updated 4 years ago.

azure-cognitive-services azure-sdk-r computer-vision custom-vision

4.3 match 5 stars 5.00 score 8 scripts

azure

AzureCognitive:Interface to Azure Cognitive Services

An interface to Azure Cognitive Services <https://docs.microsoft.com/en-us/azure/cognitive-services/>. Both an 'Azure Resource Manager' interface, for deploying Cognitive Services resources, and a client framework are supplied. While 'AzureCognitive' can be called by the end-user, it is meant to provide a foundation for other packages that will support specific services, like Computer Vision, Custom Vision, language translation, and so on. Part of the 'AzureR' family of packages.

Maintained by Hong Ooi. Last updated 4 years ago.

azure-cognitive-services azure-sdk-r

3.9 match 11 stars 5.52 score 4 scripts 1 dependents

jimbrig

rtraining:R Training Resources, Guides, Tips, and Knowledge Base

Houses variouse material realted to teaching R.

Maintained by Jimmy Briggs. Last updated 2 years ago.

best-practices curation developer-tools development development-environment guide knowledge package-development setup shiny-apps tips-and-tricks training training-materials walkthrough

6.0 match 4 stars 3.60 score 6 scripts

ivaughan

econullnetr:Null Model Analysis for Ecological Networks

Tools for using null models to analyse ecological networks (e.g. food webs, flower-visitation networks, seed-dispersal networks) and detect resource preferences or non-random interactions among network nodes. Tools are provided to run null models, test for and plot preferences, plot and analyse bipartite networks, and export null model results in a form compatible with other network analysis packages. The underlying null model was developed by Agusti et al. (2003) Molecular Ecology <doi:10.1046/j.1365-294X.2003.02014.x> and the full application to ecological networks by Vaughan et al. (2018) econullnetr: an R package using null models to analyse the structure of ecological networks and identify resource selection. Methods in Ecology & Evolution, <doi:10.1111/2041-210X.12907>.

Maintained by Ian Vaughan. Last updated 4 years ago.

4.3 match 7 stars 5.04 score 31 scripts

ropensci

bowerbird:Keep a Collection of Sparkly Data Resources

Tools to get and maintain a data repository from third-party data providers.

Maintained by Ben Raymond. Last updated 5 days ago.

ropensci antarctic southern ocean data environmental satellite climate peer-reviewed

3.0 match 50 stars 7.16 score 16 scripts 1 dependents

bioc

miaSim:Microbiome Data Simulation

Microbiome time series simulation with generalized Lotka-Volterra model, Self-Organized Instability (SOI), and other models. Hubbell's Neutral model is used to determine the abundance matrix. The resulting abundance matrix is applied to (Tree)SummarizedExperiment objects.

Maintained by Yagmur Simsek. Last updated 5 months ago.

microbiome software sequencing dnaseq atacseq coverage network

3.2 match 21 stars 6.64 score 23 scripts

ropensci

webchem:Chemical Information from the Web

Chemical information from around the web. This package interacts with a suite of web services for chemical information. Sources include: Alan Wood's Compendium of Pesticide Common Names, Chemical Identifier Resolver, ChEBI, Chemical Translation Service, ChemSpider, ETOX, Flavornet, NIST Chemistry WebBook, OPSIN, PubChem, SRS, Wikidata.

Maintained by Tamás Stirling. Last updated 3 months ago.

cas-number chemical-information chemspider identifier ropensci webscraping

2.0 match 165 stars 10.31 score 173 scripts 10 dependents

azure

AzureStor:Storage Management in 'Azure'

Manage storage in Microsoft's 'Azure' cloud: <https://azure.microsoft.com/en-us/product-categories/storage/>. On the admin side, 'AzureStor' includes features to create, modify and delete storage accounts. On the client side, it includes an interface to blob storage, file storage, and 'Azure Data Lake Storage Gen2': upload and download files and blobs; list containers and files/blobs; create containers; and so on. Authenticated access to storage is supported, via either a shared access key or a shared access signature (SAS). Part of the 'AzureR' family of packages.

Maintained by Hong Ooi. Last updated 2 years ago.

azure-data-lake azure-sdk-r azure-storage azure-storage-blob azure-storage-file

1.9 match 65 stars 10.74 score 298 scripts 4 dependents

mages

ChainLadder:Statistical Methods and Models for Claims Reserving in General Insurance

Various statistical methods and models which are typically used for the estimation of outstanding claims reserves in general insurance, including those to estimate the claims development result as required under Solvency II.

Maintained by Markus Gesmann. Last updated 1 months ago.

2.0 match 82 stars 10.04 score 196 scripts 2 dependents

miferreiro

bdpar:Big Data Preprocessing Architecture

Provide a tool to easily build customized data flows to pre-process large volumes of information from different sources. To this end, 'bdpar' allows to (i) easily use and create new functionalities and (ii) develop new data source extractors according to the user needs. Additionally, the package provides by default a predefined data flow to extract and pre-process the most relevant information (tokens, dates, ... ) from some textual sources (SMS, Email, YouTube comments).

Maintained by Miguel Ferreiro-Díaz. Last updated 1 years ago.

custom-flow custom-pipes preprocessing r6

3.8 match 8 stars 5.23 score 14 scripts

statmanrobin

Stat2Data:Datasets for Stat2

Datasets for the textbook Stat2: Modeling with Regression and ANOVA (second edition). The package also includes data for the first edition, Stat2: Building Models for a World of Data and a few functions for plotting diagnostics.

Maintained by Robin Lock. Last updated 6 years ago.

4.0 match 5 stars 4.94 score 544 scripts

bioc

igvR:igvR: integrative genomics viewer

Access to igv.js, the Integrative Genomics Viewer running in a web browser.

Maintained by Arkadiusz Gladki. Last updated 5 months ago.

visualization thirdpartyclient genomebrowsers

2.4 match 43 stars 8.31 score 118 scripts

stan-dev

rstantools:Tools for Developing R Packages Interfacing with 'Stan'

Provides various tools for developers of R packages interfacing with 'Stan' <https://mc-stan.org>, including functions to set up the required package structure, S3 generics and default methods to unify function naming across 'Stan'-based R packages, and vignettes with recommendations for developers.

Maintained by Jonah Gabry. Last updated 2 months ago.

bayesian-data-analysis bayesian-statistics developer-tools stan

1.5 match 50 stars 13.09 score 134 scripts 222 dependents

scholaempirica

reschola:The Schola Empirica Package

A collection of utilies, themes and templates for data analysis at Schola Empirica.

Maintained by Jan Netík. Last updated 5 months ago.

4.0 match 4 stars 4.83 score 14 scripts

yihui

xaringan:Presentation Ninja

Create HTML5 slides with R Markdown and the JavaScript library 'remark.js' (<https://remarkjs.com>).

Maintained by Yihui Xie. Last updated 12 months ago.

markdown naruto ninja presentation presentation-ninja remarkjs rmarkdown rstudio slideshow

1.5 match 1.5k stars 12.78 score 948 scripts 11 dependents

markedmondson1234

googleAuthR:Authenticate and Create Google APIs

Create R functions that interact with OAuth2 Google APIs <https://developers.google.com/apis-explorer/> easily, with auto-refresh and Shiny compatibility.

Maintained by Erik Grönroos. Last updated 10 months ago.

api authentication google googleauthr oauth2-flow shiny

1.5 match 178 stars 12.84 score 804 scripts 13 dependents

ncss-tech

SoilTaxonomy:A System of Soil Classification for Making and Interpreting Soil Surveys

Taxonomic dictionaries, formative element lists, and functions related to the maintenance, development and application of U.S. Soil Taxonomy. Data and functionality are based on official U.S. Department of Agriculture sources including the latest edition of the Keys to Soil Taxonomy. Descriptions and metadata are obtained from the National Soil Information System or Soil Survey Geographic databases. Other sources are referenced in the data documentation. Provides tools for understanding and interacting with concepts in the U.S. Soil Taxonomic System. Most of the current utilities are for working with taxonomic concepts at the "higher" taxonomic levels: Order, Suborder, Great Group, and Subgroup.

Maintained by Andrew Brown. Last updated 6 months ago.

great-group ncss-tech soil soil-survey soil-taxonomy subgroup suborder usda

3.4 match 15 stars 5.65 score

bioc

xcms:LC-MS and GC-MS Data Analysis

Framework for processing and visualization of chromatographically separated and single-spectra mass spectral data. Imports from AIA/ANDI NetCDF, mzXML, mzData and mzML files. Preprocesses data for high-throughput, untargeted analyte profiling.

Maintained by Steffen Neumann. Last updated 3 days ago.

immunooncology massspectrometry metabolomics bioconductor feature-detection mass-spectrometry peak-detection cpp

1.3 match 196 stars 14.31 score 984 scripts 11 dependents

inrae

airGRiwrm:'airGR' Integrated Water Resource Management

Semi-distributed Precipitation-Runoff Modeling based on 'airGR' package models integrating human infrastructures and their managements.

Maintained by David Dorchies. Last updated 6 months ago.

3.0 match 6.34 score 45 scripts

christopherkenny

bskyr:Interact with 'Bluesky' Social

Collect data from and make posts on 'Bluesky' Social via the Hypertext Transfer Protocol (HTTP) Application Programming Interface (API), as documented at <https://atproto.com/specs/xrpc>. This further supports broader queries to the Authenticated Transfer (AT) Protocol <https://atproto.com/> which 'Bluesky' Social relies on. Data is returned in a tidy format and posts can be made using a simple interface.

Maintained by Christopher T. Kenny. Last updated 1 months ago.

atproto bluesky

3.3 match 20 stars 5.66 score 23 scripts

jbengler

tidyplots:Tidy Plots for Scientific Papers

The goal of 'tidyplots' is to streamline the creation of publication-ready plots for scientific papers. It allows to gradually add, remove and adjust plot components using a consistent and intuitive syntax.

Maintained by Jan Broder Engler. Last updated 4 days ago.

2.0 match 482 stars 9.40 score 85 scripts

bioc

EnrichmentBrowser:Seamless navigation through combined results of set-based and network-based enrichment analysis

The EnrichmentBrowser package implements essential functionality for the enrichment analysis of gene expression data. The analysis combines the advantages of set-based and network-based enrichment analysis in order to derive high-confidence gene sets and biological pathways that are differentially regulated in the expression data under investigation. Besides, the package facilitates the visualization and exploration of such sets and pathways.

Maintained by Ludwig Geistlinger. Last updated 5 months ago.

immunooncology microarray rnaseq geneexpression differentialexpression pathways graphandnetwork network genesetenrichment networkenrichment visualization reportwriting

2.0 match 20 stars 9.37 score 164 scripts 3 dependents