Showing 200 of total 610 results (show query)
obiba
resourcer:Resource Resolver
A resource represents some data or a computation unit. It is described by a URL and credentials. This package proposes a Resource model with "resolver" and "client" classes to facilitate the access and the usage of the resources.
Maintained by Yannick Marcon. Last updated 2 years ago.
160.8 match 2 stars 5.10 score 42 scripts 1 dependentspolar-fhir
fhircrackr:Handling HL7 FHIR® Resources in R
Useful tools for conveniently downloading FHIR resources in xml format and converting them to R data.frames. The package uses FHIR-search to download bundles from a FHIR server, provides functions to save and read xml-files containing such bundles and allows flattening the bundles to data.frames using XPath expressions. FHIR® is the registered trademark of HL7 and is used with the permission of HL7. Use of the FHIR trademark does not constitute endorsement of this product by HL7.
Maintained by Julia Palm. Last updated 12 days ago.
56.9 match 33 stars 7.63 score 46 scriptsropensci
targets:Dynamic Function-Oriented 'Make'-Like Declarative Pipelines
Pipeline tools coordinate the pieces of computationally demanding analysis projects. The 'targets' package is a 'Make'-like pipeline tool for statistics and data science in R. The package skips costly runtime for tasks that are already up to date, orchestrates the necessary computation with implicit parallel computing, and abstracts files as R objects. If all the current output matches the current upstream code and data, then the whole pipeline is up to date, and the results are more trustworthy than otherwise. The methodology in this package borrows from GNU 'Make' (2015, ISBN:978-9881443519) and 'drake' (2018, <doi:10.21105/joss.00550>).
Maintained by William Michael Landau. Last updated 3 days ago.
data-sciencehigh-performance-computingmakepeer-reviewedpipeliner-targetopiareproducibilityreproducible-researchtargetsworkflow
25.2 match 973 stars 15.20 score 4.6k scripts 22 dependentsobiba
opalr:'Opal' Data Repository Client and 'DataSHIELD' Utils
Data integration Web application for biobanks by 'OBiBa'. 'Opal' is the core database application for biobanks. Participant data, once collected from any data source, must be integrated and stored in a central data repository under a uniform model. 'Opal' is such a central repository. It can import, process, validate, query, analyze, report, and export data. 'Opal' is typically used in a research center to analyze the data acquired at assessment centres. Its ultimate purpose is to achieve seamless data-sharing among biobanks. This 'Opal' client allows to interact with 'Opal' web services and to perform operations on the R server side. 'DataSHIELD' administration tools are also provided.
Maintained by Yannick Marcon. Last updated 2 months ago.
46.8 match 3 stars 7.76 score 179 scripts 2 dependentsbioc
OmnipathR:OmniPath web service client and more
A client for the OmniPath web service (https://www.omnipathdb.org) and many other resources. It also includes functions to transform and pretty print some of the downloaded data, functions to access a number of other resources such as BioPlex, ConsensusPathDB, EVEX, Gene Ontology, Guide to Pharmacology (IUPHAR/BPS), Harmonizome, HTRIdb, Human Phenotype Ontology, InWeb InBioMap, KEGG Pathway, Pathway Commons, Ramilowski et al. 2015, RegNetwork, ReMap, TF census, TRRUST and Vinayagam et al. 2011. Furthermore, OmnipathR features a close integration with the NicheNet method for ligand activity prediction from transcriptomics data, and its R implementation `nichenetr` (available only on github).
Maintained by Denes Turei. Last updated 19 days ago.
graphandnetworknetworkpathwayssoftwarethirdpartyclientdataimportdatarepresentationgenesignalinggeneregulationsystemsbiologytranscriptomicssinglecellannotationkeggcomplexesenzyme-ptmnetworksnetworks-biologyomnipathproteinsquarto
32.8 match 126 stars 9.90 score 226 scripts 2 dependentscloudyr
googleComputeEngineR:R Interface with Google Compute Engine
Interact with the 'Google Compute Engine' API in R. Lets you create, start and stop instances in the 'Google Cloud'. Support for preconfigured instances, with templates for common R needs.
Maintained by Mark Edmondson. Last updated 1 days ago.
apicloud-computingcloudyrgoogle-cloudgoogleauthrlaunching-virtual-machines
25.1 match 152 stars 9.73 score 235 scriptsazure
AzureRMR:Interface to 'Azure Resource Manager'
A lightweight but powerful R interface to the 'Azure Resource Manager' REST API. The package exposes a comprehensive class framework and related tools for creating, updating and deleting 'Azure' resource groups, resources and templates. While 'AzureRMR' can be used to manage any 'Azure' service, it can also be extended by other packages to provide extra functionality for specific services. Part of the 'AzureR' family of packages.
Maintained by Hong Ooi. Last updated 1 years ago.
azureazure-resource-managerazure-sdk-rcloud
24.3 match 20 stars 9.94 score 51 scripts 12 dependentsusepa
ctxR:Utilities for Interacting with the 'CTX' APIs
Access chemical, hazard, bioactivity, and exposure data from the Computational Toxicology and Exposure ('CTX') APIs <https://www.epa.gov/comptox-tools/computational-toxicology-and-exposure-apis>. 'ctxR' was developed to streamline the process of accessing the information available through the 'CTX' APIs without requiring prior knowledge of how to use APIs. Most data is also available on the CompTox Chemical Dashboard ('CCD') <https://comptox.epa.gov/dashboard/> and other resources found at the EPA Computational Toxicology and Exposure Online Resources <https://www.epa.gov/comptox-tools>.
Maintained by Paul Kruse. Last updated 2 months ago.
29.8 match 10 stars 8.02 score 13 scripts 1 dependentsalisonlanski
IPEDSuploadables:Transforms Institutional Data into Text Files for IPEDS Automated Import/Upload
Starting from user-supplied institutional data, these scripts transform, aggregate, and reshape the information to produce key-value pair data files that are able to be uploaded to IPEDS (Integrated Postsecondary Education Data System) through their submission portal <https://surveys.nces.ed.gov/ipeds/>. Starting data specifications can be found in the vignettes. Final files are saved locally to a location of the user's choice. User-friendly readable files can also be produced for purposes of data review and validation.
Maintained by Alison Lanski. Last updated 3 months ago.
30.5 match 8 stars 7.05 score 39 scriptsr-simmer
simmer:Discrete-Event Simulation for R
A process-oriented and trajectory-based Discrete-Event Simulation (DES) package for R. It is designed as a generic yet powerful framework. The architecture encloses a robust and fast simulation core written in 'C++' with automatic monitoring capabilities. It provides a rich and flexible R API that revolves around the concept of trajectory, a common path in the simulation model for entities of the same type. Documentation about 'simmer' is provided by several vignettes included in this package, via the paper by Ucar, Smeets & Azcorra (2019, <doi:10.18637/jss.v090.i02>), and the paper by Ucar, Hernández, Serrano & Azcorra (2018, <doi:10.1109/MCOM.2018.1700960>); see 'citation("simmer")' for details.
Maintained by Iñaki Ucar. Last updated 6 months ago.
18.7 match 223 stars 11.47 score 440 scripts 6 dependentsdjvanderlaan
datapackage:Creating and Reading Data Packages
Open, read data from and modify Data Packages. Data Packages are an open standard for bundling and describing data sets (<https://datapackage.org>). When data is read from a Data Package care is taken to convert the data as much a possible to R appropriate data types. The package can be extended with plugins for additional data types.
Maintained by Jan van der Laan. Last updated 7 days ago.
36.3 match 2 stars 5.62 scoreemf-creaf
indicspecies:Relationship Between Species and Groups of Sites
Functions to assess the strength and statistical significance of the relationship between species occurrence/abundance and groups of sites [De Caceres & Legendre (2009) <doi:10.1890/08-1823.1>]. Also includes functions to measure species niche breadth using resource categories [De Caceres et al. (2011) <doi:10.1111/J.1600-0706.2011.19679.x>].
Maintained by Miquel De Cáceres. Last updated 25 days ago.
19.9 match 10 stars 9.49 score 386 scripts 4 dependentsbupaverse
edeaR:Exploratory and Descriptive Event-Based Data Analysis
Exploratory and descriptive analysis of event based data. Provides methods for describing and selecting process data, and for preparing event log data for process mining. Builds on the S3-class for event logs implemented in the package 'bupaR'.
Maintained by Gert Janssenswillen. Last updated 4 months ago.
20.3 match 12 stars 9.17 score 149 scripts 8 dependentsusepa
ccdR:Utilities for Interacting with the 'CTX' APIs
Access chemical, hazard, bioactivity, and exposure data from the Computational Toxicology and Exposure ('CTX') APIs <https://api-ccte.epa.gov/docs/>. 'ccdR' was developed to streamline the process of accessing the information available through the 'CTX' APIs without requiring prior knowledge of how to use APIs. Most data is also available on the CompTox Chemical Dashboard ('CCD') <https://comptox.epa.gov/dashboard/> and other resources found at the EPA Computational Toxicology and Exposure Online Resources <https://www.epa.gov/comptox-tools>.
Maintained by Paul Kruse. Last updated 8 months ago.
28.4 match 2 stars 6.38 score 7 scriptsropensci
frictionless:Read and Write Frictionless Data Packages
Read and write Frictionless Data Packages. A 'Data Package' (<https://specs.frictionlessdata.io/data-package/>) is a simple container format and standard to describe and package a collection of (tabular) data. It is typically used to publish FAIR (<https://www.go-fair.org/fair-principles/>) and open datasets.
Maintained by Peter Desmet. Last updated 6 months ago.
18.3 match 30 stars 9.79 score 55 scripts 6 dependentsbupaverse
bupaR:Business Process Analysis in R
Comprehensive Business Process Analysis toolkit. Creates S3-class for event log objects, and related handler functions. Imports related packages for filtering event data, computation of descriptive statistics, handling of 'Petri Net' objects and visualization of process maps. See also packages 'edeaR','processmapR', 'eventdataR' and 'processmonitR'.
Maintained by Gert Janssenswillen. Last updated 2 years ago.
19.6 match 55 stars 9.07 score 389 scripts 11 dependentshzambran
hydroTSM:Time Series Management and Analysis for Hydrological Modelling
S3 functions for management, analysis, interpolation and plotting of time series used in hydrology and related environmental sciences. In particular, this package is highly oriented to hydrological modelling tasks. The focus of this package has been put in providing a collection of tools useful for the daily work of hydrologists (although an effort was made to optimise each function as much as possible, functionality has had priority over speed). Bugs / comments / questions / collaboration of any kind are very welcomed, and in particular, datasets that can be included in this package for academic purposes.
Maintained by Mauricio Zambrano-Bigiarini. Last updated 1 months ago.
hydrologyhydrology-modelinghydrology-statisticalresourcewater-resources
17.5 match 45 stars 10.14 score 340 scripts 10 dependentsbioc
AnnotationHub:Client to access AnnotationHub resources
This package provides a client for the Bioconductor AnnotationHub web resource. The AnnotationHub web resource provides a central location where genomic files (e.g., VCF, bed, wig) and other resources from standard locations (e.g., UCSC, Ensembl) can be discovered. The resource includes metadata about each resource, e.g., a textual description, tags, and date of modification. The client creates and manages a local cache of files retrieved by the user, helping with quick and reproducible access.
Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.
infrastructuredataimportguithirdpartyclientcore-packageu24ca289073
11.7 match 17 stars 13.89 score 2.7k scripts 102 dependentsbioc
BiocFileCache:Manage Files Across Sessions
This package creates a persistent on-disk cache of files that the user can add, update, and retrieve. It is useful for managing resources (such as custom Txdb objects) that are costly or difficult to create, web resources, and data files used across sessions.
Maintained by Lori Shepherd. Last updated 2 months ago.
dataimportcore-packageu24ca289073
11.5 match 13 stars 13.76 score 486 scripts 429 dependentsropensci
ckanr:Client for the Comprehensive Knowledge Archive Network ('CKAN') API
Client for 'CKAN' API (<https://ckan.org/>). Includes interface to 'CKAN' 'APIs' for search, list, show for packages, organizations, and resources. In addition, provides an interface to the 'datastore' API.
Maintained by Francisco Alves. Last updated 2 years ago.
databaseopen-datackanapidatadatasetapi-wrapperckan-api
17.4 match 100 stars 8.67 score 448 scripts 4 dependentsmolgenis
MolgenisArmadillo:Armadillo Client for the Armadillo Service
A set of functions to manage data shared on a 'MOLGENIS Armadillo' server.
Maintained by Mariska Slofstra. Last updated 17 days ago.
19.2 match 3 stars 7.51 score 28 scriptssizespectrum
mizer:Dynamic Multi-Species Size Spectrum Modelling
A set of classes and methods to set up and run multi-species, trait based and community size spectrum ecological models, focused on the marine environment.
Maintained by Gustav Delius. Last updated 2 months ago.
ecosystem-modelfish-population-dynamicsfisheriesfisheries-managementmarine-ecosystempopulation-dynamicssimulationsize-structurespecies-interactionstransport-equationcpp
15.1 match 38 stars 9.43 score 207 scriptstiledb-inc
tiledb:Modern Database Engine for Complex Data Based on Multi-Dimensional Arrays
The modern database 'TileDB' introduces a powerful on-disk format for storing and accessing any complex data based on multi-dimensional arrays. It supports dense and sparse arrays, dataframes and key-values stores, cloud storage ('S3', 'GCS', 'Azure'), chunked arrays, multiple compression, encryption and checksum filters, uses a fully multi-threaded implementation, supports parallel I/O, data versioning ('time travel'), metadata and groups. It is implemented as an embeddable cross-platform C++ library with APIs from several languages, and integrations. This package provides the R support.
Maintained by Isaiah Norton. Last updated 5 days ago.
arrayhdfss3storage-managertiledbcpp
10.7 match 107 stars 11.96 score 306 scripts 4 dependentskenaho1
asbio:A Collection of Statistical Tools for Biologists
Contains functions from: Aho, K. (2014) Foundational and Applied Statistics for Biologists using R. CRC/Taylor and Francis, Boca Raton, FL, ISBN: 978-1-4398-7338-0.
Maintained by Ken Aho. Last updated 2 months ago.
16.9 match 5 stars 7.32 score 310 scripts 3 dependentsdataoneorg
dataone:R Interface to the DataONE REST API
Provides read and write access to data and metadata from the DataONE network <https://www.dataone.org> of data repositories. Each DataONE repository implements a consistent repository application programming interface. Users call methods in R to access these remote repository functions, such as methods to query the metadata catalog, get access to metadata for particular data packages, and read the data objects from the data repository. Users can also insert and update data objects on repositories that support these methods.
Maintained by Matthew B. Jones. Last updated 3 years ago.
12.4 match 36 stars 9.93 score 472 scripts 3 dependentsobiba
s3.resourcer:S3 Resource Resolver
A S3 resource is provided by Amazon Web Services S3 or a S3-compatible object store (such as Minio). The resource can be a tidy file to be downloaded from the object store, or a data lake (such as Delta Lake) Parquet file to be read by Apache Spark.
Maintained by Yannick Marcon. Last updated 2 months ago.
45.2 match 2.70 score 3 scriptsdatashield
DSI:'DataSHIELD' Interface
'DataSHIELD' is an infrastructure and series of R packages that enables the remote and 'non-disclosive' analysis of sensitive research data. This package defines the API that is to be implemented by 'DataSHIELD' compliant data repositories.
Maintained by Yannick Marcon. Last updated 4 months ago.
16.9 match 2 stars 7.01 score 106 scripts 4 dependentszizroc
villager:A Framework for Designing and Running Agent Based Models
This is a package for creating and running Agent Based Models (ABM). It provides a set of base classes with core functionality to allow bootstrapped models. For more intensive modeling, the supplied classes can be extended to fit researcher needs.
Maintained by Thomas Thelen. Last updated 9 months ago.
abmagent-based-modelingsimulation
16.2 match 57 stars 6.79 score 18 scriptsmodeloriented
DALEX:moDel Agnostic Language for Exploration and eXplanation
Any unverified black box model is the path to failure. Opaqueness leads to distrust. Distrust leads to ignoration. Ignoration leads to rejection. DALEX package xrays any model and helps to explore and explain its behaviour. Machine Learning (ML) models are widely used and have various applications in classification or regression. Models created with boosting, bagging, stacking or similar techniques are often used due to their high performance. But such black-box models usually lack direct interpretability. DALEX package contains various methods that help to understand the link between input variables and model output. Implemented methods help to explore the model on the level of a single instance as well as a level of the whole dataset. All model explainers are model agnostic and can be compared across different models. DALEX package is the cornerstone for 'DrWhy.AI' universe of packages for visual model exploration. Find more details in (Biecek 2018) <https://jmlr.org/papers/v19/18-416.html>.
Maintained by Przemyslaw Biecek. Last updated 1 months ago.
black-boxdalexdata-scienceexplainable-aiexplainable-artificial-intelligenceexplainable-mlexplanationsexplanatory-model-analysisfairnessimlinterpretabilityinterpretable-machine-learningmachine-learningmodel-visualizationpredictive-modelingresponsible-airesponsible-mlxai
8.0 match 1.4k stars 13.40 score 876 scripts 21 dependentsropensci
redland:RDF Library Bindings in R
Provides methods to parse, query and serialize information stored in the Resource Description Framework (RDF). RDF is described at <https://www.w3.org/TR/rdf-primer/>. This package supports RDF by implementing an R interface to the Redland RDF C library, described at <https://librdf.org/docs/api/index.html>. In brief, RDF provides a structured graph consisting of Statements composed of Subject, Predicate, and Object Nodes.
Maintained by Matthew B. Jones. Last updated 1 years ago.
13.0 match 17 stars 7.85 score 98 scripts 13 dependentsbioc
ontoProc:processing of ontologies of anatomy, cell lines, and so on
Support harvesting of diverse bioinformatic ontologies, making particular use of the ontologyIndex package on CRAN. We provide snapshots of key ontologies for terms about cells, cell lines, chemical compounds, and anatomy, to help analyze genome-scale experiments, particularly cell x compound screens. Another purpose is to strengthen development of compelling use cases for richer interfaces to emerging ontologies.
Maintained by Vincent Carey. Last updated 4 days ago.
infrastructuregobioinformaticsgenomicsontology
15.0 match 3 stars 6.37 score 75 scripts 2 dependentscivisanalytics
civis:R Client for the 'Civis Platform API'
A convenient interface for making requests directly to the 'Civis Platform API' <https://www.civisanalytics.com/platform/>. Full documentation available 'here' <https://civisanalytics.github.io/civis-r/>.
Maintained by Peter Cooman. Last updated 2 months ago.
11.1 match 16 stars 7.84 score 144 scriptspaws-r
paws:Amazon Web Services Software Development Kit
Interface to Amazon Web Services <https://aws.amazon.com>, including storage, database, and compute services, such as 'Simple Storage Service' ('S3'), 'DynamoDB' 'NoSQL' database, and 'Lambda' functions-as-a-service.
Maintained by Dyfan Jones. Last updated 4 days ago.
7.7 match 332 stars 11.25 score 177 scripts 12 dependentsbioc
RITAN:Rapid Integration of Term Annotation and Network resources
Tools for comprehensive gene set enrichment and extraction of multi-resource high confidence subnetworks. RITAN facilitates bioinformatic tasks for enabling network biology research.
Maintained by Michael Zimmermann. Last updated 5 months ago.
qualitycontrolnetworknetworkenrichmentnetworkinferencegenesetenrichmentfunctionalgenomicsgraphandnetwork
15.9 match 5.40 score 9 scriptsmrc-ide
naomi.resources:Data dependencies for Naomi output generation
Makes data for Naomi output generation as an R package.
Maintained by Rachel Esra. Last updated 1 years ago.
39.4 match 2.18 score 1 scriptsr-dbi
DBI:R Database Interface
A database interface definition for communication between R and relational database management systems. All classes in this package are virtual and need to be extended by the various R/DBMS implementations.
Maintained by Kirill Müller. Last updated 3 months ago.
4.0 match 302 stars 20.88 score 19k scripts 2.9k dependentsshevandrin
rqti:Create Tests According to QTI 2.1 Standard
Create tests and tasks compliant with the Question & Test Interoperability (QTI) information model version 2.1. Input sources are Rmd/md description files or S4-class objects. Output formats include standalone zip or xml files. Supports the generation of basic task types (single and multiple choice, order, pair association, matching tables, filling gaps and essay) and provides a comprehensive set of attributes for customizing tests.
Maintained by Andrey Shevandrin. Last updated 4 days ago.
14.0 match 5 stars 5.89 score 26 scriptsbioc
ExperimentHub:Client to access ExperimentHub resources
This package provides a client for the Bioconductor ExperimentHub web resource. ExperimentHub provides a central location where curated data from experiments, publications or training courses can be accessed. Each resource has associated metadata, tags and date of modification. The client creates and manages a local cache of files retrieved enabling quick and reproducible access.
Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.
infrastructuredataimportguithirdpartyclientcore-packageu24ca289073
6.9 match 9 stars 11.98 score 764 scripts 55 dependentsazure
azuremlsdk:Interface to the 'Azure Machine Learning' 'SDK'
Interface to the 'Azure Machine Learning' Software Development Kit ('SDK'). Data scientists can use the 'SDK' to train, deploy, automate, and manage machine learning models on the 'Azure Machine Learning' service. To learn more about 'Azure Machine Learning' visit the website: <https://docs.microsoft.com/en-us/azure/machine-learning/service/overview-what-is-azure-ml>.
Maintained by Diondra Peck. Last updated 3 years ago.
amlcomputeazureazure-machine-learningazuremldsimachine-learningrstudiosdk-r
9.0 match 106 stars 8.91 score 221 scriptsrstudio
rmarkdown:Dynamic Documents for R
Convert R Markdown documents into a variety of formats.
Maintained by Yihui Xie. Last updated 4 months ago.
literate-programmingmarkdownpandocrmarkdown
3.7 match 2.9k stars 21.79 score 14k scripts 3.7k dependentsmikejohnson51
climateR:climateR
Find, subset, and retrive geospatial data by AOI.
Maintained by Mike Johnson. Last updated 3 months ago.
aoiclimatedatasetgeospatialgridded-climate-dataweather
9.1 match 187 stars 8.74 score 156 scripts 1 dependentsconfoobio
GMSE:Generalised Management Strategy Evaluation Simulator
Integrates game theory and ecological theory to construct social-ecological models that simulate the management of populations and stakeholder actions. These models build off of a previously developed management strategy evaluation (MSE) framework to simulate all aspects of management: population dynamics, manager observation of populations, manager decision making, and stakeholder responses to management decisions. The newly developed generalised management strategy evaluation (GMSE) framework uses genetic algorithms to mimic the decision-making process of managers and stakeholders under conditions of change, uncertainty, and conflict. Simulations can be run using gmse(), gmse_apply(), and gmse_gui() functions.
Maintained by A. Bradley Duthie. Last updated 3 years ago.
adaptive-managementagricultural-modellingconflictconflict-resolutionconservationecological-modellingecological-modelsecologyfood-securitygame-theorygenetic-algorithmgenetic-algorithmsmanagement-decisionsmanagement-strategy-evaluationpopulation-modelsimulationwildlife-management
14.6 match 10 stars 5.43 score 178 scriptsbioc
decoupleR:decoupleR: Ensemble of computational methods to infer biological activities from omics data
Many methods allow us to extract biological activities from omics data using information from prior knowledge resources, reducing the dimensionality for increased statistical power and better interpretability. Here, we present decoupleR, a Bioconductor package containing different statistical methods to extract these signatures within a unified framework. decoupleR allows the user to flexibly test any method with any resource. It incorporates methods that take into account the sign and weight of network interactions. decoupleR can be used with any omic, as long as its features can be linked to a biological process based on prior knowledge. For example, in transcriptomics gene sets regulated by a transcription factor, or in phospho-proteomics phosphosites that are targeted by a kinase.
Maintained by Pau Badia-i-Mompel. Last updated 5 months ago.
differentialexpressionfunctionalgenomicsgeneexpressiongeneregulationnetworksoftwarestatisticalmethodtranscription
6.9 match 230 stars 11.27 score 316 scripts 3 dependentsropensci
SymbiotaR2:Downloading Data from Symbiota2 Portals into R
Download data from Symbiota2 portals using Symbiota's API. Covers the Checklists, Collections, Crowdsource, Exsiccati, Glossary, ImageProcessor, Key, Media, Occurrence, Reference, Taxa, Traits, and UserRoles API families. Each Symbiota2 portal owner can load their own plugins (and modified code), and so this package may not cover every possible API endpoint from a given Symbiota2 instance.
Maintained by Austin Koontz. Last updated 3 years ago.
databaselibraryspecimen-recordssymbiotasymbiota2symbiota2-portal
23.4 match 2 stars 3.30 score 4 scriptshneth
riskyr:Rendering Risk Literacy more Transparent
Risk-related information (like the prevalence of conditions, the sensitivity and specificity of diagnostic tests, or the effectiveness of interventions or treatments) can be expressed in terms of frequencies or probabilities. By providing a toolbox of corresponding metrics and representations, 'riskyr' computes, translates, and visualizes risk-related information in a variety of ways. Adopting multiple complementary perspectives provides insights into the interplay between key parameters and renders teaching and training programs on risk literacy more transparent.
Maintained by Hansjoerg Neth. Last updated 10 months ago.
2x2-matrixbayesian-inferencecontingency-tablerepresentationriskrisk-literacyvisualization
10.0 match 19 stars 7.36 score 80 scriptsropensci
datapack:A Flexible Container to Transport and Manipulate Data and Associated Resources
Provides a flexible container to transport and manipulate complex sets of data. These data may consist of multiple data files and associated meta data and ancillary files. Individual data objects have associated system level meta data, and data files are linked together using the OAI-ORE standard resource map which describes the relationships between the files. The OAI- ORE standard is described at <https://www.openarchives.org/ore/>. Data packages can be serialized and transported as structured files that have been created following the BagIt specification. The BagIt specification is described at <https://tools.ietf.org/html/draft-kunze-bagit-08>.
Maintained by Matthew B. Jones. Last updated 3 years ago.
8.5 match 44 stars 8.56 score 195 scripts 4 dependentsbioc
SeqArray:Data Management of Large-Scale Whole-Genome Sequence Variant Calls
Data management of large-scale whole-genome sequencing variant calls with thousands of individuals: genotypic data (e.g., SNVs, indels and structural variation calls) and annotations in SeqArray GDS files are stored in an array-oriented and compressed manner, with efficient data access using the R programming language.
Maintained by Xiuwen Zheng. Last updated 10 days ago.
infrastructuredatarepresentationsequencinggeneticsbioinformaticsgds-formatsnpsnvweswgscpp
6.0 match 45 stars 12.08 score 1.1k scripts 9 dependentseblondel
geosapi:GeoServer REST API R Interface
Provides an R interface to the GeoServer REST API, allowing to upload and publish data in a GeoServer web-application and expose data to OGC Web-Services. The package currently supports all CRUD (Create,Read,Update,Delete) operations on GeoServer workspaces, namespaces, datastores (stores of vector data), featuretypes, layers, styles, as well as vector data upload operations. For more information about the GeoServer REST API, see <https://docs.geoserver.org/stable/en/user/rest/>.
Maintained by Emmanuel Blondel. Last updated 15 days ago.
apigeoservergispublicationrestspatial
11.6 match 34 stars 6.23 score 33 scriptsropensci
tidyhydat:Extract and Tidy Canadian 'Hydrometric' Data
Provides functions to access historical and real-time national 'hydrometric' data from Water Survey of Canada data sources (<https://dd.weather.gc.ca/hydrometric/csv/> and <https://collaboration.cmc.ec.gc.ca/cmc/hydrometrics/www/>) and then applies tidy data principles.
Maintained by Sam Albers. Last updated 5 days ago.
citzgovernment-datahydrologyhydrometricstidy-datawater-resources
7.5 match 71 stars 9.59 score 202 scripts 3 dependentsthinkr-open
golem:A Framework for Robust Shiny Applications
An opinionated framework for building a production-ready 'Shiny' application. This package contains a series of tools for building a robust 'Shiny' application from start to finish.
Maintained by Colin Fay. Last updated 7 months ago.
golemversehacktoberfestshinyshiny-appsshiny-rshinyapps
5.0 match 921 stars 14.23 score 167 scripts 62 dependentshneth
unikn:Graphical Elements of the University of Konstanz's Corporate Design
Define and use graphical elements of corporate design manuals in R. The 'unikn' package provides color functions (by defining dedicated colors and color palettes, and commands for finding, changing, viewing, and using them) and styled text elements (e.g., for marking, underlining, or plotting colored titles). The pre-defined range of colors and text decoration functions is based on the corporate design of the University of Konstanz <https://www.uni-konstanz.de/>, but can be adapted and extended for other purposes or institutions.
Maintained by Hansjoerg Neth. Last updated 3 months ago.
brandingcolorcolor-palettecolorschemecorporate-designpalettetext-decorationuniversity-colorsvisual-identity
8.0 match 39 stars 8.82 score 156 scripts 2 dependentsdalekube
hR:Better Data Engineering in Human Resources
Methods for data engineering in the human resources (HR) corporate domain. Designed for HR analytics practitioners and workforce-oriented data sets.
Maintained by Dale Kube. Last updated 14 hours ago.
analyticsdatadata-engineeringdata-sciencehuman-resources
13.9 match 21 stars 5.02 score 8 scriptsbioc
BiocIO:Standard Input and Output for Bioconductor Packages
The `BiocIO` package contains high-level abstract classes and generics used by developers to build IO funcionality within the Bioconductor suite of packages. Implements `import()` and `export()` standard generics for importing and exporting biological data formats. `import()` supports whole-file as well as chunk-wise iterative import. The `import()` interface optionally provides a standard mechanism for 'lazy' access via `filter()` (on row or element-like components of the file resource), `select()` (on column-like components of the file resource) and `collect()`. The `import()` interface optionally provides transparent access to remote (e.g. via https) as well as local access. Developers can register a file extension, e.g., `.loom` for dispatch from character-based URIs to specific `import()` / `export()` methods based on classes representing file types, e.g., `LoomFile()`.
Maintained by Marcel Ramos. Last updated 4 months ago.
annotationdataimportbioconductor-packagecore-package
6.8 match 1 stars 10.20 score 19 scripts 487 dependentsprojectmosaic
mosaic:Project MOSAIC Statistics and Mathematics Teaching Utilities
Data sets and utilities from Project MOSAIC (<http://www.mosaic-web.org>) used to teach mathematics, statistics, computation and modeling. Funded by the NSF, Project MOSAIC is a community of educators working to tie together aspects of quantitative work that students in science, technology, engineering and mathematics will need in their professional lives, but which are usually taught in isolation, if at all.
Maintained by Randall Pruim. Last updated 1 years ago.
5.1 match 93 stars 13.32 score 7.2k scripts 7 dependentsbcgov
bcdata:Search and Retrieve Data from the BC Data Catalogue
Search, query, and download tabular and 'geospatial' data from the British Columbia Data Catalogue (<https://catalogue.data.gov.bc.ca/>). Search catalogue data records based on keywords, data licence, sector, data format, and B.C. government organization. View metadata directly in R, download many data formats, and query 'geospatial' data available via the B.C. government Web Feature Service ('WFS') using 'dplyr' syntax.
Maintained by Andy Teucher. Last updated 1 months ago.
6.6 match 83 stars 10.29 score 186 scripts 4 dependentsrstudio
rstudioapi:Safely Access the RStudio API
Access the RStudio API (if available) and provide informative error messages when it's not.
Maintained by Kevin Ushey. Last updated 4 months ago.
3.6 match 172 stars 18.81 score 3.6k scripts 2.1k dependentsikosmidis
brglm2:Bias Reduction in Generalized Linear Models
Estimation and inference from generalized linear models based on various methods for bias reduction and maximum penalized likelihood with powers of the Jeffreys prior as penalty. The 'brglmFit' fitting method can achieve reduction of estimation bias by solving either the mean bias-reducing adjusted score equations in Firth (1993) <doi:10.1093/biomet/80.1.27> and Kosmidis and Firth (2009) <doi:10.1093/biomet/asp055>, or the median bias-reduction adjusted score equations in Kenne et al. (2017) <doi:10.1093/biomet/asx046>, or through the direct subtraction of an estimate of the bias of the maximum likelihood estimator from the maximum likelihood estimates as in Cordeiro and McCullagh (1991) <https://www.jstor.org/stable/2345592>. See Kosmidis et al (2020) <doi:10.1007/s11222-019-09860-6> for more details. Estimation in all cases takes place via a quasi Fisher scoring algorithm, and S3 methods for the construction of of confidence intervals for the reduced-bias estimates are provided. In the special case of generalized linear models for binomial and multinomial responses (both ordinal and nominal), the adjusted score approaches to mean and media bias reduction have been found to return estimates with improved frequentist properties, that are also always finite, even in cases where the maximum likelihood estimates are infinite (e.g. complete and quasi-complete separation; see Kosmidis and Firth, 2020 <doi:10.1093/biomet/asaa052>, for a proof for mean bias reduction in logistic regression).
Maintained by Ioannis Kosmidis. Last updated 6 months ago.
adjusted-score-equationsalgorithmsbias-reducing-adjustmentsbias-reductionestimationglmlogistic-regressionnominal-responsesordinal-responsesregressionregression-algorithmsstatistics
6.5 match 32 stars 10.41 score 106 scripts 10 dependentsbioc
AnnotationFilter:Facilities for Filtering Bioconductor Annotation Resources
This package provides class and other infrastructure to implement filters for manipulating Bioconductor annotation resources. The filters will be used by ensembldb, Organism.dplyr, and other packages.
Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.
annotationinfrastructuresoftwarebioconductor-packagecore-package
6.5 match 5 stars 10.20 score 45 scripts 162 dependentsbioc
cbpManager:Generate, manage, and edit data and metadata files suitable for the import in cBioPortal for Cancer Genomics
This R package provides an R Shiny application that enables the user to generate, manage, and edit data and metadata files suitable for the import in cBioPortal for Cancer Genomics. Create cancer studies and edit its metadata. Upload mutation data of a patient that will be concatenated to the data_mutation_extended.txt file of the study. Create and edit clinical patient data, sample data, and timeline data. Create custom timeline tracks for patients.
Maintained by Arsenij Ustjanzew. Last updated 5 months ago.
immunooncologydataimportdatarepresentationguithirdpartyclientpreprocessingvisualizationcancer-genomicscbioportalclinical-datafilegeneratormutation-datapatient-data
11.7 match 8 stars 5.51 score 1 scriptsbioc
BiocFHIR:Illustration of FHIR ingestion and transformation using R
FHIR R4 bundles in JSON format are derived from https://synthea.mitre.org/downloads. Transformation inspired by a kaggle notebook published by Dr Alexander Scarlat, https://www.kaggle.com/code/drscarlat/fhir-starter-parse-healthcare-bundles-into-tables. This is a very limited illustration of some basic parsing and reorganization processes. Additional tooling will be required to move beyond the Synthea data illustrations.
Maintained by Vincent Carey. Last updated 5 months ago.
infrastructuredataimportdatarepresentationfhir
11.0 match 4 stars 5.78 score 15 scriptsbioc
UCell:Rank-based signature enrichment analysis for single-cell data
UCell is a package for evaluating gene signatures in single-cell datasets. UCell signature scores, based on the Mann-Whitney U statistic, are robust to dataset size and heterogeneity, and their calculation demands less computing time and memory than other available methods, enabling the processing of large datasets in a few minutes even on machines with limited computing power. UCell can be applied to any single-cell data matrix, and includes functions to directly interact with SingleCellExperiment and Seurat objects.
Maintained by Massimo Andreatta. Last updated 5 months ago.
singlecellgenesetenrichmenttranscriptomicsgeneexpressioncellbasedassays
6.0 match 143 stars 10.43 score 454 scripts 2 dependentsocha-dap
ripc:Download and Tidy IPC and CH Data
Utilities to access Integrated Food Security Phase Classification (IPC) and Cadre Harmonisé (CH) food security data. Wrapper functions are available for all of the 'IPC-CH' Public API (<https://docs.api.ipcinfo.org>) simplified and advanced endpoints to easily download the data in a clean and tidy format.
Maintained by Seth Caldwell. Last updated 9 months ago.
12.6 match 2 stars 4.70 score 4 scriptspsolymos
ResourceSelection:Resource Selection (Probability) Functions for Use-Availability Data
Resource Selection (Probability) Functions for use-availability wildlife data based on weighted distributions as described in Lele and Keim (2006) <doi:10.1890/0012-9658(2006)87%5B3021:WDAEOR%5D2.0.CO;2>, Lele (2009) <doi:10.2193/2007-535>, and Solymos & Lele (2016) <doi:10.1111/2041-210X.12432>.
Maintained by Peter Solymos. Last updated 10 months ago.
ecologyestimationlelersfrspfsolymosweighted-distributions
6.8 match 8 stars 8.37 score 752 scripts 3 dependentsbioc
HubPub:Utilities to create and use Bioconductor Hubs
HubPub provides users with functionality to help with the Bioconductor Hub structures. The package provides the ability to create a skeleton of a Hub style package that the user can then populate with the necessary information. There are also functions to help add resources to the Hub package metadata files as well as publish data to the Bioconductor S3 bucket.
Maintained by Kayla Interdonato. Last updated 3 days ago.
dataimportinfrastructuresoftwarethirdpartyclientbioconductor-package
11.0 match 3 stars 5.18 score 4 scriptssebkrantz
collapse:Advanced and Fast Data Transformation
A C/C++ based package for advanced data transformation and statistical computing in R that is extremely fast, class-agnostic, robust and programmer friendly. Core functionality includes a rich set of S3 generic grouped and weighted statistical functions for vectors, matrices and data frames, which provide efficient low-level vectorizations, OpenMP multithreading, and skip missing values by default. These are integrated with fast grouping and ordering algorithms (also callable from C), and efficient data manipulation functions. The package also provides a flexible and rigorous approach to time series and panel data in R. It further includes fast functions for common statistical procedures, detailed (grouped, weighted) summary statistics, powerful tools to work with nested data, fast data object conversions, functions for memory efficient R programming, and helpers to effectively deal with variable labels, attributes, and missing data. It is well integrated with base R classes, 'dplyr'/'tibble', 'data.table', 'sf', 'units', 'plm' (panel-series and data frames), and 'xts'/'zoo'.
Maintained by Sebastian Krantz. Last updated 6 days ago.
data-aggregationdata-analysisdata-manipulationdata-processingdata-sciencedata-transformationeconometricshigh-performancepanel-datascientific-computingstatisticstime-seriesweightedweightscppopenmp
3.3 match 672 stars 16.63 score 708 scripts 97 dependentsprogram--
HSClientR:A HydroShare API client for R
A RESTful API wrapper for accessing <https://hydroshare.org> data in R.
Maintained by Justin Singh-Mohudpur. Last updated 4 years ago.
api-wrappercuashihydrologyhydrosharewater-resources
23.2 match 4 stars 2.30 score 2 scriptsjchrom
trelloR:Access the Trello API
An R client for the Trello API. Supports free-tier features such as access to private boards, creating and updating cards and other resources, and downloading data in a structured way.
Maintained by Jakub Chromec. Last updated 2 years ago.
8.6 match 42 stars 6.18 score 24 scriptsrchlumsk
RavenR:Raven Hydrological Modelling Framework R Support and Analysis
Utilities for processing input and output files associated with the Raven Hydrological Modelling Framework. Includes various plotting functions, model diagnostics, reading output files into extensible time series format, and support for writing Raven input files. The 'RavenR' package is also archived at Chlumsky et al. (2020) <doi:10.5281/zenodo.4248183>. The Raven Hydrologic Modelling Framework method can be referenced with Craig et al. (2020) <doi:10.1016/j.envsoft.2020.104728>.
Maintained by Robert Chlumsky. Last updated 4 months ago.
diagnosticshydrologymodelingmodellingvisualizationwaterwater-resourceswatershedcpp
7.5 match 36 stars 7.06 score 20 scriptsschnorr
starvz:R-Based Visualization Techniques for Task-Based Applications
Performance analysis workflow that combines the power of the R language (and the tidyverse realm) and many auxiliary tools to provide a consistent, flexible, extensible, fast, and versatile framework for the performance analysis of task-based applications that run on top of the StarPU runtime (with its MPI (Message Passing Interface) layer for multi-node support). Its goal is to provide a fruitful prototypical environment to conduct performance analysis hypothesis-checking for task-based applications that run on heterogeneous (multi-GPU, multi-core) multi-node HPC (High-performance computing) platforms.
Maintained by Lucas Leandro Nesi. Last updated 5 months ago.
10.7 match 13 stars 4.94 score 27 scriptsbioc
CompoundDb:Creating and Using (Chemical) Compound Annotation Databases
CompoundDb provides functionality to create and use (chemical) compound annotation databases from a variety of different sources such as LipidMaps, HMDB, ChEBI or MassBank. The database format allows to store in addition MS/MS spectra along with compound information. The package provides also a backend for Bioconductor's Spectra package and allows thus to match experimetal MS/MS spectra against MS/MS spectra in the database. Databases can be stored in SQLite format and are thus portable.
Maintained by Johannes Rainer. Last updated 2 months ago.
massspectrometrymetabolomicsannotationdatabasesmass-spectrometry
6.1 match 17 stars 8.40 score 69 scripts 1 dependentsbioc
AlphaMissenseR:Accessing AlphaMissense Data Resources in R
The AlphaMissense publication <https://www.science.org/doi/epdf/10.1126/science.adg7492> outlines how a variant of AlphaFold / DeepMind was used to predict missense variant pathogenicity. Supporting data on Zenodo <https://zenodo.org/record/10813168> include, for instance, 71M variants across hg19 and hg38 genome builds. The 'AlphaMissenseR' package allows ready access to the data, downloading individual files to DuckDB databases for exploration and integration into *R* and *Bioconductor* workflows.
Maintained by Martin Morgan. Last updated 5 months ago.
snpannotationfunctionalgenomicsstructuralpredictiontranscriptomicsvariantannotationgenepredictionimmunooncology
7.5 match 8 stars 6.86 score 10 scriptspandora-isomemo
Pandora:Retrieve Data using the API of the 'Pandora' Data Platform
API wrapper that contains functions to retrieve data from the 'Pandora' databases. Web services for API: <https://pandora.earth/>.
Maintained by Jan Abel. Last updated 1 months ago.
12.6 match 4.00 score 2 scriptsbcgov
bcmaps:Map Layers and Spatial Utilities for British Columbia
Various layers of B.C., including administrative boundaries, natural resource management boundaries, census boundaries etc. All layers are available in BC Albers (<https://spatialreference.org/ref/epsg/3005/>) equal-area projection, which is the B.C. government standard. The layers are sourced from the British Columbia and Canadian government under open licenses, including B.C. Data Catalogue (<https://data.gov.bc.ca>), the Government of Canada Open Data Portal (<https://open.canada.ca/en/using-open-data>), and Statistics Canada (<https://www.statcan.gc.ca/en/reference/licence>).
Maintained by Andy Teucher. Last updated 3 months ago.
5.8 match 73 stars 8.65 score 254 scriptscran
bigmemory.sri:A Shared Resource Interface for Bigmemory Project Packages
A shared resource interface for the bigmemory and synchronicity packages.
Maintained by Michael J. Kane. Last updated 1 years ago.
9.4 match 5.21 score 66 dependentsjsta
nhdR:Tools for Working with the National Hydrography Dataset
Tools for working with the National Hydrography Dataset, with functions for querying, downloading, and networking both the NHD <https://www.usgs.gov/national-hydrography> and NHDPlus <https://www.epa.gov/waterdata/nhdplus-national-hydrography-dataset-plus> datasets.
Maintained by Jemma Stachelek. Last updated 2 years ago.
geospatialnational-hydrography-datasetnhdwater-qualitywater-resources
7.5 match 38 stars 6.48 score 53 scriptsrstudio
shiny:Web Application Framework for R
Makes it incredibly easy to build interactive web applications with R. Automatic "reactive" binding between inputs and outputs and extensive prebuilt widgets make it possible to build beautiful, responsive, and powerful applications with minimal effort.
Maintained by Winston Chang. Last updated 14 days ago.
reactiverstudioshinyweb-appweb-development
2.3 match 5.4k stars 21.28 score 108k scripts 1.8k dependentswarwick-stats-resources
warwickplots:Palettes and Themes Consistent with The University of Warwick's Brand
Colour palettes and a 'ggplot2' theme that are consistent with The University of Warwick' branding. Built using the 'palettes' package, which provides methods for printing, formatting, casting and coercion, extraction and updating of components, plotting, colour mixing arithmetic, and colour interpolation.
Maintained by Ella Kaye. Last updated 10 months ago.
13.3 match 1 stars 3.56 score 12 scriptsyihui
knitr:A General-Purpose Package for Dynamic Report Generation in R
Provides a general-purpose tool for dynamic report generation in R using Literate Programming techniques.
Maintained by Yihui Xie. Last updated 9 hours ago.
dynamic-documentsknitrliterate-programmingrmarkdownsweave
2.0 match 2.4k stars 23.61 score 116k scripts 4.2k dependentsr-causal
ggdag:Analyze and Create Elegant Directed Acyclic Graphs
Tidy, analyze, and plot directed acyclic graphs (DAGs). 'ggdag' is built on top of 'dagitty', an R package that uses the 'DAGitty' web tool (<https://dagitty.net/>) for creating and analyzing DAGs. 'ggdag' makes it easy to tidy and plot 'dagitty' objects using 'ggplot2' and 'ggraph', as well as common analytic and graphical functions, such as determining adjustment sets and node relationships.
Maintained by Malcolm Barrett. Last updated 8 months ago.
causal-inferencedagggplot-extension
4.0 match 443 stars 11.78 score 1.8k scripts 5 dependentscole-brokamp
fr:Frictionless Standards
A "tabular-data-resource" (<https://specs.frictionlessdata.io/tabular-data-resource/>) is a simple format to describe a singular tabular data resource such as a CSV file. It includes support both for metadata such as author and title and a schema to describe the data, for example the types of the fields/columns in the data. Create a tabular-data-resource by providing a data.frame and specifying metadata. Write and read tabular-data-resources to and from disk.
Maintained by Cole Brokamp. Last updated 4 months ago.
8.9 match 3 stars 5.28 score 63 scriptsropensci
EDIutils:An API Client for the Environmental Data Initiative Repository
A client for the Environmental Data Initiative repository REST API. The 'EDI' data repository <https://portal.edirepository.org/nis/home.jsp> is for publication and reuse of ecological data with emphasis on metadata accuracy and completeness. It is built upon the 'PASTA+' software stack <https://pastaplus-core.readthedocs.io/en/latest/index.html#> and was developed in collaboration with the US 'LTER' Network <https://lternet.edu/>. 'EDIutils' includes functions to search and access existing data, evaluate and upload new data, and assist other data management tasks common to repository users.
Maintained by Colin Smith. Last updated 1 years ago.
ecologyeml-metadataopen-accessopen-dataresearch-data-managementresearch-data-repository
7.2 match 10 stars 6.47 score 117 scriptstudo-r
BatchJobs:Batch Computing with R
Provides Map, Reduce and Filter variants to generate jobs on batch computing systems like PBS/Torque, LSF, SLURM and Sun Grid Engine. Multicore and SSH systems are also supported. For further details see the project web page.
Maintained by Bernd Bischl. Last updated 3 years ago.
5.3 match 85 stars 8.57 score 616 scripts 3 dependentsr-simmer
simmer.bricks:Helper Methods for 'simmer' Trajectories
Provides wrappers for common activity patterns in 'simmer' trajectories.
Maintained by Iñaki Ucar. Last updated 2 years ago.
8.0 match 6 stars 5.64 score 49 scripts 1 dependentsropensci
fireexposuR:Compute and Visualize Wildfire Exposure
This package computes and visualizes wildfire exposure using the methods documented in a series of scientific publications.
Maintained by Air Forbes. Last updated 20 days ago.
8.5 match 5 stars 5.23 score 4 scriptsrstudio
pins:Pin, Discover, and Share Resources
Publish data sets, models, and other R objects, making it easy to share them across projects and with your colleagues. You can pin objects to a variety of "boards", including local folders (to share on a networked drive or with 'DropBox'), 'Posit Connect', 'AWS S3', and more.
Maintained by Julia Silge. Last updated 1 months ago.
azuregcloudrpinsrsconnects3storage
3.1 match 321 stars 14.17 score 1.9k scripts 17 dependentsncss-tech
soilDB:Soil Database Interface
A collection of functions for reading soil data from U.S. Department of Agriculture Natural Resources Conservation Service (USDA-NRCS) and National Cooperative Soil Survey (NCSS) databases.
Maintained by Andrew Brown. Last updated 7 days ago.
ksslnasisnrcssoilsoil-data-accesssoil-surveysoilwebsqlusda
3.9 match 87 stars 11.34 score 1.0k scripts 1 dependentsmrc-ide
orderly2:Orderly Next Generation
Distributed reproducible computing framework, adopting ideas from git, docker and other software. By defining a lightweight interface around the inputs and outputs of an analysis, a lot of the repetitive work for reproducible research can be automated. We define a simple format for organising and describing work that facilitates collaborative reproducible research and acknowledges that all analyses are run multiple times over their lifespans.
Maintained by Rich FitzJohn. Last updated 2 months ago.
5.3 match 8 stars 8.30 score 49 scripts 2 dependentsvimc
orderly:Lightweight Reproducible Reporting
Order, create and store reports from R. By defining a lightweight interface around the inputs and outputs of an analysis, a lot of the repetitive work for reproducible research can be automated. We define a simple format for organising and describing work that facilitates collaborative reproducible research and acknowledges that all analyses are run multiple times over their lifespans.
Maintained by Rich FitzJohn. Last updated 2 years ago.
4.5 match 117 stars 9.63 score 94 scripts 4 dependentsouhscbbmc
REDCapR:Interaction Between R and REDCap
Encapsulates functions to streamline calls from R to the REDCap API. REDCap (Research Electronic Data CAPture) is a web application for building and managing online surveys and databases developed at Vanderbilt University. The Application Programming Interface (API) offers an avenue to access and modify data programmatically, improving the capacity for literate and reproducible programming.
Maintained by Will Beasley. Last updated 2 months ago.
3.5 match 118 stars 12.36 score 438 scripts 6 dependentswlandau
crew:A Distributed Worker Launcher Framework
In computationally demanding analysis projects, statisticians and data scientists asynchronously deploy long-running tasks to distributed systems, ranging from traditional clusters to cloud services. The 'NNG'-powered 'mirai' R package by Gao (2023) <doi:10.5281/zenodo.7912722> is a sleek and sophisticated scheduler that efficiently processes these intense workloads. The 'crew' package extends 'mirai' with a unifying interface for third-party worker launchers. Inspiration also comes from packages. 'future' by Bengtsson (2021) <doi:10.32614/RJ-2021-048>, 'rrq' by FitzJohn and Ashton (2023) <https://github.com/mrc-ide/rrq>, 'clustermq' by Schubert (2019) <doi:10.1093/bioinformatics/btz284>), and 'batchtools' by Lang, Bischel, and Surmann (2017) <doi:10.21105/joss.00135>.
Maintained by William Michael Landau. Last updated 2 days ago.
3.8 match 136 stars 11.19 score 243 scripts 2 dependentssharlagelfand
opendatatoronto:Access the City of Toronto Open Data Portal
Access data from the "City of Toronto Open Data Portal" (<https://open.toronto.ca>) directly from R.
Maintained by Sharla Gelfand. Last updated 3 years ago.
5.7 match 63 stars 7.49 score 486 scriptscb4ds
periscope:Enterprise Streamlined 'Shiny' Application Framework
An enterprise-targeted scalable and UI-standardized 'shiny' framework including a variety of developer convenience functions with the goal of both streamlining robust application development while assisting with creating a consistent user experience regardless of application or developer.
Maintained by Constance Brett. Last updated 2 months ago.
6.0 match 18 stars 7.02 score 73 scriptstidyverse
lubridate:Make Dealing with Dates a Little Easier
Functions to work with date-times and time-spans: fast and user friendly parsing of date-time data, extraction and updating of components of a date-time (years, months, days, hours, minutes, and seconds), algebraic manipulation on date-time and time-span objects. The 'lubridate' package has a consistent and memorable syntax that makes working with dates easy and fun.
Maintained by Vitalie Spinu. Last updated 3 months ago.
2.0 match 757 stars 20.95 score 135k scripts 1.9k dependentsaravind-j
PGRdup:Discover Probable Duplicates in Plant Genetic Resources Collections
Provides functions to aid the identification of probable/possible duplicates in Plant Genetic Resources (PGR) collections using 'passport databases' comprising of information records of each constituent sample. These include methods for cleaning the data, creation of a searchable Key Word in Context (KWIC) index of keywords associated with sample records and the identification of nearly identical records with similar information by fuzzy, phonetic and semantic matching of keywords.
Maintained by J. Aravind. Last updated 2 years ago.
double-metaphonedouble-metaphone-algorithmnatural-language-processingpgrplant-genetic-resourcesrecord-linkage
10.0 match 1 stars 4.06 score 23 scriptsabbvie-external
OmicNavigator:Open-Source Software for 'Omic' Data Analysis and Visualization
A tool for interactive exploration of the results from 'omics' experiments to facilitate novel discoveries from high-throughput biology. The software includes R functions for the 'bioinformatician' to deposit study metadata and the outputs from statistical analyses (e.g. differential expression, enrichment). These results are then exported to an interactive JavaScript dashboard that can be interrogated on the user's local machine or deployed online to be explored by collaborators. The dashboard includes 'sortable' tables, interactive plots including network visualization, and fine-grained filtering based on statistical significance.
Maintained by John Blischak. Last updated 4 days ago.
bioinformaticsgenomicsomicsopencpu
5.3 match 34 stars 7.68 score 31 scriptsbioc
BiocHubsShiny:View AnnotationHub and ExperimentHub Resources Interactively
A package that allows interactive exploration of AnnotationHub and ExperimentHub resources. It uses DT / DataTable to display resources for multiple organisms. It provides template code for reproducibility and for downloading resources via the indicated Hub package.
Maintained by Marcel Ramos. Last updated 12 days ago.
10.2 match 3.90 score 1 scriptsmolgenis
DSMolgenisArmadillo:'DataSHIELD' Client for 'MOLGENIS Armadillo'
'DataSHIELD' is an infrastructure and series of R packages that enables the remote and 'non-disclosive' analysis of sensitive research data. This package is the 'DataSHIELD' interface implementation to analyze data shared on a 'MOLGENIS Armadillo' server. 'MOLGENIS Armadillo' is a light-weight 'DataSHIELD' server using a file store and an 'RServe' server.
Maintained by Mariska Slofstra. Last updated 8 months ago.
6.0 match 6.54 score 48 scriptspaws-r
paws.management:'Amazon Web Services' Management & Governance Services
Interface to 'Amazon Web Services' management and governance services, including 'CloudWatch' application and infrastructure monitoring, 'Auto Scaling' for automatically scaling resources, and more <https://aws.amazon.com/>.
Maintained by Dyfan Jones. Last updated 4 days ago.
4.3 match 332 stars 9.09 score 1 scripts 15 dependentspvanlaake
ncdfCF:Easy Access to NetCDF Files with CF Metadata Conventions
Network Common Data Form ('netCDF') files are widely used for scientific data. Library-level access in R is provided through packages 'RNetCDF' and 'ncdf4'. Package 'ncdfCF' is built on top of 'RNetCDF' and makes the data and its attributes available as a set of R6 classes that are informed by the Climate and Forecasting Metadata Conventions. Access to the data uses standard R subsetting operators and common function forms.
Maintained by Patrick Van Laake. Last updated 3 days ago.
7.3 match 5.41 score 4 scriptsrobitalec
distanceto:Calculate Distance to Features
Calculates distances from point locations to features. The usual approach for eg. resource selection function analyses is to generate a complete distance to features surface then sample it with your observed and random points. Since these raster based approaches can be pretty costly with large areas, and often lead to memory issues in R, the distanceto package opts to compute these distances using efficient, vector based approaches. As a helper, there's a decidedly low-res raster based approach for visually inspecting your region's distance surface. But the workhorse is distance_to.
Maintained by Alec L. Robitaille. Last updated 2 years ago.
animaldistance-toecologyresource-selectionrsfspatial
8.0 match 5 stars 4.88 score 10 scripts 1 dependentsaggregate-genius
periscope2:Enterprise Streamlined 'shiny' Application Framework Using 'bs4Dash'
A framework for building enterprise, scalable and UI-standardized 'shiny' applications. It brings enhanced features such as 'bootstrap' v4 <https://getbootstrap.com/docs/4.0/getting-started/introduction/>, additional and enhanced 'shiny' modules, customizable UI features, as well as an enhanced application file organization paradigm. This update allows developers to harness the ability to build powerful applications and enriches the 'shiny' developers' experience when building and maintaining applications.
Maintained by Mohammed Ali. Last updated 2 months ago.
6.0 match 9 stars 6.49 score 34 scriptsepiforecasts
epinowcast:Flexible Hierarchical Nowcasting
Tools to enable flexible and efficient hierarchical nowcasting of right-truncated epidemiological time-series using a semi-mechanistic Bayesian model with support for a range of reporting and generative processes. Nowcasting, in this context, is gaining situational awareness using currently available observations and the reporting patterns of historical observations. This can be useful when tracking the spread of infectious disease in real-time: without nowcasting, changes in trends can be obfuscated by partial reporting or their detection may be delayed due to the use of simpler methods like truncation. While the package has been designed with epidemiological applications in mind, it could be applied to any set of right-truncated time-series count data.
Maintained by Sam Abbott. Last updated 11 months ago.
cmdstanreffective-reproduction-number-estimationepidemiologyinfectious-disease-surveillancenowcastingoutbreak-analysispandemic-preparednessreal-time-infectious-disease-modellingstan
4.9 match 61 stars 7.88 score 65 scriptsepinowcast
epinowcast:Flexible Hierarchical Nowcasting
Tools to enable flexible and efficient hierarchical nowcasting of right-truncated epidemiological time-series using a semi-mechanistic Bayesian model with support for a range of reporting and generative processes. Nowcasting, in this context, is gaining situational awareness using currently available observations and the reporting patterns of historical observations. This can be useful when tracking the spread of infectious disease in real-time: without nowcasting, changes in trends can be obfuscated by partial reporting or their detection may be delayed due to the use of simpler methods like truncation. While the package has been designed with epidemiological applications in mind, it could be applied to any set of right-truncated time-series count data.
Maintained by Sam Abbott. Last updated 11 months ago.
cmdstanreffective-reproduction-number-estimationepidemiologyinfectious-disease-surveillancenowcastingoutbreak-analysispandemic-preparednessreal-time-infectious-disease-modellingstan
4.9 match 61 stars 7.79 score 71 scriptsrichfitz
storr:Simple Key Value Stores
Creates and manages simple key-value stores. These can use a variety of approaches for storing the data. This package implements the base methods and support for file system, in-memory and DBI-based database stores.
Maintained by Rich FitzJohn. Last updated 4 years ago.
3.8 match 117 stars 10.21 score 57 scripts 33 dependentsmurrayefford
secr:Spatially Explicit Capture-Recapture
Functions to estimate the density and size of a spatially distributed animal population sampled with an array of passive detectors, such as traps, or by searching polygons or transects. Models incorporating distance-dependent detection are fitted by maximizing the likelihood. Tools are included for data manipulation and model selection.
Maintained by Murray Efford. Last updated 3 hours ago.
3.8 match 3 stars 10.16 score 410 scripts 5 dependentsbioc
rtracklayer:R interface to genome annotation files and the UCSC genome browser
Extensible framework for interacting with multiple genome browsers (currently UCSC built-in) and manipulating annotation tracks in various formats (currently GFF, BED, bedGraph, BED15, WIG, BigWig and 2bit built-in). The user may export/import tracks to/from the supported browsers, as well as query and modify the browser state, such as the current viewport.
Maintained by Michael Lawrence. Last updated 9 days ago.
annotationvisualizationdataimportzlibopensslcurl
3.0 match 12.66 score 6.7k scripts 481 dependentsazure
AzureVM:Virtual Machines in 'Azure'
Functionality for working with virtual machines (VMs) in Microsoft's 'Azure' cloud: <https://azure.microsoft.com/en-us/services/virtual-machines/>. Includes facilities to deploy, startup, shutdown, and cleanly delete VMs and VM clusters. Deployment configurations can be highly customised, and can make use of existing resources as well as creating new ones. A selection of predefined configurations is provided to allow easy deployment of commonly used Linux and Windows images, including Data Science Virtual Machines. With a running VM, execute scripts and install optional extensions. Part of the 'AzureR' family of packages.
Maintained by Hong Ooi. Last updated 2 years ago.
azureazure-sdk-razure-virtual-machinedata-science-virtual-machine
7.4 match 14 stars 5.05 score 16 scriptsropensci
hydroscoper:Interface to the Greek National Data Bank for Hydrometeorological Information
R interface to the Greek National Data Bank for Hydrological and Meteorological Information. It covers Hydroscope's data sources and provides functions to transliterate, translate and download them into tidy dataframes.
Maintained by Konstantinos Vantas. Last updated 8 months ago.
climategreecehydrologyhydrometeorologyhydroscopemeteorological-datameteorological-stationspeer-reviewedtidy-datatime-serieswater-resources
7.5 match 14 stars 4.97 score 33 scriptscanmod
iidda:Processing Infectious Disease Datasets in IIDDA.
Part of an open toolchain for processing infectious disease datasets available through the IIDDA data repository.
Maintained by Steve Walker. Last updated 4 months ago.
6.0 match 6.07 score 133 scripts 3 dependentscran
istacr:Obtaining Open Data from Instituto Canario De Estadistica (ISTAC) API
You can access to open data published in Instituto Canario De Estadistica (ISTAC) APIs at <https://datos.canarias.es/api/estadisticas/>.
Maintained by Alberto Gonzalez. Last updated 2 years ago.
35.5 match 1.00 scorebupaverse
processmapR:Construct Process Maps Using Event Data
Visualize event logs using directed graphs, i.e. process maps. Part of the 'bupaR' framework.
Maintained by Gert Janssenswillen. Last updated 7 months ago.
4.5 match 9 stars 7.70 score 169 scripts 3 dependentsbioc
iSEEindex:iSEE extension for a landing page to a custom collection of data sets
This package provides an interface to any collection of data sets within a single iSEE web-application. The main functionality of this package is to define a custom landing page allowing app maintainers to list a custom collection of data sets that users can selected from and directly load objects into an iSEE web-application.
Maintained by Kevin Rue-Albrecht. Last updated 5 months ago.
softwareinfrastructurebioconductorhacktoberfest
6.1 match 2 stars 5.65 score 8 scriptsapache
arrow:Integration to 'Apache' 'Arrow'
'Apache' 'Arrow' <https://arrow.apache.org/> is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. This package provides an interface to the 'Arrow C++' library.
Maintained by Jonathan Keane. Last updated 1 months ago.
1.8 match 15k stars 19.22 score 10k scripts 81 dependentspachadotdev
analogsea:Interface to 'DigitalOcean'
Provides a set of functions for interacting with the 'DigitalOcean' API <https://www.digitalocean.com/>, including creating images, destroying them, rebooting, getting details on regions, and available images.
Maintained by Mauricio Vargas. Last updated 2 years ago.
4.5 match 159 stars 7.56 score 100 scripts 1 dependentsjamiemkass
ENMeval:Automated Tuning and Evaluations of Ecological Niche Models
Runs ecological niche models over all combinations of user-defined settings (i.e., tuning), performs cross validation to evaluate models, and returns data tables to aid in selection of optimal model settings that balance goodness-of-fit and model complexity. Also has functions to partition data spatially (or not) for cross validation, to plot multiple visualizations of results, to run null models to estimate significance and effect sizes of performance metrics, and to calculate range overlap between model predictions, among others. The package was originally built for Maxent models (Phillips et al. 2006, Phillips et al. 2017), but the current version allows possible extensions for any modeling algorithm. The extensive vignette, which guides users through most package functionality but unfortunately has a file size too big for CRAN, can be found here on the package's Github Pages website: <https://jamiemkass.github.io/ENMeval/articles/ENMeval-2.0-vignette.html>.
Maintained by Jamie M. Kass. Last updated 2 months ago.
3.0 match 49 stars 11.25 score 332 scripts 2 dependentslearnitr
learnitdown:R Markdown, Bookdown and Learnr Additions for Learning Material
Extension to R Markdown, Bookdown and Learnr for building better learning and e-learning material: H5P integration, course-contextual divs, differed loading of Shiny and learnr applications, and much more ...
Maintained by Philippe Grosjean. Last updated 6 months ago.
bookdownlearning-resourcesr-markdownteaching-materials
7.5 match 13 stars 4.49 score 16 scriptsipeagit
gtfs2emis:Estimating Public Transport Emissions from General Transit Feed Specification (GTFS) Data
A bottom up model to estimate the emission levels of public transport systems based on General Transit Feed Specification (GTFS) data. The package requires two main inputs: i) Public transport data in the GTFS standard format; and ii) Some basic information on fleet characteristics such as fleet age, technology, fuel and Euro stage. As it stands, the package estimates several pollutants at high spatial and temporal resolutions. Pollution levels can be calculated for specific transport routes, trips, time of the day or for the transport system as a whole. The output with emission estimates can be extracted in different formats, supporting analysis on how emission levels vary across space, time and by fleet characteristics. A full description of the methods used in the 'gtfs2emis' model is presented in Vieira, J. P. B.; Pereira, R. H. M.; Andrade, P. R. (2022) <doi:10.31219/osf.io/8m2cy>.
Maintained by Joao Bazzo. Last updated 2 months ago.
emissionsenvironmental-modellinggtfspublic-transportrspatialtransport
4.5 match 28 stars 7.47 score 29 scriptsbioc
HIBAG:HLA Genotype Imputation with Attribute Bagging
Imputes HLA classical alleles using GWAS SNP data, and it relies on a training set of HLA and SNP genotypes. HIBAG can be used by researchers with published parameter estimates instead of requiring access to large training sample datasets. It combines the concepts of attribute bagging, an ensemble classifier method, with haplotype inference for SNPs and HLA types. Attribute bagging is a technique which improves the accuracy and stability of classifier ensembles using bootstrap aggregating and random variable selection.
Maintained by Xiuwen Zheng. Last updated 4 months ago.
geneticsstatisticalmethodbioinformaticsgpuhlaimputationmhcsnpcpp
4.0 match 30 stars 8.24 score 48 scriptshelenkettle
microPop:Process-Based Modelling of Microbial Populations
Modelling interacting microbial populations - example applications include human gut microbiota, rumen microbiota and phytoplankton. Solves a system of ordinary differential equations to simulate microbial growth and resource uptake over time. This version contains network visualisation functions.
Maintained by Helen Kettle. Last updated 3 years ago.
12.5 match 2.64 score 11 scriptsmrcieu
TwoSampleMR:Two Sample MR Functions and Interface to MRC Integrative Epidemiology Unit OpenGWAS Database
A package for performing Mendelian randomization using GWAS summary data. It uses the IEU OpenGWAS database <https://gwas.mrcieu.ac.uk/> to automatically obtain data, and a wide range of methods to run the analysis.
Maintained by Gibran Hemani. Last updated 11 days ago.
2.9 match 467 stars 11.23 score 1.7k scripts 1 dependentsmrc-ide
hipercow:High Performance Computing
Set up cluster environments and jobs. Moo.
Maintained by Rich FitzJohn. Last updated 12 days ago.
5.0 match 1 stars 6.53 score 45 scripts 1 dependentstraitecoevo
APCalign:Resolving Plant Taxon Names Using the Australian Plant Census
The process of resolving taxon names is necessary when working with biodiversity data. 'APCalign' uses the Australian Plant Census (APC) and the Australian Plant Name Index (APNI) to align and update plant taxon names to current, accepted standards. 'APCalign' also supplies information about the established status of plant taxa across different states/territories.
Maintained by Daniel Falster. Last updated 1 months ago.
4.4 match 4 stars 7.30 score 23 scripts 1 dependentspredictiveecology
SpaDES.core:Core Utilities for Developing and Running Spatially Explicit Discrete Event Models
Provides the core framework for a discrete event system to implement a complete data-to-decisions, reproducible workflow. The core components facilitate the development of modular pieces, and enable the user to include additional functionality by running user-built modules. Includes conditional scheduling, restart after interruption, packaging of reusable modules, tools for developing arbitrary automated workflows, automated interweaving of modules of different temporal resolution, and tools for visualizing and understanding the within-project dependencies. The suggested package 'NLMR' can be installed from the repository (<https://PredictiveEcology.r-universe.dev>).
Maintained by Eliot J B McIntire. Last updated 19 days ago.
discrete-events-simulationssimulation-frameworksimulation-modeling
3.0 match 10 stars 10.61 score 142 scripts 6 dependentszhanxw
seqminer:Efficiently Read Sequence Data (VCF Format, BCF Format, METAL Format and BGEN Format) into R
Integrate sequencing data (Variant call format, e.g. VCF or BCF) or meta-analysis results in R. This package can help you (1) read VCF/BCF/BGEN files by chromosomal ranges (e.g. 1:100-200); (2) read RareMETAL summary statistics files; (3) read tables from a tabix-indexed files; (4) annotate VCF/BCF files; (5) create customized workflow based on Makefile.
Maintained by Xiaowei Zhan. Last updated 6 months ago.
annotationbcfbgenmeta-analysisnext-generation-sequencingplinksequencingtabixvcfworkflowzlibbzip2libzstdsqlite3cpp
3.9 match 30 stars 8.29 score 111 scripts 6 dependentsstibu81
ibawds:Functions and Datasets for the Data Science Course at IBAW
A collection of useful functions and datasets for the Data Science Course at IBAW.
Maintained by Stefan Lanz. Last updated 10 days ago.
data-science-learningeducational-resources
7.5 match 2 stars 4.26 score 8 scriptsrstudio
pagedown:Paginate the HTML Output of R Markdown with CSS for Print
Use the paged media properties in CSS and the JavaScript library 'paged.js' to split the content of an HTML document into discrete pages. Each page can have its page size, page numbers, margin boxes, and running headers, etc. Applications of this package include books, letters, reports, papers, business cards, resumes, and posters.
Maintained by Yihui Xie. Last updated 2 months ago.
csshtmlpaged-mediapdfprintingtypesetting
2.7 match 909 stars 11.73 score 350 scripts 19 dependentsbioc
AnnotationHubData:Transform public data resources into Bioconductor Data Structures
These recipes convert a wide variety and a growing number of public bioinformatic data sets into easily-used standard Bioconductor data structures.
Maintained by Bioconductor Package Maintainer. Last updated 6 days ago.
6.3 match 5.02 score 22 scripts 4 dependentsjmsigner
amt:Animal Movement Tools
Manage and analyze animal movement data. The functionality of 'amt' includes methods to calculate home ranges, track statistics (e.g. step lengths, speed, or turning angles), prepare data for fitting habitat selection analyses, and simulation of space-use from fitted step-selection functions.
Maintained by Johannes Signer. Last updated 4 months ago.
3.0 match 41 stars 10.54 score 418 scriptsr-lib
cleancall:C Resource Cleanup via Exit Handlers
Wrapper of .Call() that runs exit handlers to clean up C resources. Helps managing C (non-R) resources while using the R API.
Maintained by Gábor Csárdi. Last updated 4 months ago.
5.5 match 19 stars 5.53 score 1 scripts 2 dependentsdatashield
DSLite:'DataSHIELD' Implementation on Local Datasets
'DataSHIELD' is an infrastructure and series of R packages that enables the remote and 'non-disclosive' analysis of sensitive research data. This 'DataSHIELD Interface' implementation is for analyzing datasets living in the current R session. The purpose of this is primarily for lightweight 'DataSHIELD' analysis package development.
Maintained by Yannick Marcon. Last updated 2 years ago.
6.0 match 4 stars 5.03 score 53 scriptsbioc
ReUseData:Reusable and reproducible Data Management
ReUseData is an _R/Bioconductor_ software tool to provide a systematic and versatile approach for standardized and reproducible data management. ReUseData facilitates transformation of shell or other ad hoc scripts for data preprocessing into workflow-based data recipes. Evaluation of data recipes generate curated data files in their generic formats (e.g., VCF, bed). Both recipes and data are cached using database infrastructure for easy data management and reuse. Prebuilt data recipes are available through ReUseData portal ("https://rcwl.org/dataRecipes/") with full annotation and user instructions. Pregenerated data are available through ReUseData cloud bucket that is directly downloadable through "getCloudData()".
Maintained by Qian Liu. Last updated 5 months ago.
softwareinfrastructuredataimportpreprocessingimmunooncology
5.6 match 4 stars 5.38 score 7 scriptscritical-infrastructure-systems-lab
reservoir:Tools for Analysis, Design, and Operation of Water Supply Storages
Measure single-storage water supply system performance using resilience, reliability, and vulnerability metrics; assess storage-yield- reliability relationships; determine no-fail storage with sequent peak analysis; optimize release decisions for water supply, hydropower, and multi-objective reservoirs using deterministic and stochastic dynamic programming; generate inflow replicates using parametric and non-parametric models; evaluate inflow persistence using the Hurst coefficient.
Maintained by Sean Turner. Last updated 4 years ago.
hydrologyreservoirsimulationwater-resources
7.5 match 28 stars 4.00 score 18 scriptsbioc
AnVILWorkflow:Run workflows implemented in Terra/AnVIL workspace
The AnVIL is a cloud computing resource developed in part by the National Human Genome Research Institute. The main cloud-based genomics platform deported by the AnVIL project is Terra. The AnVILWorkflow package allows remote access to Terra implemented workflows, enabling end-user to utilize Terra/ AnVIL provided resources - such as data, workflows, and flexible/scalble computing resources - through the conventional R functions.
Maintained by Sehyun Oh. Last updated 28 days ago.
infrastructuresoftwareanvilgcpterraworkflows
5.0 match 6 stars 6.03 score 1 scriptstidyverse
googledrive:An Interface to Google Drive
Manage Google Drive files from R.
Maintained by Jennifer Bryan. Last updated 7 months ago.
2.0 match 329 stars 14.97 score 2.1k scripts 164 dependentsbioc
XINA:Multiplexes Isobaric Mass Tagged-based Kinetics Data for Network Analysis
The aim of XINA is to determine which proteins exhibit similar patterns within and across experimental conditions, since proteins with co-abundance patterns may have common molecular functions. XINA imports multiple datasets, tags dataset in silico, and combines the data for subsequent subgrouping into multiple clusters. The result is a single output depicting the variation across all conditions. XINA, not only extracts coabundance profiles within and across experiments, but also incorporates protein-protein interaction databases and integrative resources such as KEGG to infer interactors and molecular functions, respectively, and produces intuitive graphical outputs.
Maintained by Lang Ho Lee. Last updated 5 months ago.
systemsbiologyproteomicsrnaseqnetwork
6.9 match 4.30 score 3 scriptsbioc
GenomeInfoDb:Utilities for manipulating chromosome names, including modifying them to follow a particular naming style
Contains data and functions that define and allow translation between different chromosome sequence naming conventions (e.g., "chr1" versus "1"), including a function that attempts to place sequence names in their natural, rather than lexicographic, order.
Maintained by Hervé Pagès. Last updated 2 months ago.
geneticsdatarepresentationannotationgenomeannotationbioconductor-packagecore-package
1.8 match 32 stars 16.46 score 1.3k scripts 1.7k dependentsbioc
cBioPortalData:Exposes and Makes Available Data from the cBioPortal Web Resources
The cBioPortalData R package accesses study datasets from the cBio Cancer Genomics Portal. It accesses the data either from the pre-packaged zip / tar files or from the API interface that was recently implemented by the cBioPortal Data Team. The package can provide data in either tabular format or with MultiAssayExperiment object that uses familiar Bioconductor data representations.
Maintained by Marcel Ramos. Last updated 10 days ago.
softwareinfrastructurethirdpartyclientbioconductor-packagenci-itcru24ca289073
2.9 match 33 stars 10.15 score 147 scripts 4 dependentsmapme-initiative
mapme.biodiversity:Efficient Monitoring of Global Biodiversity Portfolios
Biodiversity areas, especially primary forest, serve a multitude of functions for local economy, regional functionality of the ecosystems as well as the global health of our planet. Recently, adverse changes in human land use practices and climatic responses to increased greenhouse gas emissions, put these biodiversity areas under a variety of different threats. The present package helps to analyse a number of biodiversity indicators based on freely available geographical datasets. It supports computational efficient routines that allow the analysis of potentially global biodiversity portfolios. The primary use case of the package is to support evidence based reporting of an organization's effort to protect biodiversity areas under threat and to identify regions were intervention is most duly needed.
Maintained by Darius A. Görgen. Last updated 3 months ago.
environmenteogismapmespatialsustainability
3.1 match 35 stars 9.24 score 287 scriptsbioc
rsbml:R support for SBML, using libsbml
Links R to libsbml for SBML parsing, validating output, provides an S4 SBML DOM, converts SBML to R graph objects. Optionally links to the SBML ODE Solver Library (SOSLib) for simulating models.
Maintained by Michael Lawrence. Last updated 18 days ago.
graphandnetworkpathwaysnetworklibsbmlcpp
6.0 match 4.71 score 19 scripts 1 dependentstidymodels
corrr:Correlations in R
A tool for exploring correlations. It makes it possible to easily perform routine tasks when exploring correlation matrices such as ignoring the diagonal, focusing on the correlations of certain variables against others, or rearranging and visualizing the matrix in terms of the strength of the correlations.
Maintained by Max Kuhn. Last updated 1 years ago.
2.0 match 593 stars 13.82 score 2.9k scripts 7 dependentsquicklizard99
cheddar:Analysis and Visualisation of Ecological Communities
Provides a flexible, extendable representation of an ecological community and a range of functions for analysis and visualisation, focusing on food web, body mass and numerical abundance data. Allows inter-web comparisons such as examining changes in community structure over environmental, temporal or spatial gradients.
Maintained by Lawrence Hudson. Last updated 8 months ago.
4.0 match 15 stars 6.86 score 195 scriptsbioc
GenomicScores:Infrastructure to work with genomewide position-specific scores
Provide infrastructure to store and access genomewide position-specific scores within R and Bioconductor.
Maintained by Robert Castelo. Last updated 1 months ago.
infrastructuregeneticsannotationsequencingcoverageannotationhubsoftware
3.1 match 8 stars 8.71 score 83 scripts 6 dependentsjiefei-wang
aws.ecx:Communicating with AWS EC2 and ECS using AWS REST APIs
Providing the functions for communicating with Amazon Web Services(AWS) Elastic Compute Cloud(EC2) and Elastic Container Service(ECS). The functions will have the prefix 'ecs_' or 'ec2_' depending on the class of the API. The request will be sent via the REST API and the parameters are given by the function argument. The credentials can be set via 'aws_set_credentials'. The EC2 documentation can be found at <https://docs.aws.amazon.com/AWSEC2/latest/APIReference/Welcome.html> and ECS can be found at <https://docs.aws.amazon.com/AmazonECS/latest/APIReference/Welcome.html>.
Maintained by Jiefei Wang. Last updated 3 years ago.
6.5 match 1 stars 4.18 score 2 scriptsfrbcesab
rcompendium:Create a Package or Research Compendium Structure
Makes easier the creation of R package or research compendium (i.e. a predefined files/folders structure) so that users can focus on the code/analysis instead of wasting time organizing files. A full ready-to-work structure is set up with some additional features: version control, remote repository creation, CI/CD configuration (check package integrity under several OS, test code with 'testthat', and build and deploy website using 'pkgdown'). This package heavily relies on the R packages 'devtools' and 'usethis' and follows recommendations made by Wickham H. (2015) <ISBN:9781491910597> and Marwick B. et al. (2018) <doi:10.7287/peerj.preprints.3192v2>.
Maintained by Nicolas Casajus. Last updated 1 months ago.
reproducible-researchresearch-compendium
4.0 match 40 stars 6.72 score 22 scriptsbioc
CAGEr:Analysis of CAGE (Cap Analysis of Gene Expression) sequencing data for precise mapping of transcription start sites and promoterome mining
The _CAGEr_ package identifies transcription start sites (TSS) and their usage frequency from CAGE (Cap Analysis Gene Expression) sequencing data. It normalises raw CAGE tag count, clusters TSSs into tag clusters (TC) and aggregates them across multiple CAGE experiments to construct consensus clusters (CC) representing the promoterome. CAGEr provides functions to profile expression levels of these clusters by cumulative expression and rarefaction analysis, and outputs the plots in ggplot2 format for further facetting and customisation. After clustering, CAGEr performs analyses of promoter width and detects differential usage of TSSs (promoter shifting) between samples. CAGEr also exports its data as genome browser tracks, and as R objects for downsteam expression analysis by other Bioconductor packages such as DESeq2, CAGEfightR, or seqArchR.
Maintained by Charles Plessy. Last updated 5 months ago.
preprocessingsequencingnormalizationfunctionalgenomicstranscriptiongeneexpressionclusteringvisualization
4.4 match 6.12 score 73 scriptskaneplusplus
bigmemory:Manage Massive Matrices with Shared Memory and Memory-Mapped Files
Create, store, access, and manipulate massive matrices. Matrices are allocated to shared memory and may use memory-mapped files. Packages 'biganalytics', 'bigtabulate', 'synchronicity', and 'bigalgebra' provide advanced functionality.
Maintained by Michael J. Kane. Last updated 1 years ago.
2.3 match 127 stars 11.87 score 920 scripts 64 dependentswadpac
GGIR:Raw Accelerometer Data Analysis
A tool to process and analyse data collected with wearable raw acceleration sensors as described in Migueles and colleagues (JMPB 2019), and van Hees and colleagues (JApplPhysiol 2014; PLoSONE 2015). The package has been developed and tested for binary data from 'GENEActiv' <https://activinsights.com/>, binary (.gt3x) and .csv-export data from 'Actigraph' <https://theactigraph.com> devices, and binary (.cwa) and .csv-export data from 'Axivity' <https://axivity.com>. These devices are currently widely used in research on human daily physical activity. Further, the package can handle accelerometer data file from any other sensor brand providing that the data is stored in csv format. Also the package allows for external function embedding.
Maintained by Vincent T van Hees. Last updated 3 days ago.
accelerometeractivity-recognitioncircadian-rhythmmovement-sensorsleep
2.0 match 109 stars 13.20 score 342 scripts 3 dependentsrstudio
bookdown:Authoring Books and Technical Documents with R Markdown
Output formats and utilities for authoring books and technical documents with R Markdown.
Maintained by Yihui Xie. Last updated 2 days ago.
bookbookdownepubgitbookhtmllatexrmarkdown
1.5 match 3.9k stars 17.51 score 1.7k scripts 136 dependentscarpentries
sandpaper:Create and Curate Carpentries Lessons
We provide tools to build a Carpentries-themed lesson repository into an accessible standalone static website. These include local tools and those designed to be used in a continuous integration context so that all the lesson author needs to focus on is writing the content of the actual lesson.
Maintained by Robert Davey. Last updated 2 months ago.
carpentriescarpentries-infrastructurecarpentries-workbenchlesson-templatelessonsmarkdownstatic-site-generator
3.3 match 44 stars 7.72 score 8 scriptsrstudio
promises:Abstractions for Promise-Based Asynchronous Programming
Provides fundamental abstractions for doing asynchronous programming in R using promises. Asynchronous programming is useful for allowing a single R process to orchestrate multiple tasks in the background while also attending to something else. Semantics are similar to 'JavaScript' promises, but with a syntax that is idiomatic R.
Maintained by Joe Cheng. Last updated 1 months ago.
1.5 match 204 stars 17.10 score 688 scripts 2.6k dependentslumenlearning
rise:Conduct RISE Analysis
Implements techniques for educational resource inspection, selection, and evaluation (RISE) described in Bodily, Nyland, and Wiley (2017) <doi:10.19173/irrodl.v18i2.2952>. Automates the process of identifying learning materials that are not effectively supporting student learning in technology-mediated courses by synthesizing information about access to course content and performance on assessments.
Maintained by David Wiley. Last updated 6 years ago.
continuous-improvementlearning-analyticsopen-educational-resources
7.2 match 7 stars 3.54 score 7 scriptsbioc
SNPRelate:Parallel Computing Toolset for Relatedness and Principal Component Analysis of SNP Data
Genome-wide association studies (GWAS) are widely used to investigate the genetic basis of diseases and traits, but they pose many computational challenges. We developed an R package SNPRelate to provide a binary format for single-nucleotide polymorphism (SNP) data in GWAS utilizing CoreArray Genomic Data Structure (GDS) data files. The GDS format offers the efficient operations specifically designed for integers with two bits, since a SNP could occupy only two bits. SNPRelate is also designed to accelerate two key computations on SNP data using parallel computing for multi-core symmetric multiprocessing computer architectures: Principal Component Analysis (PCA) and relatedness analysis using Identity-By-Descent measures. The SNP GDS format is also used by the GWASTools package with the support of S4 classes and generic functions. The extended GDS format is implemented in the SeqArray package to support the storage of single nucleotide variations (SNVs), insertion/deletion polymorphism (indel) and structural variation calls in whole-genome and whole-exome variant data.
Maintained by Xiuwen Zheng. Last updated 5 months ago.
infrastructuregeneticsstatisticalmethodprincipalcomponentbioinformaticsgds-formatpcasimdsnpopenblascpp
2.0 match 104 stars 12.69 score 1.6k scripts 18 dependentsaravind-j
EvaluateCore:Quality Evaluation of Core Collections
Implements various quality evaluation statistics to assess the value of plant germplasm core collections using qualitative and quantitative phenotypic trait data according to Odong et al. (2015) <doi:10.1007/s00122-012-1971-y>.
Maintained by J. Aravind. Last updated 7 days ago.
core-collectionscore-evaluationgenebankgermplasmpgrplant-genetic-resources
6.7 match 1 stars 3.80 score 21 scriptsfamuvie
breedR:Statistical Methods for Forest Genetic Resources Analysts
Statistical tools to build predictive models for the breeders community. It aims to assess the genetic value of individuals under a number of situations, including spatial autocorrelation, genetic/environment interaction and competition. It is under active development as part of the Trees4Future project, particularly developed having forest genetic trials in mind. But can be used for animals or other situations as well.
Maintained by Facundo Muñoz. Last updated 8 months ago.
4.6 match 33 stars 5.44 score 24 scriptsroelandkindt
BiodiversityR:Package for Community Ecology and Suitability Analysis
Graphical User Interface (via the R-Commander) and utility functions (often based on the vegan package) for statistical analysis of biodiversity and ecological communities, including species accumulation curves, diversity indices, Renyi profiles, GLMs for analysis of species abundance and presence-absence, distance matrices, Mantel tests, and cluster, constrained and unconstrained ordination analysis. A book on biodiversity and community ecology analysis is available for free download from the website. In 2012, methods for (ensemble) suitability modelling and mapping were expanded in the package.
Maintained by Roeland Kindt. Last updated 2 months ago.
3.3 match 16 stars 7.42 score 390 scripts 2 dependentsropensci
stplanr:Sustainable Transport Planning
Tools for transport planning with an emphasis on spatial transport data and non-motorized modes. The package was originally developed to support the 'Propensity to Cycle Tool', a publicly available strategic cycle network planning tool (Lovelace et al. 2017) <doi:10.5198/jtlu.2016.862>, but has since been extended to support public transport routing and accessibility analysis (Moreno-Monroy et al. 2017) <doi:10.1016/j.jtrangeo.2017.08.012> and routing with locally hosted routing engines such as 'OSRM' (Lowans et al. 2023) <doi:10.1016/j.enconman.2023.117337>. The main functions are for creating and manipulating geographic "desire lines" from origin-destination (OD) data (building on the 'od' package); calculating routes on the transport network locally and via interfaces to routing services such as <https://cyclestreets.net/> (Desjardins et al. 2021) <doi:10.1007/s11116-021-10197-1>; and calculating route segment attributes such as bearing. The package implements the 'travel flow aggregration' method described in Morgan and Lovelace (2020) <doi:10.1177/2399808320942779> and the 'OD jittering' method described in Lovelace et al. (2022) <doi:10.32866/001c.33873>. Further information on the package's aim and scope can be found in the vignettes and in a paper in the R Journal (Lovelace and Ellison 2018) <doi:10.32614/RJ-2018-053>, and in a paper outlining the landscape of open source software for geographic methods in transport planning (Lovelace, 2021) <doi:10.1007/s10109-020-00342-2>.
Maintained by Robin Lovelace. Last updated 7 months ago.
cyclecyclingdesire-linesorigin-destinationpeer-reviewedpubic-transportroute-networkroutesroutingspatialtransporttransport-planningtransportationwalking
2.0 match 427 stars 12.31 score 684 scripts 3 dependentsropensci
weatherOz:An API Client for Australian Weather and Climate Data Resources
Provides automated downloading, parsing and formatting of weather data for Australia through API endpoints provided by the Department of Primary Industries and Regional Development ('DPIRD') of Western Australia and by the Science and Technology Division of the Queensland Government's Department of Environment and Science ('DES'). As well as the Bureau of Meteorology ('BOM') of the Australian government precis and coastal forecasts, and downloading and importing radar and satellite imagery files. 'DPIRD' weather data are accessed through public 'APIs' provided by 'DPIRD', <https://www.agric.wa.gov.au/weather-api-20>, providing access to weather station data from the 'DPIRD' weather station network. Australia-wide weather data are based on data from the Australian Bureau of Meteorology ('BOM') data and accessed through 'SILO' (Scientific Information for Land Owners) Jeffrey et al. (2001) <doi:10.1016/S1364-8152(01)00008-1>. 'DPIRD' data are made available under a Creative Commons Attribution 3.0 Licence (CC BY 3.0 AU) license <https://creativecommons.org/licenses/by/3.0/au/deed.en>. SILO data are released under a Creative Commons Attribution 4.0 International licence (CC BY 4.0) <https://creativecommons.org/licenses/by/4.0/>. 'BOM' data are (c) Australian Government Bureau of Meteorology and released under a Creative Commons (CC) Attribution 3.0 licence or Public Access Licence ('PAL') as appropriate, see <http://www.bom.gov.au/other/copyright.shtml> for further details.
Maintained by Rodrigo Pires. Last updated 20 days ago.
dpirdbommeteorological-dataweather-forecastaustraliaweatherweather-datameteorologywestern-australiaaustralia-bureau-of-meteorologywestern-australia-agricultureaustralia-agricultureaustralia-climateaustralia-weatherapi-clientclimatedatarainfallweather-api
2.9 match 32 stars 8.54 score 40 scriptsr-forge
tm:Text Mining Package
A framework for text mining applications within R.
Maintained by Kurt Hornik. Last updated 26 days ago.
1.9 match 12.96 score 14k scripts 101 dependentsrstudio
packrat:A Dependency Management System for Projects and their R Package Dependencies
Manage the R packages your project depends on in an isolated, portable, and reproducible way.
Maintained by Aron Atkins. Last updated 2 months ago.
2.0 match 406 stars 12.15 score 256 scripts 9 dependentsdatashield
DSOpal:'DataSHIELD' Implementation for 'Opal'
'DataSHIELD' is an infrastructure and series of R packages that enables the remote and 'non-disclosive' analysis of sensitive research data. This package is the 'DataSHIELD' interface implementation for 'Opal', which is the data integration application for biobanks by 'OBiBa'. Participant data, once collected from any data source, must be integrated and stored in a central data repository under a uniform model. 'Opal' is such a central repository. It can import, process, validate, query, analyze, report, and export data. 'Opal' is the reference implementation of the 'DataSHIELD' infrastructure.
Maintained by Yannick Marcon. Last updated 2 years ago.
6.3 match 3.85 score 141 scriptsnmfs-ost
asar:Build NOAA Stock Assessment Report
Build a full or update stock assessment report for any stock assessment model. Parameterization allows the user to call a template based on their regional science center, species, area, ect.
Maintained by Samantha Schiano. Last updated 7 days ago.
latexquartostock-assessment-reports
3.5 match 21 stars 6.87 score 3 scriptscran
sas7bdat:sas7bdat Reverse Engineering Documentation
Documentation and prototypes for the earliest (circa 2010) open-source effort to reverse engineer the sas7bdat file format. The package includes a prototype reader for sas7bdat files. However, newer packages may contain more robust readers for sas7bdat files.
Maintained by Matt Shotwell. Last updated 7 months ago.
3.8 match 4 stars 6.29 score 500 scripts 4 dependentskylegrealis
froggeR:Enhance 'Quarto' Project Workflows and Standards
Streamlines 'Quarto' workflows by providing tools for consistent project setup and documentation. Enables portability through reusable metadata, automated project structure creation, and standardized templates. Features include enhanced project initialization, pre-formatted 'Quarto' documents, comprehensive data protection settings, custom styling, and structured documentation generation. Designed to improve efficiency and collaboration in R data science projects by reducing repetitive setup tasks while maintaining consistent formatting across multiple documents. There are many valuable resources providing in-depth explanations of customizing 'Quarto' templates and theme styling by the Posit team: <https://quarto.org/docs/output-formats/html-themes.html#customizing-themes> & <https://quarto.org/docs/output-formats/html-themes-more.html>, and at the Bootstrap community's GitHub at <https://github.com/twbs/bootstrap/blob/main/scss/_variables.scss>.
Maintained by Kyle Grealis. Last updated 12 hours ago.
data-scienceproject-managementquarto
3.5 match 26 stars 6.67 score 6 scriptsbioc
Organism.dplyr:dplyr-based Access to Bioconductor Annotation Resources
This package provides an alternative interface to Bioconductor 'annotation' resources, in particular the gene identifier mapping functionality of the 'org' packages (e.g., org.Hs.eg.db) and the genome coordinate functionality of the 'TxDb' packages (e.g., TxDb.Hsapiens.UCSC.hg38.knownGene).
Maintained by Martin Morgan. Last updated 5 months ago.
annotationsequencinggenomeannotationbioconductor-packagecore-package
3.4 match 3 stars 6.77 score 63 scripts 1 dependentsselesnow
rgoogleads:Loading Data from 'Google Ads API'
Interface for loading data from 'Google Ads API', see <https://developers.google.com/google-ads/api/docs/start>. Package provide function for authorization and loading reports.
Maintained by Alexey Seleznev. Last updated 2 months ago.
3.6 match 14 stars 6.40 score 15 scripts 1 dependentsrobjhyndman
tsfeatures:Time Series Feature Extraction
Methods for extracting various features from time series data. The features provided are those from Hyndman, Wang and Laptev (2013) <doi:10.1109/ICDMW.2015.104>, Kang, Hyndman and Smith-Miles (2017) <doi:10.1016/j.ijforecast.2016.09.004> and from Fulcher, Little and Jones (2013) <doi:10.1098/rsif.2013.0048>. Features include spectral entropy, autocorrelations, measures of the strength of seasonality and trend, and so on. Users can also define their own feature functions.
Maintained by Rob Hyndman. Last updated 8 months ago.
2.0 match 254 stars 11.47 score 268 scripts 22 dependentsbioc
TFutils:TFutils
This package helps users to work with TF metadata from various sources. Significant catalogs of TFs and classifications thereof are made available. Tools for working with motif scans are also provided.
Maintained by Vincent Carey. Last updated 4 months ago.
4.7 match 4.80 score 21 scriptsthibautjombart
adegenet:Exploratory Analysis of Genetic and Genomic Data
Toolset for the exploration of genetic and genomic data. Adegenet provides formal (S4) classes for storing and handling various genetic data, including genetic markers with varying ploidy and hierarchical population structure ('genind' class), alleles counts by populations ('genpop'), and genome-wide SNP data ('genlight'). It also implements original multivariate methods (DAPC, sPCA), graphics, statistical tests, simulation tools, distance and similarity measures, and several spatial methods. A range of both empirical and simulated datasets is also provided to illustrate various methods.
Maintained by Zhian N. Kamvar. Last updated 1 months ago.
1.8 match 182 stars 12.60 score 1.9k scripts 29 dependentsbioc
OrganismDbi:Software to enable the smooth interfacing of different database packages
The package enables a simple unified interface to several annotation packages each of which has its own schema by taking advantage of the fact that each of these packages implements a select methods.
Maintained by Bioconductor Package Maintainer. Last updated 5 months ago.
3.0 match 7.45 score 34 scripts 35 dependentsbioc
MGnifyR:R interface to EBI MGnify metagenomics resource
Utility package to facilitate integration and analysis of EBI MGnify data in R. The package can be used to import microbial data for instance into TreeSummarizedExperiment (TreeSE). In TreeSE format, the data is directly compatible with miaverse framework.
Maintained by Tuomas Borman. Last updated 5 months ago.
infrastructuredataimportmetagenomics
2.9 match 21 stars 7.61 score 32 scriptsipums
ipumsr:An R Interface for Downloading, Reading, and Handling IPUMS Data
An easy way to work with census, survey, and geographic data provided by IPUMS in R. Generate and download data through the IPUMS API and load IPUMS files into R with their associated metadata to make analysis easier. IPUMS data describing 1.4 billion individuals drawn from over 750 censuses and surveys is available free of charge from the IPUMS website <https://www.ipums.org>.
Maintained by Derek Burk. Last updated 19 days ago.
2.0 match 28 stars 11.07 score 720 scripts 2 dependentsbioc
SpliceWiz:interactive analysis and visualization of alternative splicing in R
The analysis and visualization of alternative splicing (AS) events from RNA sequencing data remains challenging. SpliceWiz is a user-friendly and performance-optimized R package for AS analysis, by processing alignment BAM files to quantify read counts across splice junctions, IRFinder-based intron retention quantitation, and supports novel splicing event identification. We introduce a novel visualization for AS using normalized coverage, thereby allowing visualization of differential AS across conditions. SpliceWiz features a shiny-based GUI facilitating interactive data exploration of results including gene ontology enrichment. It is performance optimized with multi-threaded processing of BAM files and a new COV file format for fast recall of sequencing coverage. Overall, SpliceWiz streamlines AS analysis, enabling reliable identification of functionally relevant AS events for further characterization.
Maintained by Alex Chit Hei Wong. Last updated 4 days ago.
softwaretranscriptomicsrnaseqalternativesplicingcoveragedifferentialsplicingdifferentialexpressionguisequencingcppopenmp
3.4 match 16 stars 6.41 score 8 scriptspaws-r
paws.security.identity:'Amazon Web Services' Security, Identity, & Compliance Services
Interface to 'Amazon Web Services' security, identity, and compliance services, including the 'Identity & Access Management' ('IAM') service for managing access to services and resources, and more <https://aws.amazon.com/>.
Maintained by Dyfan Jones. Last updated 4 days ago.
2.4 match 332 stars 9.17 score 15 dependentsshikokuchuo
mirai:Minimalist Async Evaluation Framework for R
Designed for simplicity, a 'mirai' evaluates an R expression asynchronously in a parallel process, locally or distributed over the network. The result is automatically available upon completion. Modern networking and concurrency, built on 'nanonext' and 'NNG' (Nanomsg Next Gen), ensures reliable and efficient scheduling over fast inter-process communications or TCP/IP secured by TLS. Distributed computing can launch remote resources via SSH or cluster managers. An inherently queued architecture handles many more tasks than available processes, and requires no storage on the file system. Innovative features include support for otherwise non-exportable reference objects, event-driven promises, and asynchronous parallel map.
Maintained by Charlie Gao. Last updated 3 days ago.
asyncasynchronous-tasksconcurrencydistributed-computinghigh-performance-computingparallel-computing
1.8 match 217 stars 11.94 score 130 scripts 7 dependentsimangr
frostr:R API to MET Norway's 'Frost' API
An R API to MET Norway's 'Frost' API <https://frost.met.no/index.html> to retrieve data as data frames. The 'Frost' API, and the underlying data, is made available by the Norwegian Meteorological Institute (MET Norway). The data and products are distributed under the Norwegian License for Open Data 2.0 (NLOD) <https://data.norge.no/nlod/en/2.0> and Creative Commons 4.0 <https://creativecommons.org/licenses/by/4.0/>.
Maintained by Iman Ghayoornia. Last updated 5 years ago.
frostfrost-apinorwaynorwegian-datanorwegian-weather-dataweather-apiweather-data
6.9 match 3 stars 3.18 score 4 scriptschgrl
bReeze:Functions for Wind Resource Assessment
A collection of functions to analyse, visualize and interpret wind data and to calculate the potential energy production of wind turbines.
Maintained by Christian Graul. Last updated 1 years ago.
5.0 match 20 stars 4.34 score 22 scriptsjhudsl
ottrpal:Companion Tools for Open-Source Tools for Training Resources (OTTR)
Tools for converting Open-Source Tools for Training Resources (OTTR) courses into Leanpub or Coursera courses. 'ottrpal' is for use with the OTTR Template repository to create courses.
Maintained by Candace Savonen. Last updated 14 days ago.
3.3 match 3 stars 6.50 score 10 scripts 1 dependentsazure
AzureVision:Interface to Azure Computer Vision Services
An interface to 'Azure Computer Vision' <https://docs.microsoft.com/azure/cognitive-services/Computer-vision/Home> and 'Azure Custom Vision' <https://docs.microsoft.com/azure/cognitive-services/custom-vision-service/home>, building on the low-level functionality provided by the 'AzureCognitive' package. These services allow users to leverage the cloud to carry out visual recognition tasks using advanced image processing models, without needing powerful hardware of their own. Part of the 'AzureR' family of packages.
Maintained by Hong Ooi. Last updated 4 years ago.
azure-cognitive-servicesazure-sdk-rcomputer-visioncustom-vision
4.3 match 5 stars 5.00 score 8 scriptsazure
AzureCognitive:Interface to Azure Cognitive Services
An interface to Azure Cognitive Services <https://docs.microsoft.com/en-us/azure/cognitive-services/>. Both an 'Azure Resource Manager' interface, for deploying Cognitive Services resources, and a client framework are supplied. While 'AzureCognitive' can be called by the end-user, it is meant to provide a foundation for other packages that will support specific services, like Computer Vision, Custom Vision, language translation, and so on. Part of the 'AzureR' family of packages.
Maintained by Hong Ooi. Last updated 4 years ago.
azure-cognitive-servicesazure-sdk-r
3.9 match 11 stars 5.52 score 4 scripts 1 dependentsjimbrig
rtraining:R Training Resources, Guides, Tips, and Knowledge Base
Houses variouse material realted to teaching R.
Maintained by Jimmy Briggs. Last updated 2 years ago.
best-practicescurationdeveloper-toolsdevelopmentdevelopment-environmentguideknowledgepackage-developmentsetupshiny-appstips-and-trickstrainingtraining-materialswalkthrough
6.0 match 4 stars 3.60 score 6 scriptsivaughan
econullnetr:Null Model Analysis for Ecological Networks
Tools for using null models to analyse ecological networks (e.g. food webs, flower-visitation networks, seed-dispersal networks) and detect resource preferences or non-random interactions among network nodes. Tools are provided to run null models, test for and plot preferences, plot and analyse bipartite networks, and export null model results in a form compatible with other network analysis packages. The underlying null model was developed by Agusti et al. (2003) Molecular Ecology <doi:10.1046/j.1365-294X.2003.02014.x> and the full application to ecological networks by Vaughan et al. (2018) econullnetr: an R package using null models to analyse the structure of ecological networks and identify resource selection. Methods in Ecology & Evolution, <doi:10.1111/2041-210X.12907>.
Maintained by Ian Vaughan. Last updated 4 years ago.
4.3 match 7 stars 5.04 score 31 scriptsropensci
bowerbird:Keep a Collection of Sparkly Data Resources
Tools to get and maintain a data repository from third-party data providers.
Maintained by Ben Raymond. Last updated 5 days ago.
ropensciantarcticsouthern oceandataenvironmentalsatelliteclimatepeer-reviewed
3.0 match 50 stars 7.16 score 16 scripts 1 dependentsbioc
miaSim:Microbiome Data Simulation
Microbiome time series simulation with generalized Lotka-Volterra model, Self-Organized Instability (SOI), and other models. Hubbell's Neutral model is used to determine the abundance matrix. The resulting abundance matrix is applied to (Tree)SummarizedExperiment objects.
Maintained by Yagmur Simsek. Last updated 5 months ago.
microbiomesoftwaresequencingdnaseqatacseqcoveragenetwork
3.2 match 21 stars 6.64 score 23 scriptsropensci
webchem:Chemical Information from the Web
Chemical information from around the web. This package interacts with a suite of web services for chemical information. Sources include: Alan Wood's Compendium of Pesticide Common Names, Chemical Identifier Resolver, ChEBI, Chemical Translation Service, ChemSpider, ETOX, Flavornet, NIST Chemistry WebBook, OPSIN, PubChem, SRS, Wikidata.
Maintained by Tamás Stirling. Last updated 3 months ago.
cas-numberchemical-informationchemspideridentifierropensciwebscraping
2.0 match 165 stars 10.31 score 173 scripts 10 dependentsazure
AzureStor:Storage Management in 'Azure'
Manage storage in Microsoft's 'Azure' cloud: <https://azure.microsoft.com/en-us/product-categories/storage/>. On the admin side, 'AzureStor' includes features to create, modify and delete storage accounts. On the client side, it includes an interface to blob storage, file storage, and 'Azure Data Lake Storage Gen2': upload and download files and blobs; list containers and files/blobs; create containers; and so on. Authenticated access to storage is supported, via either a shared access key or a shared access signature (SAS). Part of the 'AzureR' family of packages.
Maintained by Hong Ooi. Last updated 2 years ago.
azure-data-lakeazure-sdk-razure-storageazure-storage-blobazure-storage-file
1.9 match 65 stars 10.74 score 298 scripts 4 dependentsmages
ChainLadder:Statistical Methods and Models for Claims Reserving in General Insurance
Various statistical methods and models which are typically used for the estimation of outstanding claims reserves in general insurance, including those to estimate the claims development result as required under Solvency II.
Maintained by Markus Gesmann. Last updated 1 months ago.
2.0 match 82 stars 10.04 score 196 scripts 2 dependentsmiferreiro
bdpar:Big Data Preprocessing Architecture
Provide a tool to easily build customized data flows to pre-process large volumes of information from different sources. To this end, 'bdpar' allows to (i) easily use and create new functionalities and (ii) develop new data source extractors according to the user needs. Additionally, the package provides by default a predefined data flow to extract and pre-process the most relevant information (tokens, dates, ... ) from some textual sources (SMS, Email, YouTube comments).
Maintained by Miguel Ferreiro-Díaz. Last updated 1 years ago.
custom-flowcustom-pipespreprocessingr6
3.8 match 8 stars 5.23 score 14 scriptsstatmanrobin
Stat2Data:Datasets for Stat2
Datasets for the textbook Stat2: Modeling with Regression and ANOVA (second edition). The package also includes data for the first edition, Stat2: Building Models for a World of Data and a few functions for plotting diagnostics.
Maintained by Robin Lock. Last updated 6 years ago.
4.0 match 5 stars 4.94 score 544 scriptsbioc
igvR:igvR: integrative genomics viewer
Access to igv.js, the Integrative Genomics Viewer running in a web browser.
Maintained by Arkadiusz Gladki. Last updated 5 months ago.
visualizationthirdpartyclientgenomebrowsers
2.4 match 43 stars 8.31 score 118 scriptsstan-dev
rstantools:Tools for Developing R Packages Interfacing with 'Stan'
Provides various tools for developers of R packages interfacing with 'Stan' <https://mc-stan.org>, including functions to set up the required package structure, S3 generics and default methods to unify function naming across 'Stan'-based R packages, and vignettes with recommendations for developers.
Maintained by Jonah Gabry. Last updated 2 months ago.
bayesian-data-analysisbayesian-statisticsdeveloper-toolsstan
1.5 match 50 stars 13.09 score 134 scripts 222 dependentsscholaempirica
reschola:The Schola Empirica Package
A collection of utilies, themes and templates for data analysis at Schola Empirica.
Maintained by Jan Netík. Last updated 5 months ago.
4.0 match 4 stars 4.83 score 14 scriptsyihui
xaringan:Presentation Ninja
Create HTML5 slides with R Markdown and the JavaScript library 'remark.js' (<https://remarkjs.com>).
Maintained by Yihui Xie. Last updated 12 months ago.
markdownnarutoninjapresentationpresentation-ninjaremarkjsrmarkdownrstudioslideshow
1.5 match 1.5k stars 12.78 score 948 scripts 11 dependentsmarkedmondson1234
googleAuthR:Authenticate and Create Google APIs
Create R functions that interact with OAuth2 Google APIs <https://developers.google.com/apis-explorer/> easily, with auto-refresh and Shiny compatibility.
Maintained by Erik Grönroos. Last updated 10 months ago.
apiauthenticationgooglegoogleauthroauth2-flowshiny
1.5 match 178 stars 12.84 score 804 scripts 13 dependentsncss-tech
SoilTaxonomy:A System of Soil Classification for Making and Interpreting Soil Surveys
Taxonomic dictionaries, formative element lists, and functions related to the maintenance, development and application of U.S. Soil Taxonomy. Data and functionality are based on official U.S. Department of Agriculture sources including the latest edition of the Keys to Soil Taxonomy. Descriptions and metadata are obtained from the National Soil Information System or Soil Survey Geographic databases. Other sources are referenced in the data documentation. Provides tools for understanding and interacting with concepts in the U.S. Soil Taxonomic System. Most of the current utilities are for working with taxonomic concepts at the "higher" taxonomic levels: Order, Suborder, Great Group, and Subgroup.
Maintained by Andrew Brown. Last updated 6 months ago.
great-groupncss-techsoilsoil-surveysoil-taxonomysubgroupsuborderusda
3.4 match 15 stars 5.65 scorebioc
xcms:LC-MS and GC-MS Data Analysis
Framework for processing and visualization of chromatographically separated and single-spectra mass spectral data. Imports from AIA/ANDI NetCDF, mzXML, mzData and mzML files. Preprocesses data for high-throughput, untargeted analyte profiling.
Maintained by Steffen Neumann. Last updated 3 days ago.
immunooncologymassspectrometrymetabolomicsbioconductorfeature-detectionmass-spectrometrypeak-detectioncpp
1.3 match 196 stars 14.31 score 984 scripts 11 dependentsinrae
airGRiwrm:'airGR' Integrated Water Resource Management
Semi-distributed Precipitation-Runoff Modeling based on 'airGR' package models integrating human infrastructures and their managements.
Maintained by David Dorchies. Last updated 6 months ago.
3.0 match 6.34 score 45 scriptschristopherkenny
bskyr:Interact with 'Bluesky' Social
Collect data from and make posts on 'Bluesky' Social via the Hypertext Transfer Protocol (HTTP) Application Programming Interface (API), as documented at <https://atproto.com/specs/xrpc>. This further supports broader queries to the Authenticated Transfer (AT) Protocol <https://atproto.com/> which 'Bluesky' Social relies on. Data is returned in a tidy format and posts can be made using a simple interface.
Maintained by Christopher T. Kenny. Last updated 1 months ago.
3.3 match 20 stars 5.66 score 23 scriptsjbengler
tidyplots:Tidy Plots for Scientific Papers
The goal of 'tidyplots' is to streamline the creation of publication-ready plots for scientific papers. It allows to gradually add, remove and adjust plot components using a consistent and intuitive syntax.
Maintained by Jan Broder Engler. Last updated 4 days ago.
2.0 match 482 stars 9.40 score 85 scriptsbioc
EnrichmentBrowser:Seamless navigation through combined results of set-based and network-based enrichment analysis
The EnrichmentBrowser package implements essential functionality for the enrichment analysis of gene expression data. The analysis combines the advantages of set-based and network-based enrichment analysis in order to derive high-confidence gene sets and biological pathways that are differentially regulated in the expression data under investigation. Besides, the package facilitates the visualization and exploration of such sets and pathways.
Maintained by Ludwig Geistlinger. Last updated 5 months ago.
immunooncologymicroarrayrnaseqgeneexpressiondifferentialexpressionpathwaysgraphandnetworknetworkgenesetenrichmentnetworkenrichmentvisualizationreportwriting
2.0 match 20 stars 9.37 score 164 scripts 3 dependents