Showing 200 of total 357 results (show query)
davidchall
ipaddress:Data Analysis for IP Addresses and Networks
Classes and functions for working with IP (Internet Protocol) addresses and networks, inspired by the Python 'ipaddress' module. Offers full support for both IPv4 and IPv6 (Internet Protocol versions 4 and 6) address spaces. It is specifically designed to work well with the 'tidyverse'.
Maintained by David Hall. Last updated 1 years ago.
cyberdata-analysisip-addressipv4ipv6vctrscpp
55.5 match 32 stars 7.02 score 27 scripts 2 dependentsdatawookie
emayili:Send Email Messages
A light, simple tool for sending emails with minimal dependencies.
Maintained by Andrew B. Collier. Last updated 1 months ago.
35.3 match 180 stars 9.59 score 95 scripts 3 dependentsdieghernan
nominatimlite:Interface with 'Nominatim' API Service
Lite interface for getting data from 'OSM' service 'Nominatim' <https://nominatim.org/release-docs/latest/>. Extract coordinates from addresses, find places near a set of coordinates and return spatial objects on 'sf' format.
Maintained by Diego Hernangómez. Last updated 1 months ago.
geocodingopenstreetmapaddressnominatimreverse-geocodingshapefilespatialapi-wrapperapigis
22.3 match 20 stars 8.08 score 41 scripts 1 dependentsjessecambon
tidygeocoder:Geocoding Made Easy
An intuitive interface for getting data from geocoding services.
Maintained by Jesse Cambon. Last updated 4 months ago.
12.0 match 287 stars 11.35 score 1.0k scripts 9 dependentsjiefei-wang
aws.ecx:Communicating with AWS EC2 and ECS using AWS REST APIs
Providing the functions for communicating with Amazon Web Services(AWS) Elastic Compute Cloud(EC2) and Elastic Container Service(ECS). The functions will have the prefix 'ecs_' or 'ec2_' depending on the class of the API. The request will be sent via the REST API and the parameters are given by the function argument. The credentials can be set via 'aws_set_credentials'. The EC2 documentation can be found at <https://docs.aws.amazon.com/AWSEC2/latest/APIReference/Welcome.html> and ECS can be found at <https://docs.aws.amazon.com/AmazonECS/latest/APIReference/Welcome.html>.
Maintained by Jiefei Wang. Last updated 3 years ago.
31.0 match 1 stars 4.18 score 2 scriptslindbrook
cholera:Amend, Augment and Aid Analysis of John Snow's Cholera Map
Amends errors, augments data and aids analysis of John Snow's map of the 1854 London cholera outbreak.
Maintained by lindbrook. Last updated 1 days ago.
choleradata-visualizationdatasetsepidemiologyjohn-snowpublic-healthtriangulation-delaunayvoronoivoronoi-polygons
13.7 match 136 stars 9.33 score 95 scriptsrdatatable
data.table:Extension of `data.frame`
Fast aggregation of large data (e.g. 100GB in RAM), fast ordered joins, fast add/modify/delete of columns by group using no copies at all, list columns, friendly and fast character-separated-value read/write. Offers a natural and flexible syntax, for faster development.
Maintained by Tyson Barrett. Last updated 2 days ago.
5.3 match 3.7k stars 23.53 score 230k scripts 4.6k dependentscran
podcleaner:Legacy Scottish Post Office Directories Cleaner
Attempts to clean optical character recognition (OCR) errors in legacy Scottish Post Office Directories. Further attempts to match records from trades and general directories.
Maintained by Olivier Bautheac. Last updated 3 years ago.
61.0 match 1.70 scorewalkerke
tigris:Load Census TIGER/Line Shapefiles
Download TIGER/Line shapefiles from the United States Census Bureau (<https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-line-file.html>) and load into R as 'sf' objects.
Maintained by Kyle Walker. Last updated 4 months ago.
7.0 match 331 stars 12.87 score 5.3k scripts 16 dependentsbnosac
udpipe:Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.
Maintained by Jan Wijffels. Last updated 2 years ago.
conlldependency-parserlemmatizationnatural-language-processingnlppos-taggingr-pkgrcpptext-miningtokenizerudpipecpp
6.7 match 215 stars 11.83 score 1.2k scripts 9 dependentsdavidchall
ggip:Data Visualization for IP Addresses and Networks
A 'ggplot2' extension that enables visualization of IP (Internet Protocol) addresses and networks. The address space is mapped onto the Cartesian coordinate system using a space-filling curve. Offers full support for both IPv4 and IPv6 (Internet Protocol versions 4 and 6) address spaces.
Maintained by David Hall. Last updated 2 years ago.
cyberdata-visualizationggplot-extensionggplot2hilbert-curveip-addressipv4ipv6space-filling-curvescpp
13.3 match 27 stars 5.51 score 12 scriptsgeomarker-io
addr:Clean, Parse, Harmonize, Match, and Geocode Messy Real-World Addresses
Addresses that were not validated at the time of collection are often heterogenously formatted, making them difficult to compare or link to other sets of addresses. The addr package is designed to clean character strings of addresses, use the `usaddress` library to tag address components, and paste together select components to create a normalized address. Normalized addresses can be hashed to create hashdresses that can be used to merge with other sets of addresses.
Maintained by Cole Brokamp. Last updated 5 months ago.
15.4 match 2 stars 4.70 score 388 scriptsdieghernan
arcgeocoder:Geocoding with the 'ArcGIS' REST API Service
Lite interface for finding locations of addresses or businesses around the world using the 'ArcGIS' REST API service <https://developers.arcgis.com/rest/geocode/api-reference/overview-world-geocoding-service.htm>. Address text can be converted to location candidates and a location can be converted into an address. No API key required.
Maintained by Diego Hernangómez. Last updated 1 months ago.
geocodingarcgisaddressreverse-geocodingapi-wrapperapi-restarcgis-apigis
11.9 match 2 stars 5.56 score 15 scriptsdegauss-org
dht:A Collection of Functions to Assist Building DeGAUSS Containers
degauss helper tools are used to develop and run DeGAUSS containers.
Maintained by Erika Rasnick. Last updated 1 years ago.
14.2 match 5 stars 4.51 score 16 scriptsropensci
excluder:Checks for Exclusion Criteria in Online Data
Data that are collected through online sources such as Mechanical Turk may require excluding rows because of IP address duplication, geolocation, or completion duration. This package facilitates exclusion of these data for Qualtrics datasets.
Maintained by Jeffrey R. Stevens. Last updated 11 days ago.
datacleaningexclusionmturkqualtrics
11.1 match 9 stars 5.51 score 18 scriptsropensci
refsplitr:author name disambiguation, author georeferencing, and mapping of coauthorship networks with 'Web of Science' data
Tools to parse and organize reference records downloaded from the 'Web of Science' citation database into an R-friendly format, disambiguate the names of authors, geocode their locations, and generate/visualize coauthorship networks. This package has been peer-reviewed by rOpenSci (v. 1.0).
Maintained by Emilio Bruna. Last updated 7 months ago.
name disambiguationbibliometricscoauthorshipcollaborationgeoreferencingmetasciencereferencesscientometricsscience of scienceweb of science
10.8 match 55 stars 5.64 score 16 scriptsquanteda
quanteda:Quantitative Analysis of Textual Data
A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and n-grams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.
Maintained by Kenneth Benoit. Last updated 2 months ago.
corpusnatural-language-processingquantedatext-analyticsonetbbcpp
3.6 match 851 stars 16.68 score 5.4k scripts 51 dependentshughparsonage
healthyAddress:Convert Addresses to Standard Inputs
Efficient tools for parsing and standardizing Australian addresses from textual data. It utilizes optimized algorithms to accurately identify and extract components of addresses, such as street names, types, and postcodes, especially for large batched data in contexts where sending addresses to internet services may be slow or inappropriate. The core functionality is built on fast string processing techniques to handle variations in address formats and abbreviations commonly found in Australian address data. Designed for data scientists, urban planners, and logistics analysts, the package facilitates the cleaning and normalization of address information, supporting better data integration and analysis in urban studies, geography, and related fields.
Maintained by Hugh Parsonage. Last updated 2 months ago.
13.5 match 6 stars 4.42 score 11 scriptssnoweye
pbdZMQ:Programming with Big Data -- Interface to 'ZeroMQ'
'ZeroMQ' is a well-known library for high-performance asynchronous messaging in scalable, distributed applications. This package provides high level R wrapper functions to easily utilize 'ZeroMQ'. We mainly focus on interactive client/server programming frameworks. For convenience, a minimal 'ZeroMQ' library (4.2.2) is shipped with 'pbdZMQ', which can be used if no system installation of 'ZeroMQ' is available. A few wrapper functions compatible with 'rzmq' are also provided.
Maintained by Wei-Chen Chen. Last updated 6 months ago.
5.3 match 17 stars 9.92 score 46 scripts 26 dependentsthemains
virustotal:R Client for the VirusTotal API
Use VirusTotal, a Google service that analyzes files and URLs for viruses, worms, trojans etc., provides category of the content hosted by a domain from a variety of prominent services, provides passive DNS information, among other things. See <http://www.virustotal.com> for more information.
Maintained by Gaurav Sood. Last updated 2 years ago.
cybersecurityscan-filestrojansvirustotal
9.4 match 11 stars 5.47 score 18 scripts 1 dependentsropensci
targets:Dynamic Function-Oriented 'Make'-Like Declarative Pipelines
Pipeline tools coordinate the pieces of computationally demanding analysis projects. The 'targets' package is a 'Make'-like pipeline tool for statistics and data science in R. The package skips costly runtime for tasks that are already up to date, orchestrates the necessary computation with implicit parallel computing, and abstracts files as R objects. If all the current output matches the current upstream code and data, then the whole pipeline is up to date, and the results are more trustworthy than otherwise. The methodology in this package borrows from GNU 'Make' (2015, ISBN:978-9881443519) and 'drake' (2018, <doi:10.21105/joss.00550>).
Maintained by William Michael Landau. Last updated 2 days ago.
data-sciencehigh-performance-computingmakepeer-reviewedpipeliner-targetopiareproducibilityreproducible-researchtargetsworkflow
3.4 match 973 stars 15.20 score 4.6k scripts 22 dependentstbates
umx:Structural Equation Modeling and Twin Modeling in R
Quickly create, run, and report structural equation models, and twin models. See '?umx' for help, and umx_open_CRAN_page("umx") for NEWS. Timothy C. Bates, Michael C. Neale, Hermine H. Maes, (2019). umx: A library for Structural Equation and Twin Modelling in R. Twin Research and Human Genetics, 22, 27-41. <doi:10.1017/thg.2019.2>.
Maintained by Timothy C. Bates. Last updated 2 days ago.
behavior-geneticsgeneticsopenmxpsychologysemstatisticsstructural-equation-modelingtutorialstwin-modelsumx
5.3 match 44 stars 9.45 score 472 scriptsr-lib
whoami:Username, Full Name, Email Address, 'GitHub' Username of the Current User
Look up the username and full name of the current user, the current user's email address and 'GitHub' username, using various sources of system and configuration information.
Maintained by Gábor Csárdi. Last updated 5 months ago.
5.2 match 19 stars 9.48 score 36 scripts 39 dependentsbiomodhub
biomod2:Ensemble Platform for Species Distribution Modeling
Functions for species distribution modeling, calibration and evaluation, ensemble of models, ensemble forecasting and visualization. The package permits to run consistently up to 10 single models on a presence/absences (resp presences/pseudo-absences) dataset and to combine them in ensemble models and ensemble projections. Some bench of other evaluation and visualisation tools are also available within the package.
Maintained by Maya Gueguen. Last updated 5 days ago.
3.4 match 95 stars 13.88 score 536 scripts 7 dependentsropenspain
CatastRo:Interface to the API 'Sede Electronica Del Catastro'
Access public spatial data available under the 'INSPIRE' directive. Tools for downloading references and addresses of properties, as well as map images.
Maintained by Diego Hernangómez. Last updated 1 months ago.
catastrogismapsropenspainspainspatialstatic-tiles
6.0 match 22 stars 7.17 score 14 scriptsbnosac
crfsuite:Conditional Random Fields for Labelling Sequential Data in Natural Language Processing
Wraps the 'CRFsuite' library <https://github.com/chokkan/crfsuite> allowing users to fit a Conditional Random Field model and to apply it on existing data. The focus of the implementation is in the area of Natural Language Processing where this R package allows you to easily build and apply models for named entity recognition, text chunking, part of speech tagging, intent recognition or classification of any category you have in mind. Next to training, a small web application is included in the package to allow you to easily construct training data.
Maintained by Jan Wijffels. Last updated 2 years ago.
chunkingconditional-random-fieldscrfcrfsuitedata-scienceintent-classificationnatural-language-processingnernlpcpp
6.6 match 63 stars 6.34 score 35 scriptsshikokuchuo
nanonext:NNG (Nanomsg Next Gen) Lightweight Messaging Library
R binding for NNG (Nanomsg Next Gen), a successor to ZeroMQ. NNG is a socket library for reliable, high-performance messaging over in-process, IPC, TCP, WebSocket and secure TLS transports. Implements 'Scalability Protocols', a standard for common communications patterns including publish/subscribe, request/reply and service discovery. As its own threaded concurrency framework, provides a toolkit for asynchronous programming and distributed computing. Intuitive 'aio' objects resolve automatically when asynchronous operations complete, and synchronisation primitives allow R to wait upon events signalled by concurrent threads.
Maintained by Charlie Gao. Last updated 2 days ago.
concurrencyhttpsipc-messagemessaging-librarynngrpcsocket-communicationsynchronization-primitivestcp-protocolwebsocketmbedtls
4.0 match 60 stars 9.81 score 28 scripts 9 dependentshadley
pryr:Tools for Computing on the Language
Useful tools to pry back the covers of R and understand the language at a deeper level.
Maintained by Hadley Wickham. Last updated 1 years ago.
3.3 match 204 stars 11.85 score 1.9k scripts 56 dependentsmastropi
envnames:Track User-Defined Environment Names
Set of functions to keep track of user-defined environment names (which cannot be retrieved with the built-in function environmentName()). The package also provides functionality to search for objects in environments, deal with function calling chains, and retrieve an object's memory address.
Maintained by Daniel Mastropietro. Last updated 6 years ago.
8.1 match 2 stars 4.66 score 46 scriptsifellows
wordcloud:Word Clouds
Functionality to create pretty word clouds, visualize differences and similarity between documents, and avoid over-plotting in scatter plots with text.
Maintained by Ian Fellows. Last updated 7 years ago.
3.4 match 2 stars 10.59 score 6.9k scripts 55 dependentsjl5000
tidyged:Handle GEDCOM Files Using Tidyverse Principles
Create and summarise family tree GEDCOM files using tidy dataframes.
Maintained by Jamie Lendrum. Last updated 3 years ago.
5.6 match 8 stars 5.96 score 23 scripts 3 dependentsdboslab
expowo:An R package for mining global plant diversity and distribution data
Produces diversity estimates and species lists with associated global distribution for any vascular plant family and genus from 'Plants of the World Online' database <https://powo.science.kew.org/>, by interacting with the source code of each plant taxon page. It also creates global maps of species richness, graphics of species discoveries and nomenclatural changes over time. For more details
Maintained by Debora Zuanny. Last updated 5 months ago.
4.5 match 8 stars 7.36 score 64 scriptsbioc
sesame:SEnsible Step-wise Analysis of DNA MEthylation BeadChips
Tools For analyzing Illumina Infinium DNA methylation arrays. SeSAMe provides utilities to support analyses of multiple generations of Infinium DNA methylation BeadChips, including preprocessing, quality control, visualization and inference. SeSAMe features accurate detection calling, intelligent inference of ethnicity, sex and advanced quality control routines.
Maintained by Wanding Zhou. Last updated 2 months ago.
dnamethylationmethylationarraypreprocessingqualitycontrolbioinformaticsdna-methylationmicroarray
3.6 match 69 stars 9.08 score 258 scripts 1 dependentsfmmattioni
downloadthis:Implement Download Buttons in 'rmarkdown'
Implement download buttons in HTML output from 'rmarkdown' without the need for 'runtime:shiny'.
Maintained by Felipe Mattioni Maturana. Last updated 5 months ago.
3.4 match 146 stars 9.63 score 856 scripts 1 dependentscynkra
constructive:Display Idiomatic Code to Construct Most R Objects
Prints code that can be used to recreate R objects. In a sense it is similar to 'base::dput()' or 'base::deparse()' but 'constructive' strives to use idiomatic constructors.
Maintained by Antoine Fabri. Last updated 2 days ago.
3.8 match 137 stars 8.63 score 20 scriptsrstudio
gt:Easily Create Presentation-Ready Display Tables
Build display tables from tabular data with an easy-to-use set of functions. With its progressive approach, we can construct display tables with a cohesive set of table parts. Table values can be formatted using any of the included formatting functions. Footnotes and cell styles can be precisely added through a location targeting system. The way in which 'gt' handles things for you means that you don't often have to worry about the fine details.
Maintained by Richard Iannone. Last updated 11 days ago.
docxeasy-to-usehtmllatexrtfsummary-tables
1.7 match 2.1k stars 18.36 score 20k scripts 112 dependentskadyb
rgugik:Search and Retrieve Spatial Data from 'GUGiK'
Automatic open data acquisition from resources of Polish Head Office of Geodesy and Cartography ('Główny Urząd Geodezji i Kartografii') (<https://www.gov.pl/web/gugik>). Available datasets include various types of numeric, raster and vector data, such as orthophotomaps, digital elevation models (digital terrain models, digital surface model, point clouds), state register of borders, spatial databases, geometries of cadastral parcels, 3D models of buildings, and more. It is also possible to geocode addresses or objects using the geocodePL_get() function.
Maintained by Krzysztof Dyba. Last updated 7 days ago.
cartographygeodesygisopen-datapoland
4.1 match 34 stars 7.69 score 30 scriptsr-arcgis
arcgisgeocode:A Robust Interface to ArcGIS 'Geocoding Services'
A very fast and robust interface to ArcGIS 'Geocoding Services'. Provides capabilities for reverse geocoding, finding address candidates, character-by-character search autosuggestion, and batch geocoding. The public 'ArcGIS World Geocoder' is accessible for free use via 'arcgisgeocode' for all services except batch geocoding. 'arcgisgeocode' also integrates with 'arcgisutils' to provide access to custom locators or private 'ArcGIS World Geocoder' hosted on 'ArcGIS Enterprise'. Learn more in the 'Geocode service' API reference <https://developers.arcgis.com/rest/geocode/api-reference/overview-world-geocoding-service.htm>.
Maintained by Josiah Parry. Last updated 2 months ago.
4.5 match 41 stars 6.82 score 20 scripts 1 dependentsr-prof
procmaps:Portable Address Space Mapping
Portable '/proc/self/maps' as a data frame. Determine which library or other region is mapped to a specific address of a process. -- R packages can contain native code, compiled to shared libraries at build or installation time. When loaded, each shared library occupies a portion of the address space of the main process. When only a machine instruction pointer is available (e.g. from a backtrace during error inspection or profiling), the address space map determines which library this instruction pointer corresponds to.
Maintained by Kirill Müller. Last updated 2 months ago.
5.8 match 3 stars 5.18 score 3 scripts 1 dependentskasperwelbers
corpustools:Managing, Querying and Analyzing Tokenized Text
Provides text analysis in R, focusing on the use of a tokenized text format. In this format, the positions of tokens are maintained, and each token can be annotated (e.g., part-of-speech tags, dependency relations). Prominent features include advanced Lucene-like querying for specific tokens or contexts (e.g., documents, sentences), similarity statistics for words and documents, exporting to DTM for compatibility with many text analysis packages, and the possibility to reconstruct original text from tokens to facilitate interpretation.
Maintained by Kasper Welbers. Last updated 6 months ago.
4.0 match 31 stars 7.50 score 174 scripts 1 dependentscivisanalytics
civis:R Client for the 'Civis Platform API'
A convenient interface for making requests directly to the 'Civis Platform API' <https://www.civisanalytics.com/platform/>. Full documentation available 'here' <https://civisanalytics.github.io/civis-r/>.
Maintained by Peter Cooman. Last updated 2 months ago.
3.8 match 16 stars 7.84 score 144 scriptsmazamascience
MazamaLocationUtils:Manage Spatial Metadata for Known Locations
Utility functions for discovering and managing metadata associated with spatially unique "known locations". Applications include all fields of environmental monitoring (e.g. air and water quality) where data are collected at stationary sites.
Maintained by Jonathan Callahan. Last updated 3 months ago.
5.2 match 5.64 score 108 scriptsparklab
Nozzle.R1:Nozzle Reports
The Nozzle package provides an API to generate HTML reports with dynamic user interface elements based on JavaScript and CSS (Cascading Style Sheets). Nozzle was designed to facilitate summarization and rapid browsing of complex results in data analysis pipelines where multiple analyses are performed frequently on big data sets. The package can be applied to any project where user-friendly reports need to be created.
Maintained by Nils Gehlenborg. Last updated 10 years ago.
gehlenborglabhtml-reportreproducible-research
5.4 match 68 stars 5.31 score 10 scripts 2 dependentsrstudio
reticulate:Interface to 'Python'
Interface to 'Python' modules, classes, and functions. When calling into 'Python', R data types are automatically converted to their equivalent 'Python' types. When values are returned from 'Python' to R they are converted back to R types. Compatible with all versions of 'Python' >= 2.7.
Maintained by Tomasz Kalinowski. Last updated 2 days ago.
1.3 match 1.7k stars 21.07 score 18k scripts 427 dependentsrstudio
httpuv:HTTP and WebSocket Server Library
Provides low-level socket and protocol support for handling HTTP and WebSocket requests directly from within R. It is primarily intended as a building block for other packages, rather than making it particularly easy to create complete web applications using httpuv alone. httpuv is built on top of the libuv and http-parser C libraries, both of which were developed by Joyent, Inc. (See LICENSE file for libuv and http-parser license information.)
Maintained by Winston Chang. Last updated 12 months ago.
1.8 match 235 stars 15.09 score 708 scripts 2.1k dependentstransportfoundry
censusr:Collect Data from the Census API
Use the US Census API to collect summary data tables for SF1 and ACS datasets at arbitrary geographies.
Maintained by Greg Macfarlane. Last updated 7 years ago.
5.4 match 5 stars 4.95 score 18 scriptscoolbutuseless
c64asm:6502 Assembler
A simple 6502 assembler written purely in R and leveraging R data structures for pre-computing character sets and images.
Maintained by mikefc. Last updated 1 years ago.
12.1 match 3 stars 2.22 score 11 scriptsjlacko
RCzechia:Spatial Objects of the Czech Republic
Administrative regions and other spatial objects of the Czech Republic.
Maintained by Jindra Lacko. Last updated 2 days ago.
3.9 match 25 stars 6.87 score 85 scriptsrfhb
ctrdata:Retrieve and Analyze Clinical Trials Data from Public Registers
A system for querying, retrieving and analyzing protocol- and results-related information on clinical trials from four public registers, the 'European Union Clinical Trials Register' ('EUCTR', <https://www.clinicaltrialsregister.eu/>), 'ClinicalTrials.gov' (<https://clinicaltrials.gov/> and also translating queries the retired classic interface), the 'ISRCTN' (<http://www.isrctn.com/>) and the 'European Union Clinical Trials Information System' ('CTIS', <https://euclinicaltrials.eu/>). Trial information is downloaded, converted and stored in a database ('PostgreSQL', 'SQLite', 'DuckDB' or 'MongoDB'; via package 'nodbi'). Protocols, statistical analysis plans, informed consent sheets and other documents in registers associated with trials can also be downloaded. Other functions implement trial concepts canonically across registers, identify deduplicated records, easily find and extract variables (fields) of interest even from complex nested data as used by the registers, merge variables and update queries. The package can be used for monitoring, meta- and trend-analysis of the design and conduct as well as of the results of clinical trials across registers.
Maintained by Ralf Herold. Last updated 8 hours ago.
clinical-dataclinical-researchclinical-studiesclinical-trialsctgovdatabaseduckdbmongodbnodbipostgresqlregistersqlitestudiestrial
3.3 match 45 stars 7.92 score 32 scriptstarakc02
rmapzen:Client for 'Mapzen' and Related Map APIs
Provides an interface to 'Mapzen'-based APIs (including geocode.earth, Nextzen, and NYC GeoSearch) for geographic search and geocoding, isochrone calculation, and vector data to draw map tiles. See <https://www.mapzen.com/documentation/> for more information. The original Mapzen has gone out of business, but 'rmapzen' can be set up to work with any provider who implements the Mapzen API.
Maintained by Tarak Shah. Last updated 1 years ago.
4.0 match 34 stars 6.39 score 72 scriptswalkerke
mapboxapi:R Interface to 'Mapbox' Web Services
Includes support for 'Mapbox' Navigation APIs, including directions, isochrones, and route optimization; the Search API for forward and reverse geocoding; the Maps API for interacting with 'Mapbox' vector tilesets and visualizing 'Mapbox' maps in R; and 'Mapbox Tiling Service' and 'tippecanoe' for generating map tiles. See <https://docs.mapbox.com/api/> for more information about the 'Mapbox' APIs.
Maintained by Kyle Walker. Last updated 2 months ago.
3.4 match 112 stars 7.61 score 304 scriptsfeddelegrand7
algo:Implement an Address Search Auto Completion Menu on 'Shiny' Text Inputs Using the 'Algolia Places' 'Javascript' Library
Allows the user to implement an address search auto completion menu on 'shiny' text inputs. This is done using the 'Algolia Places' 'JavaScript' library. See <https://community.algolia.com/places/>.
Maintained by Mohamed El Fodil Ihaddaden. Last updated 5 years ago.
4.9 match 11 stars 5.08 score 11 scriptshrbrmstr
cymruservices:Query 'Team Cymru' 'IP' Address, Autonomous System Number ('ASN'), Border Gateway Protocol ('BGP'), Bogon and 'Malware' Hash Data Services
A toolkit for querying 'Team Cymru' <http://team-cymru.org> 'IP' address, Autonomous System Number ('ASN'), Border Gateway Protocol ('BGP'), Bogon and 'Malware' Hash Data Services.
Maintained by Bob Rudis. Last updated 7 years ago.
bogonsmalware-hash-registryr-cyberteam-cymru-webservice
6.5 match 7 stars 3.69 score 14 scriptsrpolars
polars:Lightning-Fast 'DataFrame' Library
Lightning-fast 'DataFrame' library written in 'Rust'. Convert R data to 'Polars' data and vice versa. Perform fast, lazy, larger-than-memory and optimized data queries. 'Polars' is interoperable with the package 'arrow', as both are based on the 'Apache Arrow' Columnar Format.
Maintained by Soren Welling. Last updated 3 days ago.
2.0 match 499 stars 12.01 score 1.0k scripts 2 dependentstaylor-arnold
cleanNLP:A Tidy Data Model for Natural Language Processing
Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or a Python back ends with 'spaCy' <https://spacy.io>. Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing.
Maintained by Taylor B. Arnold. Last updated 10 months ago.
corenlpnatural-language-processingspacy
2.9 match 214 stars 8.39 score 229 scriptsip2location
ip2location:Lookup for IP Address Information
Enables the user to find the country, region, district, city, coordinates, zip code, time zone, ISP, domain name, connection type, area code, weather, MCC, MNC, mobile brand name, elevation, usage type, address type, IAB category and ASN that any IP address or hostname originates from. Supported IPv4 and IPv6. Please visit <https://www.ip2location.com> to learn more. You may also want to visit <https://lite.ip2location.com> for free database download. This package requires 'IP2Location Python' module. At the terminal, please run 'pip install IP2Location' to install the module.
Maintained by Kai Wen Ooi. Last updated 2 years ago.
geolocationgeolocation-informationip-geolocationip-lookupip2locationlookupr-language
5.8 match 10 stars 4.00 score 1 scriptswadpac
GGIR:Raw Accelerometer Data Analysis
A tool to process and analyse data collected with wearable raw acceleration sensors as described in Migueles and colleagues (JMPB 2019), and van Hees and colleagues (JApplPhysiol 2014; PLoSONE 2015). The package has been developed and tested for binary data from 'GENEActiv' <https://activinsights.com/>, binary (.gt3x) and .csv-export data from 'Actigraph' <https://theactigraph.com> devices, and binary (.cwa) and .csv-export data from 'Axivity' <https://axivity.com>. These devices are currently widely used in research on human daily physical activity. Further, the package can handle accelerometer data file from any other sensor brand providing that the data is stored in csv format. Also the package allows for external function embedding.
Maintained by Vincent T van Hees. Last updated 2 days ago.
accelerometeractivity-recognitioncircadian-rhythmmovement-sensorsleep
1.7 match 109 stars 13.20 score 342 scripts 3 dependentsoobianom
quickcode:Quick and Essential 'R' Tricks for Better Scripts
The NOT functions, 'R' tricks and a compilation of some simple quick plus often used 'R' codes to improve your scripts. Improve the quality and reproducibility of 'R' scripts.
Maintained by Obinna Obianom. Last updated 14 days ago.
2.9 match 5 stars 7.76 score 7 scripts 6 dependentsgavinrozzi
njgeo:Tools for Geocoding Addresses in New Jersey using the 'NJOGIS' API
Provides an R interface to free geocoding REST APIs maintained by the New Jersey Office of GIS <https://njgin.nj.gov/njgin/edata/geocoding/index.html#!/> and commonly used shapefiles.
Maintained by Gavin Rozzi. Last updated 1 years ago.
new-jerseygeocodinggeocoderarcgis
8.2 match 2.70 score 3 scriptslindbrook
packageRank:Computation and Visualization of Package Download Counts and Percentile Ranks
Compute and visualize package download counts and percentile ranks from Posit/RStudio's CRAN mirror.
Maintained by lindbrook. Last updated 4 days ago.
3.6 match 28 stars 6.13 score 27 scriptstrinker
qdapRegex:Regular Expression Removal, Extraction, and Replacement Tools
A collection of regular expression tools associated with the 'qdap' package that may be useful outside of the context of discourse analysis. Tools include removal/extraction/replacement of abbreviations, dates, dollar amounts, email addresses, hash tags, numbers, percentages, citations, person tags, phone numbers, times, and zip codes.
Maintained by Tyler Rinker. Last updated 1 years ago.
2.3 match 50 stars 9.48 score 502 scripts 41 dependentsaleksanderbl29
dawaR:An API Wrapper for 'DAWA' - 'The Danish Address Web API'
Functions for interacting with all sections of the official 'Danish Address Web API' (also known as 'DAWA') <https://api.dataforsyningen.dk>. The development of this package is completely independent from the government agency, Klimadatastyrelsen, who maintains the API.
Maintained by Aleksander Bang-Larsen. Last updated 1 months ago.
3.3 match 3 stars 6.49 score 9 scripts 1 dependentsropengov
giscoR:Download Map Data from GISCO API - Eurostat
Tools to download data from the GISCO (Geographic Information System of the Commission) Eurostat database <https://ec.europa.eu/eurostat/web/gisco>. Global and European map data available. This package is in no way officially related to or endorsed by Eurostat.
Maintained by Diego Hernangómez. Last updated 30 days ago.
ropengovspatialapi-wrappereurostatgiscothematic-mapseurostat-dataggplot2gis
2.0 match 75 stars 10.70 score 424 scripts 5 dependentsvinhdizzo
IRexamples:Collection of Practical Institutional Research Examples and Tutorials
Provides examples of code for analyzing data or accomplishing tasks that may be useful to institutional or educational researchers.
Maintained by Vinh Nguyen. Last updated 2 years ago.
4.3 match 4 stars 5.00 score 4 scriptsropenspain
CatastRoEus:Interface to the 'INSPIRE' services of 'Catastro de Bizkaia', 'Catastro de Gipuzkua', 'Catastro de Araba'
Access public spatial data available under the 'INSPIRE' directive. Tools for downloading references, buildings and addresses of properties on Pais Vasco (Spain).
Maintained by Jimena Salinero. Last updated 6 months ago.
basquecountrycatastromapsropenspainspain
5.7 match 3.74 score 3 scriptspaulhendricks
generator:Generate Data Containing Fake Personally Identifiable Information
Allows users to quickly and easily generate fake data containing Personally Identifiable Information (PII) through convenience functions.
Maintained by Paul Hendricks. Last updated 8 years ago.
3.5 match 24 stars 5.99 score 81 scriptstrinker
textclean:Text Cleaning Tools
Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.
Maintained by Tyler Rinker. Last updated 3 years ago.
data-mungingemoticonsregextext-analysistext-cleaning
2.0 match 248 stars 10.08 score 760 scripts 22 dependentsmrcieu
ieugwasr:Interface to the 'OpenGWAS' Database API
Interface to the 'OpenGWAS' database API <https://api.opengwas.io/api/>. Includes a wrapper to make generic calls to the API, plus convenience functions for specific queries.
Maintained by Gibran Hemani. Last updated 3 days ago.
1.8 match 89 stars 10.71 score 404 scripts 6 dependentstalgalili
installr:Using R to Install Stuff on Windows OS (Such As: R, 'Rtools', 'RStudio', 'Git', and More!)
R is great for installing software. Through the 'installr' package you can automate the updating of R (on Windows, using updateR()) and install new software. Software installation is initiated through a GUI (just run installr()), or through functions such as: install.Rtools(), install.pandoc(), install.git(), and many more. The updateR() command performs the following: finding the latest R version, downloading it, running the installer, deleting the installation file, copy and updating old packages to the new R installation.
Maintained by Tal Galili. Last updated 1 years ago.
1.9 match 273 stars 10.19 score 1.2k scriptsropensci
geonames:Interface to the "Geonames" Spatial Query Web Service
The web service at <https://www.geonames.org/> provides a number of spatial data queries, including administrative area hierarchies, city locations and some country postal code queries. A (free) username is required and rate limits exist.
Maintained by Barry Rowlingson. Last updated 6 years ago.
2.3 match 37 stars 8.45 score 165 scripts 21 dependentspaulhendricks
detector:Detect Data Containing Personally Identifiable Information
Allows users to quickly and easily detect data containing Personally Identifiable Information (PII) through convenience functions.
Maintained by Paul Hendricks. Last updated 8 years ago.
3.5 match 15 stars 5.34 score 29 scriptstconwell
html5:Creates Valid HTML5 Strings
Generates valid HTML tag strings for HTML5 elements documented by Mozilla. Attributes are passed as named lists, with names being the attribute name and values being the attribute value. Attribute values are automatically double-quoted. To declare a DOCTYPE, wrap html() with function doctype(). Mozilla's documentation for HTML5 is available here: <https://developer.mozilla.org/en-US/docs/Web/HTML/Element>. Elements marked as obsolete are not included.
Maintained by Timothy Conwell. Last updated 2 years ago.
5.2 match 1 stars 3.65 score 1 scripts 3 dependentswrathematics
getip:'IP' Address 'Lookup'
A micro-package for getting your 'IP' address, either the local/internal or the public/external one. Currently only 'IPv4' addresses are supported.
Maintained by Drew Schmidt. Last updated 1 years ago.
4.2 match 4 stars 4.51 score 11 scripts 3 dependentskasperwelbers
tokenbrowser:Create Full Text Browsers from Annotated Token Lists
Create browsers for reading full texts from a token list format. Information obtained from text analyses (e.g., topic modeling, word scaling) can be used to annotate the texts.
Maintained by Kasper Welbers. Last updated 4 years ago.
3.5 match 7 stars 5.38 score 13 scripts 5 dependentsropensci
rebird:R Client for the eBird Database of Bird Observations
A programmatic client for the eBird database (<https://ebird.org/home>), including functions for searching for bird observations by geographic location (latitude, longitude), eBird hotspots, location identifiers, by notable sightings, by region, and by taxonomic name.
Maintained by Sebastian Pardo. Last updated 1 months ago.
birdsbirdingebirddatabasedatabiologyobservationssightingsornithologyebird-apiebird-webservicesspocc
1.8 match 90 stars 10.43 score 73 scripts 6 dependentsdaniels08663
KOR.addrlink:Matching Address Data to Reference Index
Matches a data set with semi-structured address data, e.g., street and house number as a concatenated string, wrongly spelled street names or non-existing house numbers to a reference index. The methods are specifically designed for German municipalities ('KOR'-community) and German address schemes.
Maintained by Daniel Schürmann. Last updated 1 years ago.
9.2 match 2.00 scoreipeagit
enderecobr:Padronizador de Endereços Brasileiros (Brazilian Addresses Standardizer)
Padroniza endereços brasileiros a partir de diferentes critérios. Os métodos de padronização incluem apenas manipulações básicas de strings, não oferecendo suporte a correspondências probabilísticas entre strings. (Standardizes brazilian addresses using different criteria. Standardization methods include only basic string manipulation, not supporting probabilistic matches between strings.)
Maintained by Daniel Herszenhut. Last updated 25 days ago.
3.4 match 3 stars 5.41 score 10 scriptsdavidasmith
whatthreewords:Work with the 'what3words' API for Easy Location Referencing
Use the 'what3words' API <https://developer.what3words.com/public-api> to return three words which uniquely identify every 3m x 3m square on Earth. It is also possible to return coordinates from any valid three words location. Supports multiple languages.
Maintained by David Smith. Last updated 1 years ago.
geocodinggeolocationlocationwhat3wordswhat3words-api
4.9 match 3.70 score 6 scriptscurycu
valuemap:Making Choropleth Map
You can easily visualize your 'sf' polygons or data.frame with h3 address. While 'leaflet' package is too raw for data analysis, this package can save data analysts' efforts & time with pre-set visualize options.
Maintained by Huncheol Ha. Last updated 2 years ago.
5.7 match 3 stars 3.18 score 5 scriptslaresbernardo
lares:Analytics & Machine Learning Sidekick
Auxiliary package for better/faster analytics, visualization, data mining, and machine learning tasks. With a wide variety of family functions, like Machine Learning, Data Wrangling, Marketing Mix Modeling (Robyn), Exploratory, API, and Scrapper, it helps the analyst or data scientist to get quick and robust results, without the need of repetitive coding or advanced R programming skills.
Maintained by Bernardo Lares. Last updated 24 days ago.
analyticsapiautomationautomldata-sciencedescriptive-statisticsh2omachine-learningmarketingmmmpredictive-modelingpuzzlerlanguagerobynvisualization
1.8 match 233 stars 9.84 score 185 scripts 1 dependentsstopsack
batchtma:Batch Effect Adjustments
Different adjustment methods for batch effects in biomarker data, such as from tissue microarrays. Some methods attempt to retain differences between batches that may be due to between-batch differences in "biological" factors that influence biomarker values.
Maintained by Konrad Stopsack. Last updated 9 months ago.
batch-effectsmeasurement-errortissue-microarray-analysis
4.7 match 1 stars 3.70 score 3 scriptsmunterfi
hereR:'sf'-Based Interface to the 'HERE' REST APIs
Interface to the 'HERE' REST APIs <https://developer.here.com/develop/rest-apis>: (1) geocode and autosuggest addresses or reverse geocode POIs using the 'Geocoder' API; (2) route directions, travel distance or time matrices and isolines using the 'Routing', 'Matrix Routing' and 'Isoline Routing' APIs; (3) request real-time traffic flow and incident information from the 'Traffic' API; (4) find request public transport connections and nearby stations from the 'Public Transit' API; (5) request intermodal routes using the 'Intermodal Routing' API; (6) get weather forecasts, reports on current weather conditions, astronomical information and alerts at a specific location from the 'Destination Weather' API. Locations, routes and isolines are returned as 'sf' objects.
Maintained by Merlin Unterfinger. Last updated 24 days ago.
apigeocodinggishere-technologiesisolineroutingrspatialtrafficweather
2.0 match 90 stars 8.70 score 63 scriptseddelbuettel
pinp:'pinp' is not 'PNAS'
A 'PNAS'-alike style for 'rmarkdown', derived from the 'Proceedings of the National Academy of Sciences of the United States of America' ('PNAS') 'LaTeX' style, and adapted for use with 'markdown' and 'pandoc'.
Maintained by Dirk Eddelbuettel. Last updated 1 months ago.
2.2 match 149 stars 7.81 score 2 scripts 1 dependentsropensci
opencage:Geocode with the OpenCage API
Geocode with the OpenCage API, either from place name to longitude and latitude (forward geocoding) or from longitude and latitude to the name and address of a location (reverse geocoding), see <https://opencagedata.com>.
Maintained by Daniel Possenriede. Last updated 2 months ago.
geocodegeocoderopencageopencage-apiopencage-geocoderpeer-reviewedplacenamesrspatial
2.0 match 87 stars 8.39 score 79 scriptscloudyr
googleComputeEngineR:R Interface with Google Compute Engine
Interact with the 'Google Compute Engine' API in R. Lets you create, start and stop instances in the 'Google Cloud'. Support for preconfigured instances, with templates for common R needs.
Maintained by Mark Edmondson. Last updated 1 days ago.
apicloud-computingcloudyrgoogle-cloudgoogleauthrlaunching-virtual-machines
1.7 match 152 stars 9.73 score 235 scriptsrstudio
shinyvalidate:Input Validation for Shiny Apps
Improves the user experience of Shiny apps by helping to provide feedback when required inputs are missing, or input values are not valid.
Maintained by Carson Sievert. Last updated 1 years ago.
1.8 match 112 stars 9.10 score 316 scripts 13 dependentselipousson
sfext:Extra Functions for Simple Feature Data
Extra functions with additional options for reading, writing, and transforming spatial data. Includes a variety of utility functions for working with tabular data with coordinates and distance and area units.
Maintained by Eli Pousson. Last updated 4 months ago.
2.7 match 20 stars 6.03 score 24 scripts 5 dependentsjnpeng
endogeneity:Recursive Two-Stage Models to Address Endogeneity
Various recursive two-stage models to address the endogeneity issue of treatment variables in observational study or mediators in experiments. The details of the models are discussed in Peng (2023) <doi:10.1287/isre.2022.1113>.
Maintained by Jing Peng. Last updated 2 months ago.
8.1 match 2.00 score 2 scriptsuribo
zipangu:Japanese Utility Functions and Data
Some data treated by the Japanese R user require unique operations and processing. These are caused by address, Kanji, and traditional year representations. 'zipangu' transforms specific to Japan into something more general one.
Maintained by Shinya Uryu. Last updated 4 months ago.
2.5 match 56 stars 6.38 score 41 scripts 1 dependentsbioc
spoon:Address the Mean-variance Relationship in Spatial Transcriptomics Data
This package addresses the mean-variance relationship in spatially resolved transcriptomics data. Precision weights are generated for individual observations using Empirical Bayes techniques. These weights are used to rescale the data and covariates, which are then used as input in spatially variable gene detection tools.
Maintained by Kinnary Shah. Last updated 1 months ago.
spatialsinglecelltranscriptomicsgeneexpressionpreprocessing
3.4 match 4.76 score 19 scriptsrmi-pacta
pacta.multi.loanbook:Run 'PACTA' on Multiple Loan Books Easily
Run Paris Agreement Capital Transition Assessment ('PACTA') analyses on multiple loan books in a structured way. Provides access to standard 'PACTA' metrics and additional 'PACTA'-related metrics for multiple loan books. Results take the form of 'csv' files and plots and are exported to user-specified project paths.
Maintained by Jacob Kastl. Last updated 2 days ago.
climate-changepactapactaversesustainable-finance
2.5 match 6.48 score 4 scriptsdkyleward
ipfr:List Balancing for Reweighting and Population Synthesis
Performs iterative proportional updating given a seed table and an arbitrary number of marginal distributions. This is commonly used in population synthesis, survey raking, matrix rebalancing, and other applications. For example, a household survey may be weighted to match the known distribution of households by size from the census. An origin/ destination trip matrix might be balanced to match traffic counts. The approach used by this package is based on a paper from Arizona State University (Ye, Xin, et. al. (2009) <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.537.723&rep=rep1&type=pdf>). Some enhancements have been made to their work including primary and secondary target balance/importance, general marginal agreement, and weight restriction.
Maintained by Kyle Ward. Last updated 5 years ago.
3.1 match 5 stars 5.06 score 23 scriptsdipterix
dipsaus:A Dipping Sauce for Data Analysis and Visualizations
Works as an "add-on" to packages like 'shiny', 'future', as well as 'rlang', and provides utility functions. Just like dipping sauce adding flavors to potato chips or pita bread, 'dipsaus' for data analysis and visualizations adds handy functions and enhancements to popular packages. The goal is to provide simple solutions that are frequently asked for online, such as how to synchronize 'shiny' inputs without freezing the app, or how to get memory size on 'Linux' or 'MacOS' system. The enhancements roughly fall into these four categories: 1. 'shiny' input widgets; 2. high-performance computing using the 'future' package; 3. modify R calls and convert among numbers, strings, and other objects. 4. utility functions to get system information such like CPU chip-set, memory limit, etc.
Maintained by Zhengjia Wang. Last updated 5 days ago.
2.0 match 13 stars 7.90 score 85 scripts 3 dependentsolink-proteomics
OlinkAnalyze:Facilitate Analysis of Proteomic Data from Olink
A collection of functions to facilitate analysis of proteomic data from Olink, primarily NPX data that has been exported from Olink Software. The functions also work on QUANT data from Olink by log- transforming the QUANT data. The functions are focused on reading data, facilitating data wrangling and quality control analysis, performing statistical analysis and generating figures to visualize the results of the statistical analysis. The goal of this package is to help users extract biological insights from proteomic data run on the Olink platform.
Maintained by Kathleen Nevola. Last updated 20 days ago.
olinkproteomicsproteomics-data-analysis
1.6 match 104 stars 9.72 score 61 scriptsazure
azuremlsdk:Interface to the 'Azure Machine Learning' 'SDK'
Interface to the 'Azure Machine Learning' Software Development Kit ('SDK'). Data scientists can use the 'SDK' to train, deploy, automate, and manage machine learning models on the 'Azure Machine Learning' service. To learn more about 'Azure Machine Learning' visit the website: <https://docs.microsoft.com/en-us/azure/machine-learning/service/overview-what-is-azure-ml>.
Maintained by Diondra Peck. Last updated 3 years ago.
amlcomputeazureazure-machine-learningazuremldsimachine-learningrstudiosdk-r
1.6 match 106 stars 8.91 score 221 scriptspachadotdev
analogsea:Interface to 'DigitalOcean'
Provides a set of functions for interacting with the 'DigitalOcean' API <https://www.digitalocean.com/>, including creating images, destroying them, rebooting, getting details on regions, and available images.
Maintained by Mauricio Vargas. Last updated 2 years ago.
1.9 match 159 stars 7.56 score 100 scripts 1 dependentscrunch-io
crunch:Crunch.io Data Tools
The Crunch.io service <https://crunch.io/> provides a cloud-based data store and analytic engine, as well as an intuitive web interface. Using this package, analysts can interact with and manipulate Crunch datasets from within R. Importantly, this allows technical researchers to collaborate naturally with team members, managers, and clients who prefer a point-and-click interface.
Maintained by Greg Freedman Ellis. Last updated 11 days ago.
1.3 match 9 stars 10.53 score 200 scripts 2 dependentsbioc
wateRmelon:Illumina DNA methylation array normalization and metrics
15 flavours of betas and three performance metrics, with methods for objects produced by methylumi and minfi packages.
Maintained by Leo C Schalkwyk. Last updated 4 months ago.
dnamethylationmicroarraytwochannelpreprocessingqualitycontrol
1.8 match 7.75 score 247 scripts 2 dependentsyuqitian35
multipleDL:Addressing Detection Limits by Cumulative Probability Models (CPMs)
Build CPMs (cumulative probability models, also known as cumulative link models) to account for detection limits (both single and multiple detection limits) in response variables. Conditional quantiles and conditional CDFs can be calculated based on fitted models. The package implements methods described in Tian, Y., Li, C., Tu, S., James, N. T., Harrell, F. E., & Shepherd, B. E. (2022). "Addressing Detection Limits with Semiparametric Cumulative Probability Models". <arXiv:2207.02815>.
Maintained by Yuqi Tian. Last updated 1 years ago.
5.1 match 2.70 score 2 scriptsrichfitz
storr:Simple Key Value Stores
Creates and manages simple key-value stores. These can use a variety of approaches for storing the data. This package implements the base methods and support for file system, in-memory and DBI-based database stores.
Maintained by Rich FitzJohn. Last updated 4 years ago.
1.3 match 117 stars 10.21 score 57 scripts 33 dependentszhukovyuri
SUNGEO:Sub-National Geospatial Data Archive: Geoprocessing Toolkit
Tools for integrating spatially-misaligned GIS datasets. Part of the Sub-National Geospatial Data Archive System.
Maintained by Yuri M. Zhukov. Last updated 10 months ago.
3.9 match 5 stars 3.42 score 8 scriptskjhealy
gssrdoc:Document General Social Survey Variable
The General Social Survey (GSS) is a long-running, mostly annual survey of US households. It is administered by the National Opinion Research Center (NORC). This package contains the a tibble with information on the survey variables, together with every variable documented as an R help page. For more information on the GSS see \url{http://gss.norc.org}.
Maintained by Kieran Healy. Last updated 11 months ago.
5.8 match 2.28 score 38 scriptsk-m-m
minpack.lm:R Interface to the Levenberg-Marquardt Nonlinear Least-Squares Algorithm Found in MINPACK, Plus Support for Bounds
The nls.lm function provides an R interface to lmder and lmdif from the MINPACK library, for solving nonlinear least-squares problems by a modification of the Levenberg-Marquardt algorithm, with support for lower and upper parameter bounds. The implementation can be used via nls-like calls using the nlsLM function.
Maintained by Katharine M. Mullen. Last updated 2 years ago.
1.8 match 1 stars 7.32 score 1.7k scripts 181 dependentskgjerde
corporaexplorer:A 'Shiny' App for Exploration of Text Collections
Facilitates dynamic exploration of text collections through an intuitive graphical user interface and the power of regular expressions. The package contains 1) a helper function to convert a data frame to a 'corporaexplorerobject' and 2) a 'Shiny' app for fast and flexible exploration of a 'corporaexplorerobject'. The package also includes demo apps with which one can explore Jane Austen's novels and the State of the Union Addresses (data from the 'janeaustenr' and 'sotu' packages respectively).
Maintained by Kristian Lundby Gjerde. Last updated 7 months ago.
corporacorpusshinytext-analysis
2.3 match 65 stars 5.39 score 38 scriptssnoweye
pbdMPI:R Interface to MPI for HPC Clusters (Programming with Big Data Project)
A simplified, efficient, interface to MPI for HPC clusters. It is a derivation and rethinking of the Rmpi package. pbdMPI embraces the prevalent parallel programming style on HPC clusters. Beyond the interface, a collection of functions for global work with distributed data and resource-independent RNG reproducibility is included. It is based on S4 classes and methods.
Maintained by Wei-Chen Chen. Last updated 6 months ago.
1.8 match 2 stars 7.11 score 179 scripts 3 dependentsipea
utilsIPEA:IPEA Common Functions
The most used functions on IPEA (Instituto de Pesquisa Economica Aplicada). Most of functions deal with brazilian names. It can guess the women single's name, extract prepositions or extract the first name.
Maintained by Gustavo Coelho. Last updated 7 years ago.
4.5 match 2.70 score 7 scriptscoolbutuseless
c64vice:Interface to Binary Monitor in VICE C64 Emulator
Interface to the binary monitor in VICE - the c64 emulator.
Maintained by mikefc. Last updated 1 years ago.
5.6 match 2 stars 2.08 score 12 scriptsframverse
TAMMsupport:Streamline working with Terminal Area Management Modules
A convenient tool for interfacing with Terminal Area Manamagement Modules (TAMMs) in R environments.
Maintained by Collin Edwards. Last updated 2 months ago.
3.5 match 3 stars 3.35 score 5 scriptscran
BeyondBenford:Compare the Goodness of Fit of Benford's and Blondeau Da Silva's Digit Distributions to a Given Dataset
Allows to compare the goodness of fit of Benford's and Blondeau Da Silva's digit distributions in a dataset. It is used to check whether the data distribution is consistent with theoretical distributions highlighted by Blondeau Da Silva or not (through the dat.distr() function): this ideal theoretical distribution must be at least approximately followed by the data for the use of Blondeau Da Silva's model to be well-founded. It also enables to plot histograms of digit distributions, both observed in the dataset and given by the two theoretical approaches (with the digit.ditr() function). Finally, it proposes to quantify the goodness of fit via Pearson's chi-squared test (with the chi2() function).
Maintained by Blondeau Da Silva Stephane. Last updated 5 years ago.
11.3 match 1.00 scoreirworkshop
campfin:Wrangle Campaign Finance Data
Explore and normalize American campaign finance data. Created by the Investigative Reporting Workshop to facilitate work on The Accountability Project, an effort to collect public data into a central, standard database that is more easily searched: <https://publicaccountability.org/>.
Maintained by Kiernan Nicholls. Last updated 1 years ago.
campaign-financedata-journalism
2.0 match 17 stars 5.66 score 54 scriptsmuschellij2
rscopus:Scopus Database 'API' Interface
Uses Elsevier 'Scopus' API <https://dev.elsevier.com/sc_apis.html> to download information about authors and their citations.
Maintained by John Muschelli. Last updated 1 years ago.
1.2 match 77 stars 9.33 score 124 scripts 3 dependentszuoyi93
ProSGPV:Penalized Regression with Second-Generation P-Values
Implementation of penalized regression with second-generation p-values for variable selection. The algorithm can handle linear regression, GLM, and Cox regression. S3 methods print(), summary(), coef(), predict(), and plot() are available for the algorithm. Technical details can be found at Zuo et al. (2021) <doi:10.1080/00031305.2021.1946150>.
Maintained by Yi Zuo. Last updated 4 years ago.
2.3 match 5 stars 4.70 score 9 scriptsbioc
gwascat:representing and modeling data in the EMBL-EBI GWAS catalog
Represent and model data in the EMBL-EBI GWAS catalog.
Maintained by VJ Carey. Last updated 5 months ago.
1.8 match 6.05 score 110 scripts 2 dependentsinbo
multimput:Using Multiple Imputation to Address Missing Data
Accompanying package for the paper: Working with population totals in the presence of missing data comparing imputation methods in terms of bias and precision. Published in 2017 in the Journal of Ornithology volume 158 page 603–615 (<doi:10.1007/s10336-016-1404-9>).
Maintained by Thierry Onkelinx. Last updated 17 days ago.
2.9 match 1 stars 3.62 score 14 scripts 1 dependentsjudith-bourque
locateip:Locate IP Addresses with 'ip-api'
Download Internet Protocol (IP) address location and more from the 'ip-api' application programming interface (API) <https://ip-api.com/>. The package makes it easy to get the latitude, longitude, country, region, and organisation associated to the provided IP address. The information is conveniently returned in a rectangular format.
Maintained by Judith Bourque. Last updated 2 years ago.
3.8 match 2.70 score 3 scriptsthomasp85
patchwork:The Composer of Plots
The 'ggplot2' package provides a strong API for sequentially building up a plot, but does not concern itself with composition of multiple plots. 'patchwork' is a package that expands the API to allow for arbitrarily complex composition of plots by, among others, providing mathematical operators for combining multiple plots. Other packages that try to address this need (but with a different approach) are 'gridExtra' and 'cowplot'.
Maintained by Thomas Lin Pedersen. Last updated 6 months ago.
ggplot-extensionggplot2visualization
0.5 match 2.5k stars 19.79 score 82k scripts 657 dependentsschnorr
starvz:R-Based Visualization Techniques for Task-Based Applications
Performance analysis workflow that combines the power of the R language (and the tidyverse realm) and many auxiliary tools to provide a consistent, flexible, extensible, fast, and versatile framework for the performance analysis of task-based applications that run on top of the StarPU runtime (with its MPI (Message Passing Interface) layer for multi-node support). Its goal is to provide a fruitful prototypical environment to conduct performance analysis hypothesis-checking for task-based applications that run on heterogeneous (multi-GPU, multi-core) multi-node HPC (High-performance computing) platforms.
Maintained by Lucas Leandro Nesi. Last updated 5 months ago.
2.0 match 13 stars 4.94 score 27 scriptsip2location
ip2locationio:Lookup Geolocation and Proxy Information using 'IP2Location.io' API
Query for enriched data such as country, region, city, latitude & longitude, ZIP code, time zone, Autonomous System, Internet Service Provider, domain, net speed, International direct dialing (IDD) code, area code, weather station data, mobile data, elevation, usage type, address type, advertisement category, and proxy data with an IP address. This package uses the 'IP2Location.io' API to query this data. To get started with a free API key, sign up here <https://www.ip2location.io/sign-up?ref=1>.
Maintained by IP2Location.io. Last updated 5 months ago.
9.8 match 1.00 score 1 scriptsryantmoore
muRL:Mailmerge using R, LaTeX, and the Web
Provides mailmerge methods for reading spreadsheets of addresses and other relevant information to create standardized but customizable letters. Provides a method for mapping US ZIP codes, including those of letter recipients. Provides a method for parsing and processing html code from online job postings of the American Political Science Association.
Maintained by Ryan T. Moore. Last updated 2 years ago.
5.8 match 1.70 score 7 scriptsbayesiandemography
bage:Bayesian Estimation and Forecasting of Age-Specific Rates
Fast Bayesian estimation and forecasting of age-specific rates, probabilities, and means, based on 'Template Model Builder'.
Maintained by John Bryant. Last updated 2 months ago.
1.3 match 3 stars 7.30 score 39 scriptsseth-mg
rosetteApi:'Rosette' API
'Rosette' is an API for multilingual text analysis and information extraction. More information can be found at <https://developer.rosette.com>.
Maintained by Seth Gransky. Last updated 5 years ago.
3.6 match 2.70 score 4 scriptsazure
AzureVM:Virtual Machines in 'Azure'
Functionality for working with virtual machines (VMs) in Microsoft's 'Azure' cloud: <https://azure.microsoft.com/en-us/services/virtual-machines/>. Includes facilities to deploy, startup, shutdown, and cleanly delete VMs and VM clusters. Deployment configurations can be highly customised, and can make use of existing resources as well as creating new ones. A selection of predefined configurations is provided to allow easy deployment of commonly used Linux and Windows images, including Data Science Virtual Machines. With a running VM, execute scripts and install optional extensions. Part of the 'AzureR' family of packages.
Maintained by Hong Ooi. Last updated 2 years ago.
azureazure-sdk-razure-virtual-machinedata-science-virtual-machine
1.9 match 14 stars 5.05 score 16 scriptsmuschellij2
glassdoor:Interface to 'Glassdoor' API
Interacts with the 'Glassdoor' API <https://www.glassdoor.com/developer/index.htm>. Allows the user to search job statistics, employer statistics, and job progression, where 'Glassdoor' provides a breakdown of other jobs a person did after their current one.
Maintained by John Muschelli. Last updated 6 years ago.
2.0 match 6 stars 4.59 score 13 scriptsterminological
ggrrr:Addressing Annoyances and Irritations
Visualisation hacks, tabular data helpers, fonts, caching, tidy data functions. It is an swiss army knife, jack of all trades.
Maintained by Robert Challen. Last updated 9 months ago.
3.3 match 1 stars 2.74 score 11 scriptscozygene
TCA:Tensor Composition Analysis
Tensor Composition Analysis (TCA) allows the deconvolution of two-dimensional data (features by observations) coming from a mixture of heterogeneous sources into a three-dimensional matrix of signals (features by observations by sources). The TCA framework further allows to test the features in the data for different statistical relations with an outcome of interest while modeling source-specific effects; particularly, it allows to look for statistical relations between source-specific signals and an outcome. For example, TCA can deconvolve bulk tissue-level DNA methylation data (methylation sites by individuals) into a three-dimensional tensor of cell-type-specific methylation levels for each individual (i.e. methylation sites by individuals by cell types) and it allows to detect cell-type-specific statistical relations (associations) with phenotypes. For more details see Rahmani et al. (2019) <DOI:10.1038/s41467-019-11052-9>.
Maintained by Elior Rahmani. Last updated 4 years ago.
1.3 match 18 stars 7.30 score 37 scripts 2 dependentsip2location
ip2proxy:Lookup for IP Address Proxy Information
Enable user to find the IP addresses which are used as VPN anonymizer, open proxies, web proxies and Tor exits. The package lookup the proxy IP address from IP2Proxy BIN Data file. You may visit <https://lite.ip2location.com> for free database download.
Maintained by Kai Wen Ooi. Last updated 2 years ago.
9.1 match 1.00 score 1 scriptselipousson
esri2sf:Create Simple Features from ArcGIS Server REST API
This package enables you to scrape geographic features directly from ArcGIS servers REST API into R as simple features.
Maintained by Eli Pousson. Last updated 4 months ago.
1.7 match 7 stars 5.28 score 50 scripts 1 dependentsmrchypark
sendgridr:Mail Sender Using 'Sendgrid' Service
Send email using 'Sendgrid' <https://sendgrid.com/en-us> mail API(v3) <https://docs.sendgrid.com/api-reference/how-to-use-the-sendgrid-v3-api/authentication>.
Maintained by Chanyub Park. Last updated 1 years ago.
hacktoberfesthacktoberfest2021mailmailersendgrid
1.9 match 23 stars 4.66 score 10 scriptsfrbcesab
rutils:A Collection of R Functions
A collection of R functions commonly used in FRB-CESAB projects.
Maintained by Nicolas Casajus. Last updated 2 months ago.
1.9 match 2 stars 4.66 score 454 scriptspaithiov909
jisx0402:Datasets Related to 'JIS X 0402:2020'
Provides datasets for handling Japanese municipality code defined in 'JIS X 0402' and 'JIS X 0401'.
Maintained by Akiru Kato. Last updated 1 years ago.
4.0 match 3 stars 2.18 scoreropenspain
CatastRoNav:Interface to the 'INSPIRE' services of 'Catastro de Navarra'
Access public spatial data available under the 'INSPIRE' directive. Tools for downloading references, buildings and addresses of properties on Navarre (Spain).
Maintained by Diego Hernangómez. Last updated 1 months ago.
catastromapsnavarraropenspainspain
2.3 match 1 stars 3.70 score 9 scriptsalexpkeil1
qgcompint:Quantile G-Computation Extensions for Effect Measure Modification
G-computation for a set of time-fixed exposures with quantile-based basis functions, possibly under linearity and homogeneity assumptions. Effect measure modification in this method is a way to assess how the effect of the mixture varies by a binary, categorical or continuous variable. Reference: Alexander P. Keil, Jessie P. Buckley, Katie M. OBrien, Kelly K. Ferguson, Shanshan Zhao, and Alexandra J. White (2019) A quantile-based g-computation approach to addressing the effects of exposure mixtures; <doi:10.1289/EHP5838>.
Maintained by Alexander Keil. Last updated 4 days ago.
1.8 match 4 stars 4.89 score 13 scriptsstatisticsnorway
GISSB:Network Analysis on the Norwegian Road Network
A collection of GIS (Geographic Information System) functions in R, created for use in Statistics Norway. The functions are primarily related to network analysis on the Norwegian road network.
Maintained by Sindre Mikael Haugen. Last updated 20 days ago.
2.0 match 4 stars 4.30 score 8 scriptsskranz
gtree:gtree basic functionality to model and solve games
gtree basic functionality to model and solve games
Maintained by Sebastian Kranz. Last updated 4 years ago.
economic-experimentseconomicsgambitgame-theorynash-equilibrium
2.3 match 18 stars 3.79 score 23 scripts 1 dependentsrucknium
rbch:Extraction and Analysis of Data from the Bitcoin Cash (BCH) Blockchain
Issues RPC-JSON calls to 'bitcoind', the daemon of Bitcoin Cash (BCH), to extract transaction data from the blockchain. BCH is a fork of Bitcoin that permits a greater number of transactions per second. A BCH daemon is available under an MIT license from the Bitcoin Unlimited website <https://www.bitcoinunlimited.info>.
Maintained by Rucknium. Last updated 4 months ago.
3.4 match 3 stars 2.48 score 7 scriptsnashjc
nlsr:Functions for Nonlinear Least Squares Solutions - Updated 2022
Provides tools for working with nonlinear least squares problems. For the estimation of models reliable and robust tools than nls(), where the the Gauss-Newton method frequently stops with 'singular gradient' messages. This is accomplished by using, where possible, analytic derivatives to compute the matrix of derivatives and a stabilization of the solution of the estimation equations. Tools for approximate or externally supplied derivative matrices are included. Bounds and masks on parameters are handled properly.
Maintained by John C Nash. Last updated 27 days ago.
1.2 match 7.02 score 94 scripts 5 dependentsausgis
GD:Geographical Detectors for Assessing Spatial Factors
Geographical detectors for measuring spatial stratified heterogeneity, as described in Jinfeng Wang (2010) <doi:10.1080/13658810802443457> and Jinfeng Wang (2016) <doi:10.1016/j.ecolind.2016.02.052>. Includes the optimal discretization of continuous data, four primary functions of geographical detectors, comparison of size effects of spatial unit and the visualizations of results. To use the package and to refer the descriptions of the package, methods and case datasets, please cite Yongze Song (2020) <doi:10.1080/15481603.2020.1760434>. The model has been applied in factor exploration of road performance and multi-scale spatial segmentation for network data, as described in Yongze Song (2018) <doi:10.3390/rs10111696> and Yongze Song (2020) <doi:10.1109/TITS.2020.3001193>, respectively.
Maintained by Wenbo Lv. Last updated 4 months ago.
geographical-detectorspatial-stratified-heterogeneity
1.1 match 13 stars 7.49 score 51 scriptsdyfanjones
sagemaker.mlframework:sagemaker machine learning developed by amazon
`sagemaker` machine learning developed by amazon.
Maintained by Dyfan Jones. Last updated 3 years ago.
amazon-sagemakerawsmachine-learningsagemakersdk
3.4 match 2.48 score 2 dependentsgeomarker-io
codec:Community Data Explorer for Cincinnati
This repository serves as the definition of the CoDEC data specifications and provides helpers to create, validate, release, and read CoDEC data.
Maintained by Cole Brokamp. Last updated 23 days ago.
2.0 match 4 stars 4.15 score 27 scriptsbioc
corral:Correspondence Analysis for Single Cell Data
Correspondence analysis (CA) is a matrix factorization method, and is similar to principal components analysis (PCA). Whereas PCA is designed for application to continuous, approximately normally distributed data, CA is appropriate for non-negative, count-based data that are in the same additive scale. The corral package implements CA for dimensionality reduction of a single matrix of single-cell data, as well as a multi-table adaptation of CA that leverages data-optimized scaling to align data generated from different sequencing platforms by projecting into a shared latent space. corral utilizes sparse matrices and a fast implementation of SVD, and can be called directly on Bioconductor objects (e.g., SingleCellExperiment) for easy pipeline integration. The package also includes additional options, including variations of CA to address overdispersion in count data (e.g., Freeman-Tukey chi-squared residual), as well as the option to apply CA-style processing to continuous data (e.g., proteomic TOF intensities) with the Hellinger distance adaptation of CA.
Maintained by Lauren Hsu. Last updated 5 months ago.
batcheffectdimensionreductiongeneexpressionpreprocessingprincipalcomponentsequencingsinglecellsoftwarevisualization
1.8 match 4.64 score 22 scriptsfrederikziebell
RNAseqQC:Quality Control for RNA-Seq Data
Functions for semi-automated quality control of bulk RNA-seq data.
Maintained by Frederik Ziebell. Last updated 8 months ago.
1.5 match 2 stars 5.21 score 27 scriptsgforge
Greg:Regression Helper Functions
Methods for manipulating regression models and for describing these in a style adapted for medical journals. Contains functions for generating an HTML table with crude and adjusted estimates, plotting hazard ratio, plotting model estimates and confidence intervals using forest plots, extending this to comparing multiple models in a single forest plots. In addition to the descriptive methods, there are functions for the robust covariance matrix provided by the 'sandwich' package, a function for adding non-linearities to a model, and a wrapper around the 'Epi' package's Lexis() functions for time-splitting a dataset when modeling non-proportional hazards in Cox regressions.
Maintained by Max Gordon. Last updated 1 years ago.
1.3 match 6 stars 6.26 score 68 scriptsikosmidis
cranly:Package Directives and Collaboration Networks in CRAN
Core visualizations and summaries for the CRAN package database. The package provides comprehensive methods for cleaning up and organizing the information in the CRAN package database, for building package directives networks (depends, imports, suggests, enhances, linking to) and collaboration networks, producing package dependence trees, and for computing useful summaries and producing interactive visualizations from the resulting networks and summaries. The resulting networks can be coerced to 'igraph' <https://CRAN.R-project.org/package=igraph> objects for further analyses and modelling.
Maintained by Ioannis Kosmidis. Last updated 3 years ago.
network-analysisnetwork-visualization
1.1 match 49 stars 6.85 score 32 scripts 1 dependentsmyoung3
intervalaverage:Time-Weighted Averaging for Interval Data
Perform fast and memory efficient time-weighted averaging of values measured over intervals into new arbitrary intervals. This package is useful in the context of data measured or represented as constant values over intervals on a one-dimensional discrete axis (e.g. time-integrated averages of a curve over defined periods). This package was written specifically to deal with air pollution data recorded or predicted as averages over sampling periods. Data in this format often needs to be shifted to non-aligned periods or averaged up to periods of longer duration (e.g. averaging data measured over sequential non-overlapping periods to calendar years).
Maintained by Michael Young. Last updated 5 years ago.
2.4 match 3.28 score 19 scriptsstatismike
shiny.reglog:Optional Login and Registration Module System for ShinyApps
RegLog system provides a set of shiny modules to handle register procedure for your users, alongside with login, edit credentials and password reset functionality. It provides support for popular SQL databases and optionally googlesheet-based database for easy setup. For email sending it provides support for 'emayili' and 'gmailr' backends. Architecture makes customizing usability pretty straightforward. The authentication system created with shiny.reglog is designed to be optional: user don't need to be logged-in to access your application, but when logged-in the user data can be used to read from and write to relational databases.
Maintained by Michal Kosinski. Last updated 3 years ago.
googlesheetregister-uishiny-applicationssqlite
1.2 match 14 stars 6.45 score 20 scriptselipousson
getdata:Get Easy Access to Tabular and Spatial Data
Download and format spatial and non-spatial data with simple filtering by location.
Maintained by Eli Pousson. Last updated 5 months ago.
1.7 match 12 stars 4.46 score 18 scripts 3 dependentsspatialstatisticsupna
bigDM:Scalable Bayesian Disease Mapping Models for High-Dimensional Data
Implements several spatial and spatio-temporal scalable disease mapping models for high-dimensional count data using the INLA technique for approximate Bayesian inference in latent Gaussian models (Orozco-Acosta et al., 2021 <doi:10.1016/j.spasta.2021.100496>; Orozco-Acosta et al., 2023 <doi:10.1016/j.cmpb.2023.107403> and Vicente et al., 2023 <doi:10.1007/s11222-023-10263-x>). The creation and develpment of this package has been supported by Project MTM2017-82553-R (AEI/FEDER, UE) and Project PID2020-113125RB-I00/MCIN/AEI/10.13039/501100011033. It has also been partially funded by the Public University of Navarra (project PJUPNA2001).
Maintained by Aritz Adin. Last updated 7 months ago.
1.6 match 15 stars 4.88 score 10 scriptssymbolrush
osrmr:Wrapper for the 'OSRM' API
Wrapper around the 'Open Source Routing Machine (OSRM)' API <http://project-osrm.org/>. 'osrmr' works with API versions 4 and 5 and can handle servers that run locally as well as the 'OSRM' webserver.
Maintained by Adrian Staempfli. Last updated 4 years ago.
2.5 match 3.06 score 23 scriptsjayemerson
STV:Single Transferable Vote Counting
Implementations of the Single Transferable Vote counting system. By default, it uses the Cambridge method for surplus allocation and Droop method for quota calculation. Fractional surplus allocation and the Hare quota are available as options.
Maintained by John Emerson. Last updated 4 years ago.
1.8 match 3 stars 4.26 score 10 scriptsjsspaulding
rcrimeanalysis:An Implementation of Crime Analysis Methods
An implementation of functions for the analysis of crime incident or records management system data. The package implements analysis algorithms scaled for city or regional crime analysis units. The package provides functions for kernel density estimation for crime heat maps, geocoding using the 'Google Maps' API, identification of repeat crime incidents, spatio-temporal map comparison across time intervals, time series analysis (forecasting and decomposition), detection of optimal parameters for the identification of near repeat incidents, and near repeat analysis with crime network linkage.
Maintained by Jamie Spaulding. Last updated 2 years ago.
1.7 match 5 stars 4.40 score 5 scriptscnathe
Rlabkey:Data Exchange Between R and 'LabKey' Server
The 'LabKey' client library for R makes it easy for R users to load live data from a 'LabKey' Server, <https://www.labkey.com/>, into the R environment for analysis, provided users have permissions to read the data. It also enables R users to insert, update, and delete records stored on a 'LabKey' Server, provided they have appropriate permissions to do so.
Maintained by Cory Nathe. Last updated 3 days ago.
1.7 match 4.25 score 388 scripts 1 dependentsuribo
washoku:Extra 'recipes' for Japanese Text, Date and Address Processing
In order to handle Japanese text in the feature engineering process, morphological analysis is necessary. Following the framework of `recipes`, to provide steps that can be applied to `textrecipes` for subsequent processing.
Maintained by Shinya Uryu. Last updated 3 years ago.
2.9 match 7 stars 2.54 score 3 scriptscran
pbr:Find a Cold One Near You
In short, this package is a locator for cool, refreshing beverages. It will find and return the nearest location where you can get a cold one.
Maintained by Paul Frater. Last updated 2 years ago.
3.4 match 2.11 score 13 scriptsmkleinsa
StackImpute:Tools for Analysis of Stacked Multiple Imputations
Provides methods for inference using stacked multiple imputations augmented with weights. The vignette provides example R code for implementation in general multiple imputation settings. For additional details about the estimation algorithm, we refer the reader to Beesley, Lauren J and Taylor, Jeremy M G (2020) “A stacked approach for chained equations multiple imputation incorporating the substantive model” <doi:10.1111/biom.13372>, and Beesley, Lauren J and Taylor, Jeremy M G (2021) “Accounting for not-at-random missingness through imputation stacking” <arXiv:2101.07954>.
Maintained by Mike Kleinsasser. Last updated 4 years ago.
2.2 match 3.18 score 7 scripts 1 dependentsfanhansen
creditmodel:Toolkit for Credit Modeling, Analysis and Visualization
Provides a highly efficient R tool suite for Credit Modeling, Analysis and Visualization.Contains infrastructure functionalities such as data exploration and preparation, missing values treatment, outliers treatment, variable derivation, variable selection, dimensionality reduction, grid search for hyper parameters, data mining and visualization, model evaluation, strategy analysis etc. This package is designed to make the development of binary classification models (machine learning based models as well as credit scorecard) simpler and faster. The references including: 1 Refaat, M. (2011, ISBN: 9781447511199). Credit Risk Scorecard: Development and Implementation Using SAS; 2 Bezdek, James C.FCM: The fuzzy c-means clustering algorithm. Computers & Geosciences (0098-3004),<DOI:10.1016/0098-3004(84)90020-7>.
Maintained by Dongping Fan. Last updated 3 years ago.
2.0 match 4 stars 3.48 score 15 scriptsbioc
cicero:Predict cis-co-accessibility from single-cell chromatin accessibility data
Cicero computes putative cis-regulatory maps from single-cell chromatin accessibility data. It also extends monocle 2 for use in chromatin accessibility data.
Maintained by Hannah Pliner. Last updated 5 months ago.
sequencingclusteringcellbasedassaysimmunooncologygeneregulationgenetargetepigeneticsatacseqsinglecell
1.2 match 5.80 score 312 scriptspredictiveecology
pemisc:Miscellaneous Utilities Developed by the Predictive Ecology Group
Miscellaneous utilities developed by the Predictive Ecology Group (<http://predictiveecology.org>).
Maintained by Eliot J B McIntire. Last updated 3 months ago.
landscape-ecologysimulation-modeling
1.8 match 1 stars 3.83 score 3 scripts 5 dependentsdominikkukacka
ipbase:Client for the 'ipbase.com' IP Geolocation API
An R client for the 'ipbase.com' IP Geolocation API. The API requires registration of an API key. Basic features are free, some require a paid subscription. You can find the full API documentation at <https://ipbase.com/docs> .
Maintained by Dominik Kukacka. Last updated 2 years ago.
3.4 match 2.00 scorecran
RWsearch:Lazy Search in R Packages, Task Views, CRAN, the Web. All-in-One Download
Search by keywords in R packages, task views, CRAN, the web and display the results in the console or in txt, html or pdf files. Download the package documentation (html index, README, NEWS, pdf manual, vignettes, source code, binaries) with a single instruction. Visualize the package dependencies and CRAN checks. Compare the package versions, unload and install the packages and their dependencies in a safe order. Explore CRAN archives. Use the above functions for task view maintenance. Access web search engines from the console thanks to 80+ bookmarks. All functions accept standard and non-standard evaluation.
Maintained by Patrice Kiener. Last updated 3 days ago.
1.8 match 3.70 scorebioc
metagenomeSeq:Statistical analysis for sparse high-throughput sequencing
metagenomeSeq is designed to determine features (be it Operational Taxanomic Unit (OTU), species, etc.) that are differentially abundant between two or more groups of multiple samples. metagenomeSeq is designed to address the effects of both normalization and under-sampling of microbial communities on disease association detection and the testing of feature correlations.
Maintained by Joseph N. Paulson. Last updated 3 months ago.
immunooncologyclassificationclusteringgeneticvariabilitydifferentialexpressionmicrobiomemetagenomicsnormalizationvisualizationmultiplecomparisonsequencingsoftware
0.5 match 69 stars 12.02 score 494 scripts 7 dependentstim-band
shinylight:Web Interface to 'R' Functions
Web front end for your 'R' functions producing plots or tables. If you have a function or set of related functions, you can make them available over the internet through a web browser. This is the same motivation as the 'shiny' package, but note that the development of 'shinylight' is not in any way linked to that of 'shiny' (beyond the use of the 'httpuv' package). You might prefer 'shinylight' to 'shiny' if you want a lighter weight deployment with easier horizontal scaling, or if you want to develop your front end yourself in JavaScript and HTML just using a lightweight remote procedure call interface to your R code on the server.
Maintained by Tim Band. Last updated 1 years ago.
1.9 match 3.18 score 1 scripts 1 dependentschrmongeau
repec:Access RePEc Data Through API
Utilities for accessing RePEc (Research Papers in Economics) through a RESTful API. You can request a code and get detailed information at the following page: <https://ideas.repec.org/api.html>.
Maintained by Christian Alexander Mongeau Ospina. Last updated 7 years ago.
1.8 match 3 stars 3.38 score 16 scriptsepimodel
EpiModel:Mathematical Modeling of Infectious Disease Dynamics
Tools for simulating mathematical models of infectious disease dynamics. Epidemic model classes include deterministic compartmental models, stochastic individual-contact models, and stochastic network models. Network models use the robust statistical methods of exponential-family random graph models (ERGMs) from the Statnet suite of software packages in R. Standard templates for epidemic modeling include SI, SIR, and SIS disease types. EpiModel features an API for extending these templates to address novel scientific research aims. Full methods for EpiModel are detailed in Jenness et al. (2018, <doi:10.18637/jss.v084.i08>).
Maintained by Samuel Jenness. Last updated 2 months ago.
agent-based-modelingepidemicsepidemiologyinfectious-diseasesnetwork-graphcpp
0.5 match 250 stars 11.57 score 315 scriptspik-piam
magclass:Data Class and Tools for Handling Spatial-Temporal Data
Data class for increased interoperability working with spatial-temporal data together with corresponding functions and methods (conversions, basic calculations and basic data manipulation). The class distinguishes between spatial, temporal and other dimensions to facilitate the development and interoperability of tools build for it. Additional features are name-based addressing of data and internal consistency checks (e.g. checking for the right data order in calculations).
Maintained by Jan Philipp Dietrich. Last updated 10 days ago.
0.5 match 5 stars 11.16 score 412 scripts 56 dependentssachsmc
plotROC:Generate Useful ROC Curve Charts for Print and Interactive Use
Most ROC curve plots obscure the cutoff values and inhibit interpretation and comparison of multiple curves. This attempts to address those shortcomings by providing plotting and interactive tools. Functions are provided to generate an interactive ROC curve plot for web use, and print versions. A Shiny application implementing the functions is also included.
Maintained by Michael C. Sachs. Last updated 4 months ago.
0.5 match 87 stars 10.93 score 932 scripts 7 dependentsrempsyc
pubmedDashboard:Creating PubMed Data Visualization Dashboards
Package to facilitate the creation of data visualization dashboards through the flexdashboard and easyPubMed packages. This package is now deprecated in favour of the pubDashboard package.
Maintained by Rémi Thériault. Last updated 10 months ago.
1.8 match 4 stars 3.20 score 6 scriptseltebioinformatics
mulea:Enrichment Analysis Using Multiple Ontologies and False Discovery Rate
Background - Traditional gene set enrichment analyses are typically limited to a few ontologies and do not account for the interdependence of gene sets or terms, resulting in overcorrected p-values. To address these challenges, we introduce mulea, an R package offering comprehensive overrepresentation and functional enrichment analysis. Results - mulea employs a progressive empirical false discovery rate (eFDR) method, specifically designed for interconnected biological data, to accurately identify significant terms within diverse ontologies. mulea expands beyond traditional tools by incorporating a wide range of ontologies, encompassing Gene Ontology, pathways, regulatory elements, genomic locations, and protein domains. This flexibility enables researchers to tailor enrichment analysis to their specific questions, such as identifying enriched transcriptional regulators in gene expression data or overrepresented protein domains in protein sets. To facilitate seamless analysis, mulea provides gene sets (in standardised GMT format) for 27 model organisms, covering 22 ontology types from 16 databases and various identifiers resulting in almost 900 files. Additionally, the muleaData ExperimentData Bioconductor package simplifies access to these pre-defined ontologies. Finally, mulea's architecture allows for easy integration of user-defined ontologies, or GMT files from external sources (e.g., MSigDB or Enrichr), expanding its applicability across diverse research areas. Conclusions - mulea is distributed as a CRAN R package. It offers researchers a powerful and flexible toolkit for functional enrichment analysis, addressing limitations of traditional tools with its progressive eFDR and by supporting a variety of ontologies. Overall, mulea fosters the exploration of diverse biological questions across various model organisms.
Maintained by Tamas Stirling. Last updated 3 months ago.
annotationdifferentialexpressiongeneexpressiongenesetenrichmentgographandnetworkmultiplecomparisonpathwaysreactomesoftwaretranscriptionvisualizationenrichmentenrichment-analysisfunctional-enrichment-analysisgene-set-enrichmentontologiestranscriptomicscpp
0.8 match 28 stars 7.36 score 34 scriptsmaverickg
cxxfunplus:extend cxxfunction by saving the dynamic shared objects
extend cxxfunction by saving the dynamic shared objects for reusing across R sessions
Maintained by Jiqiang Guo. Last updated 2 years ago.
1.7 match 3.18 score 5 scripts 1 dependentsframverse
xldiff:Compare excel sheets
`xldiff` provides tools to compare excel sheets, broadly inspired by "diff"-type functions. Provided functions can read sheets of two excel files and produce a third file that highlights cells that have changed. In the case of numeric changes, the direction of change is highlighted. These tools do not account for structural changes in the sheets (e.g., the addition of a column), but are useful in tracking changed values in tables or parameter files. Utility functions developed to streamline formatting output files are also more broadly useful in programmatically formatting excel files using openxlsx.
Maintained by Collin Edwards. Last updated 2 days ago.
1.8 match 3.02 score 1 dependentsbioc
BiocCheck:Bioconductor-specific package checks
BiocCheck guides maintainers through Bioconductor best practicies. It runs Bioconductor-specific package checks by searching through package code, examples, and vignettes. Maintainers are required to address all errors, warnings, and most notes produced.
Maintained by Marcel Ramos. Last updated 24 days ago.
infrastructurebioconductor-packagecore-services
0.5 match 8 stars 10.07 score 114 scripts 6 dependentsjimbrig
jimstools:Tools for R
What the package does (one paragraph).
Maintained by Jimmy Briggs. Last updated 3 years ago.
1.8 match 2 stars 3.00 score 2 scriptsropensci
charlatan:Make Fake Data
Make fake data that looks realistic, supporting addresses, person names, dates, times, colors, coordinates, currencies, digital object identifiers ('DOIs'), jobs, phone numbers, 'DNA' sequences, doubles and integers from distributions and within a range.
Maintained by Roel M. Hogervorst. Last updated 1 months ago.
datadatasetfake-datafakerpeer-reviewed
0.5 match 296 stars 10.06 score 180 scripts 1 dependentsausgis
localsp:Local Indicator of Stratified Power
Implements a local indicator of stratified power to analyze local spatial stratified association and demonstrate how spatial stratified association changes spatially and in local regions, as outlined in Hu et al. (2024) <doi:10.1080/13658816.2024.2437811>.
Maintained by Wenbo Lv. Last updated 2 months ago.
1.1 match 2 stars 4.60 scorestscl
sshicm:Information Consistency-Based Measures for Spatial Stratified Heterogeneity
Spatial stratified heterogeneity (SSH) denotes the coexistence of within-strata homogeneity and between-strata heterogeneity. Information consistency-based methods provide a rigorous approach to quantify SSH and evaluate its role in spatial processes, grounded in principles of geographical stratification and information theory (Bai, H. et al. (2023) <doi:10.1080/24694452.2023.2223700>; Wang, J. et al. (2024) <doi:10.1080/24694452.2023.2289982>).
Maintained by Wenbo Lv. Last updated 3 months ago.
geoinformaticsgeospatial-analysisinformation-theoryspatial-statisticsspatial-stratified-heterogeneitycpp
1.1 match 3 stars 4.65 score 2 scriptsbpfaff
rbtc:Bitcoin API
Implementation of the RPC-JSON API for Bitcoin and utility functions for address creation and content analysis of the blockchain.
Maintained by Bernhard Pfaff. Last updated 5 months ago.
4.3 match 1.15 score 14 scriptsepijim
notifyme:Send Alerts to your Cellphone and Phillips Hue Lights
Functions to flash your hue lights, or text yourself, from R. Designed to be used with long running scripts.
Maintained by James Black. Last updated 8 years ago.
1.8 match 1 stars 2.70 score 7 scriptsaquincum
Rexperigen:R Interface to Experigen
Provides convenience functions to communicate with an Experigen server: Experigen (<http://github.com/aquincum/experigen>) is an online framework for creating linguistic experiments, and it stores the results on a dedicated server. This package can be used to retrieve the results from the server, and it is especially helpful with registered experiments, as authentication with the server has to happen.
Maintained by Daniel Szeredi. Last updated 9 years ago.
1.6 match 1 stars 2.95 score 18 scriptsbioc
Rdisop:Decomposition of Isotopic Patterns
In high resolution mass spectrometry (HR-MS), the measured masses can be decomposed into potential element combinations (chemical sum formulas). Where additional mass/intensity information of respective isotopic peaks is available, decomposition can take this information into account to better rank the potential candidate sum formulas. To compare measured mass/intensity information with the theoretical distribution of candidate sum formulas, the latter needs to be calculated. This package implements fast algorithms to address both tasks, the calculation of isotopic distributions for arbitrary sum formulas (assuming a HR-MS resolution of roughly 30,000), and the ranked list of sum formulas fitting an observed peak or isotopic peak set.
Maintained by Steffen Neumann. Last updated 1 months ago.
immunooncologymassspectrometrymetabolomicsmass-spectrometrycpp
0.5 match 4 stars 9.14 score 111 scripts 2 dependentsbart1
move:Visualizing and Analyzing Animal Track Data
Contains functions to access movement data stored in 'movebank.org' as well as tools to visualize and statistically analyze animal movement data, among others functions to calculate dynamic Brownian Bridge Movement Models. Move helps addressing movement ecology questions.
Maintained by Bart Kranstauber. Last updated 4 months ago.
0.5 match 8.74 score 690 scripts 3 dependentsikosmidis
semnar:Constructing and Interacting with Databases of Presentations
Provides methods for constructing and maintaining a database of presentations in R. The presentations are either ones that the user gives or gave or presentations at a particular event or event series. The package also provides a plot method for the interactive mapping of the presentations using 'leaflet' by grouping them according to country, city, year and other presentation attributes. The markers on the map come with popups providing presentation details (title, institution, event, links to materials and events, and so on).
Maintained by Ioannis Kosmidis. Last updated 7 months ago.
1.7 match 2.70 scorealexpkeil1
qgcomp:Quantile G-Computation
G-computation for a set of time-fixed exposures with quantile-based basis functions, possibly under linearity and homogeneity assumptions. This approach estimates a regression line corresponding to the expected change in the outcome (on the link basis) given a simultaneous increase in the quantile-based category for all exposures. Works with continuous, binary, and right-censored time-to-event outcomes. Reference: Alexander P. Keil, Jessie P. Buckley, Katie M. OBrien, Kelly K. Ferguson, Shanshan Zhao, and Alexandra J. White (2019) A quantile-based g-computation approach to addressing the effects of exposure mixtures; <doi:10.1289/EHP5838>.
Maintained by Alexander Keil. Last updated 4 days ago.
exposureexposure-mixtureexposure-mixturesquantile-gcomputationsurvival
0.5 match 37 stars 8.73 score 70 scripts 2 dependentskwb-r
kwb.abimo:R Package with Functions for Working with Water Balance Model ABIMO
R Package with functions for working with water balance bodel ABIMO https://www.stadtentwicklung.berlin.de/umwelt/umweltatlas/download/goedecke_et_al_abimo2019_doku.pdf).
Maintained by Andreas Matzinger. Last updated 1 years ago.
abimoproject-amarexproject-basarproject-flusshygieneproject-keysproject-kurasproject-ogreproject-spurwater-balance-model
1.7 match 2.48 score 1 dependentsropengov
digitransit:A client for digitransit API services
Geocode and reverse geocode addresses and poinst in Finland.
Maintained by Markus Kainu. Last updated 11 months ago.
2.5 match 1.70 scorebioc
PLSDAbatch:PLSDA-batch
A novel framework to correct for batch effects prior to any downstream analysis in microbiome data based on Projection to Latent Structures Discriminant Analysis. The main method is named “PLSDA-batch”. It first estimates treatment and batch variation with latent components, then subtracts batch-associated components from the data whilst preserving biological variation of interest. PLSDA-batch is highly suitable for microbiome data as it is non-parametric, multivariate and allows for ordination and data visualisation. Combined with centered log-ratio transformation for addressing uneven library sizes and compositional structure, PLSDA-batch addresses all characteristics of microbiome data that existing correction methods have ignored so far. Two other variants are proposed for 1/ unbalanced batch x treatment designs that are commonly encountered in studies with small sample sizes, and for 2/ selection of discriminative variables amongst treatment groups to avoid overfitting in classification problems. These two variants have widened the scope of applicability of PLSDA-batch to different data settings.
Maintained by Yiwen (Eva) Wang. Last updated 5 months ago.
statisticalmethoddimensionreductionprincipalcomponentclassificationmicrobiomebatcheffectnormalizationvisualization
0.8 match 13 stars 5.37 score 18 scriptsropensci
dittodb:A Test Environment for Database Requests
Testing and documenting code that communicates with remote databases can be painful. Although the interaction with R is usually relatively simple (e.g. data(frames) passed to and from a database), because they rely on a separate service and the data there, testing them can be difficult to set up, unsustainable in a continuous integration environment, or impossible without replicating an entire production cluster. This package addresses that by allowing you to make recordings from your database interactions and then play them back while testing (or in other contexts) all without needing to spin up or have access to the database your code would typically connect to.
Maintained by Jonathan Keane. Last updated 11 months ago.
0.5 match 82 stars 8.04 score 49 scriptscerte-medical-epidemiology
certegis:A Certe R Package for Geographic Information Science
A Certe R package for geographic information science (GIS), using the 'sf' package and Dutch reference data. This package is part of the 'certedata' universe.
Maintained by Matthijs S. Berends. Last updated 3 months ago.
1.9 match 2.18 score 1 scriptsropensci
ezknitr:Avoid the Typical Working Directory Pain When Using 'knitr'
An extension of 'knitr' that adds flexibility in several ways. One common source of frustration with 'knitr' is that it assumes the directory where the source file lives should be the working directory, which is often not true. 'ezknitr' addresses this problem by giving you complete control over where all the inputs and outputs are, and adds several other convenient features to make rendering markdown/HTML documents easier.
Maintained by Dean Attali. Last updated 2 years ago.
knitrpeer-reviewedreproducibilityrmarkdownrmd
0.5 match 115 stars 7.81 score 378 scriptssamhforbes
eyetrackingR:Eye-Tracking Data Analysis
Addresses tasks along the pipeline from raw data to analysis and visualization for eye-tracking data. Offers several popular types of analyses, including linear and growth curve time analyses, onset-contingent reaction time analyses, as well as several non-parametric bootstrapping approaches. For references to the approach see Mirman, Dixon & Magnuson (2008) <doi:10.1016/j.jml.2007.11.006>, and Barr (2008) <doi:10.1016/j.jml.2007.09.002>.
Maintained by Samuel Forbes. Last updated 2 years ago.
0.5 match 22 stars 7.84 score 60 scriptsoldlipe
Rwtss:Client for Web Time-Series Service
Allows remote access to satellite image time series provided by the web time series service (WTSS) available at servers such as <https://brazildatacube.dpi.inpe.br/wtss/>. The functions include listing the data sets available in WTSS servers, describing the contents of a data set, and retrieving a time series based on spatial location and temporal filters.
Maintained by Felipe Souza. Last updated 3 years ago.
1.8 match 2.28 score 19 scriptsitsarthurwhite
BayesLCA:Bayesian Latent Class Analysis
Bayesian Latent Class Analysis using several different methods.
Maintained by Arthur White. Last updated 5 years ago.
1.6 match 2 stars 2.48 score 38 scriptstidy-finance
tidyfinance:Tidy Finance Helper Functions
Helper functions for empirical research in financial economics, addressing a variety of topics covered in Scheuch, Voigt, and Weiss (2023) <doi:10.1201/b23237>. The package is designed to provide shortcuts for issues extensively discussed in the book, facilitating easier application of its concepts. For more information and resources related to the book, visit <https://www.tidy-finance.org/r/index.html>.
Maintained by Christoph Scheuch. Last updated 3 months ago.
0.5 match 15 stars 7.56 score 24 scriptskosukeimai
wru:Who are You? Bayesian Prediction of Racial Category Using Surname, First Name, Middle Name, and Geolocation
Predicts individual race/ethnicity using surname, first name, middle name, geolocation, and other attributes, such as gender and age. The method utilizes Bayes' Rule (with optional measurement error correction) to compute the posterior probability of each racial category for any given individual. The package implements methods described in Imai and Khanna (2016) "Improving Ecological Inference by Predicting Individual Ethnicity from Voter Registration Records" Political Analysis <DOI:10.1093/pan/mpw001> and Imai, Olivella, and Rosenman (2022) "Addressing census data problems in race imputation via fully Bayesian Improved Surname Geocoding and name supplements" <DOI:10.1126/sciadv.adc9824>. The package also incorporates the data described in Rosenman, Olivella, and Imai (2023) "Race and ethnicity data for first, middle, and surnames" <DOI:10.1038/s41597-023-02202-2>.
Maintained by Brandon Bertelsen. Last updated 9 months ago.
0.5 match 133 stars 7.54 score 146 scriptsdominikkukacka
emailvalidation:Client for the 'emailalvalidation.io' E-Mail Validation API
An R client for the 'emailvalidation.io' e-mail verification API. The API requires registration of an API key. Basic features are free, some require a paid subscription. You can find the full API documentation at <https://emailvalidation.io/docs> .
Maintained by Dominik Kukacka. Last updated 2 years ago.
1.9 match 2.00 scoreolgalezhnina
dtreg:Interact with Data Type Registries and Create Machine-Readable Data
You can load a schema from a DTR (data type registry) as an R object. Use this schema to write your data in JSON-LD (JavaScript Object Notation for Linked Data) format to make it machine readable.
Maintained by Olga Lezhnina. Last updated 29 days ago.
1.2 match 3.18 score 4 scriptseric-hunt
htce:A set of internal tools for managing high-throughput assay data at NEB
What the package does (one paragraph).
Maintained by Eric Hunt. Last updated 9 months ago.
3.7 match 1.00 scorebioc
xenLite:Simple classes and methods for managing Xenium datasets
Define a relatively light class for managing Xenium data using Bioconductor. Address use of parquet for coordinates, SpatialExperiment for assay and sample data. Address serialization and use of cloud storage.
Maintained by Vincent Carey. Last updated 5 months ago.
0.8 match 1 stars 4.48 score 4 scripts