FSAdata:Data to Support Fish Stock Assessment ('FSA') Package
The datasets to support the Fish Stock Assessment ('FSA') package.
Maintained by Derek Ogle. Last updated 2 years ago.
216.9 match 13 stars 5.75 score 285 scripts
FSA:Simple Fisheries Stock Assessment Methods
A variety of simple fish stock assessment methods.
Maintained by Derek H. Ogle. Last updated 2 months ago.
22.2 match 68 stars 11.08 score 1.7k scripts 6 dependents
ypr:Yield Per Recruit
An implementation of equilibrium-based yield per recruit methods. Yield per recruit methods can used to estimate the optimal yield for a fish population as described by Walters and Martell (2004) <isbn:0-691-11544-3>. The yield can be based on the number of fish caught (or harvested) or biomass caught for all fish or just large (trophy) individuals.
Maintained by Joe Thorley. Last updated 2 months ago.
30.2 match 7 stars 7.84 score 55 scripts 1 dependents
rLakeAnalyzer:Lake Physics Tools
Standardized methods for calculating common important derived physical features of lakes including water density based based on temperature, thermal layers, thermocline depth, lake number, Wedderburn number, Schmidt stability and others.
Maintained by Luke Winslow. Last updated 4 years ago.
25.5 match 45 stars 9.05 score 280 scripts 1 dependents
tidypaleo:Tidy Tools for Paleoenvironmental Archives
Provides a set of functions with a common framework for age-depth model management, stratigraphic visualization, and common statistical transformations. The focus of the package is stratigraphic visualization, for which 'ggplot2' components are provided to reproduce the scales, geometries, facets, and theme elements commonly used in publication-quality stratigraphic diagrams. Helpers are also provided to reproduce the exploratory statistical summaries that are frequently included on stratigraphic diagrams. See Dunnington et al. (2021) <doi:10.18637/jss.v101.i07>.
Maintained by Dewey Dunnington. Last updated 2 years ago.
32.0 match 34 stars 6.59 score 38 scripts
nhdR:Tools for Working with the National Hydrography Dataset
Tools for working with the National Hydrography Dataset, with functions for querying, downloading, and networking both the NHD <> and NHDPlus <> datasets.
Maintained by Jemma Stachelek. Last updated 2 years ago.
31.7 match 38 stars 6.48 score 53 scripts
glatos:A package for the Great Lakes Acoustic Telemetry Observation System
Functions useful to members of the Great Lakes Acoustic Telemetry Observation System; many more broadly relevant to simulating, processing, analysing, and visualizing acoustic telemetry data.
Maintained by Christopher Holbrook. Last updated 6 months ago.
28.2 match 10 stars 6.38 score 112 scripts
lakemorpho:Lake Morphometry Metrics
Lake morphometry metrics are used by limnologists to understand, among other things, the ecological processes in a lake. Traditionally, these metrics are calculated by hand, with planimeters, and increasingly with commercial GIS products. All of these methods work; however, they are either outdated, difficult to reproduce, or require expensive licenses to use. The 'lakemorpho' package provides the tools to calculate a typical suite of these metrics from an input elevation model and lake polygon. The metrics currently supported are: fetch, major axis, minor axis, major/minor axis ratio, maximum length, maximum width, mean width, maximum depth, mean depth, shoreline development, shoreline length, surface area, and volume.
Maintained by Jeffrey W. Hollister. Last updated 6 months ago.
30.2 match 27 stars 4.44 score 34 scripts
wikilake:Scrape Lake Metadata Tables from Wikipedia
Scrape lake metadata tables from Wikipedia <>.
Maintained by Jemma Stachelek. Last updated 2 years ago.
27.3 match 8 stars 4.83 score 17 scripts
BSDA:Basic Statistics and Data Analysis
Data sets for book "Basic Statistics and Data Analysis" by Larry J. Kitchens.
Maintained by Alan T. Arnholt. Last updated 2 years ago.
13.5 match 7 stars 9.11 score 1.3k scripts 6 dependents
AzureStor:Storage Management in 'Azure'
Manage storage in Microsoft's 'Azure' cloud: <>. On the admin side, 'AzureStor' includes features to create, modify and delete storage accounts. On the client side, it includes an interface to blob storage, file storage, and 'Azure Data Lake Storage Gen2': upload and download files and blobs; list containers and files/blobs; create containers; and so on. Authenticated access to storage is supported, via either a shared access key or a shared access signature (SAS). Part of the 'AzureR' family of packages.
Maintained by Hong Ooi. Last updated 2 years ago.
10.6 match 64 stars 10.72 score 298 scripts 4 dependents
LAGOSNE:Interface to the Lake Multi-Scaled Geospatial and Temporal Database
Client for programmatic access to the Lake Multi-scaled Geospatial and Temporal database <>, with functions for accessing lake water quality and ecological context data for the US.
Maintained by Jemma Stachelek. Last updated 2 years ago.
13.2 match 15 stars 6.77 score 98 scripts
spup:Spatial Uncertainty Propagation Analysis
Uncertainty propagation analysis in spatial environmental modelling following methodology described in Heuvelink et al. (2007) <doi:10.1080/13658810601063951> and Brown and Heuvelink (2007) <doi:10.1016/j.cageo.2006.06.015>. The package provides functions for examining the uncertainty propagation starting from input data and model parameters, via the environmental model onto model outputs. The functions include uncertainty model specification, stochastic simulation and propagation of uncertainty using Monte Carlo (MC) techniques. Uncertain variables are described by probability distributions. Both numerical and categorical data types are handled. Spatial auto-correlation within an attribute and cross-correlation between attributes is accommodated for. The MC realizations may be used as input to the environmental models called from R, or externally.
Maintained by Kasia Sawicka. Last updated 1 years ago.
12.9 match 9 stars 6.31 score 57 scripts
gstat:Spatial and Spatio-Temporal Geostatistical Modelling, Prediction and Simulation
Variogram modelling; simple, ordinary and universal point or block (co)kriging; spatio-temporal kriging; sequential Gaussian or indicator (co)simulation; variogram and variogram map plotting utility functions; supports sf and stars.
Maintained by Edzer Pebesma. Last updated 9 days ago.
5.3 match 197 stars 14.78 score 4.8k scripts 57 dependents
lterdatasampler:Educational Dataset Examples from the Long Term Ecological Research Program
Curated datasets from US Long Term Ecological Research sites.
Maintained by Allison Horst. Last updated 1 years ago.
11.6 match 50 stars 6.26 score 240 scripts
rbacon:Age-Depth Modelling using Bayesian Statistics
An approach to age-depth modelling that uses Bayesian statistics to reconstruct accumulation histories for deposits, through combining radiocarbon and other dates with prior information on accumulation rates and their variability. See Blaauw & Christen (2011).
Maintained by Maarten Blaauw. Last updated 24 days ago.
10.0 match 7 stars 6.75 score 57 scripts 1 dependents
kootlake:Kootenay Lake Data
Annual Rainbow Trout, Bull Trout and Kokanee datasets for Kootenay Lake.
Maintained by Joe Thorley. Last updated 2 months ago.
20.4 match 3.30 score 4 scripts
spmodel:Spatial Statistical Modeling and Prediction
Fit, summarize, and predict for a variety of spatial statistical models applied to point-referenced and areal (lattice) data. Parameters are estimated using various methods. Additional modeling features include anisotropy, non-spatial random effects, partition factors, big data approaches, and more. Model-fit statistics are used to summarize, visualize, and compare models. Predictions at unobserved locations are readily obtainable. For additional details, see Dumelle et al. (2023) <doi:10.1371/journal.pone.0282524>.
Maintained by Michael Dumelle. Last updated 2 days ago.
7.8 match 15 stars 7.66 score 112 scripts 3 dependents
DAAG:Data Analysis and Graphics Data and Functions
Functions and data sets used in examples and exercises in the text Maindonald, J.H. and Braun, W.J. (2003, 2007, 2010) "Data Analysis and Graphics Using R", and in an upcoming Maindonald, Braun, and Andrews text that builds on this earlier text.
Maintained by W. John Braun. Last updated 11 months ago.
7.2 match 8.25 score 1.2k scripts 1 dependents
maps:Draw Geographical Maps
Display of maps. Projection code and larger maps are in separate packages ('mapproj' and 'mapdata').
Maintained by Alex Deckmyn. Last updated 2 months ago.
4.0 match 24 stars 14.70 score 19k scripts 490 dependents
sparklyr:R Interface to Apache Spark
R interface to Apache Spark, a fast and general engine for big data processing, see <>. This package supports connecting to local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end, and provides an interface to Spark's built-in machine learning algorithms.
Maintained by Edgar Ruiz. Last updated 8 days ago.
3.6 match 959 stars 15.16 score 4.0k scripts 21 dependents
alr4:Data to Accompany Applied Linear Regression 4th Edition
Datasets to Accompany S. Weisberg (2014, ISBN: 978-1-118-38608-8), "Applied Linear Regression," 4th edition. Many data files in this package are included in the `alr3` package as well, so only one of them should be used.
Maintained by Sanford Weisberg. Last updated 7 years ago.
14.9 match 1 stars 3.45 score 306 scripts
gratia:Graceful 'ggplot'-Based Graphics and Other Functions for GAMs Fitted Using 'mgcv'
Graceful 'ggplot'-based graphics and utility functions for working with generalized additive models (GAMs) fitted using the 'mgcv' package. Provides a reimplementation of the plot() method for GAMs that 'mgcv' provides, as well as 'tidyverse' compatible representations of estimated smooths.
Maintained by Gavin L. Simpson. Last updated 4 days ago.
3.8 match 216 stars 12.68 score 1.6k scripts 1 dependents
s20x:Functions for University of Auckland Course STATS 201/208 Data Analysis
A set of functions used in teaching STATS 201/208 Data Analysis at the University of Auckland. The functions are designed to make parts of R more accessible to a large undergraduate population who are mostly not statistics majors.
Maintained by James Curran. Last updated 2 years ago.
7.2 match 3 stars 6.40 score 211 scripts 3 dependents
paws:Amazon Web Services Software Development Kit
Interface to Amazon Web Services <>, including storage, database, and compute services, such as 'Simple Storage Service' ('S3'), 'DynamoDB' 'NoSQL' database, and 'Lambda' functions-as-a-service.
Maintained by Dyfan Jones. Last updated 2 days ago.
4.0 match 332 stars 11.25 score 177 scripts 12 dependents
MixSIAR:Bayesian Mixing Models in R
Creates and runs Bayesian mixing models to analyze biological tracer data (i.e. stable isotopes, fatty acids), which estimate the proportions of source (prey) contributions to a mixture (consumer). 'MixSIAR' is not one model, but a framework that allows a user to create a mixing model based on their data structure and research questions, via options for fixed/ random effects, source data types, priors, and error terms. 'MixSIAR' incorporates several years of advances since 'MixSIR' and 'SIAR'.
Maintained by Brian Stock. Last updated 4 years ago.
4.8 match 96 stars 9.21 score 122 scripts
elevatr:Access Elevation Data from Various APIs
Several web services are available that provide access to elevation data. This package provides access to many of those services and returns elevation data either as an 'sf' simple features object from point elevation services or as a 'raster' object from raster elevation services. In future versions, 'elevatr' will drop support for 'raster' and will instead return 'terra' objects. Currently, the package supports access to the Amazon Web Services Terrain Tiles <>, the Open Topography Global Datasets API <>, and the USGS Elevation Point Query Service <>.
Maintained by Jeffrey Hollister. Last updated 6 months ago.
4.0 match 206 stars 11.11 score 1.3k scripts 3 dependents
DirichletReg:Dirichlet Regression
Implements Dirichlet regression models.
Maintained by Marco Johannes Maier. Last updated 4 years ago.
4.9 match 13 stars 8.70 score 222 scripts 8 dependents
cheddar:Analysis and Visualisation of Ecological Communities
Provides a flexible, extendable representation of an ecological community and a range of functions for analysis and visualisation, focusing on food web, body mass and numerical abundance data. Allows inter-web comparisons such as examining changes in community structure over environmental, temporal or spatial gradients.
Maintained by Lawrence Hudson. Last updated 8 months ago.
6.0 match 15 stars 6.86 score 195 scripts
adespatial:Multivariate Multiscale Spatial Analysis
Tools for the multiscale spatial analysis of multivariate data. Several methods are based on the use of a spatial weighting matrix and its eigenvector decomposition (Moran's Eigenvectors Maps, MEM). Several approaches are described in the review Dray et al (2012) <doi:10.1890/11-1183.1>.
Maintained by Aurélie Siberchicot. Last updated 11 days ago.
3.6 match 36 stars 11.06 score 398 scripts 2 dependents
mwlaxeref:Cross-References Lake Identifiers Between Different Data Sets
Handy helper package for cross-referencing lake identifiers among different data sets in the Midwestern United States. There are multiple different state, regional, and federal agencies that have different identifiers on lakes. This package helps you to go between them.
Maintained by Paul Frater. Last updated 1 years ago.
19.1 match 2.00 score
fma:Data Sets from "Forecasting: Methods and Applications" by Makridakis, Wheelwright & Hyndman (1998)
All data sets from "Forecasting: methods and applications" by Makridakis, Wheelwright & Hyndman (Wiley, 3rd ed., 1998) <>.
Maintained by Rob Hyndman. Last updated 1 years ago.
4.0 match 19 stars 8.74 score 336 scripts 2 dependents
resampledata:Data Sets for Mathematical Statistics with Resampling in R
Package of data sets from "Mathematical Statistics with Resampling in R" (1st Ed. 2011, 2nd Ed. 2018) by Laura Chihara and Tim Hesterberg.
Maintained by Albert Y. Kim. Last updated 4 months ago.
6.6 match 15 stars 5.15 score 187 scripts
laketemps:Lake Temperatures Collected by Situ and Satellite Methods from 1985-2009
Lake temperature records, metadata, and climate drivers for 291 global lakes during the time period 1985-2009. Temperature observations were collected using satellite and in situ methods. Climatic drivers and geomorphometric characteristics were also compiled and are included for each lake. Data are part of the associated publication from the Global Lake Temperature Collaboration project ( See citation('laketemps') for dataset attribution.
Maintained by Jordan S Read. Last updated 8 years ago.
10.8 match 2 stars 3.04 score 11 scripts
adklakedata:Adirondack Long-Term Lake Data
Package for the access and distribution of Long-term lake datasets from lakes in the Adirondack Park, northern New York state. Includes a wide variety of physical, chemical, and biological parameters from 28 lakes. Data are from multiple collection organizations and have been harmonized in both time and space for ease of reuse.
Maintained by Luke Winslow. Last updated 7 years ago.
9.6 match 1 stars 3.29 score 39 scripts
rshift:Paleoecology Functions for Regime Shift Analysis
Contains a variety of functions, based around regime shift analysis of paleoecological data. Citations: Rodionov() from Rodionov (2004) <doi:10.1029/2004GL019448> Lanzante() from Lanzante (1996) <doi:10.1002/(SICI)1097-0088(199611)16:11%3C1197::AID-JOC89%3E3.0.CO;2-L> Hellinger_trans from Numerical Ecology, Legendre & Legendre (ISBN 9780444538680) rolling_autoc from Liu, Gao & Wang (2018) <doi:10.1016/j.scitotenv.2018.06.276> Sample data sets lake_data & lake_RSI processed from Bush, Silman & Urrego (2004) <doi:10.1126/science.1090795> Sample data set January_PDO from NOAA: <>.
Maintained by Alex H. Room. Last updated 2 months ago.
6.9 match 4 stars 4.38 score 8 scripts
mixdist:Finite Mixture Distribution Models
Fit finite mixture distribution models to grouped data and conditional data by maximum likelihood using a combination of a Newton-type algorithm and the EM algorithm.
Maintained by Peter Macdonald. Last updated 7 years ago.
10.7 match 2.78 score 2 dependents
astsa:Applied Statistical Time Series Analysis
Contains data sets and scripts for analyzing time series in both the frequency and time domains including state space modeling as well as supporting the texts Time Series Analysis and Its Applications: With R Examples (5th ed), by R.H. Shumway and D.S. Stoffer. Springer Texts in Statistics, 2025, <>, and Time Series: A Data Analysis Approach Using R. Chapman-Hall, 2019, <DOI:10.1201/9780429273285>.
Maintained by David Stoffer. Last updated 2 months ago.
3.8 match 7 stars 7.88 score 2.2k scripts 8 dependents
Ecdat:Data Sets for Econometrics
Data sets for econometrics, including political science.
Maintained by Spencer Graves. Last updated 3 months ago.
4.0 match 2 stars 7.25 score 740 scripts 3 dependents
RFishBC:Back-Calculation of Fish Length
Helps fisheries scientists collect measurements from calcified structures and back-calculate estimated lengths at previous ages using standard procedures and models. This is intended to replace much of the functionality provided by the now out-dated 'fishBC' software (<>).
Maintained by Derek H. Ogle. Last updated 1 years ago.
6.8 match 13 stars 4.26 score 28 scripts
fishmethods:Fishery Science Methods and Models
Functions for applying a wide range of fisheries stock assessment methods.
Maintained by Gary A. Nelson. Last updated 1 months ago.
6.9 match 5 stars 4.12 score 136 scripts 1 dependents
klexdatr:Kootenay Lake Exploitation Study Data
Six relational 'tibbles' from the Kootenay Lake Large Trout Exploitation study. The study which ran from 2008 to 2014 caught, tagged and released large Rainbow Trout and Bull Trout in Kootenay Lake by boat angling. The fish were tagged with internal acoustic tags and/or high reward external tags and subsequently detected by an acoustic receiver array as well as reported by anglers. The data are analysed by Thorley and Andrusak (1994) <doi:10.7717/peerj.2874> to estimate the natural and fishing mortality of both species.
Maintained by Joe Thorley. Last updated 2 months ago.
11.3 match 2.30 score 7 scripts
gss:General Smoothing Splines
A comprehensive package for structural multivariate function estimation using smoothing splines.
Maintained by Chong Gu. Last updated 5 months ago.
4.0 match 3 stars 6.40 score 137 dependents
yaImpute:Nearest Neighbor Observation Imputation and Evaluation Tools
Performs nearest neighbor-based imputation using one or more alternative approaches to processing multivariate data. These include methods based on canonical correlation: analysis, canonical correspondence analysis, and a multivariate adaptation of the random forest classification and regression techniques of Leo Breiman and Adele Cutler. Additional methods are also offered. The package includes functions for comparing the results from running alternative techniques, detecting imputation targets that are notably distant from reference observations, detecting and correcting for bias, bootstrapping and building ensemble imputations, and mapping results.
Maintained by Jeffrey S. Evans. Last updated 6 months ago.
3.4 match 3 stars 7.40 score 94 scripts 12 dependents
coda.base:A Basic Set of Functions for Compositional Data Analysis
A minimum set of functions to perform compositional data analysis using the log-ratio approach introduced by John Aitchison (1982). Main functions have been implemented in c++ for better performance.
Maintained by Marc Comas-Cufí. Last updated 1 years ago.
3.6 match 7 stars 6.93 score 81 scripts
isotone:Active Set and Generalized PAVA for Isotone Optimization
Contains two main functions: one for solving general isotone regression problems using the pool-adjacent-violators algorithm (PAVA); another one provides a framework for active set methods for isotone optimization problems with arbitrary order restrictions. Various types of loss functions are prespecified.
Maintained by Patrick Mair. Last updated 3 months ago.
3.6 match 6.88 score 80 scripts 13 dependents
rioja:Analysis of Quaternary Science Data
Constrained clustering, transfer functions, and other methods for analysing Quaternary science data.
Maintained by Steve Juggins. Last updated 6 months ago.
3.4 match 10 stars 7.21 score 191 scripts 3 dependents
nexus:Sourcing Archaeological Materials by Chemical Composition
Exploration and analysis of compositional data in the framework of Aitchison (1986, ISBN: 978-94-010-8324-9). This package provides tools for chemical fingerprinting and source tracking of ancient materials.
Maintained by Nicolas Frerebeau. Last updated 11 days ago.
4.5 match 5.21 score 26 scripts 1 dependents
trps:Bayesian trophic position models using stan
Bayesian trophic position models using stan by leveraging 'brms' for stable isotope data. Trophic position models are derived by using equations from Post (2002) <doi:10.1890/0012-9658(2002)083[0703:USITET]2.0.CO;2>, and Huevel et al. (2024) <doi:10.1139/cjfas-2024-0028>.
Maintained by Benjamin L. Hlina. Last updated 6 hours ago.
6.7 match 3.48 score 4 scripts
Kendall:Kendall Rank Correlation and Mann-Kendall Trend Test
Computes the Kendall rank correlation and Mann-Kendall trend test. See documentation for use of block bootstrap when there is autocorrelation.
Maintained by A.I. McLeod. Last updated 3 years ago.
3.4 match 6.74 score 864 scripts 25 dependents
OTUtable:North Temperate Lakes - Microbial Observatory 16S Time Series Data and Functions
Analyses of OTU tables produced by 16S rRNA gene amplicon sequencing, as well as example data. It contains the data and scripts used in the paper Linz, et al. (2017) "Bacterial community composition and dynamics spanning five years in freshwater bog lakes," <doi: 10.1128/mSphere.00169-17>.
Maintained by Alexandra Linz. Last updated 7 years ago.
10.4 match 2.20 score 53 scripts
sptotal:Predicting Totals and Weighted Sums from Spatial Data
Performs predictions of totals and weighted sums, or finite population block kriging, on spatial data using the methods in Ver Hoef (2008) <doi:10.1007/s10651-007-0035-y>. The primary outputs are an estimate of the total, mean, or weighted sum in the region, an estimated prediction variance, and a plot of the predicted and observed values. This is useful primarily to users with ecological data that are counts or densities measured on some sites in a finite area of interest. Spatial prediction for the total count or average density in the entire region can then be done using the functions in this package.
Maintained by Matt Higham. Last updated 7 months ago.
4.6 match 4 stars 4.90 score 10 scripts
networkdata:Repository of Network Datasets
The package contains a large collection of network dataset with different context. This includes social networks, animal networks and movie networks. All datasets are in 'igraph' format.
Maintained by David Schoch. Last updated 12 months ago.
4.5 match 143 stars 5.01 score 143 scripts
compositions:Compositional Data Analysis
Provides functions for the consistent analysis of compositional data (e.g. portions of substances) and positive numbers (e.g. concentrations) in the way proposed by J. Aitchison and V. Pawlowsky-Glahn.
Maintained by K. Gerald van den Boogaart. Last updated 1 years ago.
3.4 match 1 stars 6.35 score 36 dependents
whitebox:'WhiteboxTools' R Frontend
An R frontend for the 'WhiteboxTools' library, which is an advanced geospatial data analysis platform developed by Prof. John Lindsay at the University of Guelph's Geomorphometry and Hydrogeomatics Research Group. 'WhiteboxTools' can be used to perform common geographical information systems (GIS) analysis operations, such as cost-distance analysis, distance buffering, and raster reclassification. Remote sensing and image processing tasks include image enhancement (e.g. panchromatic sharpening, contrast adjustments), image mosaicing, numerous filtering operations, simple classification (k-means), and common image transformations. 'WhiteboxTools' also contains advanced tooling for spatial hydrological analysis (e.g. flow-accumulation, watershed delineation, stream network analysis, sink removal), terrain analysis (e.g. common terrain indices such as slope, curvatures, wetness index, hillshading; hypsometric analysis; multi-scale topographic position analysis), and LiDAR data processing. Suggested citation: Lindsay (2016) <doi:10.1016/j.cageo.2016.07.003>.
Maintained by Andrew Brown. Last updated 5 months ago.
2.3 match 173 stars 9.65 score 203 scripts 2 dependents
benford.analysis:Benford Analysis for Data Validation and Forensic Analytics
Provides tools that make it easier to validate data using Benford's Law.
Maintained by Carlos Cinelli. Last updated 6 years ago.
3.8 match 62 stars 5.66 score 74 scripts
hatchR:Predict Fish Hatch and Emergence Timing
Predict hatch and emergence timing for a wide range of wild fishes using the effective value framework (Sparks et al., (2019) <DOI:10.1139/cjfas-2017-0468>). 'hatchR' offers users access to established phenological models and the flexibility to incorporate custom parameterizations using external datasets.
Maintained by Bryan M. Maitland. Last updated 2 days ago.
3.4 match 1 stars 5.89 score
timeSeriesDataSets:Time Series Data Sets
Provides a diverse collection of time series datasets spanning various fields such as economics, finance, energy, healthcare, and more. Designed to support time series analysis in R by offering datasets from multiple disciplines, making it a valuable resource for researchers and analysts.
Maintained by Renzo Caceres Rossi. Last updated 6 months ago.
3.5 match 10 stars 5.71 score 103 scripts
loo:Efficient Leave-One-Out Cross-Validation and WAIC for Bayesian Models
Efficient approximate leave-one-out cross-validation (LOO) for Bayesian models fit using Markov chain Monte Carlo, as described in Vehtari, Gelman, and Gabry (2017) <doi:10.1007/s11222-016-9696-4>. The approximation uses Pareto smoothed importance sampling (PSIS), a new procedure for regularizing importance weights. As a byproduct of the calculations, we also obtain approximate standard errors for estimated predictive errors and for the comparison of predictive errors between models. The package also provides methods for using stacking and other model weighting techniques to average Bayesian predictive distributions.
Maintained by Jonah Gabry. Last updated 1 days ago.
1.1 match 152 stars 17.30 score 2.6k scripts 297 dependents
tswge:Time Series for Data Science
Accompanies the texts Time Series for Data Science with R by Woodward, Sadler and Robertson & Applied Time Series Analysis with R, 2nd edition by Woodward, Gray, and Elliott. It is helpful for data analysis and for time series instruction.
Maintained by Bivin Sadler. Last updated 2 years ago.
7.3 match 2.70 score 496 scripts
sfdct:Constrained Triangulation for Simple Features
Build a constrained high quality Delaunay triangulation from simple features objects, applying constraints based on input line segments, and triangle properties including maximum area, minimum internal angle. The triangulation code in 'RTriangle' uses the method of Cheng, Dey and Shewchuk (2012, ISBN:9781584887300). For a low-dependency alternative with low-quality path-based constrained triangulation see <> and for high-quality configurable triangulation see <>. Also consider comparison with the 'GEOS' lib which since version 3.10.0 includes a low quality polygon triangulation method that starts with ear clipping and refines to Delaunay.
Maintained by Michael D. Sumner. Last updated 1 years ago.
4.0 match 3 stars 4.67 score 31 scripts
grafify:Easy Graphs for Data Visualisation and Linear Models for ANOVA
Easily explore data by plotting graphs with a few lines of code. Use these ggplot() wrappers to quickly draw graphs of scatter/dots with box-whiskers, violins or SD error bars, data distributions, before-after graphs, factorial ANOVA and more. Customise graphs in many ways, for example, by choosing from colour blind-friendly palettes (12 discreet, 3 continuous and 2 divergent palettes). Use the simple code for ANOVA as ordinary (lm()) or mixed-effects linear models (lmer()), including randomised-block or repeated-measures designs, and fit non-linear outcomes as a generalised additive model (gam) using mgcv(). Obtain estimated marginal means and perform post-hoc comparisons on fitted models (via emmeans()). Also includes small datasets for practising code and teaching basics before users move on to more complex designs. See vignettes for details on usage <>. Citation: <doi:10.5281/zenodo.5136508>.
Maintained by Avinash R Shenoy. Last updated 2 days ago.
3.5 match 48 stars 5.31 score 107 scripts
Interface to 'Amazon Web Services' security, identity, and compliance services, including the 'Identity & Access Management' ('IAM') service for managing access to services and resources, and more <>.
Maintained by Dyfan Jones. Last updated 2 days ago.
2.0 match 332 stars 9.17 score 15 dependents
paws.database:'Amazon Web Services' Database Services
Interface to 'Amazon Web Services' database services, including 'Relational Database Service' ('RDS'), 'DynamoDB' 'NoSQL' database, and more <>.
Maintained by Dyfan Jones. Last updated 2 days ago.
2.0 match 332 stars 9.07 score 3 scripts 13 dependents
atsalibrary:Packages, data and scripts for ATSA course and lab book
This package will load the needed packages and data files for the ATSA course material when students install from GitHub.
Maintained by Elizabeth E. Holmes. Last updated 2 years ago.
7.5 match 4 stars 2.41 score 13 scripts
tsfeatures:Time Series Feature Extraction
Methods for extracting various features from time series data. The features provided are those from Hyndman, Wang and Laptev (2013) <doi:10.1109/ICDMW.2015.104>, Kang, Hyndman and Smith-Miles (2017) <doi:10.1016/j.ijforecast.2016.09.004> and from Fulcher, Little and Jones (2013) <doi:10.1098/rsif.2013.0048>. Features include spectral entropy, autocorrelations, measures of the strength of seasonality and trend, and so on. Users can also define their own feature functions.
Maintained by Rob Hyndman. Last updated 8 months ago.
1.5 match 254 stars 11.47 score 268 scripts 22 dependents
gamclass:Functions and Data for a Course on Modern Regression and Classification
Functions and data are provided that support a course that emphasizes statistical issues of inference and generalizability. The functions are designed to make it straightforward to illustrate the use of cross-validation, the training/test approach, simulation, and model-based estimates of accuracy. Methods considered are Generalized Additive Modeling, Linear and Quadratic Discriminant Analysis, Tree-based methods, and Random Forests.
Maintained by John Maindonald. Last updated 2 years ago.
3.5 match 4.82 score 44 scripts
itsmr:Time Series Analysis Using the Innovations Algorithm
Provides functions for modeling and forecasting time series data. Forecasting is based on the innovations algorithm. A description of the innovations algorithm can be found in the textbook "Introduction to Time Series and Forecasting" by Peter J. Brockwell and Richard A. Davis. <>.
Maintained by George Weigt. Last updated 3 years ago.
7.1 match 2.34 score 218 scripts
azuremlsdk:Interface to the 'Azure Machine Learning' 'SDK'
Interface to the 'Azure Machine Learning' Software Development Kit ('SDK'). Data scientists can use the 'SDK' to train, deploy, automate, and manage machine learning models on the 'Azure Machine Learning' service. To learn more about 'Azure Machine Learning' visit the website: <>.
Maintained by Diondra Peck. Last updated 3 years ago.
1.7 match 106 stars 8.91 score 221 scripts
bayess:Bayesian Essentials with R
Allows the reenactment of the R programs used in the book Bayesian Essentials with R without further programming. R code being available as well, they can be modified by the user to conduct one's own simulations. Marin J.-M. and Robert C. P. (2014) <doi:10.1007/978-1-4614-8687-9>.
Maintained by Jean-Michel Marin. Last updated 1 years ago.
3.6 match 3 stars 4.01 score 68 scripts
DAIME:Effects of Changing Deposition Rates
Reverse and model the effects of changing deposition rates on geological data and rates. Based on Hohmann (2018) <doi:10.13140/RG.2.2.23372.51841> .
Maintained by Niklas Hohmann. Last updated 5 years ago.
4.5 match 3.00 score
arima2:Likelihood Based Inference for ARIMA Modeling
Estimating and analyzing auto regressive integrated moving average (ARIMA) models. The primary function in this package is arima(), which fits an ARIMA model to univariate time series data using a random restart algorithm. This approach frequently leads to models that have model likelihood greater than or equal to that of the likelihood obtained by fitting the same model using the arima() function from the 'stats' package. This package enables proper optimization of model likelihoods, which is a necessary condition for performing likelihood ratio tests. This package relies heavily on the source code of the arima() function of the 'stats' package. For more information, please see Jesse Wheeler and Edward L. Ionides (2023) <arXiv:2310.01198>.
Maintained by Jesse Wheeler. Last updated 8 months ago.
3.5 match 3 stars 3.86 score 12 scripts
GREENeR:Geospatial Regression Equation for European Nutrient Losses (GREEN)
Tools and methods to apply the model Geospatial Regression Equation for European Nutrient losses (GREEN); Grizzetti et al. (2005) <doi:10.1016/j.jhydrol.2004.07.036>; Grizzetti et al. (2008); Grizzetti et al. (2012) <doi:10.1111/j.1365-2486.2011.02576.x>; Grizzetti et al. (2021) <doi:10.1016/j.gloenvcha.2021.102281>.
Maintained by C. Alfaro. Last updated 6 months ago.
3.4 match 1 stars 4.00 score 9 scripts
ropenmeteo:Wrappers for 'Open-Meteo' API
Wrappers for the Application Programming Interface from the <> project along with helper functions. The <> project streamlines access to a range of publicly historical and forecast meteorology data from agencies across the world.
Maintained by Quinn Thomas. Last updated 7 months ago.
2.9 match 4.62 score 14 scripts
MADPop:MHC Allele-Based Differencing Between Populations
Tools for the analysis of population differences using the Major Histocompatibility Complex (MHC) genotypes of samples having a variable number of alleles (1-4) recorded for each individual. A hierarchical Dirichlet-Multinomial model on the genotype counts is used to pool small samples from multiple populations for pairwise tests of equality. Bayesian inference is implemented via the 'rstan' package. Bootstrapped and posterior p-values are provided for chi-squared and likelihood ratio tests of equal genotype probabilities.
Maintained by Martin Lysy. Last updated 1 years ago.
3.6 match 1 stars 3.70 score 8 scripts
Lock5Data:Datasets for "Statistics: UnLocking the Power of Data"
Datasets for the third edition of "Statistics: Unlocking the Power of Data" by Lock^5 Includes version of datasets from earlier editions.
Maintained by Robin Lock. Last updated 4 years ago.
4.5 match 2.90 score 322 scripts
flfishltm:A package for analyzing FL FWC Freshwater long-term monitoring (LTM) data
Provides functions for summarizing and analyzing FL FWC Freshwater Fish Long-term Monitoring Data.
Maintained by Jason OConnor. Last updated 8 months ago.
3.4 match 1 stars 3.70 score
mvgam:Multivariate (Dynamic) Generalized Additive Models
Fit Bayesian Dynamic Generalized Additive Models to multivariate observations. Users can build nonlinear State-Space models that can incorporate semiparametric effects in observation and process components, using a wide range of observation families. Estimation is performed using Markov Chain Monte Carlo with Hamiltonian Monte Carlo in the software 'Stan'. References: Clark & Wells (2023) <doi:10.1111/2041-210X.13974>.
Maintained by Nicholas J Clark. Last updated 7 hours ago.
1.3 match 139 stars 9.85 score 117 scripts
ecoval:Procedures for Ecological Assessment of Surface Waters
Functions for evaluating and visualizing ecological assessment procedures for surface waters containing physical, chemical and biological assessments in the form of value functions.
Maintained by Nele Schuwirth. Last updated 3 years ago.
8.9 match 1.34 score 22 scripts
RavenR:Raven Hydrological Modelling Framework R Support and Analysis
Utilities for processing input and output files associated with the Raven Hydrological Modelling Framework. Includes various plotting functions, model diagnostics, reading output files into extensible time series format, and support for writing Raven input files. The 'RavenR' package is also archived at Chlumsky et al. (2020) <doi:10.5281/zenodo.4248183>. The Raven Hydrologic Modelling Framework method can be referenced with Craig et al. (2020) <doi:10.1016/j.envsoft.2020.104728>.
Maintained by Robert Chlumsky. Last updated 4 months ago.
1.7 match 36 stars 7.06 score 20 scripts
vvcanvas:'Canvas' LMS API Integration
Allow R users to interact with the 'Canvas' Learning Management System (LMS) API (see <> for details). It provides a set of functions to access and manipulate course data, assignments, grades, users, and other resources available through the 'Canvas' API.
Maintained by Tomer Iwan. Last updated 3 days ago.
1.9 match 7 stars 6.23 score 10 scripts
avocado:Weekly Hass Avocado Sales Summary
Provides a weekly summary of Hass Avocado sales for the contiguous US from January 2017 to November 2020. See the package website for more information, documentation, and examples. Data source: Haas Avocado Board <>.
Maintained by Nikhil Agarwal. Last updated 4 years ago.
2.4 match 4.34 score 11 scripts
fishkirkko2015:Dataset of Measurements of Fish Species at Kirkkojarvi Lake, Finland
Dataset of 302 measurements of 11 fish species to accompany the manuscript "Length-weight relationships of six freshwater fish species from lake Kirkkojarvi, Finland".
Maintained by Jose Gama. Last updated 8 years ago.
10.3 match 1.00 score
VGAMdata:Data Supporting the 'VGAM' Package
Mainly data sets to accompany the VGAM package and the book "Vector Generalized Linear and Additive Models: With an Implementation in R" (Yee, 2015) <DOI:10.1007/978-1-4939-2818-7>. These are used to illustrate vector generalized linear and additive models (VGLMs/VGAMs), and associated models (Reduced-Rank VGLMs, Quadratic RR-VGLMs, Row-Column Interaction Models, and constrained and unconstrained ordination models in ecology). This package now contains some old VGAM family functions which have been replaced by newer ones (often because they are now special cases).
Maintained by Thomas Yee. Last updated 1 months ago.
3.4 match 1 stars 2.94 score 95 scripts 1 dependents
geocn:Loads Spatial Data Sets of China
Providing various commonly used spatial data related to Chinese regions in the R programming environment.
Maintained by Wenbo Lv. Last updated 3 months ago.
2.0 match 17 stars 4.93 score 10 scripts
AnnotationHubData:Transform public data resources into Bioconductor Data Structures
These recipes convert a wide variety and a growing number of public bioinformatic data sets into easily-used standard Bioconductor data structures.
Maintained by Bioconductor Package Maintainer. Last updated 4 days ago.
1.8 match 5.02 score 22 scripts 4 dependents
HubPub:Utilities to create and use Bioconductor Hubs
HubPub provides users with functionality to help with the Bioconductor Hub structures. The package provides the ability to create a skeleton of a Hub style package that the user can then populate with the necessary information. There are also functions to help add resources to the Hub package metadata files as well as publish data to the Bioconductor S3 bucket.
Maintained by Kayla Interdonato. Last updated 16 hours ago.
1.1 match 3 stars 5.18 score 4 scripts
geographer:Geography Vizualisations
Provides function and objects to establish vizualisations for my Geography lessons.
Maintained by Pascal Burkhard. Last updated 21 days ago.
2.0 match 1 stars 2.78 score
forward:Robust Analysis using Forward Search
Robust analysis using forward search in linear and generalized linear regression models, as described in Atkinson, A.C. and Riani, M. (2000), Robust Diagnostic Regression Analysis, First Edition. New York: Springer.
Maintained by Ken Beath. Last updated 6 months ago.
4.5 match 1.18 score 15 scripts
syllogi:Collection of Data Sets for Teaching Purposes
Collection (syllogi in greek) of real and fictitious data sets for teaching purposes. The datasets were manually entered by the author from the respective references as listed in the individual dataset documentation. The fictions datasets are the creation of the author, that he has found useful for teaching statistics.
Maintained by Jared Studyvin. Last updated 2 months ago.
3.5 match 1.30 score
assist:A Suite of R Functions Implementing Spline Smoothing Techniques
Fit various smoothing spline models. Includes an ssr() function for smoothing spline regression, an nnr() function for nonparametric nonlinear regression, an snr() function for semiparametric nonlinear regression, an slm() function for semiparametric linear mixed-effects models, and an snm() function for semiparametric nonlinear mixed-effects models. See Wang (2011) <doi:10.1201/b10954> for an overview.
Maintained by Yuedong Wang. Last updated 2 years ago.
4.0 match 1.00 score
MCAvariants:Multiple Correspondence Analysis Variants
Provides two variants of multiple correspondence analysis (ca): multiple ca and ordered multiple ca via orthogonal polynomials of Emerson.
Maintained by Rosaria Lombardo. Last updated 2 years ago.
4.0 match 1.00 score 6 scripts
Nmix:Bayesian Inference on Univariate Normal Mixtures
A program for Bayesian analysis of univariate normal mixtures with an unknown number of components, following the approach of Richardson and Green (1997) <doi:10.1111/1467-9868.00095>. This makes use of reversible jump Markov chain Monte Carlo methods that are capable of jumping between the parameter sub-spaces corresponding to different numbers of components in the mixture. A sample from the full joint distribution of all unknown variables is thereby generated, and this can be used as a basis for a thorough presentation of many aspects of the posterior distribution.
Maintained by Peter Green. Last updated 1 years ago.
3.8 match 1.00 score
wd4tx:Access 'TWDB' Water Data For Texas
An R interface to the Texas Water Development Board ('TWDB') Water Data for Texas website <>.
Maintained by Michael Schramm. Last updated 3 years ago.
1.8 match 1.70 score
implyr:R Interface for Apache Impala
'SQL' back-end to 'dplyr' for Apache Impala, the massively parallel processing query engine for Apache 'Hadoop'. Impala enables low-latency 'SQL' queries on data stored in the 'Hadoop' Distributed File System '(HDFS)', Apache 'HBase', Apache 'Kudu', Amazon Simple Storage Service '(S3)', Microsoft Azure Data Lake Store '(ADLS)', and Dell 'EMC' 'Isilon'. See <> for more information about Impala.
Maintained by Ian Cook. Last updated 1 years ago.
0.5 match 81 stars 5.71 score 42 scripts
apcf:Adapted Pair Correlation Function
The adapted pair correlation function transfers the concept of the pair correlation function from point patterns to patterns of objects of finite size and irregular shape (e.g. lakes within a country). The pair correlation function describes the spatial distribution of objects, e.g. random, aggregated or regularly spaced. This is a reimplementation of the method suggested by Nuske et al. (2009) <doi:10.1016/j.foreco.2009.09.050> using the library 'GEOS' <doi:10.5281/zenodo.11396894>.
Maintained by Robert Nuske. Last updated 10 days ago.
0.5 match 5 stars 4.95 score 12 scripts
s3.resourcer:S3 Resource Resolver
A S3 resource is provided by Amazon Web Services S3 or a S3-compatible object store (such as Minio). The resource can be a tidy file to be downloaded from the object store, or a data lake (such as Delta Lake) Parquet file to be read by Apache Spark.
Maintained by Yannick Marcon. Last updated 2 months ago.
0.8 match 2.70 score 3 scripts
extRatum:Summary Statistics for Geospatial Features
Provides summary statistics of local geospatial features within a given geographic area. It does so by calculating the area covered by a target geospatial feature (i.e. buildings, parks, lakes, etc.). The geospatial features can be of any type of geospatial data, including point, polygon or line data.
Maintained by Nikos Patias. Last updated 4 years ago.
0.5 match 2 stars 3.00 score
simfish:simulate fish tracks & acoustic detections
facilitates simulation of fish tracks in featureless and semi-realistic environments (coastal oceans, fjords, and lakes). A user-defined raster defining the water body - land boundaries is used to constrain simulated tracks. Fish movements are simulated (currently) as either a correlated random walk or a biased & correlated random walk, with the bias toward a defined Centre-of-Attraction. A potential function is used to ensure the tracks avoid land. The package can also be used to simulate detections of acoustically-tagged fish by acoustic receivers at user-defined locations.
Maintained by Ian Jonsen. Last updated 1 years ago.
0.5 match 2.70 score 3 scripts
SwissAir:Air Quality Data of Switzerland for One Year in 30 Min Resolution
Ozone, NOx (= Sum of nitrogen monoxide and nitrogen dioxide), nitrogen monoxide, ambient temperature, dew point, wind speed and wind direction at 3 sites around lake of Lucerne in Central Switzerland in 30 min time resolution for year 2004.
Maintained by Christoph Hofer. Last updated 1 years ago.
0.5 match 1 stars 1.00 score 5 scripts