crawl:Fit Continuous-Time Correlated Random Walk Models to Animal Movement Data
Fit continuous-time correlated random walk models with time indexed covariates to animal telemetry data. The model is fit using the Kalman-filter on a state space version of the continuous-time stochastic movement process.
Maintained by Devin S. Johnson. Last updated 6 months ago.
56.8 match 19 stars 6.29 score 63 scripts 3 dependentsforkonlp
N2H4:Handling Methods for Naver News Text Crawling
Provides some functions to get Korean text sample from news articles in Naver which is popular news portal service <> in Korea.
Maintained by Chanyub Park. Last updated 1 years ago.
12.9 match 216 stars 6.11 score 20 scriptsopenintrostat
openintro:Datasets and Supplemental Functions from 'OpenIntro' Textbooks and Labs
Supplemental functions and data for 'OpenIntro' resources, which includes open-source textbooks and resources for introductory statistics (<>). The package contains datasets used in our open-source textbooks along with custom plotting functions for reproducing book figures. Note that many functions and examples include color transparency; some plotting elements may not show up properly (or at all) when run in some versions of Windows operating system.
Maintained by Mine Çetinkaya-Rundel. Last updated 3 months ago.
4.5 match 240 stars 11.39 score 6.0k scriptsjulianfaraway
faraway:Datasets and Functions for Books by Julian Faraway
Books are "Linear Models with R" published 1st Ed. August 2004, 2nd Ed. July 2014, 3rd Ed. February 2025 by CRC press, ISBN 9781439887332, and "Extending the Linear Model with R" published by CRC press in 1st Ed. December 2005 and 2nd Ed. March 2016, ISBN 9781584884248 and "Practical Regression and ANOVA in R" contributed documentation on CRAN (now very dated).
Maintained by Julian Faraway. Last updated 1 months ago.
4.0 match 29 stars 9.43 score 1.7k scripts 1 dependentslightbluetitan
MedDataSets:Comprehensive Medical, Disease, Treatment, and Drug Datasets
Provides an extensive collection of datasets related to medicine, diseases, treatments, drugs, and public health. This package covers topics such as drug effectiveness, vaccine trials, survival rates, infectious disease outbreaks, and medical treatments. The included datasets span various health conditions, including AIDS, cancer, bacterial infections, and COVID-19, along with information on pharmaceuticals and vaccines. These datasets are sourced from the R ecosystem and other R packages, remaining unaltered to ensure data integrity. This package serves as a valuable resource for researchers, analysts, and healthcare professionals interested in conducting medical and public health data analysis in R.
Maintained by Renzo Caceres Rossi. Last updated 5 months ago.
4.5 match 8 stars 5.68 score 60 scriptsjameshwade
gpttools:Extensions and Tools for gptstudio
gpttools is an R package that provides extensions to gptstudio to provide devtools-like functionality using the latest natural language processing (NLP) models. It is designed to make package development easier by providing a range of tools and functions that can be used to improve the quality of your package's documentation, testing, and maybe even functionality.
Maintained by James Wade. Last updated 7 months ago.
3.3 match 293 stars 7.06 score 14 scriptshenryrscharf
anipaths:Animation of Multiple Trajectories with Uncertainty
Animation of observed trajectories using spline-based interpolation (see for example, Buderman, F. E., Hooten, M. B., Ivan, J. S. and Shenk, T. M. (2016), <doi:10.1111/2041-210X.12465> "A functional model for characterizing long-distance movement behaviour". Methods Ecol Evol). Intended to be used exploratory data analysis, and perhaps for preparation of presentations.
Maintained by Henry Scharf. Last updated 16 days ago.
6.3 match 2.92 score 14 scriptsdsjohnson
crawlUtils:Enhance And Integrate the {crawl} Package For Spatial Analysis Of Telemetry Output
Utility functions to augment the the {crawl} package and integrate it with the {sf} package for spatial analysis of telemetry model output. The additional function are targeted toward analysis of marine mammal telemetry, but can be used or easily modified for other situations.
Maintained by Devin S. Johnson. Last updated 6 months ago.
6.8 match 2 stars 2.60 score 1 scriptsbmcclintock
momentuHMM:Maximum Likelihood Analysis of Animal Movement Behavior Using Multivariate Hidden Markov Models
Extended tools for analyzing telemetry data using generalized hidden Markov models. Features of momentuHMM (pronounced ``momentum'') include data pre-processing and visualization, fitting HMMs to location and auxiliary biotelemetry or environmental data, biased and correlated random walk movement models, discrete- or continuous-time HMMs, continuous- or discrete-space movement models, approximate Langevin diffusion models, hierarchical HMMs, multiple imputation for incorporating location measurement error and missing data, user-specified design matrices and constraints for covariate modelling of parameters, random effects, decoding of the state process, visualization of fitted models, model checking and selection, and simulation. See McClintock and Michelot (2018) <doi:10.1111/2041-210X.12995>.
Maintained by Brett McClintock. Last updated 1 months ago.
1.8 match 43 stars 8.47 score 162 scriptsforkonlp
DNH4:Crawling for Daum News Text
Provides some utils to get Korean text sample from news articles in Daum which is popular news portal service in Korea.
Maintained by Chanyub Park. Last updated 3 months ago.
3.1 match 31 stars 4.49 score 6 scriptscran
oncrawlR:Machine Learning for S.E.O
Measures different aspects of page content, structure and performance for SEO (Search Engine Optimization). Aspects covered include HTML tags used in SEO, duplicate and near-duplicate content, structured data, on-site linking structure and popularity transfer, and many other amazing things. This package can be used to generate a real, full SEO audit report, which serves to detect errors or inefficiencies on a page that can be corrected in order to optimise its performance on search engines.
Maintained by Vincent Terrasi. Last updated 5 years ago.
8.1 match 1.70 score 1 scriptsmarkbravington
mvbutils:General utilities, workspace organization, code and docu editing, live package maintenance, etc
Hierarchical workspace tree, code editing and backup, easy package prep, editing of packages while loaded, per-object lazy-loading, easy documentation, macro functions, and miscellaneous utilities. Needed by debug package.
Maintained by Mark V. Bravington. Last updated 7 days ago.
1.8 match 6.53 score 138 scripts 18 dependentspeterkdunn
GLMsData:Generalized Linear Model Data Sets
Data sets from the book Generalized Linear Models with Examples in R by Dunn and Smyth.
Maintained by Peter K. Dunn. Last updated 3 years ago.
3.8 match 2.61 score 220 scriptsropensci
roadoi:Find Free Versions of Scholarly Publications via Unpaywall
This web client interfaces Unpaywall <>, formerly oaDOI, a service finding free full-texts of academic papers by linking DOIs with open access journals and repositories. It provides unified access to various data sources for open access full-text links including Crossref and the Directory of Open Access Journals (DOAJ). API usage is free and no registration is required.
Maintained by Najko Jahn. Last updated 6 months ago.
1.3 match 65 stars 7.25 score 69 scriptsbioc
GEOfastq:Downloads ENA Fastqs With GEO Accessions
GEOfastq is used to download fastq files from the European Nucleotide Archive (ENA) starting with an accession from the Gene Expression Omnibus (GEO). To do this, sample metadata is retrieved from GEO and the Sequence Read Archive (SRA). SRA run accessions are then used to construct FTP and aspera download links for fastq files generated by the ENA.
Maintained by Alex Pickering. Last updated 5 months ago.
1.8 match 4 stars 4.60 score 6 scriptsjmlondon
pathroutr:An R Package for (Re-)Routing Paths Around Barriers
The `pathroutr` package aims to provide a set of tools for routing marine animal tracks around land barriers based on the shortest path through a visibility graph network. The foundation of the package is a graph network created from a Delaunay Triangle mesh created from the vertices of land polygons within the study area. Any network edges that cross or fall completely within the land (barrier) polygons are removed.
Maintained by Josh London. Last updated 2 years ago.
1.2 match 16 stars 4.91 score 17 scripts 1 dependentssalimk
Rcrawler:Web Crawler and Scraper
Performs parallel web crawling and web scraping. It is designed to crawl, parse and store web pages to produce data that can be directly used for analysis application. For details see Khalil and Fakir (2017) <DOI:10.1016/j.softx.2017.04.004>.
Maintained by Salim Khalil. Last updated 5 years ago.
0.8 match 354 stars 6.89 score 110 scriptsgastonbecerra
ojsr:Crawler and Data Scraper for Open Journal System ('OJS')
Crawler for 'OJS' pages and scraper for meta-data from articles. You can crawl 'OJS' archives, issues, articles, galleys, and search results. You can scrape articles metadata from their head tag in html, or from Open Archives Initiative ('OAI') records. Most of these functions rely on 'OJS' routing conventions (<>).
Maintained by Gaston Becerra. Last updated 4 months ago.
0.5 match 3 stars 4.35 score 15 scriptsr-spark
sparkwarc:Load WARC Files into Apache Spark
Load WARC (Web ARChive) files into Apache Spark using 'sparklyr'. This allows to read files from the Common Crawl project <>.
Maintained by Edgar Ruiz. Last updated 3 years ago.
0.5 match 13 stars 3.89 score 12 scripts