Showing 97 of total 97 results (show query)
hrbrmstr
ggalt:Extra Coordinate Systems, 'Geoms', Statistical Transformations, Scales and Fonts for 'ggplot2'
A compendium of new geometries, coordinate systems, statistical transformations, scales and fonts for 'ggplot2', including splines, 1d and 2d densities, univariate average shifted histograms, a new map coordinate system based on the 'PROJ.4'-library along with geom_cartogram() that mimics the original functionality of geom_map(), formatters for "bytes", a stat_stepribbon() function, increased 'plotly' compatibility and the 'StateFace' open source font 'ProPublica'. Further new functionality includes lollipop charts, dumbbell charts, the ability to encircle points and coordinate-system-based text annotations.
Maintained by Bob Rudis. Last updated 2 years ago.
geomggplot-extensionggplot2ggplot2-geomggplot2-scales
7.9 match 674 stars 12.59 score 2.3k scripts 7 dependentscirculosmeos
bytescircle:Statistics About Bytes Contained in a File as a Circle Plot
Shows statistics about bytes contained in a file as a circle graph of deviations from mean in sigma increments. The function can be useful for statistically analyze the content of files in a glimpse: text files are shown as a green centered crown, compressed and encrypted files should be shown as equally distributed variations with a very low CV (sigma/mean), and other types of files can be classified between these two categories depending on their text vs binary content, which can be useful to quickly determine how information is stored inside them (databases, multimedia files, etc).
Maintained by Roberto S. Galende. Last updated 3 years ago.
byte-valuesbytescircledeviationplotsigma
22.7 match 3 stars 4.18 score 1 scriptseddelbuettel
RProtoBuf:R Interface to the 'Protocol Buffers' 'API' (Version 2 or 3)
Protocol Buffers are a way of encoding structured data in an efficient yet extensible format. Google uses Protocol Buffers for almost all of its internal 'RPC' protocols and file formats. Additional documentation is available in two included vignettes one of which corresponds to our 'JSS' paper (2016, <doi:10.18637/jss.v071.i02>. A sufficiently recent version of 'Protocol Buffers' library is required; currently version 3.3.0 from 2017 is the stated minimum.
Maintained by Dirk Eddelbuettel. Last updated 1 months ago.
c-plus-plusprotocol-buffersprotobufcpp
7.5 match 73 stars 11.42 score 126 scripts 21 dependentsbnosac
tokenizers.bpe:Byte Pair Encoding Text Tokenization
Unsupervised text tokenizer focused on computational efficiency. Wraps the 'YouTokenToMe' library <https://github.com/VKCOM/YouTokenToMe> which is an implementation of fast Byte Pair Encoding (BPE) <https://aclanthology.org/P16-1162/>.
Maintained by Jan Wijffels. Last updated 2 years ago.
bpebyte-pair-encodingtext-miningtokenizationcpp
17.3 match 15 stars 4.56 score 48 scriptsrstudio
gt:Easily Create Presentation-Ready Display Tables
Build display tables from tabular data with an easy-to-use set of functions. With its progressive approach, we can construct display tables with a cohesive set of table parts. Table values can be formatted using any of the included formatting functions. Footnotes and cell styles can be precisely added through a location targeting system. The way in which 'gt' handles things for you means that you don't often have to worry about the fine details.
Maintained by Richard Iannone. Last updated 10 days ago.
docxeasy-to-usehtmllatexrtfsummary-tables
3.8 match 2.1k stars 18.36 score 20k scripts 112 dependentsr-lib
rlang:Functions for Base Types and Core R and 'Tidyverse' Features
A toolbox for working with base types, core R features like the condition system, and core 'Tidyverse' features like tidy evaluation.
Maintained by Lionel Henry. Last updated 19 days ago.
3.0 match 517 stars 20.53 score 9.8k scripts 15k dependentsbnosac
sentencepiece:Text Tokenization using Byte Pair Encoding and Unigram Modelling
Unsupervised text tokenizer allowing to perform byte pair encoding and unigram modelling. Wraps the 'sentencepiece' library <https://github.com/google/sentencepiece> which provides a language independent tokenizer to split text in words and smaller subword units. The techniques are explained in the paper "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing" by Taku Kudo and John Richardson (2018) <doi:10.18653/v1/D18-2012>. Provides as well straightforward access to pretrained byte pair encoding models and subword embeddings trained on Wikipedia using 'word2vec', as described in "BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages" by Benjamin Heinzerling and Michael Strube (2018) <http://www.lrec-conf.org/proceedings/lrec2018/pdf/1049.pdf>.
Maintained by Jan Wijffels. Last updated 2 years ago.
bytenatural-language-processingsentencepieceword-segmentationcpp
14.6 match 25 stars 4.10 score 8 scriptshadley
pryr:Tools for Computing on the Language
Useful tools to pry back the covers of R and understand the language at a deeper level.
Maintained by Hadley Wickham. Last updated 1 years ago.
4.8 match 204 stars 11.85 score 1.9k scripts 56 dependentsdankelley
oce:Analysis of Oceanographic Data
Supports the analysis of Oceanographic data, including 'ADCP' measurements, measurements made with 'argo' floats, 'CTD' measurements, sectional data, sea-level time series, coastline and topographic data, etc. Provides specialized functions for calculating seawater properties such as potential temperature in either the 'UNESCO' or 'TEOS-10' equation of state. Produces graphical displays that conform to the conventions of the Oceanographic literature. This package is discussed extensively by Kelley (2018) "Oceanographic Analysis with R" <doi:10.1007/978-1-4939-8844-0>.
Maintained by Dan Kelley. Last updated 14 hours ago.
3.7 match 146 stars 15.42 score 4.2k scripts 18 dependentscoolbutuseless
ctypesio:Read and Write Standard 'C' Types from Files, Connections and Raw Vectors
Interacting with binary files can be difficult because R's types are a subset of what is generally supported by 'C'. This package provides a suite of functions for reading and writing binary data (with files, connections, and raw vectors) using 'C' type descriptions. These functions convert data between 'C' types and R types while checking for values outside the type limits, 'NA' values, etc.
Maintained by Mike Cheng. Last updated 2 months ago.
7.7 match 5 stars 6.02 score 6 scripts 1 dependentshenrikbengtsson
R.oo:R Object-Oriented Programming with or without References
Methods and classes for object-oriented programming in R with or without references. Large effort has been made on making definition of methods as simple as possible with a minimum of maintenance for package developers. The package has been developed since 2001 and is now considered very stable. This is a cross-platform package implemented in pure R that defines standard S3 classes without any tricks.
Maintained by Henrik Bengtsson. Last updated 4 months ago.
3.8 match 20 stars 11.49 score 329 scripts 828 dependentsboydorr
rdiversity:Measurement and Partitioning of Similarity-Sensitive Biodiversity
Provides a framework for the measurement and partitioning of the (similarity-sensitive) biodiversity of a metacommunity and its constituent subcommunities. Richard Reeve, et al. (2016) <arXiv:1404.6520v3>.
Maintained by Richard Reeve. Last updated 3 years ago.
biodiversitydiversity-measurementpartitioning-diversity
5.4 match 8 stars 6.85 score 66 scripts 1 dependentsgagolews
stringi:Fast and Portable Character String Processing Facilities
A collection of character string/text/natural language processing tools for pattern searching (e.g., with 'Java'-like regular expressions or the 'Unicode' collation algorithm), random string generation, case mapping, string transliteration, concatenation, sorting, padding, wrapping, Unicode normalisation, date-time formatting and parsing, and many more. They are fast, consistent, convenient, and - thanks to 'ICU' (International Components for Unicode) - portable across all locales and platforms. Documentation about 'stringi' is provided via its website at <https://stringi.gagolewski.com/> and the paper by Gagolewski (2022, <doi:10.18637/jss.v103.i02>).
Maintained by Marek Gagolewski. Last updated 1 months ago.
icuicu4cnatural-language-processingnlpregexregexpstring-manipulationstringistringrtexttext-processingtidy-dataunicodecpp
2.0 match 309 stars 18.31 score 10k scripts 8.6k dependentsr-lidar
lidR:Airborne LiDAR Data Manipulation and Visualization for Forestry Applications
Airborne LiDAR (Light Detection and Ranging) interface for data manipulation and visualization. Read/write 'las' and 'laz' files, computation of metrics in area based approach, point filtering, artificial point reduction, classification from geographic data, normalization, individual tree segmentation and other manipulations.
Maintained by Jean-Romain Roussel. Last updated 1 months ago.
alsforestrylaslazlidarpoint-cloudremote-sensingopenblascppopenmp
2.5 match 623 stars 14.47 score 844 scripts 8 dependentsrstudio
reticulate:Interface to 'Python'
Interface to 'Python' modules, classes, and functions. When calling into 'Python', R data types are automatically converted to their equivalent 'Python' types. When values are returned from 'Python' to R they are converted back to R types. Compatible with all versions of 'Python' >= 2.7.
Maintained by Tomasz Kalinowski. Last updated 1 days ago.
1.7 match 1.7k stars 21.07 score 18k scripts 427 dependentsr-lib
scales:Scale Functions for Visualization
Graphical scales map data to aesthetics, and provide methods for automatically determining breaks and labels for axes and legends.
Maintained by Thomas Lin Pedersen. Last updated 5 months ago.
1.7 match 419 stars 19.88 score 88k scripts 7.9k dependentsjeroen
openssl:Toolkit for Encryption, Signatures and Certificates Based on OpenSSL
Bindings to OpenSSL libssl and libcrypto, plus custom SSH key parsers. Supports RSA, DSA and EC curves P-256, P-384, P-521, and curve25519. Cryptographic signatures can either be created and verified manually or via x509 certificates. AES can be used in cbc, ctr or gcm mode for symmetric encryption; RSA for asymmetric (public key) encryption or EC for Diffie Hellman. High-level envelope functions combine RSA and AES for encrypting arbitrary sized data. Other utilities include key generators, hash functions (md5, sha1, sha256, etc), base64 encoder, a secure random number generator, and 'bignum' math methods for manually performing crypto calculations on large multibyte integers.
Maintained by Jeroen Ooms. Last updated 1 months ago.
1.8 match 65 stars 18.00 score 632 scripts 5.0k dependentsr-lib
prettyunits:Pretty, Human Readable Formatting of Quantities
Pretty, human readable formatting of quantities. Time intervals: '1337000' -> '15d 11h 23m 20s'. Vague time intervals: '2674000' -> 'about a month ago'. Bytes: '1337' -> '1.34 kB'. Rounding: '99' with 3 significant digits -> '99.0' p-values: '0.00001' -> '<0.0001'. Colors: '#FF0000' -> 'red'. Quantities: '1239437' -> '1.24 M'.
Maintained by Gabor Csardi. Last updated 7 months ago.
2.4 match 133 stars 13.46 score 86 scripts 3.3k dependentsyihui
xfun:Supporting Functions for Packages Maintained by 'Yihui Xie'
Miscellaneous functions commonly used in other packages maintained by 'Yihui Xie'.
Maintained by Yihui Xie. Last updated 2 days ago.
1.8 match 145 stars 18.18 score 916 scripts 4.4k dependentscoolbutuseless
rmonocypher:Easy Encryption of R Objects using Strong Modern Cryptography
Encrypt R objects to a raw vector or file using modern cryptographic techniques. Password-based key derivation is with 'Argon2' (<https://en.wikipedia.org/wiki/Argon2>). Objects are serialized and then encrypted using 'XChaCha20-Poly1305' (<https://en.wikipedia.org/wiki/ChaCha20-Poly1305>) which follows RFC 8439 for authenticated encryption (<https://en.wikipedia.org/wiki/Authenticated_encryption>). Cryptographic functions are provided by the included 'monocypher' 'C' library (<https://monocypher.org>).
Maintained by Mike Cheng. Last updated 1 months ago.
4.6 match 17 stars 6.25 score 4 scriptsr-lidar
rlas:Read and Write 'las' and 'laz' Binary File Formats Used for Remote Sensing Data
Read and write 'las' and 'laz' binary file formats. The LAS file format is a public file format for the interchange of 3-dimensional point cloud data between data users. The LAS specifications are approved by the American Society for Photogrammetry and Remote Sensing <https://www.asprs.org/divisions-committees/lidar-division/laser-las-file-format-exchange-activities>. The LAZ file format is an open and lossless compression scheme for binary LAS format versions 1.0 to 1.4 <https://laszip.org/>.
Maintained by Jean-Romain Roussel. Last updated 8 months ago.
3.4 match 34 stars 8.03 score 49 scripts 9 dependentsedwbaker
sonicscrewdriver:Bioacoustic Analysis and Publication Tools
Provides tools for manipulating sound files for bioacoustic analysis, and preparing analyses these for publication. The package validates that values are physically possible wherever feasible.
Maintained by Ed Baker. Last updated 1 months ago.
3.8 match 6 stars 7.12 score 26 scriptsropensci
git2r:Provides Access to Git Repositories
Interface to the 'libgit2' library, which is a pure C implementation of the 'Git' core methods. Provides access to 'Git' repositories to extract data and running some basic 'Git' commands.
Maintained by Stefan Widgren. Last updated 11 days ago.
gitgit-clientlibgit2libgit2-library
1.9 match 218 stars 13.86 score 836 scripts 49 dependentsbioc
Rarr:Read Zarr Files in R
The Zarr specification defines a format for chunked, compressed, N-dimensional arrays. It's design allows efficient access to subsets of the stored array, and supports both local and cloud storage systems. Rarr aims to implement this specifcation in R with minimal reliance on an external tools or libraries.
Maintained by Mike Smith. Last updated 17 days ago.
dataimportome-zarron-diskout-of-memoryzarrc-blosclibzstd
3.7 match 35 stars 6.68 score 21 scriptsr-gregmisc
gdata:Various R Programming Tools for Data Manipulation
Various R programming tools for data manipulation, including medical unit conversions, combining objects, character vector operations, factor manipulation, obtaining information about R objects, generating fixed-width format files, extracting components of date & time objects, operations on columns of data frames, matrix operations, operations on vectors, operations on data frames, value of last evaluated expression, and a resample() wrapper for sample() that ensures consistent behavior for both scalar and vector arguments.
Maintained by Arni Magnusson. Last updated 2 months ago.
1.8 match 9 stars 13.62 score 4.5k scripts 124 dependentsdavisvaughan
treesitter:Bindings to 'Tree-Sitter'
Provides bindings to 'Tree-sitter', an incremental parsing system for programming tools. 'Tree-sitter' builds concrete syntax trees for source files of any language, and can efficiently update those syntax trees as the source file is edited. It also includes a robust error recovery system that provides useful parse results even in the presence of syntax errors.
Maintained by Davis Vaughan. Last updated 6 months ago.
3.6 match 37 stars 6.62 score 18 scripts 2 dependentsmelff
RKernel:Yet another R kernel for Jupyter
Provides a kernel for Jupyter.
Maintained by Martin Elff. Last updated 14 days ago.
jupyterjupyter-kerneljupyter-kernelsjupyter-notebook
5.0 match 38 stars 4.60 scorerpolars
polars:Lightning-Fast 'DataFrame' Library
Lightning-fast 'DataFrame' library written in 'Rust'. Convert R data to 'Polars' data and vice versa. Perform fast, lazy, larger-than-memory and optimized data queries. 'Polars' is interoperable with the package 'arrow', as both are based on the 'Apache Arrow' Columnar Format.
Maintained by Soren Welling. Last updated 3 days ago.
1.9 match 499 stars 12.01 score 1.0k scripts 2 dependentsbrodieg
fansi:ANSI Control Sequence Aware String Functions
Counterparts to R string manipulation functions that account for the effects of ANSI text formatting control sequences.
Maintained by Brodie Gaslam. Last updated 10 months ago.
1.6 match 54 stars 14.18 score 136 scripts 11k dependentss-u
base64enc:Tools for base64 Encoding
Tools for handling base64 encoding. It is more flexible than the orphaned base64 package.
Maintained by Simon Urbanek. Last updated 3 years ago.
1.7 match 9 stars 12.62 score 680 scripts 4.8k dependentstiledb-inc
tiledb:Modern Database Engine for Complex Data Based on Multi-Dimensional Arrays
The modern database 'TileDB' introduces a powerful on-disk format for storing and accessing any complex data based on multi-dimensional arrays. It supports dense and sparse arrays, dataframes and key-values stores, cloud storage ('S3', 'GCS', 'Azure'), chunked arrays, multiple compression, encryption and checksum filters, uses a fully multi-threaded implementation, supports parallel I/O, data versioning ('time travel'), metadata and groups. It is implemented as an embeddable cross-platform C++ library with APIs from several languages, and integrations. This package provides the R support.
Maintained by Isaiah Norton. Last updated 3 days ago.
arrayhdfss3storage-managertiledbcpp
1.8 match 107 stars 11.96 score 306 scripts 4 dependentsqsbase
qs:Quick Serialization of R Objects
Provides functions for quickly writing and reading any R object to and from disk.
Maintained by Travers Ching. Last updated 9 days ago.
compressiondata-storageencodingserializationlibzstdlz4cpp
1.5 match 414 stars 13.91 score 2.5k scripts 51 dependentsbioc
igvR:igvR: integrative genomics viewer
Access to igv.js, the Integrative Genomics Viewer running in a web browser.
Maintained by Arkadiusz Gladki. Last updated 5 months ago.
visualizationthirdpartyclientgenomebrowsers
2.5 match 43 stars 8.31 score 118 scriptsreside-ic
ids:Generate Random Identifiers
Generate random or human readable and pronounceable identifiers.
Maintained by Rich FitzJohn. Last updated 3 years ago.
1.5 match 94 stars 13.27 score 175 scripts 165 dependentscoolbutuseless
c64asm:6502 Assembler
A simple 6502 assembler written purely in R and leveraging R data structures for pre-computing character sets and images.
Maintained by mikefc. Last updated 1 years ago.
8.9 match 3 stars 2.22 score 11 scriptsropensci
frictionless:Read and Write Frictionless Data Packages
Read and write Frictionless Data Packages. A 'Data Package' (<https://specs.frictionlessdata.io/data-package/>) is a simple container format and standard to describe and package a collection of (tabular) data. It is typically used to publish FAIR (<https://www.go-fair.org/fair-principles/>) and open datasets.
Maintained by Peter Desmet. Last updated 6 months ago.
2.0 match 30 stars 9.79 score 55 scripts 6 dependentsneonscience
neonUtilities:Utilities for Working with NEON Data
NEON data packages can be accessed through the NEON Data Portal <https://www.neonscience.org> or through the NEON Data API (see <https://data.neonscience.org/data-api> for documentation). Data delivered from the Data Portal are provided as monthly zip files packaged within a parent zip file, while individual files can be accessed from the API. This package provides tools that aid in discovering, downloading, and reformatting data prior to use in analyses. This includes downloading data via the API, merging data tables by type, and converting formats. For more information, see the readme file at <https://github.com/NEONScience/NEON-utilities>.
Maintained by Claire Lunch. Last updated 1 months ago.
1.8 match 57 stars 10.66 score 944 scripts 15 dependentscsgillespie
benchmarkme:Crowd Sourced System Benchmarks
Benchmark your CPU and compare against other CPUs. Also provides functions for obtaining system specifications, such as RAM, CPU type, and R version.
Maintained by Colin Gillespie. Last updated 10 months ago.
2.0 match 41 stars 8.96 score 118 scripts 13 dependentsdataoneorg
dataone:R Interface to the DataONE REST API
Provides read and write access to data and metadata from the DataONE network <https://www.dataone.org> of data repositories. Each DataONE repository implements a consistent repository application programming interface. Users call methods in R to access these remote repository functions, such as methods to query the metadata catalog, get access to metadata for particular data packages, and read the data objects from the data repository. Users can also insert and update data objects on repositories that support these methods.
Maintained by Matthew B. Jones. Last updated 3 years ago.
1.8 match 36 stars 9.93 score 472 scripts 3 dependentsbioc
snpStats:SnpMatrix and XSnpMatrix classes and methods
Classes and statistical methods for large SNP association studies. This extends the earlier snpMatrix package, allowing for uncertainty in genotypes.
Maintained by David Clayton. Last updated 5 months ago.
microarraysnpgeneticvariabilityzlib
1.8 match 9.41 score 674 scripts 17 dependentsrkbauer
oceanmap:A Plotting Toolbox for 2D Oceanographic Data
Plotting toolbox for 2D oceanographic data (satellite data, sea surface temperature, chlorophyll, ocean fronts & bathymetry). Recognized classes and formats include netcdf, Raster, '.nc' and '.gz' files.
Maintained by Robert K. Bauer. Last updated 1 years ago.
bathymetrychlaggplotmapping-toolsncdfoceanographic-dataremote-sensingsatellite-imspatial-datasst
3.4 match 4 stars 4.54 score 58 scripts 1 dependentss-u
PKI:Public Key Infrastucture for R Based on the X.509 Standard
Public Key Infrastucture functions such as verifying certificates, RSA encription and signing which can be used to build PKI infrastructure and perform cryptographic tasks.
Maintained by Simon Urbanek. Last updated 7 months ago.
1.8 match 18 stars 8.52 score 63 scripts 8 dependentsdipterix
dipsaus:A Dipping Sauce for Data Analysis and Visualizations
Works as an "add-on" to packages like 'shiny', 'future', as well as 'rlang', and provides utility functions. Just like dipping sauce adding flavors to potato chips or pita bread, 'dipsaus' for data analysis and visualizations adds handy functions and enhancements to popular packages. The goal is to provide simple solutions that are frequently asked for online, such as how to synchronize 'shiny' inputs without freezing the app, or how to get memory size on 'Linux' or 'MacOS' system. The enhancements roughly fall into these four categories: 1. 'shiny' input widgets; 2. high-performance computing using the 'future' package; 3. modify R calls and convert among numbers, strings, and other objects. 4. utility functions to get system information such like CPU chip-set, memory limit, etc.
Maintained by Zhengjia Wang. Last updated 4 days ago.
1.8 match 13 stars 7.90 score 85 scripts 3 dependentsropensci
redland:RDF Library Bindings in R
Provides methods to parse, query and serialize information stored in the Resource Description Framework (RDF). RDF is described at <https://www.w3.org/TR/rdf-primer/>. This package supports RDF by implementing an R interface to the Redland RDF C library, described at <https://librdf.org/docs/api/index.html>. In brief, RDF provides a structured graph consisting of Statements composed of Subject, Predicate, and Object Nodes.
Maintained by Matthew B. Jones. Last updated 1 years ago.
1.8 match 17 stars 7.85 score 98 scripts 13 dependentsdavzim
rtiktoken:A Byte-Pair-Encoding (BPE) Tokenizer for OpenAI's Large Language Models
A thin wrapper around the tiktoken-rs crate, allowing to encode text into Byte-Pair-Encoding (BPE) tokens and decode tokens back to text. This is useful to understand how Large Language Models (LLMs) perceive text.
Maintained by David Zimmermann-Kollenda. Last updated 4 months ago.
3.3 match 11 stars 4.22 score 3 scriptsdavidchall
ipaddress:Data Analysis for IP Addresses and Networks
Classes and functions for working with IP (Internet Protocol) addresses and networks, inspired by the Python 'ipaddress' module. Offers full support for both IPv4 and IPv6 (Internet Protocol versions 4 and 6) address spaces. It is specifically designed to work well with the 'tidyverse'.
Maintained by David Hall. Last updated 1 years ago.
cyberdata-analysisip-addressipv4ipv6vctrscpp
1.9 match 32 stars 7.02 score 27 scripts 2 dependentsmlampros
textTinyR:Text Processing for Small or Big Data Files
It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. 'GloVe', 'fasttext') and incorporates functions for the calculation of (pairwise) text document dissimilarities. The source code is based on 'C++11' and exported in R through the 'Rcpp', 'RcppArmadillo' and 'BH' packages.
Maintained by Lampros Mouselimis. Last updated 1 years ago.
bhboostcpp11processingrcpprcpparmadillotextopenblascppopenmp
1.7 match 38 stars 7.64 score 244 scripts 1 dependentscoolbutuseless
rbytecode:R Byte Code Assembler/Disassembler
Assembler/Disassembler for R's byte code.
Maintained by Mike Cheng. Last updated 7 months ago.
3.6 match 35 stars 3.47 score 17 scriptscoolbutuseless
bitstreamio:Read and Write Bits from Files, Connections and Raw Vectors
Bit-level reading and writing are necessary when dealing with many file formats e.g. compressed data and binary files. Currently, R connections are manipulated at the byte level. This package wraps existing connections and raw vectors so that it is possible to read bits, bit sequences, unaligned bytes and low-bit representations of integers.
Maintained by Mike Cheng. Last updated 2 months ago.
2.6 match 3 stars 4.18 score 4 scriptscoolbutuseless
c64vice:Interface to Binary Monitor in VICE C64 Emulator
Interface to the binary monitor in VICE - the c64 emulator.
Maintained by mikefc. Last updated 1 years ago.
5.1 match 2 stars 2.08 score 12 scriptspatzaw
TKCat:Tailored Knowledge Catalog
Facilitate the management of data from knowledge resources that are frequently used alone or together in research environments. In 'TKCat', knowledge resources are manipulated as modeled database (MDB) objects. These objects provide access to the data tables along with a general description of the resource and a detail data model documenting the tables, their fields and their relationships. These MDBs are then gathered in catalogs that can be easily explored an shared. Finally, 'TKCat' provides tools to easily subset, filter and combine MDBs and create new catalogs suited for specific needs.
Maintained by Patrice Godard. Last updated 14 hours ago.
1.8 match 5 stars 6.08 score 27 scriptstrinker
numform:Tools to Format Numbers for Publication
Format numbers and plots for publication; includes the removal of leading zeros, standardization of number of digits, addition of affixes, and a p-value formatter. These tools combine the functionality of several 'base' functions such as 'paste()', 'format()', and 'sprintf()' into specific use case functions that are named in a way that is consistent with usage, making their names easy to remember and easy to deploy.
Maintained by Tyler Rinker. Last updated 3 years ago.
1.8 match 51 stars 6.06 score 151 scripts 1 dependentsshaunpwilkinson
insect:Informatic Sequence Classification Trees
Provides tools for probabilistic taxon assignment with informatic sequence classification trees. See Wilkinson et al (2018) <doi:10.7287/peerj.preprints.26812v1>.
Maintained by Shaun Wilkinson. Last updated 4 years ago.
1.7 match 14 stars 5.80 score 91 scriptscoolbutuseless
fastpng:Read and Write PNG Files with Configurable Decoder/Encoder Options
Read and write PNG images with arrays, rasters, native rasters, numeric arrays, integer arrays, raw vectors and indexed values. This PNG encoder exposes configurable internal options enabling the user to select a speed-size tradeoff. For example, disabling compression can speed up writing PNG by a factor of 50. Multiple image formats are supported including raster, native rasters, and integer and numeric arrays at color depths of 1, 2, 3 or 4. 16-bit images are also supported. This implementation uses the 'libspng' 'C' library which is available from <https://github.com/randy408/libspng/>.
Maintained by Mike Cheng. Last updated 2 months ago.
1.7 match 18 stars 5.86 score 7 scriptsgerrymanoim
humanize:Create Values for Human Consumption
An almost direct port of the 'python' 'humanize' package <https://github.com/jmoiron/humanize>. This package contains utilities to convert values into human readable forms.
Maintained by Gerry Manoim. Last updated 7 years ago.
datehumanhuman-readablenumbertime
1.9 match 13 stars 5.10 score 13 scripts 5 dependentseitsupi
neopolars:R Bindings for the 'polars' Rust Library
Lightning-fast 'DataFrame' library written in 'Rust'. Convert R data to 'Polars' data and vice versa. Perform fast, lazy, larger-than-memory and optimized data queries. 'Polars' is interoperable with the package 'arrow', as both are based on the 'Apache Arrow' Columnar Format.
Maintained by Tatsuya Shima. Last updated 12 hours ago.
1.9 match 40 stars 4.86 score 1 scriptsraven-computing
raven.rdf:An R Interface for Raven DataFrames (Beta0)
Provides an I/O interface between R data.frames and Raven DataFrames. Defines functions to both read and write DataFrame files, as well as serialize/deserialize data.frames/DataFrames.
Maintained by Phil Gaiser. Last updated 4 years ago.
3.4 match 2.70 scoregrimbough
FITfileR:Read FIT files using only native R code
The 'FIT' (Flexible and Interoperable Data Transfer) protocol is designed specifically for sharing data from fitness and health devices. This package allows reading 'FIT' files in pure R without any dependence on external software or SDKs.
Maintained by Mike Smith. Last updated 2 months ago.
1.6 match 56 stars 5.59 score 46 scriptsrogiersbart
ti:A single-channel UINT8 TIFF toolkit
The {ti} package provides a set of tools for working with single-channel unsigned 8-bit integer TIFF images.
Maintained by Bart Rogiers. Last updated 9 months ago.
3.6 match 2.46 score 58 scriptsnicholasjcooper
NCmisc:Miscellaneous Functions for Creating Adaptive Functions and Scripts
A set of handy functions. Includes a versatile one line progress bar, one line function timer with detailed output, time delay function, text histogram, object preview, CRAN package search, simpler package installer, Linux command install check, a flexible Mode function, top function, simulation of correlated data, and more.
Maintained by Nicholas Cooper. Last updated 2 years ago.
2.3 match 3.86 score 172 scripts 5 dependentsdavzim
RITCH:R Parser for the ITCH-Protocol
Allows to efficiently parse, filter, and write binary ITCH Files (Version 5.0) containing detailed financial transactions as distributed by NASDAQ to an R data.table.
Maintained by David Zimmermann-Kollenda. Last updated 7 months ago.
2.0 match 18 stars 4.32 score 23 scriptsevanodell
mnis:Easy Downloading Capabilities for the Members' Name Information Service
An API package for the Members' Name Information Service operated by the UK parliament. Documentation for the API itself can be found here: <http://data.parliament.uk/membersdataplatform/default.aspx>.
Maintained by Evan Odell. Last updated 4 years ago.
parliamentary-monitoringpolitical-sciencepoliticianspoliticscpp
1.7 match 4 stars 5.13 score 67 scriptscoolbutuseless
zstdlite:Fast Compression and Serialization with 'Zstandard' Algorithm
Fast, compressed serialization of R objects using the 'Zstandard' algorithm. The included zstandard connection ('zstdfile()') can be used to read/write compressed data by any code which supports R's built-in 'connections' mechanism. Dictionaries are supported for more effective compression of small data, and functions are provided for training these dictionaries. This implementation provides an R interface to advanced features of the 'Zstandard' 'C' library (available from <https://github.com/facebook/zstd>).
Maintained by Mike Cheng. Last updated 2 months ago.
1.7 match 30 stars 4.95 score 7 scriptsdustin
humanFormat:Human-Friendly Formatting Functions
Format quantities of time or bytes into human-friendly strings.
Maintained by Dustin Sallings. Last updated 3 years ago.
2.3 match 5 stars 3.40 score 9 scriptscran
notebookutils:Dummy R APIs Used in 'Azure Synapse Analytics' for Local Developments
This is a pure dummy interfaces package which mirrors 'MsSparkUtils' APIs <https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/microsoft-spark-utilities?pivots=programming-language-r> of 'Azure Synapse Analytics' <https://learn.microsoft.com/en-us/azure/synapse-analytics/> for R users, customer of Azure Synapse can download this package from CRAN for local development.
Maintained by runtimeexp. Last updated 11 months ago.
3.3 match 2.36 score 23 scriptskurthornik
tau:Text Analysis Utilities
Utilities for text analysis.
Maintained by Kurt Hornik. Last updated 5 months ago.
1.9 match 4.02 score 115 scripts 6 dependentsdenchpokepon
fedstatAPIr:Unofficial API for Fedstat (Rosstat EMISS System) for Automatic and Efficient Data Queries
An API for automatic data queries to the fedstat <https://www.fedstat.ru>, using a small set of functions with a common interface.
Maintained by Denis Krylov. Last updated 4 months ago.
1.8 match 23 stars 4.06 score 5 scriptssmaakage85
trimmer:Trim an Object
A lightweight toolkit to reduce the size of a list object. The object is minimized by recursively removing elements from the object one-by-one. The process is constrained by a reference function call specified by the user, where the target object is given as an argument. The procedure will not allow elements to be removed from the object, that will cause results from the function call to diverge from the function call with the original object.
Maintained by Lars Kjeldgaard. Last updated 5 years ago.
1.8 match 2 stars 4.00 score 9 scriptsrossellhayes
stringstatic:Dependency-Free String Operations
Provides drop-in replacements for functions from the 'stringr' package, with the same user interface. These functions have no external dependencies and can be copied directly into your package code using the 'staticimports' package.
Maintained by Alexander Rossell Hayes. Last updated 2 years ago.
1.9 match 6 stars 3.48 score 1 scriptskwb-r
kwb.fakin:Functions Used in Our Fakin Project
This package contains all functions and evaluations related to KWB project "FAKIN". Set the Subversion setting 'enable-auto-props' to 'yes' so that the 'auto-probs' options are considered.
Maintained by Hauke Sonnenberg. Last updated 3 years ago.
project-fakinresearch-data-management
1.8 match 1 stars 3.40 score 1 scriptsropensci
cld2:Google's Compact Language Detector 2
Bindings to Google's C++ library Compact Language Detector 2 (see <https://github.com/cld2owners/cld2#readme> for more information). Probabilistically detects over 80 languages in plain text or HTML. For mixed-language input it returns the top three detected languages and their approximate proportion of the total classified text bytes (e.g. 80% English and 20% French out of 1000 bytes). There is also a 'cld3' package on CRAN which uses a neural network model instead.
Maintained by Jeroen Ooms. Last updated 5 months ago.
cldcld2language-detectionlanguage-detectorcpp
0.8 match 38 stars 7.74 score 161 scripts 3 dependentsk5cents
k5:Kiernan Nicholls Miscellaneous
Quality of life functions for interactive programming. Shortcuts for common combinations of functions or different default arguments. Not to be used in production level scripts, but useful for exploring and quickly manipulating data for easy analysis. Also imports a variety of packages to facilitate the installation of those imported packages on the host machine.
Maintained by Kiernan Nicholls. Last updated 6 months ago.
1.8 match 1 stars 3.30 score 2 scriptsbioc
gdsfmt:R Interface to CoreArray Genomic Data Structure (GDS) Files
Provides a high-level R interface to CoreArray Genomic Data Structure (GDS) data files. GDS is portable across platforms with hierarchical structure to store multiple scalable array-oriented data sets with metadata information. It is suited for large-scale datasets, especially for data which are much larger than the available random-access memory. The gdsfmt package offers the efficient operations specifically designed for integers of less than 8 bits, since a diploid genotype, like single-nucleotide polymorphism (SNP), usually occupies fewer bits than a byte. Data compression and decompression are available with relatively efficient random access. It is also allowed to read a GDS file in parallel with multiple R processes supported by the package parallel.
Maintained by Xiuwen Zheng. Last updated 23 hours ago.
infrastructuredataimportbioinformaticsgds-formatgenomicscpp
0.5 match 18 stars 11.34 score 920 scripts 29 dependentsgillian-earthscope
IRISSeismic:Classes and Methods for Seismic Data Analysis
Provides classes and methods for seismic data analysis. The base classes and methods are inspired by the python code found in the 'ObsPy' python toolbox <https://github.com/obspy/obspy>. Additional classes and methods support data returned by web services provided by EarthScope. <https://service.earthscope.org/>.
Maintained by Gillian Sharer. Last updated 3 months ago.
1.8 match 3.18 score 50 scripts 1 dependentscoolbutuseless
lz4lite:Extremely Fast Compression with LZ4
Extremely fast compression of R objects with LZ4.
Maintained by mikefc. Last updated 4 years ago.
1.7 match 20 stars 3.00 score 7 scriptseddelbuettel
RcppMsgPack:'MsgPack' C++ Header Files and Interface Functions for R
'MsgPack' header files are provided for use by R packages, along with the ability to access, create and alter 'MsgPack' objects directly from R. 'MsgPack' is an efficient binary serialization format. It lets you exchange data among multiple languages like 'JSON' but it is faster and smaller. Small integers are encoded into a single byte, and typical short strings require only one extra byte in addition to the strings themselves. This package provides headers from the 'msgpack-c' implementation for C and C++(11) for use by R, particularly 'Rcpp'. The included 'msgpack-c' headers are licensed under the Boost Software License (Version 1.0); the code added by this package as well the R integration are licensed under the GPL (>= 2). See the files 'COPYRIGHTS' and 'AUTHORS' for a full list of copyright holders and contributors to 'msgpack-c'.
Maintained by Dirk Eddelbuettel. Last updated 2 months ago.
messagepackmsgpackmsgpack-headerscpp
0.8 match 20 stars 6.38 score 16 scripts 1 dependentsdyfanjones
sagemaker.mlcore:sagemaker machine learning core classes and methods
`sagemaker` machine learning core classes and methods.
Maintained by Dyfan Jones. Last updated 3 years ago.
amazon-sagemakerawsmachine-learningsagemakersdk
1.8 match 2.65 score 3 dependentsbenbruyneel
proteinDiscover:ProteinDiscover
Provides an interface to the data contained in Proteome Discoverer (Thermo Scientific) results.
Maintained by Ben Bruyneel. Last updated 1 years ago.
mass-spectrometryproteomicsproteomics-data-analysis
1.6 match 2 stars 3.00 score 2 scriptsnatverse
dracor:Decode Draco Format 3D Mesh Data
Decodes meshes and point cloud data encoded by the Draco mesh compression library from Google. Note that this is only designed for basic decoding and not intended as a full scale wrapping of the Draco library.
Maintained by Gregory Jefferis. Last updated 2 years ago.
1.7 match 2.70 score 3 scriptsjl5000
tidyged.io:Import and Export GEDCOM Files
Import and export family tree GEDCOM files to and from tidy dataframes.
Maintained by Jamie Lendrum. Last updated 3 years ago.
1.8 match 2.48 score 2 dependentskindlychung
collUtils:Auxiliary Package for CollapsABEL
This package includes some low level functions for processing PLINK input and output files.
Maintained by Kaiyin Zhong. Last updated 9 years ago.
1.8 match 2.00 score 4 scriptsandreyshabalin
filematrix:File-Backed Matrix Class with Convenient Read and Write Access
Interface for working with large matrices stored in files, not in computer memory. Supports multiple non-character data types (double, integer, logical and raw) of various sizes (e.g. 8 and 4 byte real values). Access to parts of the matrix is done by indexing, exactly as with usual R matrices. Supports very large matrices. Tested on multi-terabyte matrices. Allows for more than 2^32 rows or columns. Allows for quick addition of extra columns to a filematrix. Cross-platform as the package has R code only.
Maintained by Andrey A Shabalin. Last updated 6 years ago.
0.5 match 8 stars 6.51 score 45 scripts 2 dependentshrbrmstr
qrencoder:Quick Response Code (QR Code) / Matrix Barcode Creator
Quick Response codes (QR codes) are a type of matrix bar code and can be used to authenticate transactions, provide access to multi-factor authentication services and enable general data transfer in an image. QR codes use four standardized encoding modes (numeric, alphanumeric, byte/binary, and kanji) to efficiently store data. Matrix barcode generation is performed efficiently in C via the included 'libqrencoder' library created by Kentaro Fukuchi.
Maintained by Bob Rudis. Last updated 6 years ago.
0.5 match 61 stars 6.03 score 59 scripts 1 dependentscerte-medical-epidemiology
msgxtractr:Read Outlook '.msg' Files
Forked from hrbrmstr/msgxtractr. 'Microsoft' 'Outlook' messages can be saved in '.msg' files. Tools are provided that enable extraction of metadata, envelope, headers, body and attachments from these files.
Maintained by Bob Rudis. Last updated 6 months ago.
1.8 match 1.70 score 8 scriptspolkas
pacs:Supplementary Tools for R Packages Developers
Supplementary utils for CRAN maintainers and R packages developers. Validating the library, packages and lock files. Exploring a complexity of a specific package like evaluating its size in bytes with all dependencies. The shiny app complexity could be explored too. Assessing the life duration of a specific package version. Checking a CRAN package check page status for any errors and warnings. Retrieving a DESCRIPTION or NAMESPACE file for any package version. Comparing DESCRIPTION or NAMESPACE files between different package versions. Getting a list of all releases for a specific package. The Bioconductor is partly supported.
Maintained by Maciej Nasinski. Last updated 6 months ago.
bioconductordependencieslibrarylifedurationrenvshinytoolsutils
0.5 match 25 stars 5.70 score 8 scriptsdirkschumacher
listcomp:List Comprehensions
An implementation of list comprehensions as purely syntactic sugar with a minor runtime overhead. It constructs nested for-loops and executes the byte-compiled loops to collect the results.
Maintained by Dirk Schumacher. Last updated 3 years ago.
comprehensionslist-comprehensionslistcomprehensions
0.5 match 19 stars 5.33 score 3 scripts 7 dependentscoolbutuseless
xxhashlite:Extremely Fast Hashing of R Objects, Raw Data and Files using 'xxHash' Algorithms
Extremely fast hashing of R objects using 'xxHash'. R objects are hashed via the standard serialization mechanism in R. Raw byte vectors and strings can be handled directly for compatibility with hashes created on other systems. This implementation is a wrapper around the 'xxHash' 'C' library which is available from <https://github.com/Cyan4973/xxHash>.
Maintained by Mike Cheng. Last updated 3 months ago.
0.5 match 37 stars 4.87 score 3 scriptscran
serial:The Serial Interface Package
Enables reading and writing binary and ASCII data to RS232/RS422/RS485 or any other virtual serial interface of the computer.
Maintained by Martin Seilmayer. Last updated 5 years ago.
1.7 match 1.48 score 1 dependentsjmablog
tinieR:Shrink Image Filesizes with TinyPNG.com API
Shrink image filesizes with the TinyPNG API <https://tinypng.com>. From the TinyPNG website: "TinyPNG uses smart lossy compression techniques to reduce the file size of your files. By selectively decreasing the number of colors in the image, fewer bytes are required to store the data. The effect is nearly invisible but it makes a very large difference in file size!" TinieR works with .png and .jpg/.jpeg files, and can return the new image filepath to enable embedding in other image workflows/functions.
Maintained by James Adams. Last updated 3 years ago.
0.5 match 11 stars 4.22 score 4 scriptshrbrmstr
wand:Retrieve Magic Attributes from Files and Directories
MIME types are shorthand descriptors for file contents and can be determined from "magic" bytes in file headers, file contents or intuited from file extensions. Tools are provided to perform curated "magic" tests as well as mapping MIME types from a database of over 1,800 extension mappings.
Maintained by Bob Rudis. Last updated 5 years ago.
0.5 match 3.69 score 11 scripts 3 dependentshypertidy
ximage:Draw Images of Raster Data and Related Adornments
Draw images easily and as if doing that was considered desirable or even essential to be able to do. Set up a plot with an image, specify where that image should be placed. Image plot by default reflects the index of the image data itself, or can be specified in simple extent terms 'xmin,xmax,ymin,ymax'. Numeric matrices, integer arrays, byte arrays, character arrays, and native rasters are (or will be) supported. A little bit like 'image()' and 'rasterImage()' from the 'graphics' package but with the good features celebrated and made easier to use.
Maintained by Michael D. Sumner. Last updated 7 months ago.
0.5 match 8 stars 3.25 score 22 scriptscran
convertBCD:Convert Decimal to Binary-Coded Decimal (BCD) Form and Vice Versa
Convert BCD (raw bytes) to decimal numbers and vice versa. BCD format is used to preserve decimals exactly, as opposed to the binary rounding errors inherent in "numeric" or "floating-point" formats.
Maintained by Carl Witthoft. Last updated 9 months ago.
0.5 match 1.00 score