Showing 86 of total 86 results (show query)
gagolews
stringi:Fast and Portable Character String Processing Facilities
A collection of character string/text/natural language processing tools for pattern searching (e.g., with 'Java'-like regular expressions or the 'Unicode' collation algorithm), random string generation, case mapping, string transliteration, concatenation, sorting, padding, wrapping, Unicode normalisation, date-time formatting and parsing, and many more. They are fast, consistent, convenient, and - thanks to 'ICU' (International Components for Unicode) - portable across all locales and platforms. Documentation about 'stringi' is provided via its website at <https://stringi.gagolewski.com/> and the paper by Gagolewski (2022, <doi:10.18637/jss.v103.i02>).
Maintained by Marek Gagolewski. Last updated 1 months ago.
icuicu4cnatural-language-processingnlpregexregexpstring-manipulationstringistringrtexttext-processingtidy-dataunicodecpp
14.5 match 309 stars 18.31 score 10k scripts 8.6k dependentsrpolars
polars:Lightning-Fast 'DataFrame' Library
Lightning-fast 'DataFrame' library written in 'Rust'. Convert R data to 'Polars' data and vice versa. Perform fast, lazy, larger-than-memory and optimized data queries. 'Polars' is interoperable with the package 'arrow', as both are based on the 'Apache Arrow' Columnar Format.
Maintained by Soren Welling. Last updated 3 days ago.
14.2 match 499 stars 12.01 score 1.0k scripts 2 dependentstrinker
textclean:Text Cleaning Tools
Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.
Maintained by Tyler Rinker. Last updated 3 years ago.
data-mungingemoticonsregextext-analysistext-cleaning
15.9 match 248 stars 10.08 score 760 scripts 22 dependentsr-lib
rex:Friendly Regular Expressions
A friendly interface for the construction of regular expressions.
Maintained by Kevin Ushey. Last updated 1 years ago.
10.3 match 335 stars 14.17 score 178 scripts 95 dependentstrinker
qdapRegex:Regular Expression Removal, Extraction, and Replacement Tools
A collection of regular expression tools associated with the 'qdap' package that may be useful outside of the context of discourse analysis. Tools include removal/extraction/replacement of abbreviations, dates, dollar amounts, email addresses, hash tags, numbers, percentages, citations, person tags, phone numbers, times, and zip codes.
Maintained by Tyler Rinker. Last updated 1 years ago.
12.1 match 50 stars 9.48 score 502 scripts 41 dependentsjonclayden
ore:An R Interface to the Onigmo Regular Expression Library
Provides an alternative to R's built-in functionality for handling regular expressions, based on the Onigmo library. Offers first-class compiled regex objects, partial matching and function-based substitutions, amongst other features.
Maintained by Jon Clayden. Last updated 1 days ago.
regexregular-expressionstext-analysis
15.4 match 58 stars 7.16 score 125 scripts 6 dependentsirworkshop
campfin:Wrangle Campaign Finance Data
Explore and normalize American campaign finance data. Created by the Investigative Reporting Workshop to facilitate work on The Accountability Project, an effort to collect public data into a central, standard database that is more easily searched: <https://publicaccountability.org/>.
Maintained by Kiernan Nicholls. Last updated 1 years ago.
campaign-financedata-journalism
18.8 match 17 stars 5.66 score 54 scriptsgadenbuie
regexplain:Rstudio Addin to Explain, Test and Build Regular Expressions
A set of RStudio Addins to help interactively test and build regular expressions. Provides a Shiny gadget interface for interactively constructing the regular expression and viewing the results from common string-searching functions. The gadget interface includes a helpful regex syntax reference sheet and a library of common patterns.
Maintained by Garrick Aden-Buie. Last updated 4 years ago.
gadgetregexregex-expressionregular-expressionrstudio-addinshinystringr
22.8 match 486 stars 4.07 score 12 scriptsverbalexpressions
RVerbalExpressions:Create Regular Expressions Easily
Build regular expressions using grammar and functionality inspired by <https://github.com/VerbalExpressions>. Usage of the %>% is encouraged to build expressions in a chain-like fashion.
Maintained by Tyler Littlefield. Last updated 12 months ago.
regexregular-expressionsverbal-expressions
12.8 match 281 stars 7.17 score 105 scriptseitsupi
neopolars:R Bindings for the 'polars' Rust Library
Lightning-fast 'DataFrame' library written in 'Rust'. Convert R data to 'Polars' data and vice versa. Perform fast, lazy, larger-than-memory and optimized data queries. 'Polars' is interoperable with the package 'arrow', as both are based on the 'Apache Arrow' Columnar Format.
Maintained by Tatsuya Shima. Last updated 12 hours ago.
17.6 match 40 stars 4.86 score 1 scriptslrberge
stringmagic:Character String Operations and Interpolation, Magic Edition
Performs complex string operations compactly and efficiently. Supports string interpolation jointly with over 50 string operations. Also enhances regular string functions (like grep() and co). See an introduction at <https://lrberge.github.io/stringmagic/>.
Maintained by Laurent R Berge. Last updated 7 months ago.
7.2 match 15 stars 10.56 score 37 scripts 33 dependentsrichierocks
rebus.base:Core Functionality for the 'rebus' Package
Build regular expressions piece by piece using human readable code. This package contains core functionality, and is primarily intended to be used by package developers.
Maintained by Richard Cotton. Last updated 8 years ago.
15.3 match 2 stars 4.81 score 31 scripts 7 dependentstidyverse
stringr:Simple, Consistent Wrappers for Common String Operations
A consistent, simple and easy to use set of wrappers around the fantastic 'stringi' package. All function and argument names (and positions) are consistent, all functions deal with "NA"'s and zero length vectors in the same way, and the output from one function is easy to feed into the input of another.
Maintained by Hadley Wickham. Last updated 7 months ago.
3.3 match 622 stars 21.97 score 164k scripts 8.2k dependentsgbganalyst
forstringr:String Manipulation Package for Those Familiar with 'Microsoft Excel'
The goal of 'forstringr' is to enable complex string manipulation in R especially to those more familiar with LEFT(), RIGHT(), and MID() functions in Microsoft Excel. The package combines the power of 'stringr' with other manipulation packages such as 'dplyr' and 'tidyr'.
Maintained by Ezekiel Ogundepo. Last updated 6 months ago.
11.0 match 10 stars 5.89 score 26 scripts 1 dependentsrichierocks
rebus:Build Regular Expressions in a Human Readable Way
Build regular expressions piece by piece using human readable code. This package is designed for interactive use. For package development, use the rebus.* dependencies.
Maintained by Richard Cotton. Last updated 8 years ago.
9.3 match 33 stars 6.73 score 364 scripts 3 dependentsgagolews
stringx:Replacements for Base String Functions Powered by 'stringi'
English is the native language for only 5% of the World population. Also, only 17% of us can understand this text. Moreover, the Latin alphabet is the main one for merely 36% of the total. The early computer era, now a very long time ago, was dominated by the US. Due to the proliferation of the internet, smartphones, social media, and other technologies and communication platforms, this is no longer the case. This package replaces base R string functions (such as grep(), tolower(), sprintf(), and strptime()) with ones that fully support the Unicode standards related to natural language and date-time processing. It also fixes some long-standing inconsistencies, and introduces some new, useful features. Thanks to 'ICU' (International Components for Unicode) and 'stringi', they are fast, reliable, and portable across different platforms.
Maintained by Marek Gagolewski. Last updated 2 months ago.
icuicu4cnatural-language-processingnlpregexregexpstring-manipulationstringitexttext-processingunicode
11.0 match 28 stars 4.75 score 1 scriptstdhock
nc:Named Capture to Data Tables
User-friendly functions for extracting a data table (row for each match, column for each group) from non-tabular text data using regular expressions, and for melting columns that match a regular expression. Patterns are defined using a readable syntax that makes it easy to build complex patterns in terms of simpler, re-usable sub-patterns. Named R arguments are translated to column names in the output; capture groups without names are used internally in order to provide a standard interface to three regular expression 'C' libraries ('PCRE', 'RE2', 'ICU'). Output can also include numeric columns via user-specified type conversion functions.
Maintained by Toby Hocking. Last updated 2 months ago.
7.0 match 16 stars 6.85 score 46 scriptspbs-software
PBSmodelling:GUI Tools Made Easy: Interact with Models and Explore Data
Provides software to facilitate the design, testing, and operation of computer models. It focuses particularly on tools that make it easy to construct and edit a customized graphical user interface ('GUI'). Although our simplified 'GUI' language depends heavily on the R interface to the 'Tcl/Tk' package, a user does not need to know 'Tcl/Tk'. Examples illustrate models built with other R packages, including 'PBSmapping', 'PBSddesolve', and 'BRugs'. A complete user's guide 'PBSmodelling-UG.pdf' shows how to use this package effectively.
Maintained by Rowan Haigh. Last updated 4 months ago.
6.8 match 2 stars 6.76 score 120 scripts 4 dependentsbenwiseman
roperators:Additional Operators to Help you Write Cleaner R Code
Provides string arithmetic, reassignment operators, logical operators that handle missing values, and extra logical operators such as floating point equality and all or nothing. The intent is to allow R users to write code that is easier to read, write, and maintain while providing a friendlier experience to new R users from other language backgrounds (such as 'Python') who are used to concepts such as x += 1 and 'foo' + 'bar'. Includes operators for not in, easy floating point comparisons, === equivalent, and SQL-like like operations (), etc. We also added in some extra helper functions, such as OS checks, pasting in Oxford comma format, and functions to get the first, last, nth, or most common element of a vector or word in a string.
Maintained by Ben Wiseman. Last updated 2 months ago.
7.2 match 4 stars 5.77 score 49 scripts 2 dependentsrdatatable
data.table:Extension of `data.frame`
Fast aggregation of large data (e.g. 100GB in RAM), fast ordered joins, fast add/modify/delete of columns by group using no copies at all, list columns, friendly and fast character-separated-value read/write. Offers a natural and flexible syntax, for faster development.
Maintained by Tyson Barrett. Last updated 1 days ago.
1.8 match 3.7k stars 23.53 score 230k scripts 4.6k dependentstabima
effectR:Predicts Oomycete Effectors
Predicts cytoplasmic effector proteins using genomic data by searching for motifs of interest using regular expression searches and hidden Markov models (HMM) based in Haas et al. (2009) <doi:10.1038/nature08358>.
Maintained by Javier Tabima. Last updated 6 years ago.
8.6 match 1 stars 4.59 score 13 scriptsmarkfairbanks
tidytable:Tidy Interface to 'data.table'
A tidy interface to 'data.table', giving users the speed of 'data.table' while using tidyverse-like syntax.
Maintained by Mark Fairbanks. Last updated 2 months ago.
3.4 match 458 stars 11.41 score 732 scripts 10 dependentsgdemin
expss:Tables, Labels and Some Useful Functions from Spreadsheets and 'SPSS' Statistics
Package computes and displays tables with support for 'SPSS'-style labels, multiple and nested banners, weights, multiple-response variables and significance testing. There are facilities for nice output of tables in 'knitr', 'Shiny', '*.xlsx' files, R and 'Jupyter' notebooks. Methods for labelled variables add value labels support to base R functions and to some functions from other packages. Additionally, the package brings popular data transformation functions from 'SPSS' Statistics and 'Excel': 'RECODE', 'COUNT', 'COUNTIF', 'VLOOKUP' and etc. These functions are very useful for data processing in marketing research surveys. Package intended to help people to move data processing from 'Excel' and 'SPSS' to R.
Maintained by Gregory Demin. Last updated 11 months ago.
excellabelslabels-supportmsexcelpivot-tablesrecodespssspss-statisticstablesvariable-labelsvlookup
3.3 match 84 stars 11.00 score 1.8k scripts 4 dependentsrstudio
gt:Easily Create Presentation-Ready Display Tables
Build display tables from tabular data with an easy-to-use set of functions. With its progressive approach, we can construct display tables with a cohesive set of table parts. Table values can be formatted using any of the included formatting functions. Footnotes and cell styles can be precisely added through a location targeting system. The way in which 'gt' handles things for you means that you don't often have to worry about the fine details.
Maintained by Richard Iannone. Last updated 10 days ago.
docxeasy-to-usehtmllatexrtfsummary-tables
1.7 match 2.1k stars 18.36 score 20k scripts 112 dependentsropensci
beautier:'BEAUti' from R
'BEAST2' (<https://www.beast2.org>) is a widely used Bayesian phylogenetic tool, that uses DNA/RNA/protein data and many model priors to create a posterior of jointly estimated phylogenies and parameters. 'BEAUti 2' (which is part of 'BEAST2') is a GUI tool that allows users to specify the many possible setups and generates the XML file 'BEAST2' needs to run. This package provides a way to create 'BEAST2' input files without active user input, but using R function calls instead.
Maintained by Richรจl J.C. Bilderbeek. Last updated 22 days ago.
bayesianbeastbeast2beautiphylogenetic-inferencephylogenetics
3.5 match 13 stars 8.76 score 198 scripts 5 dependentsknausb
vcfR:Manipulate and Visualize VCF Data
Facilitates easy manipulation of variant call format (VCF) data. Functions are provided to rapidly read from and write to VCF files. Once VCF data is read into R a parser function extracts matrices of data. This information can then be used for quality control or other purposes. Additional functions provide visualization of genomic data. Once processing is complete data may be written to a VCF file (*.vcf.gz). It also may be converted into other popular R objects (e.g., genlight, DNAbin). VcfR provides a link between VCF data and familiar R software.
Maintained by Brian J. Knaus. Last updated 22 days ago.
genomicspopulation-geneticspopulation-genomicsrcppvcf-datavisualizationzlibcpp
2.3 match 254 stars 13.59 score 3.1k scripts 19 dependentsr-lib
lintr:A 'Linter' for R Code
Checks adherence to a given style, syntax errors and possible semantic issues. Supports on the fly checking of R code edited with 'RStudio IDE', 'Emacs', 'Vim', 'Sublime Text', 'Atom' and 'Visual Studio Code'.
Maintained by Michael Chirico. Last updated 8 days ago.
1.7 match 1.2k stars 17.00 score 916 scripts 33 dependentsdesiquintans
librarian:Install, Update, Load Packages from CRAN, 'GitHub', and 'Bioconductor' in One Step
Automatically install, update, and load 'CRAN', 'GitHub', and 'Bioconductor' packages in a single function call. By accepting bare unquoted names for packages, it's easy to add or remove packages from the list.
Maintained by Desi Quintans. Last updated 3 months ago.
3.6 match 54 stars 7.63 score 410 scripts 1 dependentsjl5000
tidyged.internals:Internal Data Structures for the 'Tidyged' Package
Represent data structures described in the GEDCOM 5.5.5 Specification as tidy data.
Maintained by Jamie Lendrum. Last updated 3 years ago.
9.1 match 2.88 score 1 scripts 5 dependentstazinho
snakecase:Convert Strings into any Case
A consistent, flexible and easy to use tool to parse and convert strings into cases like snake or camel among others.
Maintained by Malte Grosser. Last updated 2 years ago.
camelcasecaseconversionpascalcasesnake-case
1.8 match 150 stars 13.99 score 744 scripts 290 dependentscanmod
iidda:Processing Infectious Disease Datasets in IIDDA.
Part of an open toolchain for processing infectious disease datasets available through the IIDDA data repository.
Maintained by Steve Walker. Last updated 4 months ago.
4.0 match 6.07 score 133 scripts 3 dependentshughjonesd
huxtable:Easily Create and Style Tables for LaTeX, HTML and Other Formats
Creates styled tables for data presentation. Export to HTML, LaTeX, RTF, 'Word', 'Excel', and 'PowerPoint'. Simple, modern interface to manipulate borders, size, position, captions, colours, text styles and number formatting. Table cells can span multiple rows and/or columns. Includes a 'huxreg' function for creation of regression tables, and 'quick_*' one-liners to print data to a new document.
Maintained by David Hugh-Jones. Last updated 11 days ago.
htmlhuxtablelatexmicrosoft-wordpowerpointreproducible-researchtables
1.7 match 323 stars 13.93 score 1.9k scripts 16 dependentsrenkun-ken
rlist:A Toolbox for Non-Tabular Data Manipulation
Provides a set of functions for data manipulation with list objects, including mapping, filtering, grouping, sorting, updating, searching, and other useful functions. Most functions are designed to be pipeline friendly so that data processing with lists can be chained.
Maintained by Kun Ren. Last updated 2 years ago.
1.7 match 206 stars 13.73 score 2.2k scripts 123 dependentsjuba
questionr:Functions to Make Surveys Processing Easier
Set of functions to make the processing and analysis of surveys easier : interactive shiny apps and addins for data recoding, contingency tables, dataset metadata handling, and several convenience functions.
Maintained by Julien Barnier. Last updated 7 hours ago.
1.7 match 83 stars 12.62 score 1.1k scripts 19 dependentsthinkr-open
lozen:Management tools for missions
Management tools for missions (internal and external). Includes weekly, GL projects, etc.
Maintained by Sรฉbastien Rochette. Last updated 12 months ago.
3.8 match 7 stars 5.42 score 14 scriptseagerai
fastai:Interface to 'fastai'
The 'fastai' <https://docs.fast.ai/index.html> library simplifies training fast and accurate neural networks using modern best practices. It is based on research in to deep learning best practices undertaken at 'fast.ai', including 'out of the box' support for vision, text, tabular, audio, time series, and collaborative filtering models.
Maintained by Turgut Abdullayev. Last updated 11 months ago.
audiocollaborative-filteringdarknetdarknet-image-classificationfastaimedicalobject-detectiontabulartextvision
2.0 match 118 stars 9.40 score 76 scriptsrstudio
pointblank:Data Validation and Organization of Metadata for Local and Remote Tables
Validate data in data frames, 'tibble' objects, 'Spark' 'DataFrames', and database tables. Validation pipelines can be made using easily-readable, consecutive validation steps. Upon execution of the validation plan, several reporting options are available. User-defined thresholds for failure rates allow for the determination of appropriate reporting actions. Many other workflows are available including an information management workflow, where the aim is to record, collect, and generate useful information on data tables.
Maintained by Richard Iannone. Last updated 9 days ago.
data-assertionsdata-checkerdata-dictionariesdata-framesdata-inferencedata-managementdata-profilerdata-qualitydata-validationdata-verificationdatabase-tableseasy-to-understandreporting-toolschema-validationtesting-toolsyaml-configuration
1.8 match 932 stars 10.59 score 284 scriptsqile0317
FastUtils:Fast, Readable Utility Functions
A wide variety of tools for general data analysis, wrangling, spelling, statistics, visualizations, package development, and more. All functions have vectorized implementations whenever possible. Exported names are designed to be readable, with longer names possessing short aliases.
Maintained by Qile Yang. Last updated 4 months ago.
scientific-computingutilitiesutilitycpp
3.7 match 2 stars 4.95 score 2 scriptsrossellhayes
stringstatic:Dependency-Free String Operations
Provides drop-in replacements for functions from the 'stringr' package, with the same user interface. These functions have no external dependencies and can be copied directly into your package code using the 'staticimports' package.
Maintained by Alexander Rossell Hayes. Last updated 2 years ago.
5.2 match 6 stars 3.48 score 1 scriptspoissonconsulting
chk:Check User-Supplied Function Arguments
For developers to check user-supplied function arguments. It is designed to be simple, fast and customizable. Error messages follow the tidyverse style guide.
Maintained by Joe Thorley. Last updated 2 months ago.
1.5 match 48 stars 11.89 score 22 scripts 95 dependentsgrunwaldlab
metacoder:Tools for Parsing, Manipulating, and Graphing Taxonomic Abundance Data
Reads, plots, and manipulates large taxonomic data sets, like those generated from modern high-throughput sequencing, such as metabarcoding (i.e. amplification metagenomics, 16S metagenomics, etc). It provides a tree-based visualization called "heat trees" used to depict statistics for every taxon in a taxonomy using color and size. It also provides various functions to do common tasks in microbiome bioinformatics on data in the 'taxmap' format defined by the 'taxa' package. The 'metacoder' package is described in the publication by Foster et al. (2017) <doi:10.1371/journal.pcbi.1005404>.
Maintained by Zachary Foster. Last updated 1 months ago.
community-diversityhierarchicalmetabarcodingpcrtaxonomytreescpp
1.8 match 140 stars 9.64 score 328 scriptsgwangjinkim
regreplaceR:Match and Replace Strings Based on Named Groups in Regular Expressions
An R6 class "Replacer" provided by the package simplifies working with regex patterns containing named groups. It allows easy retrieval of matched portions and targeted replacements by group name, improving both code clarity and maintainability.
Maintained by Gwang-Jin Kim. Last updated 6 months ago.
4.1 match 4.00 score 2 scriptswilliam-swl
baizer:Useful Functions for Data Processing
In ancient Chinese mythology, Bai Ze is a divine creature that knows the needs of everything. 'baizer' provides data processing functions frequently used by the author. Hope this package also knows what you want!
Maintained by William Song. Last updated 1 years ago.
dataframenumbersstringstidyverse
4.0 match 6 stars 3.95 score 5 scripts 1 dependentsmoodymudskipper
unglue:Extract Matched Substrings Using a Pattern
Use syntax inspired by the package 'glue' to extract matched substrings in a more intuitive and compact way than by using standard regular expressions.
Maintained by Antoine Fabri. Last updated 3 months ago.
1.8 match 161 stars 8.65 score 119 scripts 4 dependentspsyteachr
webexercises:Create Interactive Web Exercises in 'R Markdown' (Formerly 'webex')
Functions for easily creating interactive web pages using 'R Markdown' that students can use in self-guided learning.
Maintained by Lisa DeBruine. Last updated 2 years ago.
2.0 match 32 stars 6.86 score 458 scriptsbioc
autonomics:Unified Statistical Modeling of Omics Data
This package unifies access to Statistal Modeling of Omics Data. Across linear modeling engines (lm, lme, lmer, limma, and wilcoxon). Across coding systems (treatment, difference, deviation, etc). Across model formulae (with/without intercept, random effect, interaction or nesting). Across omics platforms (microarray, rnaseq, msproteomics, affinity proteomics, metabolomics). Across projection methods (pca, pls, sma, lda, spls, opls). Across clustering methods (hclust, pam, cmeans). It provides a fast enrichment analysis implementation. And an intuitive contrastogram visualisation to summarize contrast effects in complex designs.
Maintained by Aditya Bhagwat. Last updated 2 months ago.
softwaredataimportpreprocessingdimensionreductionprincipalcomponentregressiondifferentialexpressiongenesetenrichmenttranscriptomicstranscriptiongeneexpressionrnaseqmicroarrayproteomicsmetabolomicsmassspectrometry
2.3 match 5.95 score 5 scriptsabjur
abjutils:Useful Tools for Jurimetrical Analysis Used by the Brazilian Jurimetrics Association
The Brazilian Jurimetrics Association (ABJ in Portuguese, see <https://abj.org.br/> for more information) is a non-profit organization which aims to investigate and promote the use of statistics and probability in the study of Law and its institutions. This package implements general purpose tools used by ABJ, such as functions for sampling and basic manipulation of Brazilian lawsuits identification number. It also implements functions for text cleaning, such as accentuation removal.
Maintained by Caio Lente. Last updated 1 years ago.
1.8 match 55 stars 6.76 score 78 scripts 1 dependentstjarkvandemerwe
tidyprompt:Prompt Large Language Models and Enhance Their Functionality
Easily construct prompts and associated logic for interacting with large language models (LLMs). 'tidyprompt' introduces the concept of prompt wraps, which are building blocks that you can use to quickly turn a simple prompt into a complex one. Prompt wraps do not just modify the prompt text, but also add extraction and validation functions that will be applied to the response of the LLM. This ensures that the user gets the desired output. 'tidyprompt' can add various features to prompts and their evaluation by LLMs, such as structured output, automatic feedback, retries, reasoning modes, autonomous R function calling, and R code generation and evaluation. It is designed to be compatible with any LLM provider that offers chat completion.
Maintained by Luka Koning. Last updated 1 months ago.
1.8 match 16 stars 6.56 score 9 scriptsluisdva
unheadr:Handle Data with Messy Header Rows and Broken Values
Verb-like functions to work with messy data, often derived from spreadsheets or parsed PDF tables. Includes functions for unwrapping values broken up across rows, relocating embedded grouping values, and to annotate meaningful formatting in spreadsheet files.
Maintained by Luis D. Verde Arregoitia. Last updated 10 months ago.
1.7 match 61 stars 6.44 score 45 scriptsjibarozzo
nplyr:A Grammar of Nested Data Manipulation
Provides functions for manipulating nested data frames in a list-column using 'dplyr' <https://dplyr.tidyverse.org/> syntax. Rather than unnesting, then manipulating a data frame, 'nplyr' allows users to manipulate each nested data frame directly. 'nplyr' is a wrapper for 'dplyr' functions that provide tools for common data manipulation steps: filtering rows, selecting columns, summarising grouped data, among others.
Maintained by Bolรญvar Aponte Rolรณn. Last updated 1 months ago.
1.6 match 120 stars 6.56 score 1 dependentsnifu-no
saros.base:Base Tools for Semi-Automatic Reporting of Ordinary Surveys
Scaffold an entire web-based report using template chunks, based on a small chapter overview and a dataset. Highly adaptable with prefixes, suffixes, translations, etc. Also contains tools for password-protecting, e.g. for each organization's report on a website. Developed for the common case of a survey across multiple organizations/sites where each organization wants to obtain results for their organization compared with everyone else. See 'saros' (<https://CRAN.R-project.org/package=saros>) for tools used for authors in the drafted reports.
Maintained by Stephan Daus. Last updated 1 months ago.
1.7 match 1 stars 5.98 score 7 scriptsbioc
ISAnalytics:Analyze gene therapy vector insertion sites data identified from genomics next generation sequencing reads for clonal tracking studies
In gene therapy, stem cells are modified using viral vectors to deliver the therapeutic transgene and replace functional properties since the genetic modification is stable and inherited in all cell progeny. The retrieval and mapping of the sequences flanking the virus-host DNA junctions allows the identification of insertion sites (IS), essential for monitoring the evolution of genetically modified cells in vivo. A comprehensive toolkit for the analysis of IS is required to foster clonal trackign studies and supporting the assessment of safety and long term efficacy in vivo. This package is aimed at (1) supporting automation of IS workflow, (2) performing base and advance analysis for IS tracking (clonal abundance, clonal expansions and statistics for insertional mutagenesis, etc.), (3) providing basic biology insights of transduced stem cells in vivo.
Maintained by Francesco Gazzo. Last updated 3 months ago.
biomedicalinformaticssequencingsinglecell
1.8 match 3 stars 5.83 score 15 scriptshypertidy
ceramic:Download Online Imagery Tiles
Download imagery tiles to a standard cache and load the data into raster objects. Facilities for 'AWS' terrain <https://registry.opendata.aws/terrain-tiles/> terrain and 'Mapbox' <https://www.mapbox.com/> servers are provided.
Maintained by Michael Sumner. Last updated 1 years ago.
1.6 match 92 stars 6.44 score 121 scriptstgve
tgver:Turing Geovisualization Engine R package
Turing Geovisualization Engine R package for geospatial visualization and analysis.
Maintained by Layik Hama. Last updated 2 years ago.
geospatial-analysisgeospatial-visualization
1.8 match 5 stars 5.48 score 3 scriptsbioc
CoreGx:Classes and Functions to Serve as the Basis for Other 'Gx' Packages
A collection of functions and classes which serve as the foundation for our lab's suite of R packages, such as 'PharmacoGx' and 'RadioGx'. This package was created to abstract shared functionality from other lab package releases to increase ease of maintainability and reduce code repetition in current and future 'Gx' suite programs. Major features include a 'CoreSet' class, from which 'RadioSet' and 'PharmacoSet' are derived, along with get and set methods for each respective slot. Additional functions related to fitting and plotting dose response curves, quantifying statistical correlation and calculating area under the curve (AUC) or survival fraction (SF) are included. For more details please see the included documentation, as well as: Smirnov, P., Safikhani, Z., El-Hachem, N., Wang, D., She, A., Olsen, C., Freeman, M., Selby, H., Gendoo, D., Grossman, P., Beck, A., Aerts, H., Lupien, M., Goldenberg, A. (2015) <doi:10.1093/bioinformatics/btv723>. Manem, V., Labie, M., Smirnov, P., Kofia, V., Freeman, M., Koritzinksy, M., Abazeed, M., Haibe-Kains, B., Bratman, S. (2018) <doi:10.1101/449793>.
Maintained by Benjamin Haibe-Kains. Last updated 5 months ago.
softwarepharmacogenomicsclassificationsurvival
1.5 match 6.53 score 63 scripts 6 dependentstdhock
atime:Asymptotic Timing
Computing and visualizing comparative asymptotic timings of different algorithms and code versions. Also includes functionality for comparing empirical timings with expected references such as linear or quadratic, <https://en.wikipedia.org/wiki/Asymptotic_computational_complexity> Also includes functionality for measuring asymptotic memory and other quantities.
Maintained by Toby Hocking. Last updated 2 months ago.
1.3 match 7 stars 7.31 score 95 scriptsmatthewheun
RCLabels:Manipulate Matrix Row and Column Labels with Ease
Functions to assist manipulation of matrix row and column labels for all types of matrix mathematics where row and column labels are to be respected.
Maintained by Matthew Heun. Last updated 11 days ago.
2.0 match 1 stars 4.78 score 5 scripts 2 dependentsandrie
surveydata:Tools to Work with Survey Data
Data obtained from surveys contains information not only about the survey responses, but also the survey metadata, e.g. the original survey questions and the answer options. The 'surveydata' package makes it easy to keep track of this metadata, and to easily extract columns with specific questions.
Maintained by Andrie de Vries. Last updated 2 years ago.
1.7 match 23 stars 5.68 score 42 scriptsidslme
IDSL.MXP:Parser for mzML, mzXML, and netCDF Files (Mass Spectrometry Data)
A tiny parser to extract mass spectra data and metadata table of mass spectrometry acquisition properties from mzML, mzXML and netCDF files introduced in <doi:10.1021/acs.jproteome.2c00120>.
Maintained by Dinesh Barupal. Last updated 2 years ago.
exposomegc-ms-datalc-ms-datalipidomicsmass-spectrometrymetabolomicsmzmlmzxmlnetcdf
2.0 match 5 stars 4.57 score 1 scripts 5 dependentsosthomas
ggtikz:Post-Process 'ggplot2' Plots with 'TikZ' Code Using Plot Coordinates
Annotation of 'ggplot2' plots with arbitrary 'TikZ' code, using absolute data or relative plot coordinates.
Maintained by Oliver Thomas. Last updated 9 months ago.
1.8 match 10 stars 4.85 score 14 scriptsambiorix-web
slighe:Use 'path-to-regex' 'NPM' library
Use 'path-to-regex' 'NPM' library.
Maintained by John Coene. Last updated 3 years ago.
3.6 match 4 stars 2.30 scoreidslme
IDSL.UFA:United Formula Annotation (UFA) for HRMS Data Processing
A pipeline to annotate chromatography peaks from the 'IDSL.IPA' workflow <doi:10.1021/acs.jproteome.2c00120> with molecular formulas of a prioritized chemical space using an isotopic profile matching approach. The 'IDSL.UFA' workflow only requires mass spectrometry level 1 (MS1) data for formula annotation. The 'IDSL.UFA' methods was described in <doi:10.1021/acs.analchem.2c00563> .
Maintained by Dinesh Barupal. Last updated 2 years ago.
exposomeisotopic-distribution-calculationslipidomicsmass-spectrometrymetabolomicsmolecular-formulae-assignmentsmall-molecule
2.0 match 8 stars 4.08 score 1 scripts 1 dependentscurso-r
scryr:An Interface to the 'Scryfall' API
A simple, light, and robust interface between R and the 'Scryfall' card data API <https://scryfall.com/docs/api>.
Maintained by Caio Lente. Last updated 3 years ago.
1.3 match 17 stars 6.09 score 18 scriptscoolbutuseless
flexo:Simple Tools for Lexing/Parsing Text Data
Simple tools for lexing/parsing text data.
Maintained by mikefc. Last updated 3 years ago.
1.9 match 7 stars 4.32 score 8 scriptsguokai8
fctutils:Advanced Factor Manipulation Utilities
Provides a collection of utility functions for manipulating and analyzing factor vectors in R. It offers tools for filtering, splitting, combining, and reordering factor levels based on various criteria. The package is designed to simplify common tasks in categorical data analysis, making it easier to work with factors in a flexible and efficient manner.
Maintained by Kai Guo. Last updated 5 months ago.
1.7 match 2 stars 4.60 score 4 scriptsgiocomai
castarter:Content Analysis Starter Toolkit
Consistent approaches for basic web scraping, text mining and word frequency analysis of textual datasets
Maintained by Giorgio Comai. Last updated 11 hours ago.
1.7 match 3 stars 4.52 score 2 scriptsrichierocks
rebus.datetimes:Date and Time Extensions for the 'rebus' Package
Build regular expressions piece by piece using human readable code. This package contains date and time functionality, and is primarily intended to be used by package developers.
Maintained by Richard Cotton. Last updated 2 years ago.
2.0 match 1 stars 3.78 score 4 scripts 4 dependentsagdamsbo
stRoke:Clinical Stroke Research
A collection of tools for clinical trial data management and analysis in research and teaching. The package is mainly collected for personal use, but any use beyond that is encouraged. This package has migrated functions from 'agdamsbo/daDoctoR', and new functions has been added. Version follows months and year. See NEWS/Changelog for release notes. This package includes sampled data from the TALOS trial (Kraglund et al (2018) <doi:10.1161/STROKEAHA.117.020067>). The win_prob() function is based on work by Zou et al (2022) <doi:10.1161/STROKEAHA.121.037744>. The age_calc() function is based on work by Becker (2020) <doi:10.18637/jss.v093.i02>.
Maintained by Andreas Gammelgaard Damsbo. Last updated 5 months ago.
1.8 match 4.18 score 7 scriptstin900
vvauditor:Creates Assertion Tests
Offers a comprehensive set of assertion tests to help users validate the integrity of their data. These tests can be used to check for specific conditions or properties within a dataset and help ensure that data is accurate and reliable. The package is designed to make it easy to add quality control checks to data analysis workflows and to aid in identifying and correcting any errors or inconsistencies in data.
Maintained by Tomer Iwan. Last updated 1 months ago.
1.8 match 4.03 score 7 scriptsidslme
IDSL.FSA:Fragmentation Spectra Analysis (FSA)
The 'IDSL.FSA' package was designed to annotate standard .msp (mass spectra format) and .mgf (Mascot generic format) files using mass spectral entropy similarity, dot product (cosine) similarity, and normalized Euclidean mass error (NEME) followed by intelligent pre-filtering steps for rapid spectra searches. 'IDSL.FSA' also provides a number of modules to convert and manipulate .msp and .mgf files. The 'IDSL.FSA' workflow was integrated in the 'IDSL.CSA' and 'IDSL.NPA' packages introduced in <doi:10.1021/acs.analchem.3c00376>.
Maintained by Dinesh Barupal. Last updated 7 months ago.
fragmentation-spectramass-spectrometrymassbankmgfmgf-parsermspmsp-parserspectral-entropy
2.0 match 1 stars 3.48 score 2 dependentsabhi-1u
rebib:Convert and Aggregate Bibliographies
Authors working with 'LaTeX' articles use the built-in bibliography options and 'BibTeX' files. While this might work with 'LaTeX', it does not function well with Web articles. As a way out, 'rebib' offers tools to convert and combine bibliographies from both sources.
Maintained by Abhishek Ulayil. Last updated 5 months ago.
1.2 match 1 stars 5.68 score 9 scripts 1 dependentsskranz
stringtools:Tools for working with strings in R
Tools for working with strings in R
Maintained by Sebastian Kranz. Last updated 3 years ago.
1.8 match 2 stars 3.66 score 29 scripts 26 dependentsuweblock
lazysql:Lazy SQL Programming
Helper functions to build SQL statements under program control for dbGetQuery, dbSendQuery, sqldf, etc. They are intended to increase speed of coding and to reduce coding errors. Arguments are carefully checked, in particular SQL identifiers such as names of tables or columns. More patterns will be added as required.
Maintained by Uwe Block. Last updated 9 years ago.
1.8 match 7 stars 3.54 score 6 scriptsgregfrog
WebAnalytics:Web Server Log Analysis
Provides Apache and IIS log analytics for transaction performance, client populations and workload definitions.
Maintained by Greg Hunt. Last updated 1 years ago.
1.6 match 3.78 score 12 scriptsmasterclm
mclm:Mastering Corpus Linguistics Methods
Read, inspect and process corpus files for quantitative corpus linguistics. Obtain concordances via regular expressions, tokenize texts, and compute frequencies and association measures. Useful for collocation analysis, keywords analysis and variationist studies (comparison of linguistic variants and of linguistic varieties).
Maintained by Mariana Montes. Last updated 2 years ago.
1.9 match 1 stars 3.24 score 35 scriptstheogrost
NUSS:Mixed N-Grams and Unigram Sequence Segmentation
Segmentation of short text sequences - like hashtags - into the separated words sequence, done with the use of dictionary, which may be built on custom corpus of texts. Unigram dictionary is used to find most probable sequence, and n-grams approach is used to determine possible segmentation given the text corpus.
Maintained by Oskar Kosch. Last updated 8 months ago.
1.8 match 3.00 score 8 scriptsidslme
IDSL.SUFA:Simplified UFA
A simplified version of the 'IDSL.UFA' package to calculate isotopic profiles and adduct formulas from molecular formulas with no dependency on other R packages for online tools and educational mass spectrometry courses. The 'IDSL.SUFA' package also provides an ancillary module to process user-defined adduct formulas.
Maintained by Dinesh Barupal. Last updated 2 years ago.
adduct-formulaisotopic-distribution-calculations
2.0 match 2.70 scorecb12991
expstudy:Tools for Actuarial Experience Studies
Experiences studies are an integral component of the actuarial control cycle. Regardless of the decrement or policyholder behavior of interest, the analyses conducted is often the same. Ultimately, this package aims to reduce time spent writing the same code used for different experience studies, therefore increasing the time for to uncover new insights inherit within the relevant experience.
Maintained by Cody Buehler. Last updated 9 months ago.
1.8 match 1 stars 2.70 score 7 scriptscran
PepMapViz:A Versatile Toolkit for Peptide Mapping, Visualization, and Comparative Exploration
A versatile R visualization package that empowers researchers with comprehensive visualization tools for seamlessly mapping peptides to protein sequences, identifying distinct domains and regions of interest, accentuating mutations, and highlighting post-translational modifications, all while enabling comparisons across diverse experimental conditions. Potential applications of 'PepMapViz' include the visualization of cross-software mass spectrometry results at the peptide level for specific protein and domain details in a linearized format and post-translational modification coverage across different experimental conditions; unraveling insights into disease mechanisms. It also enables visualization of major histocompatibility complex-presented peptides in different antibody regions predicting immunogenicity in antibody drug development.
Maintained by Zhenru Zhou. Last updated 4 months ago.
immunogenicitymassspectrometryproteomicspeptidomicssoftwarevisualization
1.8 match 2.70 scoredyfanjones
sagemaker.common:R6sagemaker lower level api calls
`R6sagemaker` lower level api calls.
Maintained by Dyfan Jones. Last updated 3 years ago.
amazon-sagemakerawssagemakersdk
1.7 match 2.78 score 4 dependentscoolbutuseless
c64asm:6502 Assembler
A simple 6502 assembler written purely in R and leveraging R data structures for pre-computing character sets and images.
Maintained by mikefc. Last updated 1 years ago.
1.8 match 3 stars 2.22 score 11 scriptscran
DCEtool:Efficient and Accessible Discrete Choice Experiments
Design, conduct and analyze 'DCEs' from a virtual interface in shiny. Reference: Perez-Troncoso, D. (2022) <https://github.com/danielpereztr/DCEtool>.
Maintained by Daniel Perez Troncoso. Last updated 2 years ago.
1.7 match 2 stars 2.30 score 3 scriptselipousson
isstatic:Dependency-Free Object Tests
Convenience functions for checking class inheritance, extracting attributes, basic type conversion, and miscellaneous string manipulation. working with sf, ggplot2, and other packages.
Maintained by Eli Pousson. Last updated 2 years ago.
1.8 match 3 stars 2.18 score 1 scriptscran
wdnr.gis:Pull Spatial Layers from 'WDNR ArcGIS REST API'
Functions for finding and pulling data from the 'Wisconsin Department of Natural Resources ArcGIS REST APIs' <https://dnrmaps.wi.gov/arcgis/rest/services> and <https://dnrmaps.wi.gov/arcgis2/rest/services>.
Maintained by Paul Frater. Last updated 1 years ago.
1.8 match 1 stars 2.00 score 3 scripts