Showing 11 of total 11 results (show query)
tidyverse
dplyr:A Grammar of Data Manipulation
A fast, consistent tool for working with data frame like objects, both in memory and out of memory.
Maintained by Hadley Wickham. Last updated 26 days ago.
4.8k stars 24.68 score 659k scripts 7.8k dependentstidyverse
tibble:Simple Data Frames
Provides a 'tbl_df' class (the 'tibble') with stricter checking and better formatting than the traditional data frame.
Maintained by Kirill Müller. Last updated 9 days ago.
693 stars 22.85 score 47k scripts 11k dependentsigraph
igraph:Network Analysis and Visualization
Routines for simple graphs and network analysis. It can handle large graphs very well and provides functions for generating random and regular graphs, graph visualization, centrality methods and much more.
Maintained by Kirill Müller. Last updated 5 days ago.
complex-networksgraph-algorithmsgraph-theorymathematicsnetwork-analysisnetwork-graphfortranlibxml2glpkopenblascpp
584 stars 21.13 score 31k scripts 1.9k dependentscolearendt
tidyjson:Tidy Complex 'JSON'
Turn complex 'JSON' data into tidy data frames.
Maintained by Cole Arendt. Last updated 2 years ago.
192 stars 10.64 score 522 scripts 7 dependentsropensci
taxa:Classes for Storing and Manipulating Taxonomic Data
Provides classes for storing and manipulating taxonomic data. Most of the classes can be treated like base R vectors (e.g. can be used in tables as columns and can be named). Vectorized classes can store taxon names and authorities, taxon IDs from databases, taxon ranks, and other types of information. More complex classes are provided to store taxonomic trees and user-defined data associated with them.
Maintained by Zachary Foster. Last updated 1 years ago.
taxonomybiologyhierarchydata-cleaningtaxon
47 stars 6.79 score 217 scriptsdfriend21
quadtree:Region Quadtrees for Spatial Data
Provides functionality for working with raster-like quadtrees (also called “region quadtrees”), which allow for variable-sized cells. The package allows for flexibility in the quadtree creation process. Several functions defining how to split and aggregate cells are provided, and custom functions can be written for both of these processes. In addition, quadtrees can be created using other quadtrees as “templates”, so that the new quadtree's structure is identical to the template quadtree. The package also includes functionality for modifying quadtrees, querying values, saving quadtrees to a file, and calculating least-cost paths using the quadtree as a resistance surface.
Maintained by Derek Friend. Last updated 2 years ago.
19 stars 6.34 score 58 scriptsbioc
structToolbox:Data processing & analysis tools for Metabolomics and other omics
An extensive set of data (pre-)processing and analysis methods and tools for metabolomics and other omics, with a strong emphasis on statistics and machine learning. This toolbox allows the user to build extensive and standardised workflows for data analysis. The methods and tools have been implemented using class-based templates provided by the struct (Statistics in R Using Class-based Templates) package. The toolbox includes pre-processing methods (e.g. signal drift and batch correction, normalisation, missing value imputation and scaling), univariate (e.g. ttest, various forms of ANOVA, Kruskal–Wallis test and more) and multivariate statistical methods (e.g. PCA and PLS, including cross-validation and permutation testing) as well as machine learning methods (e.g. Support Vector Machines). The STATistics Ontology (STATO) has been integrated and implemented to provide standardised definitions for the different methods, inputs and outputs.
Maintained by Gavin Rhys Lloyd. Last updated 1 months ago.
workflowstepmetabolomicsbioconductor-packagedimslc-msmachine-learningmultivariate-analysisstatisticsunivariate
10 stars 6.26 score 12 scriptsbioc
struct:Statistics in R Using Class-based Templates
Defines and includes a set of class-based templates for developing and implementing data processing and analysis workflows, with a strong emphasis on statistics and machine learning. The templates can be used and where needed extended to 'wrap' tools and methods from other packages into a common standardised structure to allow for effective and fast integration. Model objects can be combined into sequences, and sequences nested in iterators using overloaded operators to simplify and improve readability of the code. Ontology lookup has been integrated and implemented to provide standardised definitions for methods, inputs and outputs wrapped using the class-based templates.
Maintained by Gavin Rhys Lloyd. Last updated 5 months ago.
5.91 score 76 scripts 3 dependentscole-brokamp
fr:Frictionless Standards
A "tabular-data-resource" (<https://specs.frictionlessdata.io/tabular-data-resource/>) is a simple format to describe a singular tabular data resource such as a CSV file. It includes support both for metadata such as author and title and a schema to describe the data, for example the types of the fields/columns in the data. Create a tabular-data-resource by providing a data.frame and specifying metadata. Write and read tabular-data-resources to and from disk.
Maintained by Cole Brokamp. Last updated 5 months ago.
3 stars 5.28 score 63 scriptsnetcoupler
NetCoupler:Inference of Causal Links Between a Network and an External Variable
The 'NetCoupler' algorithm identifies potential direct effects of correlated, high-dimensional variables formed as a network with an external variable. The external variable may act as the dependent/response variable or as an independent/predictor variable to the network.
Maintained by Luke Johnston. Last updated 1 years ago.
6 stars 4.78 score 7 scriptsmasterclm
mclm:Mastering Corpus Linguistics Methods
Read, inspect and process corpus files for quantitative corpus linguistics. Obtain concordances via regular expressions, tokenize texts, and compute frequencies and association measures. Useful for collocation analysis, keywords analysis and variationist studies (comparison of linguistic variants and of linguistic varieties).
Maintained by Mariana Montes. Last updated 2 years ago.
1 stars 3.24 score 35 scripts