R-universe search: exports:as_data

Showing 11 of total 11 results (show query)

tidyverse

dplyr:A Grammar of Data Manipulation

A fast, consistent tool for working with data frame like objects, both in memory and out of memory.

Maintained by Hadley Wickham. Last updated 26 days ago.

data-manipulation grammar cpp

4.8k stars 24.68 score 659k scripts 7.8k dependents

tidyverse

tibble:Simple Data Frames

Provides a 'tbl_df' class (the 'tibble') with stricter checking and better formatting than the traditional data frame.

Maintained by Kirill Müller. Last updated 9 days ago.

tidy-data

693 stars 22.85 score 47k scripts 11k dependents

igraph

igraph:Network Analysis and Visualization

Routines for simple graphs and network analysis. It can handle large graphs very well and provides functions for generating random and regular graphs, graph visualization, centrality methods and much more.

Maintained by Kirill Müller. Last updated 5 days ago.

complex-networks graph-algorithms graph-theory mathematics network-analysis network-graph fortran libxml2 glpk openblas cpp

584 stars 21.13 score 31k scripts 1.9k dependents

colearendt

tidyjson:Tidy Complex 'JSON'

Turn complex 'JSON' data into tidy data frames.

Maintained by Cole Arendt. Last updated 2 years ago.

192 stars 10.64 score 522 scripts 7 dependents

ropensci

taxa:Classes for Storing and Manipulating Taxonomic Data

Provides classes for storing and manipulating taxonomic data. Most of the classes can be treated like base R vectors (e.g. can be used in tables as columns and can be named). Vectorized classes can store taxon names and authorities, taxon IDs from databases, taxon ranks, and other types of information. More complex classes are provided to store taxonomic trees and user-defined data associated with them.

Maintained by Zachary Foster. Last updated 1 years ago.

taxonomy biology hierarchy data-cleaning taxon

47 stars 6.79 score 217 scripts

dfriend21

quadtree:Region Quadtrees for Spatial Data

Provides functionality for working with raster-like quadtrees (also called “region quadtrees”), which allow for variable-sized cells. The package allows for flexibility in the quadtree creation process. Several functions defining how to split and aggregate cells are provided, and custom functions can be written for both of these processes. In addition, quadtrees can be created using other quadtrees as “templates”, so that the new quadtree's structure is identical to the template quadtree. The package also includes functionality for modifying quadtrees, querying values, saving quadtrees to a file, and calculating least-cost paths using the quadtree as a resistance surface.

Maintained by Derek Friend. Last updated 2 years ago.

cpp

19 stars 6.34 score 58 scripts

bioc

structToolbox:Data processing & analysis tools for Metabolomics and other omics

An extensive set of data (pre-)processing and analysis methods and tools for metabolomics and other omics, with a strong emphasis on statistics and machine learning. This toolbox allows the user to build extensive and standardised workflows for data analysis. The methods and tools have been implemented using class-based templates provided by the struct (Statistics in R Using Class-based Templates) package. The toolbox includes pre-processing methods (e.g. signal drift and batch correction, normalisation, missing value imputation and scaling), univariate (e.g. ttest, various forms of ANOVA, Kruskal–Wallis test and more) and multivariate statistical methods (e.g. PCA and PLS, including cross-validation and permutation testing) as well as machine learning methods (e.g. Support Vector Machines). The STATistics Ontology (STATO) has been integrated and implemented to provide standardised definitions for the different methods, inputs and outputs.

Maintained by Gavin Rhys Lloyd. Last updated 1 months ago.

workflowstep metabolomics bioconductor-package dims lc-ms machine-learning multivariate-analysis statistics univariate

10 stars 6.26 score 12 scripts

bioc

struct:Statistics in R Using Class-based Templates

Defines and includes a set of class-based templates for developing and implementing data processing and analysis workflows, with a strong emphasis on statistics and machine learning. The templates can be used and where needed extended to 'wrap' tools and methods from other packages into a common standardised structure to allow for effective and fast integration. Model objects can be combined into sequences, and sequences nested in iterators using overloaded operators to simplify and improve readability of the code. Ontology lookup has been integrated and implemented to provide standardised definitions for methods, inputs and outputs wrapped using the class-based templates.

Maintained by Gavin Rhys Lloyd. Last updated 5 months ago.

workflowstep

5.91 score 76 scripts 3 dependents

cole-brokamp

fr:Frictionless Standards

A "tabular-data-resource" (<https://specs.frictionlessdata.io/tabular-data-resource/>) is a simple format to describe a singular tabular data resource such as a CSV file. It includes support both for metadata such as author and title and a schema to describe the data, for example the types of the fields/columns in the data. Create a tabular-data-resource by providing a data.frame and specifying metadata. Write and read tabular-data-resources to and from disk.

Maintained by Cole Brokamp. Last updated 5 months ago.

3 stars 5.28 score 63 scripts

netcoupler

NetCoupler:Inference of Causal Links Between a Network and an External Variable

The 'NetCoupler' algorithm identifies potential direct effects of correlated, high-dimensional variables formed as a network with an external variable. The external variable may act as the dependent/response variable or as an independent/predictor variable to the network.

Maintained by Luke Johnston. Last updated 1 years ago.

6 stars 4.78 score 7 scripts

masterclm

mclm:Mastering Corpus Linguistics Methods

Read, inspect and process corpus files for quantitative corpus linguistics. Obtain concordances via regular expressions, tokenize texts, and compute frequencies and association measures. Useful for collocation analysis, keywords analysis and variationist studies (comparison of linguistic variants and of linguistic varieties).

Maintained by Mariana Montes. Last updated 2 years ago.

corpus linguistics cpp

1 stars 3.24 score 35 scripts