R-universe search: modelselection

Showing 14 of total 14 results (show query)

jonasrieger

ldaPrototype:Prototype of Multiple Latent Dirichlet Allocation Runs

Determine a Prototype from a number of runs of Latent Dirichlet Allocation (LDA) measuring its similarities with S-CLOP: A procedure to select the LDA run with highest mean pairwise similarity, which is measured by S-CLOP (Similarity of multiple sets by Clustering with Local Pruning), to all other runs. LDA runs are specified by its assignments leading to estimators for distribution parameters. Repeated runs lead to different results, which we encounter by choosing the most representative LDA run as prototype.

Maintained by Jonas Rieger. Last updated 2 years ago.

latent-dirichlet-allocation lda model-selection modelselection reliability text-mining textdata topic-model topic-models topic-similarities topicmodeling topicmodelling

10.0 match 8 stars 4.44 score 23 scripts 1 dependents

hmorlon

RPANDA:Phylogenetic ANalyses of DiversificAtion

Implements macroevolutionary analyses on phylogenetic trees. See Morlon et al. (2010) <DOI:10.1371/journal.pbio.1000493>, Morlon et al. (2011) <DOI:10.1073/pnas.1102543108>, Condamine et al. (2013) <DOI:10.1111/ele.12062>, Morlon et al. (2014) <DOI:10.1111/ele.12251>, Manceau et al. (2015) <DOI:10.1111/ele.12415>, Lewitus & Morlon (2016) <DOI:10.1093/sysbio/syv116>, Drury et al. (2016) <DOI:10.1093/sysbio/syw020>, Manceau et al. (2016) <DOI:10.1093/sysbio/syw115>, Morlon et al. (2016) <DOI:10.1111/2041-210X.12526>, Clavel & Morlon (2017) <DOI:10.1073/pnas.1606868114>, Drury et al. (2017) <DOI:10.1093/sysbio/syx079>, Lewitus & Morlon (2017) <DOI:10.1093/sysbio/syx095>, Drury et al. (2018) <DOI:10.1371/journal.pbio.2003563>, Clavel et al. (2019) <DOI:10.1093/sysbio/syy045>, Maliet et al. (2019) <DOI:10.1038/s41559-019-0908-0>, Billaud et al. (2019) <DOI:10.1093/sysbio/syz057>, Lewitus et al. (2019) <DOI:10.1093/sysbio/syz061>, Aristide & Morlon (2019) <DOI:10.1111/ele.13385>, Maliet et al. (2020) <DOI:10.1111/ele.13592>, Drury et al. (2021) <DOI:10.1371/journal.pbio.3001270>, Perez-Lamarque & Morlon (2022) <DOI:10.1111/mec.16478>, Perez-Lamarque et al. (2022) <DOI:10.1101/2021.08.30.458192>, Mazet et al. (2023) <DOI:10.1111/2041-210X.14195>, Drury et al. (2024) <DOI:10.1016/j.cub.2023.12.055>.

Maintained by Hélène Morlon. Last updated 2 months ago.

5.0 match 24 stars 8.50 score 255 scripts

tomasfryda

h2o:R Interface for the 'H2O' Scalable Machine Learning Platform

R interface for 'H2O', the scalable open source machine learning platform that offers parallelized implementations of many supervised and unsupervised machine learning algorithms such as Generalized Linear Models (GLM), Gradient Boosting Machines (including XGBoost), Random Forests, Deep Neural Networks (Deep Learning), Stacked Ensembles, Naive Bayes, Generalized Additive Models (GAM), ANOVA GLM, Cox Proportional Hazards, K-Means, PCA, ModelSelection, Word2Vec, as well as a fully automatic machine learning algorithm (H2O AutoML).

Maintained by Tomas Fryda. Last updated 1 years ago.

4.3 match 3 stars 8.20 score 7.8k scripts 11 dependents

emkayoh

Dark:The Analysis of Dark Adaptation Data

The recovery of visual sensitivity in a dark environment is known as dark adaptation. In a clinical or research setting the recovery is typically measured after a dazzling flash of light and can be described by the Mahroo, Lamb and Pugh (MLP) model of dark adaptation. The functions in this package take dark adaptation data and use nonlinear regression to find the parameters of the model that 'best' describe the data. They do this by firstly, generating rapid initial objective estimates of data adaptation parameters, then a multi-start algorithm is used to reduce the possibility of a local minimum. There is also a bootstrap method to calculate parameter confidence intervals. The functions rely upon a 'dark' list or object. This object is created as the first step in the workflow and parts of the object are updated as it is processed.

Maintained by Jeremiah MF Kelly. Last updated 2 months ago.

6.6 match 5.18 score 30 scripts

davidrusi

mombf:Model Selection with Bayesian Methods and Information Criteria

Model selection and averaging for regression and mixtures, inclusing Bayesian model selection and information criteria (BIC, EBIC, AIC, GIC).

Maintained by David Rossell. Last updated 1 months ago.

openblas cpp openmp

3.0 match 7 stars 7.89 score 73 scripts 1 dependents

mpierrejean

jointseg:Joint Segmentation of Multivariate (Copy Number) Signals

Methods for fast segmentation of multivariate signals into piecewise constant profiles and for generating realistic copy-number profiles. A typical application is the joint segmentation of total DNA copy numbers and allelic ratios obtained from Single Nucleotide Polymorphism (SNP) microarrays in cancer studies. The methods are described in Pierre-Jean, Rigaill and Neuvial (2015) <doi:10.1093/bib/bbu026>.

Maintained by Morgane Pierre-Jean. Last updated 6 years ago.

cpp

3.0 match 6 stars 6.50 score 44 scripts 2 dependents

tdhock

penaltyLearning:Penalty Learning

Implementations of algorithms from Learning Sparse Penalties for Change-point Detection using Max Margin Interval Regression, by Hocking, Rigaill, Vert, Bach <http://proceedings.mlr.press/v28/hocking13.html> published in proceedings of ICML2013.

Maintained by Toby Dylan Hocking. Last updated 6 months ago.

cpp

3.0 match 16 stars 6.13 score 129 scripts 2 dependents

aagillet

MorphoRegions:Analysis of Regionalization Patterns in Serially Homologous Structures

Computes the optimal number of regions (or subdivisions) and their position in serial structures without a priori assumptions and to visualize the results. After reducing data dimensionality with the built-in function for data ordination, regions are fitted as segmented linear regressions along the serial structure. Every region boundary position and increasing number of regions are iteratively fitted and the best model (number of regions and boundary positions) is selected with an information criterion. This package expands on the previous 'regions' package (Jones et al. (2018) <doi:10.1126/science.aar3126>) with improved computation and more fitting and plotting options.

Maintained by Amandine Gillet. Last updated 4 months ago.

3.3 match 4.30 score 6 scripts

bioc

INSPEcT:Modeling RNA synthesis, processing and degradation with RNA-seq data

INSPEcT (INference of Synthesis, Processing and dEgradation rates from Transcriptomic data) RNA-seq data in time-course experiments or steady-state conditions, with or without the support of nascent RNA data.

Maintained by Stefano de Pretis. Last updated 5 months ago.

sequencing rnaseq generegulation timecourse systemsbiology

3.0 match 4.38 score 9 scripts

haghish

autoEnsemble:Automated Stacked Ensemble Classifier for Severe Class Imbalance

An AutoML algorithm is developed to construct homogeneous or heterogeneous stacked ensemble models using specified base-learners. Various criteria are employed to identify optimal models, enhancing diversity among them and resulting in more robust stacked ensembles. The algorithm optimizes the model by incorporating an increasing number of top-performing models to create a diverse combination. Presently, only models from 'h2o.ai' are supported.

Maintained by E. F. Haghish. Last updated 12 hours ago.

ai algorithm automated-machine-learning automl automl-algorithms ensemble ensemble-learning h2o h2oai machine-learning machinelearning metalearning stack-ensemble stacked-ensembles stacking

3.0 match 5 stars 4.20 score 21 scripts

bioc

STATegRa:Classes and methods for multi-omics data integration

Classes and tools for multi-omics data integration.

Maintained by David Gomez-Cabrero. Last updated 5 months ago.

software statisticalmethod clustering dimensionreduction principalcomponent

3.0 match 4.15 score 3 scripts

vsousa

poolABC:Approximate Bayesian Computation with Pooled Sequencing Data

Provides functions to simulate Pool-seq data under models of demographic formation and to import Pool-seq data from real populations. Implements two ABC algorithms for performing parameter estimation and model selection using Pool-seq data. Cross-validation can also be performed to assess the accuracy of ABC estimates and model choice. Carvalho et al., (2022) <doi:10.1111/1755-0998.13834>.

Maintained by João Carvalho. Last updated 2 years ago.

3.3 match 1 stars 3.70 score 3 scripts

bioc

DaMiRseq:Data Mining for RNA-seq data: normalization, feature selection and classification

The DaMiRseq package offers a tidy pipeline of data mining procedures to identify transcriptional biomarkers and exploit them for both binary and multi-class classification purposes. The package accepts any kind of data presented as a table of raw counts and allows including both continous and factorial variables that occur with the experimental setting. A series of functions enable the user to clean up the data by filtering genomic features and samples, to adjust data by identifying and removing the unwanted source of variation (i.e. batches and confounding factors) and to select the best predictors for modeling. Finally, a "stacking" ensemble learning technique is applied to build a robust classification model. Every step includes a checkpoint that the user may exploit to assess the effects of data management by looking at diagnostic plots, such as clustering and heatmaps, RLE boxplots, MDS or correlation plot.

Maintained by Mattia Chiesa. Last updated 5 months ago.

sequencing rnaseq classification immunooncology openjdk

2.3 match 5.32 score 7 scripts 1 dependents

hhhelfer

HCmodelSets:Regression with a Large Number of Potential Explanatory Variables

Software for performing the reduction, exploratory and model selection phases of the procedure proposed by Cox, D.R. and Battey, H.S. (2017) <doi:10.1073/pnas.1703764114> for sparse regression when the number of potential explanatory variables far exceeds the sample size. The software supports linear regression, likelihood-based fitting of generalized linear regression models and the proportional hazards model fitted by partial likelihood.

Maintained by H. Battey. Last updated 2 years ago.

2.3 match 2 stars 4.00 score 5 scripts