Showing 17 of total 17 results (show query)

kurthornik

NLP:Natural Language Processing Infrastructure

Basic classes and methods for Natural Language Processing.

Maintained by Kurt Hornik. Last updated 4 months ago.

6 stars 9.42 score 1.0k scripts 127 dependents

bioc

ViSEAGO:ViSEAGO: a Bioconductor package for clustering biological functions using Gene Ontology and semantic similarity

The main objective of ViSEAGO package is to carry out a data mining of biological functions and establish links between genes involved in the study. We developed ViSEAGO in R to facilitate functional Gene Ontology (GO) analysis of complex experimental design with multiple comparisons of interest. It allows to study large-scale datasets together and visualize GO profiles to capture biological knowledge. The acronym stands for three major concepts of the analysis: Visualization, Semantic similarity and Enrichment Analysis of Gene Ontology. It provides access to the last current GO annotations, which are retrieved from one of NCBI EntrezGene, Ensembl or Uniprot databases for several species. Using available R packages and novel developments, ViSEAGO extends classical functional GO analysis to focus on functional coherence by aggregating closely related biological themes while studying multiple datasets at once. It provides both a synthetic and detailed view using interactive functionalities respecting the GO graph structure and ensuring functional coherence supplied by semantic similarity. ViSEAGO has been successfully applied on several datasets from different species with a variety of biological questions. Results can be easily shared between bioinformaticians and biologists, enhancing reporting capabilities while maintaining reproducibility.

Maintained by Aurelien Brionne. Last updated 3 months ago.

softwareannotationgogenesetenrichmentmultiplecomparisonclusteringvisualization

6.64 score 22 scripts

bioc

geneXtendeR:Optimized Functional Annotation Of ChIP-seq Data

geneXtendeR optimizes the functional annotation of ChIP-seq peaks by exploring relative differences in annotating ChIP-seq peak sets to variable-length gene bodies. In contrast to prior techniques, geneXtendeR considers peak annotations beyond just the closest gene, allowing users to see peak summary statistics for the first-closest gene, second-closest gene, ..., n-closest gene whilst ranking the output according to biologically relevant events and iteratively comparing the fidelity of peak-to-gene overlap across a user-defined range of upstream and downstream extensions on the original boundaries of each gene's coordinates. Since different ChIP-seq peak callers produce different differentially enriched peaks with a large variance in peak length distribution and total peak count, annotating peak lists with their nearest genes can often be a noisy process. As such, the goal of geneXtendeR is to robustly link differentially enriched peaks with their respective genes, thereby aiding experimental follow-up and validation in designing primers for a set of prospective gene candidates during qPCR.

Maintained by Bohdan Khomtchouk. Last updated 5 months ago.

chipseqgeneticsannotationgenomeannotationdifferentialpeakcallingcoveragepeakdetectionchiponchiphistonemodificationdataimportnaturallanguageprocessingvisualizationgosoftwarebioconductorbioinformaticscchip-seqcomputational-biologyepigeneticsfunctional-annotation

9 stars 3.95 score 5 scripts

daniel-jg

IntervalSurgeon:Operating on Integer-Bounded Intervals

Manipulate integer-bounded intervals including finding overlaps, piling and merging.

Maintained by Daniel Greene. Last updated 1 years ago.

cpp

2.73 score 18 scripts 1 dependents