Showing 4 of total 4 results (show query)
kasperwelbers
corpustools:Managing, Querying and Analyzing Tokenized Text
Provides text analysis in R, focusing on the use of a tokenized text format. In this format, the positions of tokens are maintained, and each token can be annotated (e.g., part-of-speech tags, dependency relations). Prominent features include advanced Lucene-like querying for specific tokens or contexts (e.g., documents, sentences), similarity statistics for words and documents, exporting to DTM for compatibility with many text analysis packages, and the possibility to reconstruct original text from tokens to facilitate interpretation.
Maintained by Kasper Welbers. Last updated 7 months ago.
31 stars 7.50 score 174 scripts 1 dependentsdwulff
text2sdg:Detecting UN Sustainable Development Goals in Text
The United Nations’ Sustainable Development Goals (SDGs) have become an important guideline for organisations to monitor and plan their contributions to social, economic, and environmental transformations. The 'text2sdg' package is an open-source analysis package that identifies SDGs in text using scientifically developed query systems, opening up the opportunity to monitor any type of text-based data, such as scientific output or corporate publications. For more information regarding the methodology see Meier, Mata & Wulff (2022) <arXiv:2110.05856>.
Maintained by Dominik S. Meier. Last updated 7 months ago.
natural-language-processingsustainabilitysustainable-developmentsustainable-development-goals
18 stars 6.13 score 9 scriptsagusnieto77
ACEP:Análisis Computacional de Eventos de Protesta
La librería 'ACEP' contiene funciones específicas para desarrollar análisis computacional de eventos de protesta. Asimismo, contiene bases de datos con colecciones de notas sobre protestas y diccionarios de palabras conflictivas. La colección de diccionarios reune diccionarios de diferentes orígenes. The 'ACEP' library contains specific functions to perform computational analysis of protest events. It also contains a database with collections of notes on protests and dictionaries of conflicting words. Collection of dictionaries that brings together dictionaries from different sources.
Maintained by Agustín Nieto. Last updated 1 years ago.
computer-aided-detectionconflict-analysisconflict-detectiondictionariesnlp-keywords-extractionprotest-eventstext-miningvisualization
10 stars 5.48 score 9 scriptsomstuhler
semgram:Extracting Semantic Motifs from Textual Data
A framework for extracting semantic motifs around entities in textual data. It implements an entity-centered semantic grammar that distinguishes six classes of motifs: actions of an entity, treatments of an entity, agents acting upon an entity, patients acted upon by an entity, characterizations of an entity, and possessions of an entity. Motifs are identified by applying a set of extraction rules to a parsed text object that includes part-of-speech tags and dependency annotations - such as those generated by 'spacyr'. For further reference, see: Stuhler (2022) <doi: 10.1177/00491241221099551>.
Maintained by Oscar Stuhler. Last updated 3 years ago.
24 stars 4.08 score 4 scripts