Showing 4 of total 4 results (show query)
junhewk
RcppMeCab:'rcpp' Wrapper for 'mecab' Library
R package based on 'Rcpp' for 'MeCab': Yet Another Part-of-Speech and Morphological Analyzer. The purpose of this package is providing a seamless developing and analyzing environment for CJK texts. This package utilizes parallel programming for providing highly efficient text preprocessing 'posParallel()' function. For installation, please refer to README.md file.
Maintained by Junhewk Kim. Last updated 7 months ago.
11.5 match 25 stars 5.30 score 40 scriptsyixuan
showtext:Using Fonts More Easily in R Graphs
Making it easy to use various types of fonts ('TrueType', 'OpenType', Type 1, web fonts, etc.) in R graphs, and supporting most output formats of R graphics including PNG, PDF and SVG. Text glyphs will be converted into polygons or raster images, hence after the plot has been created, it no longer relies on the font files. No external software such as 'Ghostscript' is needed to use this package.
Maintained by Yixuan Qiu. Last updated 1 years ago.
fontgraphicsgraphics-devicer-graphicsfreetype
1.5 match 487 stars 12.92 score 10k scripts 35 dependentsdschuhmacher
kanjistat:A Statistical Framework for the Analysis of Japanese Kanji Characters
Various tools and data sets that support the study of kanji, including their morphology, decomposition and concepts of distance and similarity between them.
Maintained by Dominic Schuhmacher. Last updated 10 months ago.
1.8 match 4 stars 4.90 score 6 scriptsmacmillancontentscience
piecemaker:Tools for Preparing Text for Tokenizers
Tokenizers break text into pieces that are more usable by machine learning models. Many tokenizers share some preparation steps. This package provides those shared steps, along with a simple tokenizer.
Maintained by Jon Harmon. Last updated 2 years ago.
1.8 match 3.48 score 6 scripts 2 dependents