Showing 1 of total 1 results (show query)
ropensci
rtika:R Interface to 'Apache Tika'
Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.
Maintained by Sasha Goodman. Last updated 2 years ago.
extract-metadataextract-textjavaparsepdf-filespeer-reviewedtesseracttika
55 stars 6.00 score 12 scripts