Showing 18 of total 18 results (show query)
rmhogervorst
imdb:Download IMDB Series Information Into a Dataframe
Download movie and series information using the open movie database http://www.omdbapi.com and return tidy results. To pull relevant information from the API you have to get a key from the omdb website.
Maintained by Roel M. Hogervorst. Last updated 7 years ago.
66.8 match 11 stars 2.92 score 15 scriptserictleung
pixarfilms:Pixar Films and Achievements
Data about Disney Pixar films provided by Wikipedia. This package contains data about the films, the people involved, and their awards.
Maintained by Eric Leung. Last updated 13 hours ago.
datadata-sciencedatapackagedisneyimdbimdb-datasetpixarpixar-filmsweb-scrapingwikipedia
20.7 match 20 stars 7.42 score 23 scripts 1 dependentsdselivanov
text2vec:Modern Text Mining Framework for R
Fast and memory-friendly tools for text vectorization, topic modeling (LDA, LSA), word embeddings (GloVe), similarities. This package provides a source-agnostic streaming API, which allows researchers to perform analysis of collections of documents which are larger than available RAM. All core functions are parallelized to benefit from multicore machines.
Maintained by Dmitriy Selivanov. Last updated 7 months ago.
glovelatent-dirichlet-allocationnatural-language-processingtext-miningtopic-modelingvectorizationword-embeddingsword2veccpp
4.0 match 860 stars 13.48 score 1.3k scripts 23 dependentsindrajeetpatil
ggstatsplot:'ggplot2' Based Plots with Statistical Details
Extension of 'ggplot2', 'ggstatsplot' creates graphics with details from statistical tests included in the plots themselves. It provides an easier syntax to generate information-rich plots for statistical analysis of continuous (violin plots, scatterplots, histograms, dot plots, dot-and-whisker plots) or categorical (pie and bar charts) data. Currently, it supports the most common types of statistical approaches and tests: parametric, nonparametric, robust, and Bayesian versions of t-test/ANOVA, correlation analyses, contingency table analysis, meta-analysis, and regression analyses. References: Patil (2021) <doi:10.21105/joss.03236>.
Maintained by Indrajeet Patil. Last updated 19 days ago.
bayes-factorsdatasciencedatavizeffect-sizeggplot-extensionhypothesis-testingnon-parametric-statisticsregression-modelsstatistical-analysis
3.4 match 2.1k stars 14.49 score 3.0k scripts 1 dependentsindrajeetpatil
statsExpressions:Tidy Dataframes and Expressions with Statistical Details
Utilities for producing dataframes with rich details for the most common types of statistical approaches and tests: parametric, nonparametric, robust, and Bayesian t-test, one-way ANOVA, correlation analyses, contingency table analyses, and meta-analyses. The functions are pipe-friendly and provide a consistent syntax to work with tidy data. These dataframes additionally contain expressions with statistical details, and can be used in graphing packages. This package also forms the statistical processing backend for 'ggstatsplot'. References: Patil (2021) <doi:10.21105/joss.03236>.
Maintained by Indrajeet Patil. Last updated 19 days ago.
bayesian-inferencebayesian-statisticscontingency-tablecorrelationeffectsizemeta-analysisparametricrobustrobust-statisticsstatistical-detailsstatistical-teststidy
3.6 match 312 stars 10.97 score 146 scripts 2 dependentsinsileco
omdbr:Retrieve Movies Metadata from OMDb Database
This package retrieves movies metadata (title, actors, year, genres, etc.) using the OMDb API <http://www.omdbapi.com/> (an API key is required). Cover can also be downloaded from the YTS API <https://yts.mx/api> (no API key required).
Maintained by Nicolas Casajus. Last updated 1 years ago.
coverimdbmetadatamoviesomdb-apiyts-api
12.9 match 4 stars 2.30 score 2 scriptsrstudio
keras3:R Interface to 'Keras'
Interface to 'Keras' <https://keras.io>, a high-level neural networks API. 'Keras' was developed with a focus on enabling fast experimentation, supports both convolution based networks and recurrent networks (as well as combinations of the two), and runs seamlessly on both CPU and GPU devices.
Maintained by Tomasz Kalinowski. Last updated 4 days ago.
1.8 match 845 stars 13.57 score 264 scripts 2 dependentsfcharte
mldr.datasets:R Ultimate Multilabel Dataset Repository
Large collection of multilabel datasets along with the functions needed to export them to several formats, to make partitions, and to obtain bibliographic information.
Maintained by David Charte. Last updated 6 years ago.
5.1 match 8 stars 4.68 score 120 scriptseagerai
fastai:Interface to 'fastai'
The 'fastai' <https://docs.fast.ai/index.html> library simplifies training fast and accurate neural networks using modern best practices. It is based on research in to deep learning best practices undertaken at 'fast.ai', including 'out of the box' support for vision, text, tabular, audio, time series, and collaborative filtering models.
Maintained by Turgut Abdullayev. Last updated 11 months ago.
audiocollaborative-filteringdarknetdarknet-image-classificationfastaimedicalobject-detectiontabulartextvision
2.3 match 118 stars 9.40 score 76 scriptshadley
ggplot2movies:Movies Data
A dataset about movies. This was previously contained in ggplot2, but has been moved its own package to reduce the download size of ggplot2.
Maintained by Hadley Wickham. Last updated 10 years ago.
3.5 match 9 stars 5.92 score 556 scriptst-kalinowski
keras:R Interface to 'Keras'
Interface to 'Keras' <https://keras.io>, a high-level neural networks 'API'. 'Keras' was developed with a focus on enabling fast experimentation, supports both convolution based networks and recurrent networks (as well as combinations of the two), and runs seamlessly on both 'CPU' and 'GPU' devices.
Maintained by Tomasz Kalinowski. Last updated 11 months ago.
1.8 match 10.82 score 10k scripts 54 dependentsimpaug
FilmsGmooG:IMDb Film Ratings from the Summer of 2022
Average rating and number of votes reported by IMDb for films and shorts with over 100 votes in 2022. The data are analysed in Chapter 3 of the Book 'Getting (more out of) Graphics' (Antony Unwin, CRC Press 2024).
Maintained by Antony Unwin. Last updated 8 months ago.
10.3 match 1.70 scoreemilhvitfeldt
textdata:Download and Load Various Text Datasets
Provides a framework to download, parse, and store text datasets on the disk and load them when needed. Includes various sentiment lexicons and labeled text data sets for classification and analysis.
Maintained by Emil Hvitfeldt. Last updated 10 months ago.
1.8 match 75 stars 9.66 score 1.4k scripts 1 dependentssfeuerriegel
SentimentAnalysis:Dictionary-Based Sentiment Analysis
Performs a sentiment analysis of textual contents in R. This implementation utilizes various existing dictionaries, such as Harvard IV, or finance-specific dictionaries. Furthermore, it can also create customized dictionaries. The latter uses LASSO regularization as a statistical approach to select relevant terms based on an exogenous response variable.
Maintained by Nicolas Proellochs. Last updated 2 years ago.
2.0 match 149 stars 8.34 score 242 scripts 1 dependentsamalan-constat
SouthParkRshiny:Data and 'Shiny' Application for the Show 'SouthPark'
Ratings, votes, swear words and sentiments are analysed for the show 'SouthPark' through a 'Shiny' application after web scraping from 'IMDB' and the website <https://southpark.fandom.com/wiki/South_Park_Archives>.
Maintained by Amalan Mahendran. Last updated 1 years ago.
5.5 match 1 stars 2.70 scoremlverse
torchdatasets:Ready to Use Extra Datasets for Torch
Provides datasets in a format that can be easily consumed by torch 'dataloaders'. Handles data downloading from multiple sources, caching and pre-processing so users can focus only on their model implementations.
Maintained by Daniel Falbel. Last updated 5 days ago.
1.8 match 15 stars 5.65 score 99 scriptscran
imdbapi:Get Movie, Television Data from the 'imdb' Database
Provides API access to the <http://imdbapi.net> which maintains metadata about movies, games and television shows through a public API.
Maintained by Yuan Li. Last updated 7 years ago.
4.7 match 1.00 score 9 scriptsxuwenzhu20
MatTransMix:Clustering with Matrix Gaussian and Matrix Transformation Mixture Models
Provides matrix Gaussian mixture models, matrix transformation mixture models and their model-based clustering results. The parsimonious models of the mean matrices and variance covariance matrices are implemented with a total of 196 variations. For more information, please check: Xuwen Zhu, Shuchismita Sarkar, and Volodymyr Melnykov (2021), "MatTransMix: an R package for matrix model-based clustering and parsimonious mixture modeling", <doi:10.1007/s00357-021-09401-9>.
Maintained by Xuwen Zhu. Last updated 2 months ago.
4.5 match 1.00 score