Showing 8 of total 8 results (show query)
salimk
Rcrawler:Web Crawler and Scraper
Performs parallel web crawling and web scraping. It is designed to crawl, parse and store web pages to produce data that can be directly used for analysis application. For details see Khalil and Fakir (2017) <DOI:10.1016/j.softx.2017.04.004>.
Maintained by Salim Khalil. Last updated 5 years ago.
crawlercrawlersscraperwebcrawlerwebscraperwebscrapingwebscrapping
30.0 match 354 stars 6.89 score 110 scriptsfeddelegrand7
ralger:Easy Web Scraping
The goal of 'ralger' is to facilitate web scraping in R.
Maintained by Mohamed El Fodil Ihaddaden. Last updated 8 months ago.
dataextractionwebcrawlingwebscraper-websitewebscraping
17.5 match 155 stars 7.41 score 33 scriptsropensci
robotstxt:A 'robots.txt' Parser and 'Webbot'/'Spider'/'Crawler' Permissions Checker
Provides functions to download and parse 'robots.txt' files. Ultimately the package makes it easy to check if bots (spiders, crawler, scrapers, ...) are allowed to access specific resources on a domain.
Maintained by Jordan Bradford. Last updated 4 months ago.
crawlerpeer-reviewedrobotstxtscraperspiderwebscraping
10.0 match 68 stars 10.43 score 414 scripts 7 dependentsropensci
webchem:Chemical Information from the Web
Chemical information from around the web. This package interacts with a suite of web services for chemical information. Sources include: Alan Wood's Compendium of Pesticide Common Names, Chemical Identifier Resolver, ChEBI, Chemical Translation Service, ChemSpider, ETOX, Flavornet, NIST Chemistry WebBook, OPSIN, PubChem, SRS, Wikidata.
Maintained by Tamás Stirling. Last updated 3 months ago.
cas-numberchemical-informationchemspideridentifierropensciwebscraping
10.0 match 165 stars 10.31 score 173 scripts 10 dependentsdmi3kno
polite:Be Nice on the Web
Be responsible when scraping data from websites by following polite principles: introduce yourself, ask for permission, take slowly and never ask twice.
Maintained by Dmytro Perepolkin. Last updated 2 years ago.
crawlermemoiserate-limiterrobotstxtrvestscraperwebscraping
10.0 match 327 stars 8.98 score 596 scripts 5 dependentsvinhdizzo
IRexamples:Collection of Practical Institutional Research Examples and Tutorials
Provides examples of code for analyzing data or accomplishing tasks that may be useful to institutional or educational researchers.
Maintained by Vinh Nguyen. Last updated 2 years ago.
9.0 match 4 stars 5.00 score 4 scriptsmatt-dray
altcheckr:Assess Image Alt Text on a Web Page
Scrape image element attributes from a webpage, detect alternative (alt) text and assess it with simple heuristics. Alt text is important for users of assistive technologies, like screen readers, for understanding the content of images. This package should be used in conjunction with other accessibility assessment tools for more comprehensive coverage.
Maintained by Matt Dray. Last updated 4 years ago.
accessibilityalt-textwebscraping
10.0 match 7 stars 3.54 score 6 scripts