scrape

Scrape content off the internet, quickly.

Repository

This is a simple tool that assumes you want to download files from a straightforwardly-constructed HTML page. You'll need an XPath specification to help narrow down the scraping.

Resource files for testing scrape:

sample_1280x720

sample_1920x1080

sample_2560x1440

sample_3840x2160

sample_640x360

sample_960x400_ocean_with_audio

sample_960x540

sample_640x360 (mp4)

Usage (Auto-Generated)

usage: scrape [-h] [--xpath XPATH] [--ext EXT] [-o DIRNAME]
              {simple-link-scraper,simple-image-scraper} page

Scrape content off the internet, quickly.

positional arguments:
  {simple-link-scraper,simple-image-scraper}
                        The type of content to be scraped.
  page                  Webpage url.

options:
  -h, --help            show this help message and exit
  --xpath XPATH         Optionally specify the XPath
  --ext EXT             Optionally specify the file extension
  -o DIRNAME, --output DIRNAME
                        Output directory.