Use wget to download / scrape a full website



Use wget to download / scrape a full website

Use wget to download / scrape a full website

********************************************
Using wget to download/scrape an entire site
********************************************

Resources: https://www.gnu.org/software/wget/manual/wget.html

Example 1: Basic Example
wget https://docs.scrapy.org/en/latest/

Example 2: Basic Template

wget
–recursive
–no-clobber
–page-requisites
–html-extension
–convert-links
–restrict-file-names=unix
–domains docs.scrapy.org
–no-parent
https://docs.scrapy.org/en/latest/

Example 3: More Advanced Template
wget
–recursive
–no-clobber
–page-requisites
–html-extension
–convert-links
–restrict-file-names=unix
–domains docs.scrapy.org
–wait=5
–limit-rate=20K
–user-agent=Mozilla
–level=10
–output-file /home/melvin/test/activity.txt
–no-parent
https://docs.scrapy.org/en/latest/ &

Other options
–random-wait