Offline mirror with wget
From Noah.org
Jump to: navigation, search
This downloads the given document and all parts it needs to be viewed offline. The number set by —cut-dirs must match the number of parent directories in the URL (dir1 and dir2).
wget —mirror —no-parent —page-requisites —convert-links —no-host-directories —cut-dirs=2 —directory-prefix=OUTPUT_DIR http://www.example.org/dir1/dir2/index.html
—mirror : Mirror is equivalent to “-r -N -l inf —no-remove-listing” (basically, infinite recursion).
—no-parent : Do not follow links that ascend to the parent directory. Only follow links that are under the given URL.
—page-requisites : Download all page requisites necessary to display the page (images, CSS, javascript, etc.).
—convert-links : Convert links in the pages so that they work locally relative to the OUTPUT_DIR.
—no-host-directories : Don’t create host name directories.
—cut-dirs=n : Remove n directories from the path of the URL.
—directory-prefix=
wget -r -nc -l 10 --no-remove-listing -w 1 -p --no-host-directories --convert-links --no-parent --html-extension --directory-prefix=~/projects/www http://www.example.pl