httrack Command: Examples, Options, and Usage

常用示例

Mirror a website

httrack [https://example.com]

Mirror a website

httrack [https://example.com] -O [path/to/output]

Mirror a website

httrack [https://example.com] -r[3]

Continue an interrupted download

httrack --continue

Update an existing mirror

httrack --update

Mirror a website

httrack [https://example.com] -P [proxy:8080]

Mirror only specific file types

httrack [https://example.com] +*.pdf +*.jpg -*

Limit download speed

httrack [https://example.com] -A[102400]

说明

httrack is a website copier that downloads websites to a local directory for offline browsing. It preserves the original site structure, converting links to work locally. The mirrored site can be browsed offline using any web browser. The tool follows links to specified depths, downloads files, and reconstructs relative paths. It supports HTTP and HTTPS protocols, authentication, cookies, and proxy servers. Filters control which files are downloaded using wildcard patterns. HTTrack can update previously mirrored sites, downloading only changed files. It handles interrupted downloads gracefully with the continue option. The webhttrack command provides a browser-based graphical interface.

参数

-O, --path _PATH_: Output/project path
-w, --mirror: Mirror websites (default mode)
-W, --mirror-wizard: Mirror websites with interactive wizard
-g, --get-files: Get files without mirroring structure
-i, --continue: Continue an interrupted download
-r _N_, --depth _N_: Set link depth limit (default: unlimited for mirror mode)
-m _N_, --max-files _N_: Maximum number of files to download
-M _N_, --max-size _N_: Maximum total size in bytes
-E _N_, --max-time _N_: Maximum mirror time in seconds
-A _N_, --max-rate _N_: Maximum transfer rate (bytes/second)
-c _N_, --sockets _N_: Number of simultaneous connections
-T _N_, --timeout _N_: Connection timeout in seconds
-R _N_, --retries _N_: Number of retry attempts
-P, --proxy _HOST:PORT_: Use proxy server
-K _N_, --keep-links _N_: Keep original link format (0=relative, 2=absolute)
-x, --replace-external: Replace external links with error page
-n, --near: Get non-HTML files near links
-t, --test: Test links only, do not download
-q, --quiet: Quiet mode, no output
-v, --verbose: Verbose output
-s0, --robots=0: Ignore robots.txt
-F _STRING_: Set the User-Agent header for HTTP requests
-N _N_: Structure type for saved files naming convention
-%c _N_: Maximum number of connections per second

FAQ

What is the httrack command used for?

httrack is a website copier that downloads websites to a local directory for offline browsing. It preserves the original site structure, converting links to work locally. The mirrored site can be browsed offline using any web browser. The tool follows links to specified depths, downloads files, and reconstructs relative paths. It supports HTTP and HTTPS protocols, authentication, cookies, and proxy servers. Filters control which files are downloaded using wildcard patterns. HTTrack can update previously mirrored sites, downloading only changed files. It handles interrupted downloads gracefully with the continue option. The webhttrack command provides a browser-based graphical interface.

How do I run a basic httrack example?

Run `httrack [https://example.com]` in a terminal, then adjust file names, paths, flags, or remote targets for your system.

What does -O, --path _PATH_ do in httrack?

Output/project path

httrack 命令

常用示例

说明

参数

FAQ

相关命令