HTTrack

From Encoresoup - The Ultimate Guide to Free/Open Source Software

Jump to: navigation, search
This article contains content from the Wikipedia article:
HTTrack
history contributors
HTTrack is a free and open source website copier and offline browser by Xavier Roche, licensed under the GNU General Public License. It allows one to download World Wide Web sites from the Internet to a local computer. By default, HTTrack arranges the downloaded site by the original site's relative link-structure. The downloaded (or "mirrored") website can be browsed by opening a page of the site in a browser.

HTTrack can also update an existing mirrored site and resume interrupted downloads. HTTrack is fully configurable by options and by filters (include/exclude), and has an integrated help system. There is a basic command line version and two GUI versions (WinHTTrack and WebHTrack); the former can be part of scripts and cron jobs.

HTTrack uses a web crawler to download a website. Some parts of the website may not be downloaded by default due to the robots exclusion protocol unless disabled during the program. HTTrack can follow links that are generated with basic JavaScript and inside Applets or Flash, but not complex links (generated using functions or expressions) or server-side image maps.

[edit] See also

  • Robots Exclusion Standard
  • Web crawler

[edit] External links



Personal tools

Pico USB Flash Drive (8Gb) [ThinkGeek] Media Tank - HD and DVD Enclosure + card reader [ThinkGeek]Just For Fun [Amazon]