Created
October 31, 2011 09:55
-
-
Save deepak/1327210 to your computer and use it in GitHub Desktop.
httrack to mirror a website along with links on another domain
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#/usr/bin/env sh | |
# httrack to mirror a website along with links on another domain | |
# http://pomozok.wordpress.com/2011/01/19/ians-linker-notes/ has a bunch of links on what is a compiler | |
# these links are on another website domain. | |
# httrack by default will download only the links on the same domain | |
# passing a +<url> option will donload the other links | |
# also passes a -r2 option to restrict the depth, we only want the directly linked files not the whole domain | |
# passing -W runs it in wizard mode, it prompts for other domains it finds and you can then selectively download them | |
# disadvantage is that it will require user interaction ie. not scriptable | |
httrack http://pomozok.wordpress.com/2011/01/19/ians-linker-notes/ -O "/Users/deepak/websites/ian's linker notes" -%v -r2 http://www.airs.com/blog/* |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment