Skip to content

Instantly share code, notes, and snippets.

View enrico-atzeni's full-sized avatar

Enrico Atzeni enrico-atzeni

View GitHub Profile
@enrico-atzeni
enrico-atzeni / crawl.sh
Last active March 11, 2023 14:36 — forked from antoineMoPa/crawl.sh
Bash Web Crawler
#!/bin/bash
# bash web crawler
# $ bash crawl.sh http://example.com 1
# NOTE: this version ignores static files, see in last pipe of the visit function
# TODO:
# 1. avoid loops by calling same urls ony once per entire script
# 2. do not use txt lists