Skip to content

Instantly share code, notes, and snippets.

View gabldotink's full-sized avatar
👓

Gabriel “gabldotink” gabldotink

👓
View GitHub Profile
@gabldotink
gabldotink / urllist
Last active June 5, 2023 20:37
bash function to get a plaintext list of URLs with wget — recursive, no parent
#!/usr/bin/env bash
# urllist | source: https://gist.github.com/gabldotink/91a749a83dbc75af71e720999d0bbfd6
# usage: urllist url > file.csv
wget --spider --no-parent --mirror \
--force-html -e robots=off "${1}" 2>&1 \
| grep '^--' | awk '{print $3}' | awk '!seen[$0]++'