Skip to content

Instantly share code, notes, and snippets.

@yshalsager
Last active March 18, 2022 13:21
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save yshalsager/dbb516cab80afa7753054267db801cb9 to your computer and use it in GitHub Desktop.
Save yshalsager/dbb516cab80afa7753054267db801cb9 to your computer and use it in GitHub Desktop.
#!/bin/bash
# Scrapes mp3 files links from audio.islamweb.com links into csv data: url, filename
function islamweb_get_mp3() {
URL=$1
PAGES="${2:-1}"
for page in $(seq 1 $PAGES); do
case $URL in
*pageno=*)
URL=$(echo $URL | sed "s|pageno=[0-9]\+|pageno=$page|g");;
*)
URL=$(echo $URL | sed "s|page=|pageno=$page\&page=|g");;
esac
echo "$page" && echo "$URL"
curl -s "$URL" | grep -Po "<a url='(https.*mp3)' title_data=\"(.*)\" id_data=\"" | sed -e "s|<a url='||g" -e "s|' title_data=\"|,|g" -e 's|" id_data="|.mp3|g';
done
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment