Skip to content

Instantly share code, notes, and snippets.

@JaimeObregon
Created January 15, 2020 11:41
Show Gist options
  • Save JaimeObregon/b886ffefee794fcc0bcab9f57f5ccf6f to your computer and use it in GitHub Desktop.
Save JaimeObregon/b886ffefee794fcc0bcab9f57f5ccf6f to your computer and use it in GitHub Desktop.
Descarga el índice de telediarios y los concatena en un gran fichero pages.html
#!/bin/bash
for page in `seq 1 638`; do
wget "https://www.rtve.es/alacarta/interno/contenttable.shtml?pbq=$page&orderCriteria=DESC&modl=TOC&locale=es&pageSize=15&ctx=45030&typeFilter=39816&advSearchOpen=false" \
-Odownloads/html/page.$page.html;
done
cat downloads/html/page.*.html > downloads/html/pages.html
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment