Last active
January 1, 2019 01:37
-
-
Save xpepper/5872399 to your computer and use it in GitHub Desktop.
This script will download all the published rubytapas. Each episode, with all the attachments, will be downloaded in a folder named after the episode number. For example, in the folder "64" will be downloaded:
064-yield-or-enumerate.html,
064-yield-or-enumerate.mp4, 064-yield-or-enumerate.rb You need to set your credentials (username and passwor…
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# username = "my_username" | |
# pwd = "my_password" | |
# target_path = "my_target_path" | |
# saving auth cookie | |
system %Q{wget --save-cookies /tmp/cookie.txt --keep-session-cookies --post-data "username=#{username}&password=#{pwd}" -O - \ | |
https://rubytapas.dpdcart.com/subscriber/login?__dpd_cart=d08391e6-5fe2-4400-8b27-2dc17b413027} | |
(25..600).each do |i| | |
%x[wget -q -S --spider --load-cookies /tmp/cookie.txt https://rubytapas.dpdcart.com/subscriber/download?file_id=#{i} > spider.log 2>&1] | |
head_log = File.read("spider.log") | |
if head_log =~ /Content-Disposition.*filename="(.+)\.(.+)"/ | |
name, extension = $1, $2 | |
puts "Processing #{name}.#{extension}..." | |
if name && extension && !name.empty? && !extension.empty? | |
if Dir.glob("#{target_path}/**/#{name}.#{extension}").size == 0 | |
puts "Downloading #{name}.#{extension}..." | |
filename = "#{name}.#{extension}" | |
if filename =~ /.*(\d{3}).*/ | |
target_folder = $1 | |
target_fullpath = "#{target_path}/#{target_folder}" | |
Dir.mkdir(target_fullpath) unless File.exist?(target_fullpath) | |
puts "*** wget --load-cookies /tmp/cookie.txt --output-document='#{target_path}/#{target_folder}/#{name}.#{extension}' https://rubytapas.dpdcart.com/subscriber/download?file_id=#{i}" | |
system "wget --load-cookies /tmp/cookie.txt --output-document='#{target_path}/#{target_folder}/#{name}.#{extension}' https://rubytapas.dpdcart.com/subscriber/download?file_id=#{i}" | |
end | |
end | |
end | |
end | |
end |
To anyone who wants downloads past episode 114, I would advise against this script. In order to get the latest episodes, I tweaked the max for the id range from 600 to 56000, but the files on rubytapas.dpdcart.com aren't in sequential order, so the brute force id scan wastes a lot of time trying to locate files on the server.
In case you’re looking to download the video files from the new RubyTapas 2.0, check out my downloader
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
@kevinjonesevans I'm sorry, this script is really quick & dirty, so I didn't consider password with special chars... But I guess you should simply URL-encode the password before using it in the WGET request, isn't it?