Skip to content

Instantly share code, notes, and snippets.

@bartolsthoorn
Created July 23, 2012 12:38
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save bartolsthoorn/d272861d88d80a7bf232 to your computer and use it in GitHub Desktop.
Save bartolsthoorn/d272861d88d80a7bf232 to your computer and use it in GitHub Desktop.
Scraper class (for jobs)
#!/usr/bin/env rake
require "resque/tasks"
task 'resque:setup' do
require_relative 'scraper.rb'
end
require 'resque'
require 'resque/plugins/lock'
class Scraper
extend Resque::Plugins::Lock
@queue = :url_serve
#def self.lock(host, url)
# host
#end
def self.perform(host, url)
File.open('scraper.log', 'a') do |f|
f.write(Time.now.to_s+" "+url+"\n")
end
return url
end
end
require_relative 'scraper.rb'
requests = [
{
:host => "bartolsthoorn.nl",
:url => "http://bartolsthoorn.nl"
},
{
:host => "bartlab.nl",
:url => "http://bartlab.nl"
},
{
:host => "bartolsthoorn.nl",
:url => "http://bartolsthoorn.nl/contact"
},
{
:host => "bartlab.nl",
:url => "http://bartlab.nl/contact"
}
]
#Does not work:
#Resque.redis = 'localhost:6379'
requests.each do |request|
#Equivalent: Scraper.perform request[:host], request[:url]
Resque.enqueue(Scraper, request[:host], request[:url])
end
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment