Skip to content

Instantly share code, notes, and snippets.

@victusfate
Forked from max-mapper/chicago-inspections.rb
Created December 5, 2010 18:18
Show Gist options
  • Save victusfate/729315 to your computer and use it in GitHub Desktop.
Save victusfate/729315 to your computer and use it in GitHub Desktop.
require 'net/http'
require 'uri'
require 'nokogiri'
url = URI.parse('http://webapps.cityofchicago.org/healthinspection/inspectionresultrow.jsp')
request = Net::HTTP::Post.new(url.path)
request.set_form_data({"REST"=>" ", "STR_NBR"=>"", "STR_NBR2"=>"", "STR_DIRECTION"=>"", "STR_NM"=>"", "ZIP"=>""})
response = Net::HTTP.new(url.host, url.port).start {|http| http.request(request)}
doc = Nokogiri::HTML.parse(response.body)
doc.search('#results tr').each do |tr|
info = tr.text.each_line.map(&:strip).delete_if {|l| l.strip == ""}.join('\n')
ScraperWiki.save(unique_keys=['text'], data = {'text' => info})
end
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment