Skip to content

Instantly share code, notes, and snippets.

@HunnyJummani
Created December 23, 2019 13:40
Show Gist options
  • Save HunnyJummani/458bfbc1968dd61161e85494211ddaa4 to your computer and use it in GitHub Desktop.
Save HunnyJummani/458bfbc1968dd61161e85494211ddaa4 to your computer and use it in GitHub Desktop.
Static Web Scraping- Kimurai ( Ruby on Rails )
class WebScrapper < Kimurai::Base
@name = "web_scrapper_spider"
@engine = :mechanize
@start_urls = ["https://metaruby.com/"]
@config = {
user_agent: "Chrome/68.0.3440.84"
}
def parse(response, url:, data: { })
blogs = []
response.xpath("//table[@class='topic-list']//tbody//tr").each do |tr|
scrapped_data = {
title: tr.at('td[1]//span').text,
category: tr.at('td[1]//div//span').text,
date: tr.at('td[3]').text.strip
}
blogs << scrapped_data
save_to "results.json", scrapped_data.as_json, format: :json
end
end
end
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment