Skip to content

Instantly share code, notes, and snippets.

View majacaci00's full-sized avatar

Mario Carrillo majacaci00

View GitHub Profile
@majacaci00
majacaci00 / embedded_map.ipynb
Created November 8, 2017 18:59
embedded sf map
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@majacaci00
majacaci00 / indeed_spider.py
Created November 2, 2016 06:39
In class lab use this file in your "spiders" folder of a scrapy project. Make sure you set your "DOWNLOAD_DELAY" to 4 seconds while you're testing your spider. Remove the delay once you've debugged your spider and then let it fly. Please try to avoid running your crawling processes at full speed more than necessary!
## scrapy crawl indeed_base -o indeed_raw.json
# -*- coding: utf-8 -*-from scrapy.spiders import CrawlSpider, Rule
from scrapy.linkextractors import LinkExtractor
import scrapy
from indeed.items import IndeedItem
from scrapy.spiders import CrawlSpider, Rule
from bs4 import BeautifulSoup
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.