Skip to content

Instantly share code, notes, and snippets.

View Seraph2000's full-sized avatar

Seraphina Anderson Seraph2000

View GitHub Profile
@Seraph2000
Seraph2000 / gujarat_spider.py
Created September 17, 2018 11:26
Extension of a scrapy spider, which automates the process of [i] downloading pdfs from website, [ii] converting pdfs to plaintext, [iii] extracting info using regular expressions, [iv] collates info into individuals csv files, [v]is able to load data to AWS server
# -*- coding: utf-8 -*-
import scrapy
import os
import re
import json
import csv
import time
import sys
import glob
import shutil
@Seraph2000
Seraph2000 / items.json
Last active December 18, 2015 15:40
Oxygen Crawler
[,
,
{"code": "Heather-Mini-Dress", "name": "For Love and Lemons Heather Mini Dress", "gbp_price": 240.0, "sale_discount": "None", "gender": "F", "stock_status": ["XS - Available", "S - Available", "M - Sold Out", "L - Sold Out"], "clothing_type": "A", "usd_price": "360.00", "designer": "For Love and Lemons", "link": "http://www.oxygenboutique.com/Heather-Mini-Dress.aspx", "raw_color": "red", "images": ["http://www.oxygenboutique.com/GetImage/cT0xMDAmdz04MDAmaD02MDAmUEltZz1mYjkzZGM1OS0xNTBlLTQyZTAtYmU3OS00NzM0MjBiZGRiNjYuanBn0.jpg", "http://www.oxygenboutique.com/GetImage/cT0xMDAmdz04MDAmaD02MDAmUEltZz1kY2EwYTNmNS1jMjhhLTQxOGYtOTc0My1hNmEyNDM4MDgwMTMuanBn0.jpg"], "eur_price": "328.80", "description": "RELATED. Heather Mini Dress by for Love and Lemons. This gorgeous, warm red dress was just made for the festive season! Featuring sheer puff sleeves and velvet embroidery, we guarantee you'll fall in love with this dress!.Heather Mini Dress by for Love and Lemons."},
,
,
,
,
,
,
,