Skip to content

Instantly share code, notes, and snippets.

@scrapehero-code
Last active June 18, 2019 12:47
Show Gist options
  • Save scrapehero-code/8e3f9bc0f77eccc354b909f0a586ed26 to your computer and use it in GitHub Desktop.
Save scrapehero-code/8e3f9bc0f77eccc354b909f0a586ed26 to your computer and use it in GitHub Desktop.
Extract product details such as product name, pricing, rating, reviews and more from Overstock.com using Web Scraper Chrome Extension and google chrome
{
"_id":"overstock_new",
"startUrl":[
"https://www.overstock.com/Home-Garden/Casual-Dinnerware/Gibson,/brand,/6451/subcat.html"
],
"selectors":[
{
"id":"product",
"type":"SelectorLink",
"parentSelectors":[
"_root",
"next"
],
"selector":"a.productCardLink",
"multiple":true,
"delay":0
},
{
"id":"next",
"type":"SelectorLink",
"parentSelectors":[
"_root",
"next"
],
"selector":"a.next",
"multiple":false,
"delay":0
},
{
"id":"product_name",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"h1",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"brand_name",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"#brand-name a",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"rating",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"span.overall-rating",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"review_count",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"p.product-info-review-count",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"price",
"type":"SelectorElementAttribute",
"parentSelectors":[
"product"
],
"selector":"span[class=\"monetary-price-value\"]",
"multiple":false,
"extractAttribute":"content",
"delay":0
},
{
"id":"currency",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"span.price-symbol",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"shipping",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"div.free-shipping-message",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"item_details",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"div.col-xs-6:nth-of-type(1) div.content-block",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"item_specifications",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"div:nth-of-type(2) div.toggle-content",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"category_breadcrumbs",
"type":"SelectorGroup",
"parentSelectors":[
"product"
],
"selector":".breadcrumbs span",
"delay":0,
"extractAttribute":""
}
]
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment