Skip to content

Instantly share code, notes, and snippets.

@scrapehero
Created October 25, 2018 13:17
Show Gist options
  • Save scrapehero/7b69057643b7dab78ca6770398531f96 to your computer and use it in GitHub Desktop.
Save scrapehero/7b69057643b7dab78ca6770398531f96 to your computer and use it in GitHub Desktop.
JSON sitemap for scraping the amazon bestseller list using Web Scraper Chrome Extension
{
"_id":"amazon_bestsellers",
"startUrl":[
"https://www.amazon.com/Best-Sellers-Pet-Supplies/zgbs/pet-supplies/ref=zg_bs_nav_0"
],
"selectors":[
{
"id":"product",
"type":"SelectorElement",
"parentSelectors":[
"_root",
"next"
],
"selector":"li.zg-item-immersion",
"multiple":true,
"delay":0
},
{
"id":"next",
"type":"SelectorLink",
"parentSelectors":[
"_root",
"next"
],
"selector":"li.a-last a",
"multiple":false,
"delay":0
},
{
"id":"rank",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"span.zg-badge-text",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"name",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"div.p13n-sc-truncated",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"ratings",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"a.a-size-small",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"rating",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"span.a-icon-alt",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"price",
"type":"SelectorText",
"parentSelectors":[
"product"
],
"selector":"span.p13n-sc-price",
"multiple":false,
"regex":"",
"delay":0
},
{
"id":"image",
"type":"SelectorImage",
"parentSelectors":[
"product"
],
"selector":"img",
"multiple":false,
"delay":0
}
]
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment