Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save scrapehero-code/e666dcda7594e0a88abeb873cda2fc75 to your computer and use it in GitHub Desktop.
Save scrapehero-code/e666dcda7594e0a88abeb873cda2fc75 to your computer and use it in GitHub Desktop.
Scraper to extract details from Wayfair.com such as product name, seller, rating, reviews, price and more using Web Scraper Chrome Extension and Google Chrome
{
"_id": "wayfair",
"startUrl": [
"https://www.wayfair.com/outdoor/sb0/hammocks-with-stands-c1864031.html"
],
"selectors": [
{
"id": "links",
"type": "SelectorLink",
"parentSelectors": [
"_root",
"paginate"
],
"selector": "a.ProductCardReviews",
"multiple": true,
"delay": 0
},
{
"id": "paginate",
"type": "SelectorLink",
"parentSelectors": [
"_root",
"paginate"
],
"selector": "a.Pagination-icon--next",
"multiple": false,
"delay": 0
},
{
"id": "product_name",
"type": "SelectorText",
"parentSelectors": [
"links"
],
"selector": "h1",
"multiple": false,
"regex": "",
"delay": 0
},
{
"id": "sold_by",
"type": "SelectorText",
"parentSelectors": [
"links"
],
"selector": ".ProductDetailInfoBlock-header-manu a",
"multiple": false,
"regex": "",
"delay": 0
},
{
"id": "sale_price",
"type": "SelectorText",
"parentSelectors": [
"links"
],
"selector": ".StandardPriceBlock > div.BasePriceBlock",
"multiple": false,
"regex": "",
"delay": 0
},
{
"id": "original_price",
"type": "SelectorText",
"parentSelectors": [
"links"
],
"selector": ".SuggestedRetailPrice span",
"multiple": false,
"regex": "",
"delay": 0
},
{
"id": "features",
"type": "SelectorText",
"parentSelectors": [
"links"
],
"selector": ".ProductOverviewInformation-content div:nth-of-type(2) ul",
"multiple": false,
"regex": "",
"delay": 0
},
{
"id": "no_of_reviews",
"type": "SelectorText",
"parentSelectors": [
"links"
],
"selector": ".ProductDetailInfoBlock-header-rating p.pl-ReviewStars-reviews",
"multiple": false,
"regex": "",
"delay": 0
},
{
"id": "rating",
"type": "SelectorText",
"parentSelectors": [
"links"
],
"selector": "p[class=\"pl-VisuallyHidden\"]",
"multiple": false,
"regex": "",
"delay": 0
},
{
"id": "product_details",
"type": "SelectorText",
"parentSelectors": [
"links"
],
"selector": ".ProductOverviewInformation-content div:nth-of-type(3) ul",
"multiple": false,
"regex": "",
"delay": 0
}
]
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment