Skip to content

Instantly share code, notes, and snippets.

@ringods
Created January 26, 2017 21:10
Show Gist options
  • Save ringods/3ac30de8ee28fadefa318ae857b365c5 to your computer and use it in GitHub Desktop.
Save ringods/3ac30de8ee28fadefa318ae857b365c5 to your computer and use it in GitHub Desktop.
Delicious Bookmark Scraping
{
"_id": "delicious",
"selectors": [
{
"delay": "",
"id": "per_page_bookmarks_enumeration",
"multiple": false,
"parentSelectors": [
"_root"
],
"selector": "div.profileMidpanel",
"type": "SelectorElement"
},
{
"delay": "",
"id": "single_bookmark_data",
"multiple": true,
"parentSelectors": [
"per_page_bookmarks_enumeration"
],
"selector": "div.articleThumbBlockOuter",
"type": "SelectorElement"
},
{
"delay": "",
"id": "bookmark_title",
"multiple": false,
"parentSelectors": [
"single_bookmark_data"
],
"regex": "",
"selector": "a.title",
"type": "SelectorText"
},
{
"delay": "",
"id": "bookmark_link",
"multiple": false,
"parentSelectors": [
"single_bookmark_data"
],
"regex": "",
"selector": "div.articleInfoPan p:nth-of-type(1)",
"type": "SelectorText"
},
{
"delay": "",
"extractAttribute": "a",
"id": "tag",
"parentSelectors": [
"single_bookmark_data"
],
"selector": "ul.tagName li",
"type": "SelectorGroup"
},
{
"delay": "",
"id": "description",
"multiple": false,
"parentSelectors": [
"single_bookmark_data"
],
"regex": "",
"selector": "div.thumbTBriefTxt p:nth-of-type(2)",
"type": "SelectorText"
}
],
"startUrl": [
"https://del.icio.us/<your_account_id>",
"https://del.icio.us/<your_account_id>?&page=[2-<your_maximum_page>]"
]
}
@jezmck
Copy link

jezmck commented Apr 23, 2017

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment