Skip to content

Instantly share code, notes, and snippets.

@hanleybrand
Created March 17, 2015 13:47
Show Gist options
  • Save hanleybrand/cba527b2f2aa01a7e054 to your computer and use it in GitHub Desktop.
Save hanleybrand/cba527b2f2aa01a7e054 to your computer and use it in GitHub Desktop.
metadata crawl examples - for future art metadata scraper idea source:
<a href="http://www.comicartfans.com/gallerypiece.asp?piece=1016448"SOURCE</a>
<table border="0" cellpadding="5" cellspacing="0" width="100%">
<tbody><tr>
<td width="40%" valign="top">
<h2 class="head">Artwork Details</h2>
<table width="100%" border="0" cellspacing="0" cellpadding="3">
<tbody><tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Title:</b></td>
<td><b>VEBER Les maisons sont des visages 1899</b></td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Artist: </b></td>
<td><b><a href="/searchresult.asp?txtsearch=Jean %20VEBER" rel="nofollow">Jean &nbsp;VEBER</a></b> (All)</td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Media Type:</b></td>
<td>Mixed Media</td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Art Type:</b></td>
<td><a href="/Comic_Art/Other.asp">Other</a></td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>For Sale Status: </b></td>
<td>
NFS
</td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Views: </b></td>
<td>145</td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Likes on CAF: </b></td>
<td style="line-height:14px;">
<div id="likecount-loadbottom">0</div>
<div id="likecount-addbottom" style="display:none;">1</div>
</td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Comments: </b></td>
<td><a href="#Comments">0</a></td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Added to Site: </b></td>
<td>5/24/2013</td>
</tr>
<tr>
<td width="1%" style="height:18px; text-align:right" nowrap=""><b>Comic Art Archive: </b></td>
<td>
<div class="lc"></div>
</td>
</tr>
</tbody></table>
</td>
<td width="60%" valign="top">
<h2 class="head">Description</h2>
<span class="art-desc">Lithographie de 1899 d'après un tableau de l'auteur exposé au Salon la même année.Tirage 50 épreuves.<br><br>Le tableau eut un immense succès et reste une des plus célèbres oeuvres de Veber<br><br>C'est l'aboutissement d'un travail sur les façades d'abord publié sous forme de dessins humoristiques dans "Le Rire" en 1896.<br><br>Cette image influença plusieurs artistes comme Gus BOFA ou Maurice RADIGUET (image additionnelle).<br><br>Vous pouvez découvrir l'oeuvre extraordinaire de Jean VEBER sur son site "La Houille Rouge" à l'adresse jeanveber.com<br>Allez visiter!</span>
<h2 class="head">Social/Sharing</h2>
<div class="lc"></div>
<div class="share-split">
<div id="fb-root" class=" fb_reset"><div style="position: absolute; top: -10000px; height: 0px; width: 0px;"><div><iframe name="fb_xdm_frame_http" frameborder="0" allowtransparency="true" scrolling="no" id="fb_xdm_frame_http" aria-hidden="true" title="Facebook Cross Domain Communication Frame" tabindex="-1" src="http://static.ak.facebook.com/connect/xd_arbiter/6Dg4oLkBbYq.js?version=41#channel=f3677882b8&amp;origin=http%3A%2F%2Fwww.comicartfans.com" style="border: none;"></iframe><iframe name="fb_xdm_frame_https" frameborder="0" allowtransparency="true" scrolling="no" id="fb_xdm_frame_https" aria-hidden="true" title="Facebook Cross Domain Communication Frame" tabindex="-1" src="https://s-static.ak.facebook.com/connect/xd_arbiter/6Dg4oLkBbYq.js?version=41#channel=f3677882b8&amp;origin=http%3A%2F%2Fwww.comicartfans.com" style="border: none;"></iframe></div></div><div style="position: absolute; top: -10000px; height: 0px; width: 0px;"><div></div></div></div>
<div class="fb-like fb_iframe_widget" data-href="http://www.comicartfans.com/gallerypiece.asp?piece=1016448" data-send="false" data-layout="box_count" data-width="450" data-show-faces="false" fb-xfbml-state="rendered" fb-iframe-plugin-query="app_id=&amp;container_width=109&amp;href=http%3A%2F%2Fwww.comicartfans.com%2Fgallerypiece.asp%3Fpiece%3D1016448&amp;layout=box_count&amp;locale=en_US&amp;sdk=joey&amp;send=false&amp;show_faces=false&amp;width=450"><span style="vertical-align: bottom; width: 47px; height: 61px;"><iframe name="fdbb2bf28" width="450px" height="1000px" frameborder="0" allowtransparency="true" scrolling="no" title="fb:like Facebook Social Plugin" src="http://www.facebook.com/plugins/like.php?app_id=&amp;channel=http%3A%2F%2Fstatic.ak.facebook.com%2Fconnect%2Fxd_arbiter%2F6Dg4oLkBbYq.js%3Fversion%3D41%23cb%3Df3f90ebe0c%26domain%3Dwww.comicartfans.com%26origin%3Dhttp%253A%252F%252Fwww.comicartfans.com%252Ff3677882b8%26relation%3Dparent.parent&amp;container_width=109&amp;href=http%3A%2F%2Fwww.comicartfans.com%2Fgallerypiece.asp%3Fpiece%3D1016448&amp;layout=box_count&amp;locale=en_US&amp;sdk=joey&amp;send=false&amp;show_faces=false&amp;width=450" class="" style="border: none; visibility: visible; width: 47px; height: 61px;"></iframe></span></div>
</div>
<div class="share-split">
<div id="___plusone_0" style="text-indent: 0px; margin: 0px; padding: 0px; background-color: transparent; border-style: none; float: none; line-height: normal; font-size: 1px; vertical-align: baseline; display: inline-block; width: 50px; height: 60px; background-position: initial initial; background-repeat: initial initial;"><iframe frameborder="0" hspace="0" marginheight="0" marginwidth="0" scrolling="no" style="position: static; top: 0px; width: 50px; margin: 0px; border-style: none; left: 0px; visibility: visible; height: 60px;" tabindex="0" vspace="0" width="100%" id="I0_1426599771489" name="I0_1426599771489" src="https://apis.google.com/u/0/se/0/_/+1/fastbutton?usegapi=1&amp;size=tall&amp;origin=http%3A%2F%2Fwww.comicartfans.com&amp;url=http%3A%2F%2Fwww.comicartfans.com%2Fgallerypiece.asp%3Fpiece%3D1016448&amp;gsrc=3p&amp;ic=1&amp;jsh=m%3B%2F_%2Fscs%2Fapps-static%2F_%2Fjs%2Fk%3Doz.gapi.en.RgKP7RQuasM.O%2Fm%3D__features__%2Fam%3DIQ%2Frt%3Dj%2Fd%3D1%2Ft%3Dzcms%2Frs%3DAGLTcCOEO9Bkq7TwwzfqE01GsA5GDKwt7A#_methods=onPlusOne%2C_ready%2C_close%2C_open%2C_resizeMe%2C_renderstart%2Concircled%2Cdrefresh%2Cerefresh%2Conload&amp;id=I0_1426599771489&amp;parent=http%3A%2F%2Fwww.comicartfans.com&amp;pfname=&amp;rpctoken=55009928" data-gapiattached="true" title="+1"></iframe></div>
</div>
<div class="share-split" style="padding-top:18px;">
<br>
<a class="PIN_1426599771537_pin_it_button_20 PIN_1426599771537_pin_it_button_en_20_gray PIN_1426599771537_pin_it_button_inline_20 PIN_1426599771537_pin_it_above_20" data-pin-href="//www.pinterest.com/pin/create/button/?guid=dB8UhKPfXcyf-1&amp;url=http%3A%2F%2Fwww%2Ecomicartfans%2Ecom%2Fgallerypiece%2Easp%3Fpiece%3D1016448&amp;media=http%3A%2F%2Fcdn%2Ecomicartfans%2Ecom%2FImages%2FCategory%5F37430%2Fsubcat%5F78556%2FnPyzKxnB%5F1111140947421%2Ejpg&amp;description=VEBER+Les+maisons+sont+des+visages+1899" data-pin-log="button_pinit" data-pin-config="above"><span class="PIN_1426599771537_pin_it_button_count" id="PIN_1426599771537_pin_count_0"><i></i>0</span></a>
</div>
<div class="lc"></div>
<div class="share"><a href="javascript:DisplayOne('sharebox1');">Share this item on a blog/forum/website</a></div>
<div name="sharebox" id="sharebox1" style="display: none;">
<p align="center" style="font-size:11px">To share this item on your favorite blog, forum or website, just copy the html from the box below.</p>
<form name="share">
<textarea class="share-textarea" name="sharecode" onclick="select_all();">&lt;table cellpadding='5' style='width:250px; border: 1px solid #efefef; border-radius:5px; background-color: #fafafa; font-family: Tahoma, lucida, sans-serif;'&gt;
&lt;tr&gt;&lt;td&gt;&lt;a href='http://www.comicartfans.com' target='_blank'&gt;&lt;img src='http://www.comicartfans.com/lib/images/sharelogo.png' border='0' /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td style='text-align:center;'&gt;&lt;b&gt;&lt;a href='http://www.comicartfans.com/gallerypiece.asp?piece=1016448' target='_blank' style='color:#000000; font-size: 14px; line-height: 16px; text-decoration: none;'&gt;VEBER Les maisons sont des visages 1899&lt;/a&gt;&lt;/b&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style='padding-left: 27px;'&gt;&lt;div style='width: 200px; height: 200px; text-align:center; border-radius: 3px; overflow: hidden; position: relative;'&gt;&lt;a href='http://www.comicartfans.com/gallerypiece.asp?piece=1016448' target='_blank'&gt;&lt;img style='max-width: 320px; width: expression(this.width &gt; 320 ? 320: true); max-height: 320px; height: expression(this.height &gt; 320 ? 320: true); 'src='http://www.comicartfans.com/Images/Category_37430/subcat_78556/thumbs/nPyzKxnB_1111140947421.jpg'&gt;&lt;/a&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td style='text-align:center;'&gt;&lt;p style='font-size:12px;'&gt;&lt;a href='http://www.comicartfans.com/GalleryDetail.asp?GCat=37430' target='_blank' style='color:#fe6a23; text-decoration: none;'&gt;Click Here&lt;/a&gt; to visit my Art Gallery on &lt;a href='http://www.comicartfans.com' target='_blank' style='color:#fe6a23; text-decoration: none;'&gt;comicartfans.com&lt;/a&gt;!&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;</textarea>
</form>
<p align="center" style="margin:0; padding:0"><a href="javascript:DisplayOne('sharebox2');" style="font-size: 11px">[ click to close ]</a></p>
</div>
<br>
</td>
</tr>
</tbody></table>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment