Skip to content

Instantly share code, notes, and snippets.

@zcorpan
zcorpan / results-20170223-081749.csv
Created February 23, 2017 07:43
SELECT page, url, REGEXP_EXTRACT(body, r'(.{20}\bclientInformation\b.{20})') AS match
We can make this file beautiful and searchable if this error is corrected: It looks like row 9 should actually have 3 columns, instead of 2. in line 8.
page,url,match
http://www.elm.sa/,http://www.elm.sa/_layouts/search.js?rev=mNvuYQIlFFUBb3Q8Ktm7hw%3D%3D,lse;if(null!=window.clientInformation)d=window.clientInfo
http://www.alonely.com.cn/,http://images.sohu.com/cs/jsfile/js/c.js,"n""liebao""}if(window.clientInformation&&window.clientInfor"
http://www.mkelectric.com/,http://www.mkelectric.com/_layouts/search.js?rev=BjP0%2BmPXUFhF7kDZmHIaVg%3D%3D,lse;if(null!=window.clientInformation)d=window.clientInfo
http://www.jimmychoo.com/,http://d16fk4ms6rqz1v.cloudfront.net/capture/jimmychoo.js,"ew r(this.api),this.clientInformation=this.getClientInfor"
http://www.86y.org/,http://images.sohu.com/cs/jsfile/js/f.js,"n""liebao""}if(window.clientInformation&&window.clientInfor"
http://www.zuilxy.com/,http://images.sohu.com/cs/jsfile/js/c.js,"n""liebao""}if(window.clientInformation&&window.clientInfor"
http://www.kilimall.co.ke/,http://script.kilimall.co.ke/js/kui/babel.min.js,":!1,clearTimeout:!1,clientInformation:!1,ClientRect:!1,Cl"
http://www.zwijsen.nl/,http://wpg.blue
@zcorpan
zcorpan / nav.js
Last active February 10, 2017 10:30 — forked from tobie/nav.js
"use strict";
(function(global) {
var CSS_TEXT = "float: left; padding: 5px 15px; border-left: 1px solid #999; cursor: pointer;"
var CSS_BORDER = "outline: 2px dashed #00F";
var container = document.createElement("div");
var display = document.createElement("div");
var previous = document.createElement("div");
var next = document.createElement("div");
var position = document.createElement("span");
display.appendChild(position);
@zcorpan
zcorpan / no_lang_hyphens_auto.csv
Created January 20, 2017 12:19
Pages in httparchive without <html lang>, with hyphens:auto
We can't make this file beautiful and searchable because it's too large.
page,url,match
http://www.connectto.com/,https://www.connectto.com/wp-content/themes/CherryFramework/css/style.css,hyphens:auto
http://www.blackberry-10.fr/,http://www.blackberry-10.fr/sheetstyle.css?v1043339944221,hyphens: auto
http://www.shelby.pro/,https://p3chat.com/dist/css/p3.css?20160421080459,hyphens:auto
http://www.amsterdam.nl/,https://www.amsterdam.nl/views/css/933c105ddb36d8cd6e6dc297aa3cda7a.css,hyphens:auto
http://www.beadedladies.com/,"http://beadedladies.com/wp-content/plugins/bwp-minify/min/?f=wp-content/plugins/contact-form-7/includes/css/styles.css,wp-content/plugins/the-growth-jacker/includes/public/css/tgj-styles.css,wp-content/plugins/the-growth-jacker/includes/public/js/reveal/reveal.css,wp-content/themes/twentysixteen/genericons/genericons.css,wp-content/themes/twentysixteen/style.css",hyphens:auto
http://www.poracaso.com/,http://poracaso.com/wp-content/themes/poracaso-2015/library/css/style.css?ver=4.4.9,hyphens:auto
http://www.personalityjunkie.com/,http://personalityjunkie.com/wp-co
@zcorpan
zcorpan / results-20170119-091935.csv
Created January 19, 2017 08:25
Pages in httparchive with attributes that start with `_` or non-standard attributes containing `-`
We can't make this file beautiful and searchable because it's too large.
page,url,match
http://www.zz91.com/,http://www.zz91.com/,<a action-data='newyear' onclick='bookmarks(this);'>
http://www.randewoo.ru/,http://www.randewoo.ru/,"<li ng-repeat=""tip in search_tips"">"
http://www.smitehighlight.com/,http://www.smitehighlight.com/,"<html ng-app=""app"" lang=""en"">"
http://www.grabmyessay.com/,http://www.grabmyessay.com/,"<body class-on-popup=""not-scroll"" ng-app=""essayapp"">"
http://www.solwayscuba.com/,http://www.solwayscuba.com/,"<span currency-id=""2"">"
http://www.turkkitap.de/,http://www.turkkitap.de/,"<img max-width=200px src=""http://www.turkkitap.de/images/campagne_images/1483176638.jpg"">"
http://www.xxhh.com/,http://www.xxhh.com/,"<img _src=""/static/uploads/custom/20160727/07235e9180351c79.jpg"" id=""slide-img-3"" height=""150"" width=""600"" alt=""大湿兄小湿妹"" onclick=""_hmt.push(['_trackevent', 'fun', 'click', 'fun-slide-3']);"" />"
http://www.luxresorts.com/,http://www.luxresorts.com/,"<form ng-controller=""bookingcontroller"">"
http://www.compteur-visite
@zcorpan
zcorpan / results-20170117-133147.csv
Created January 17, 2017 12:35
Newlines and < in src/href in httparchive
We can make this file beautiful and searchable if this error is corrected: Unclosed quoted field in line 4.
page,url,match
http://www.juara.net/,http://www.juara.net/,"<script src=""//
<http://widget.crowdynews.com/juara_juara-horizontal.js>
widget.crowdynews.com/juara_juara-horizontal.js""
async=""true"">"
http://www.cyclenews.com/,http://www.cyclenews.com/,"<img
height=""1"" width=""1"" style=""display:none""
src=""https://www.facebook.com/tr?id=1508971226068396&ev=pageview&noscript=1</></noscript><meta
property=""fb:pages"" content=""97064620277"" />"
http://www.meta-lol.com/,http://www.meta-lol.com/,"<img src=""-->
@zcorpan
zcorpan / results-20170117-125808.csv
Created January 17, 2017 12:27
Newlines (ignoring leading/trailing only) in src/href in httparchive
We can't make this file beautiful and searchable because it's too large.
page,url,match
http://www.chemao.com/,http://www.chemao.com/,"<a rel=""nofollow"" href=""http://122.224.75.236/wzba/login.do?method=hdurl&doamin=null&id=330105000251506&shid=1223.0aff_name=com.rouger.
gs.main.userinfoaff&aff_action=qyhzdetail&page_url=showdetail""
target=""_blank"">"
http://www.washingtonexaminer.com/,http://www.washingtonexaminer.com/,"<a class=""event-tracking"" href=""/politics/white-
house"" target="""" data-action=""menu"" data-label=""politics_white_house"">"
http://www.wikilover.com/,http://www.wikilover.com/,"<iframe allowtransparency='true' frameborder='0' scrolling='no' src='//www.facebook.com/plugins/likebox.php?
href=https://www.facebook.com/wikidulich/&width=402&height=255&colorscheme=light&show_faces=true&show_border=false&stream=false&header=false'
@zcorpan
zcorpan / results-20170111-152822.csv
Created January 11, 2017 23:30
Resources in httparchive:har.2016_12_15_chrome_requests_bodies matching '(\bbackground-position\s*:\s*(?:\d+(?:\.\d+)(?:[a-z]+)|top|left|bottom|right)\s+(?:\d+(?:\.\d+)(?:[a-z]+)|top|left|bottom|right)\s+(?:\d+(?:\.\d+)(?:[a-z]+)|top|left|bottom|right)\s*[;\}])')
page url match
http://www.legrand.us/ http://www.legrand.us/assets/css/legrand.min-vts17150.23_08_56.7901499.css background-position:right 2.5vw top}
http://www.rado.com/ https://www.rado.com/sites/default/files/advagg_css/css__BXkJWLORBCp9dTQv8UaWBaeuGawwSSuZ3VsiFVzV-yc__Cw4hw9bM0Vk18I_wjwNMVzZM2_3DwVfpGt-LO8u6tKw__D7xkni5JJk6oLtBY9N-HdopecSoZZr-5FllXqHNcoe0.css background-position:left top 0.4em;
http://www.wgv.de/ https://www.wgv.de/media/layout/css/main.css background-position:top right 1.25rem;
http://www.bygg.no/ http://bygg-ext.cdn.aptoma.no/fronts/bygg-front-full/styles.css background-position: right 1.5em bottom;
http://www.bridelice.fr/ http://www.bridelice.fr/sites/default/files/public/css/css_tcocgz_B-Nqg7tEkzwaCA0UFnjIn_PsjBSO6o_VOoWs.css background-position:left top 3.2rem}
http://www.athena-innovation.gr/ http://www.athena-innovation.gr/ background-position: left top bottom;
http://www.trabber.es/ https://www.trabber.es/css/styles.css?20569fd0c7c92e189a72c4833fba89e3.cache.css background-positi
@zcorpan
zcorpan / results-20170105-141014.csv
Created January 5, 2017 13:13
Pages in httparchive:har.2016_12_15_chrome_requests_bodies with <area shape=default> and no nohref
We can make this file beautiful and searchable if this error is corrected: Unclosed quoted field in line 9.
page,url,match
http://www.narutopedia.eu/,http://www.narutopedia.eu/,"<area shape=""default"" alt=""startseite"" href=""http://www.narutopedia.eu/"">"
http://www.awardspace.net/,https://www.awardspace.com/,"<area shape=""default"" href=""/signup?gid=1603&#038;months_1=12"" target="""">"
http://www.magicoveneto.it/,http://www.magicoveneto.it/,"<area shape=""default"" alt=""click on the map"" />"
http://www.tilelife.co.jp/,http://www.tilelife.co.jp/,"<area shape=""default"" href=""http://www.tilelife.co.jp/news/"" alt=""¤ªãî¤é¤»"">"
http://www.metalaficion.com/,http://foro.metalaficion.com/,"<area shape=""default"" href=""http://hp.martinerni.ch/congenialrope.php?ld=2"" />"
http://www.mudcat.org/,http://www.mudcat.org/,"<area shape=default href=""http://mudcat.org"">"
http://www.yaoko-net.com/,http://www.yaoko-net.com/,"<area shape=""default"" href=""store/store01/"" alt=""チラシ・店舗検索のページへ"">"
http://www.fiddlesalad.com/,http://pythonfiddle.s3.amazonaws.com/js/codemirror.fiddle.js.gz?87,"
@zcorpan
zcorpan / results-20161202-212844.csv
Created December 6, 2016 10:26
httparchive pages containing <address> and <article> + analysis
page use notes not contact information
http://www.spartak.ru/ contact information for site
http://www.makumaku.jp/ company postal address
http://www.wtxl.com/ contact information for site
http://www.mhc.kr/ Request removal of post + email 1
http://www.udd.cl/ Physical location for upcoming event <address> applies to <article> 1
http://www.chinatimes.com/ Copyright + link to general company info 1
http://www.gloucestertimes.com/ contact information for site
http://www.tribdem.com/ contact information for site
http://www.hvrsd.org/ contact information for site
@zcorpan
zcorpan / results-20161116-143944.csv
Created November 16, 2016 13:43
httparchive pages using markup in iframes
We can make this file beautiful and searchable if this error is corrected: Unclosed quoted field in line 5.
page,url,match
http://www.ilgirlsdaily.com/,http://www.ilgirlsdaily.com/,"<iframe id=""ca30f4c7df"" name=""ca30f4c7df"" src=""//us-ads.openx.net/w/1.0/afr?auid=501694&cb=insert_random_number_here"" frameborder=""0"" scrolling=""no"" width=""160"" height=""600""><a href=""//us-ads.openx.net/w/1.0/rc?cs=ca30f4c7df&cb=insert_random_number_here"" >"
http://www.sitemeter.com/,http://www.sitemeter.com/,<iframe src=footer_banner_test.html width=800 height=175 frameborder=0> </div>
http://www.casimirocastillojal.gob.mx/,http://www.casimirocastillojal.gob.mx/,"<iframe width=""100%"" height=""380"" src=""imagenes/gob_abierto.jpg"" frameborder=""0"" allowfullscreen=""allowfullscreen"" scrolling=""no""><a href=""http://www.google.com"">"
http://www.intimshop.ru/,http://www.intimshop.ru/,"<iframe src=""//ads.adfox.ru/243010/getcode?pp=g&amp;ps=cdqc&amp;p2=fghl&amp;p3=a&amp;p4=a&amp;pct=a&amp;plp=a&amp;pli=a&amp;pop=a&amp;pr=' + pr + '&amp;pt=b&amp;pd=' + addate.getdate() + '&amp;pw=' + addate.getday() + '&amp;pv=' + addat