Skip to content

Instantly share code, notes, and snippets.

View waldoj's full-sized avatar

Waldo Jaquith waldoj

View GitHub Profile
@waldoj
waldoj / industry_codes.txt
Created November 2, 2014 19:04
Every possible type of business, according to the Virginia State Corporation Commission.
GENERAL
ELECTRIC
TELEPHONE
GAS
WATER
WATER-SEWER
SEWER
RADIO COMMON CARRIER
BANKS AND CREDIT UNIONS
FEDERAL BANKS
@waldoj
waldoj / virginia-tmsm.csv
Created October 24, 2014 15:04
Every trademark and service mark registered with the Virginia State Corporation Commission, as of October 23, 2014.
type file_number term
SM 6185 "Joe D. Roofer" (& design)
SM 8482 "The Original Home of Brunswick Stew"
SM 5322 "Warren-ty" Services
SM 8935 $3.00 Car Wash Free Vacuums TLC (& design)
SM 10584 & Live It Now (& design in color)
SM 7482 1-800-STARNES
SM 11064 1-888-LOCK-U-UP
SM 7888 1st Choice women's health center (& design)
SM 3070 1st Step Financial
@waldoj
waldoj / legislative_tags.csv
Last active November 13, 2015 14:26
A collection of 11,000 sets of crowdsourced, moderated tags, applied to legislation by hundreds of people over the course of eight years. Sourced from richmondsunlight.com.
We can make this file beautiful and searchable if this error is corrected: It looks like row 2 should actually have 3 columns, instead of 8. in line 1.
art,blacksburg,home rule
business,electricity,energy,regulation,results,scc,ssc,utility
business,electricity,energy,regulation,results,scc,ssc,utility
amendment,business,clean air,freedom,loss,marshall,newman,regulation,restaurant,sb,smoking
amendment,business,clean air,freedom,loss,marshall,newman,regulation,restaurant,sb,smoking
alcohol,bar,business,firearm,gun,regulation,restaurant,weapon
alcohol,bar,business,firearm,gun,regulation,restaurant,weapon
court,judge,supreme court,term limits
court,judge,supreme court,term limits
deed,foreclosure,hb,house,mortgage
We can make this file beautiful and searchable if this error is corrected: It looks like row 2 should actually have 6 columns, instead of 5. in line 1.
,street-1,street-2,city,state,zip
Suite 300,11350 Random Hills Road,Fairfax,VA,22030
3625 CORNELL ROAD,,FAIRFAX,VA,22030
3625 CORNELL ROAD,,FAIRFAX,VA,22030
1330 Mercer Lane,,McLean,VA,22101
Suite 225,333 South Glebe Road,Arlington,VA,22204
71 WEST FIRST AVENUE,POST OFFICE BOX 430,ALBERTA,VA,23821
5221 INDIAN RIVER RD,,VIRGINIA BEACH,VA,23464
1045 COFFEE RD,,LYNCHBURG,VA,24503
1919 GABLES LANE,,VIENNA,VA,22182
@waldoj
waldoj / rates.json
Created June 18, 2014 02:04
An imagined JSON response from a utility's API, specifying the rate per kWh for power now and over the next 24 hours. This provision of data would allow households to automatically adjust energy use in response to demand-based pricing.
[
{
"provider": "Dominion",
"url": "http://www.dom.com/",
"api_url": "http://api.dom.com/",
"documentation": "http://api.dom.com/docs/",
"state": "Virginia",
"rate": "12.3",
"units": "kWh",
"rate_units": "cents",
@waldoj
waldoj / virginia-servers.csv
Created April 11, 2014 04:04
The HTTP-header-reported server for every subdomain of virginia.gov, every agency website, and every college/university website. Where no "Server" header exists, the string "Unknown" is used.
http://abc.virginia.gov Apache
http://agencies.virginia.gov/ Microsoft-HTTPAPI/2.0
http://apa.virginia.gov/ Unknown
http://boa.virginia.gov/ Microsoft-IIS/6.0
http://bos.virginia.gov/ Microsoft-IIS/7.5
http://chr.vipnet.org/ Unknown
http://commonhelp.virginia.gov/ Unknown
http://commonwealth.virginia.gov/ Unknown
http://www.dbhds.virginia.gov/ Microsoft-IIS/5.0
http://dcjs.virginia.gov/ Microsoft-IIS/7.5

Keybase proof

I hereby claim:

  • I am waldoj on github.
  • I am waldo (https://keybase.io/waldo) on keybase.
  • I have a public key whose fingerprint is 8CC4 1FF2 95C7 5E3B D30A 2CF2 E148 6A8B FA29 F947

To claim this, I am signing this object:

@waldoj
waldoj / scc_subscribers.md
Last active April 19, 2016 06:27
Virginia State Corporation Commission database subscribers.

The Virginia State Corporation Commission (SCC) charges for bulk data of corporate registrations—$150/month for weekly updates, with a minimum three-month contract. I asked them for a list of their customers for this service. They are as follows:

Dun & Bradstreet
899 Eaton Ave.
Bethlehem, PA 18025

Seisint, Inc. (LexisNexis)
6601 Park of Commerce Blvd.
Boca Raton, FL 33487

@waldoj
waldoj / spam_template.txt
Created January 10, 2014 20:28
A blog spammer failed to parse her variables, and sent me the raw content. I thought it was interesting.
{
{I have|I’ve} been {surfing|browsing} online more than {three|3|2|4} hours
today, yet I never found any interesting article like yours.
{It’s|It is} pretty worth enough for me. {In my opinion|Personally|In
my view}, if all {webmasters|site owners|website owners|web owners} and bloggers made good content as you did, the
{internet|net|web} will be {much more|a lot more} useful
than ever before.|
I {couldn’t|could not} {resist|refrain from} commenting.
{Very well|Perfectly|Well|Exceptionally well} written!|
{I will|I’ll} {right away|immediately} {take hold of|grab|clutch|grasp|seize|snatch} your {rss|rss feed} as I {can not|can’t} {in finding|find|to find} your
@waldoj
waldoj / blog-entry-changes.md
Created November 22, 2013 19:52
Changes to @benbalter's recent blog entry.

The internet has a particular way of doing things. It's an ethos driven by a desire for resilience, for interoperability, for speed and efficiency — for preferring pragmatism over perfection. There's an unspoken set of rules born out of the hacker ethic. It's about elegant solutions, not over engineered ones, and it's what makes the internet what it is. Put another way, the internet forces us to not simply to press upload, but to reimagine desktop technologies as potential vehicles for collaboration.

There's been some talk recently, about the promise of using GitHub for data, with the excitement for the platform's disruptive potential being counterbalanced by criticism that there are use cases for which it's not ideal. That's going to be true for any technology, and like any technology, you don't solve for scale on