Skip to content

Instantly share code, notes, and snippets.

View alexlitel's full-sized avatar

Alex Litel alexlitel

  • Los Angeles, CA
  • 22:22 (UTC -07:00)
View GitHub Profile
@alexlitel
alexlitel / README.MD
Last active March 4, 2024 01:49
GSA Slack Channels (alphabetically sorted)

Alphabetically sorted list below of Slack channels on the Slack instance used by the General Services Administration from response to FOIA request by @rebeccawilliams

@alexlitel
alexlitel / README.md
Last active May 21, 2020 04:05
Congressional robots.txt parser

Simple script extracting data from https://github.com/unitedstates/congress-legislators, and checking the member URLs for robots.txt files, and if so, checking if there are disallow rules in the robots.txt encompassing the entire site, and preventing constituents from being able to easy access data on the site.

Datasets below:

  • senate_access.csv - Senate site data
  • house_access.csv - House site data

Both datasets have columns for data about members, whether robots.txt file exists, whether there is a global disallow rule, or any page on the public site with a disallow rule.

Found the following MOC sites had rules prohibiting Google from properly indexing them:

@alexlitel
alexlitel / README.md
Last active October 9, 2020 04:35
State legislature robots.txt parser

Simple script using got to check robots.txt files of state legislature websites from a manually collated dataset state_urls.csv to check if Google and accessible means of finding resources are blocked. The resulting dataset can be found in state_access.csv.

There is one state legislative body blocking public access using a robots.txt: the Missouri State Senate http://www.senate.mo.gov

@alexlitel
alexlitel / timmons_campaigntweets.md
Created January 23, 2020 02:26
William Timmons' tweets from office account
@alexlitel
alexlitel / hunter_officetweets.md
Created January 16, 2020 03:44
Duncan Hunter tweets from deleted office account
@alexlitel
alexlitel / lofgren_campaigntweets.md
Created January 16, 2020 03:41
Zoe Lofgren tweets from deactivated campaign account
@alexlitel
alexlitel / tweets_by_source.md
Last active October 21, 2019 02:16
Tweet breakdown by source
@alexlitel
alexlitel / collinstweets_office.md
Last active October 2, 2019 02:33
Chris Collins office account tweets
@alexlitel
alexlitel / collinstweets.md
Created October 1, 2019 05:21
Chris Collins tweets from deleted campaign account
@alexlitel
alexlitel / goodentweets.md
Created September 19, 2019 04:57
Gooden tweets from deleted office account