Alphabetically sorted list below of Slack channels on the Slack instance used by the General Services Administration from response to FOIA request by @rebeccawilliams
Simple script extracting data from https://github.com/unitedstates/congress-legislators, and checking the member URLs for robots.txt files, and if so, checking if there are disallow rules in the robots.txt
encompassing the entire site, and preventing constituents from being able to easy access data on the site.
Datasets below:
senate_access.csv
- Senate site datahouse_access.csv
- House site data
Both datasets have columns for data about members, whether robots.txt file exists, whether there is a global disallow rule, or any page on the public site with a disallow rule.
Found the following MOC sites had rules prohibiting Google from properly indexing them:
- Sen. Mitch McConnell (R-KY) - https://www.mcconnell.senate.gov
Simple script using got to check robots.txt files of state legislature websites from a manually collated dataset state_urls.csv
to check if Google and accessible means of finding resources are blocked. The resulting dataset can be found in state_access.csv
.
There is one state legislative body blocking public access using a robots.txt: the Missouri State Senate http://www.senate.mo.gov
Jump to tweets by date
Jump to tweets by date
Jump to tweets by date
Data is sorted by source type, then chamber, type, and state if relevant. Jump to counts by source
Jump to tweets by date
Jump to tweets by date
Jump to tweets by date