Skip to content

Instantly share code, notes, and snippets.

View edsu's full-sized avatar

Ed Summers edsu

View GitHub Profile
name twitter
Susan A. Davis https://twitter.com/repsusandavis
Martha Roby https://twitter.com/repmartharoby
F. James Sensenbrenner, Jr. https://twitter.com/jimpressoffice
Joseph P. Kennedy III https://twitter.com/repjoekennedy
Denny Heck https://twitter.com/repdennyheck
Donna E. Shalala https://twitter.com/repshalala
Steve Watkins https://twitter.com/rep_watkins
Kendra S. Horn https://twitter.com/repkendrahorn
Ben McAdams https://twitter.com/repbenmcadams
We can make this file beautiful and searchable if this error is corrected: It looks like row 8 should actually have 9 columns, instead of 6. in line 7.
name,twitter,twitter_ok,youtube,youtube_ok,facebook,facebook_ok,instagram,instagram_ok
Lamar Alexander,https://twitter.com/senalexander,True,https://www.youtube.com/user/lamaralexander,True,,,https://www.instagram.com/senlamaralexander,True
Michael B. Enzi,https://twitter.com/senatorenzi,True,https://www.youtube.com/user/senatorenzi,True,https://www.facebook.com/mikeenzi,True,https://www.instagram.com/senatorenzi,True
Pat Roberts,https://twitter.com/senpatroberts,True,https://www.youtube.com/user/senpatroberts,True,https://www.facebook.com/senpatroberts,True,,
Tom Udall,https://twitter.com/senatortomudall,True,https://www.youtube.com/user/senatortomudall,True,https://www.facebook.com/senatortomudall,True,https://www.instagram.com/senatortomudall,True
Justin Amash,,,,,https://www.facebook.com/repjustinamash,True,,
Rob Bishop,https://twitter.com/reprobbishop,True,https://www.youtube.com/user/congressmanbishop,True,https://www.facebook.com/reprobbishop,True,,
Wm. Lacy Clay,,,,,https://www.facebook.com/1091354058
id:
bioguide: P000598
thomas: '01910'
govtrack: 412308
opensecrets: N00029127
votesmart: 106220
fec:
- H8CO02137
cspan: 1031300
wikipedia: Jared Polis
@edsu
edsu / search_tweets_all.sh
Last active January 31, 2021 02:49
use twitter's search client to retrieve all possible data for a tweet from the v2 api
search_tweets.py \
--query obama \
--results-per-call 100 \
--tweet-fields id,text,attachments,author_id,context_annotations,conversation_id,created_at,entities,geo,in_reply_to_user_id,lang,possibly_sensitive,public_metrics,referenced_tweets,reply_settings,source,withheld \
--user-fields id,name,username,created_at,description,entities,location,pinned_tweet_id,profile_image_url,protected,public_metrics,url,verified,withheld \
--media-fields media_key,type,duration_ms,height,preview_image_url,public_metrics,width \
--poll-fields id,options,duration_minutes,end_datetime,voting_status \
--place-fields full_name,id,contained_within,country,country_code,geo,name,place_type \
--expansions author_id,referenced_tweets.id,in_reply_to_user_id,attachments.media_keys,attachments.poll_ids,geo.place_id,entities.mentions.username,referenced_tweets.id.author_id \
--filename-prefix obama \
@edsu
edsu / notes-00000.json
Last active January 26, 2021 15:21
$ curl https://ton.twitter.com/birdwatch-public-data/notes/1DSGKVEVHWGZVA FMGTEZXQRG8MBTEG0IDU1ZPS2U3FZJIE6UA07MK1KSP3FW/notes-00000.tsv | csvjson --indent 2 > notes-00000.json
[
{
"noteId": 1.3527968784384246e+18,
"participantId": "6B6171E803FBBF58397EC4A0AF76A2FF",
"createdAtMillis": 1611366884227.0,
"tweetId": 1.3527966185048924e+18,
"classification": "MISINFORMED_OR_POTENTIALLY_MISLEADING",
"believable": "BELIEVABLE_BY_MANY",
"harmful": "LITTLE_HARM",
"validationDifficulty": "EASY",
#!/usr/bin/env python
import internetarchive
ia = internetarchive.get_session()
for result in ia.search_items('collection:mediahistory creator:National Association of Educational Broadcasters'):
ia_id = result['identifier']
item = ia.get_item(ia_id)
item.download(files=[ia_id + '_meta.xml'], destdir="meta", no_directory=True)
#!/usr/bin/env python
import re
import csv
import internetarchive as ia
def item_summary(item_id):
item = ia.get_item(item_id)
size = 0
date bytes
2020-02-22 924153122530
2020-02-23 1965467061008
2020-02-24 1867608336805
2020-02-25 1779999943858
2020-02-26 1985847458884
2020-02-27 1888468270741
2020-02-28 3475697622991
2020-02-29 4306188573766
2020-03-01 4463740524509
We can't make this file beautiful and searchable because it's too large.
url,archive_url
http://twitter.com/realDonaldTrump/status/1006837823469735936,http://web.archive.org/web/20180613095658/http://twitter.com/realDonaldTrump/status/1006837823469735936
https://twitter.com/realDonaldTrump/status/1006837823469735936,http://web.archive.org/web/20180613095659/https://twitter.com/realDonaldTrump/status/1006837823469735936
https://twitter.com/realDonaldTrump/status/1006837823469735936,http://web.archive.org/web/20180613100034/https://twitter.com/realDonaldTrump/status/1006837823469735936
https://twitter.com/realDonaldTrump/status/1006837823469735936,http://web.archive.org/web/20180613102120/https://twitter.com/realDonaldTrump/status/1006837823469735936
https://twitter.com/realDonaldTrump/status/1006837823469735936,http://web.archive.org/web/20180613105147/https://twitter.com/realDonaldTrump/status/1006837823469735936
https://twitter.com/realDonaldTrump/status/1006837823469735936,http://web.archive.org/web/20180613105159/https://twitter.com/realDonaldTrump/status/1006837823469735936
ht