Skip to content

Instantly share code, notes, and snippets.

@wragge
Created April 11, 2017 11:25
Show Gist options
  • Star 1 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save wragge/66e0792825c9cf2676902a96a4a61d13 to your computer and use it in GitHub Desktop.
Save wragge/66e0792825c9cf2676902a96a4a61d13 to your computer and use it in GitHub Desktop.
Most significant words (via TF-IDF) in titles of questions asked in the House of Reps for each decade 1900-1979
1900
kanakas 0.0725222278008
stripper 0.0604896801498
employes 0.0599783041901
increments 0.0539607103026
creswell 0.0528185096992
drawback 0.0528185096992
masters 0.0528185096992
slanders 0.0528185096992
strachan 0.0528185096992
tenderers 0.0528185096992
commanding 0.0493690889945
driver 0.0493690889945
encampments 0.0493690889945
entries 0.0493690889945
fay 0.0493690889945
hutton 0.0493690889945
lyster 0.0493690889945
microbes 0.0493690889945
postmistress 0.0493690889945
shelling 0.0468741428254
1910
transports 0.064347188132
anzacs 0.0509506673685
luxuries 0.0496785277758
troopships 0.0482850565135
germans 0.0477977574988
expeditionary 0.0477049782283
outbreak 0.047091258264
jellicoe 0.0467446454993
moisture 0.0450226124939
chinn 0.0430703324897
eighteen 0.0430703324897
possessions 0.0430703324897
employes 0.0427415473633
tender 0.0427415473633
cologne 0.0408165906247
eau 0.0408165906247
eligibles 0.0408165906247
intoxicating 0.0408165906247
karri 0.0408165906247
relatives 0.0408165906247
1920
expropriation 0.0512935773343
abrahams 0.0501418984731
biloela 0.0475186003739
mayoh 0.0475186003739
main 0.0465838851207
mawson 0.046002641179
steamers 0.0435965086544
kyogle 0.0433763274495
preservatives 0.0423866526278
roma 0.0423866526278
tanunda 0.0423866526278
mandated 0.0416159707008
bedford 0.040168685688
borer 0.040168685688
pernicious 0.040168685688
phosphates 0.040168685688
separators 0.040168685688
sumatra 0.040168685688
wembley 0.040168685688
willis 0.040168685688
1930
abyssinian 0.062744589306
italo 0.062744589306
primage 0.0536222857968
freer 0.0531517291577
resident 0.0531517291577
matson 0.0482372473098
default 0.0449780335826
extraction 0.0449780335826
kyeema 0.0449780335826
militia 0.0447321747364
cut 0.0433853936673
yampi 0.0419687617864
boock 0.0407761762846
clocks 0.0407761762846
docking 0.0407761762846
unemploymentrelief 0.0407761762846
ottawa 0.0403600186213
broadcasting 0.0393207241873
wheatgrowers 0.0384583405146
subsidized 0.0384359450204
1940
clothes 0.065158862529
woods 0.0612251793633
lend 0.0583966441197
bretton 0.0575961895315
producer 0.0575961895315
impressment 0.052778374829
man 0.0505110702411
rationalization 0.0503051719268
unrra 0.0488941234405
rationing 0.0475717914479
immobilization 0.0473342817064
abbco 0.0455905270044
blain 0.0455905270044
disposals 0.0446526042379
deficits 0.0444115005646
effort 0.0439761991363
falstein 0.0436136209716
kenmore 0.0436136209716
marginal 0.0436136209716
pegging 0.0436136209716
1950
filling 0.0600582199482
studios 0.0555860255351
korea 0.054285819182
poliomyelitis 0.054285819182
atomic 0.0527062470959
malaya 0.0524980080916
weapons 0.0503254624897
myxomatosis 0.0500026390335
colombo 0.0495999069527
uranium 0.0479474306082
television 0.0476820586852
dollars 0.047005999257
summit 0.047005999257
formosa 0.0461313943595
jordan 0.0456702754752
peking 0.0456702754752
shipowners 0.0456702754752
aged 0.045586858027
radiation 0.0451919466458
cortisone 0.0441772658153
1960
vietnam 0.0809811514004
voyager 0.0687939560597
irian 0.0661766882395
malaysia 0.0553606640559
decentralisation 0.0547152426729
nuclear 0.052007638407
practices 0.0518650595878
ord 0.0508305493074
restrictive 0.05014421813
television 0.0496794479187
hydroelectric 0.0492527394978
offshore 0.0476986053596
nigeria 0.0474383104415
resettlement 0.0474383104415
tourism 0.0470862974172
investment 0.0466766130973
pty 0.0464408760342
standardisation 0.0464408760342
f111 0.0457585538869
mirage 0.0457585538869
1970
medibank 0.0635744122185
program 0.0609571772184
freeze 0.055414865083
indexation 0.0537167694044
vietnam 0.052665425041
actu 0.0520826678978
programs 0.0509786981225
solo 0.0505860193084
whitlam 0.0473861997431
organisation 0.0473619484136
ranger 0.046847362325
avoidance 0.0462822299678
aurukun 0.0456881040645
funding 0.0456881040645
pecuniary 0.0450618479999
pricing 0.0450618479999
pty 0.0444410436995
cambodian 0.0443997869126
natural 0.0439076749186
equalisation 0.0436975770305
@Sufiness
Copy link

Thank you! I'm so curious if you can do the same for 1980, 1990, 2010?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment