Skip to content

Instantly share code, notes, and snippets.

@wragge
wragge / moad-election-speeches-tfidf.txt
Created November 22, 2016 07:27
Top twenty trigrams from each speech in MoAD's collection of election speeches weighted by TF-IDF value.
1901-EDMUND-BARTON
30 june 1900 0.0452017025048
as to appointments 0.0364684489678
barton and lyne 0.0364684489678
be taken over 0.0364684489678
direct taxation by 0.0364684489678
from 30 june 0.0364684489678
if he has 0.0364684489678
illegible handwritten notes 0.0364684489678
@wragge
wragge / dfat_documents_summary.txt
Created November 26, 2016 06:28
First attempt at processing DFAT's collection of historical documents
10573 documents
10367 dates found (98% of documents)
8777 NAA references found (83% of documents)
2481 NAA barcodes found (23% of documents, 28% of references)
3615 unique NAA references found
961 unique NAA barcodes found
VOLUME 1: 1937-38:
337 documents
335 dates found (99% of documents)
@wragge
wragge / title_words_totals.txt
Created December 22, 2016 02:37
Frequency of words in Trove newspaper titles -- December 2016
advertiser - 183
times - 111
news - 88
advocate - 84
chronicle - 74
gazette - 64
herald - 57
australian - 56
journal - 47
standard - 45
@wragge
wragge / hansard-interjections-fascist.md
Last active January 28, 2017 02:43
Interjections less than 50 characters long including the word 'fascist' from the Commonwealth Parliament between 1901 and 1980.
@wragge
wragge / hansard-interjections-white.md
Last active January 28, 2017 02:44
Interjections less than 50 characters long including the word 'white' from the Commonwealth Parliament between 1901 and 1980.
@wragge
wragge / dh2017-titlewithheld.md
Created February 18, 2017 01:52
Proposal for DH2017

[Title withheld] -- access and surveillance in the archives

The Australian Security Intelligence Organisation (ASIO) was established in 1949 amidst Cold War fears of spies and secrets (Horner, 2014). In the decades that followed, ASIO compiled many thousands of dossiers on people and organisations who might pose a threat to the nation -- these included communists, writers, academics, scientists, unionists, and Indigenous rights activists. One historian has estimated that hundreds of thousands of files were created (McKnight, 2014).

Under the Australian Archives Act, the public has a right to access government records more than twenty years old. This is subject to exemptions on grounds such as national security and privacy. Exemptions are assessed and applied as part of a process known as 'access examination' (National Archives of Australia, 2016).

Unlike other government agencies, ASIO does not have to disclose information about its records. You can, however, ask whether they hold a file on a particular

@wragge
wragge / trove-translator-testing.md
Last active February 20, 2017 02:45
Install new Trove translator for testing

Trove translator testing

I've created a new Trove translator for Zotero which takes over the capture of newspaper articles from my old 'Australian newspapers' translator, and adds support for most of the other Trove zones as well.

I've submitted a pull request to have the new translator added to the Zotero repository, but if you're impatient and would like to help test it out, here's how to manually install it.

  • First find out where your translators folder is...
  • Open Zotero and choose 'Preferences > Advanced > Files & Folders'
  • Click on 'Show data directory'
  • Your Zotero data directory should open up -- it will include a directory called 'translators'
@wragge
wragge / functions-in-recordsearch.md
Last active March 31, 2017 03:04
Functions currently used in RecordSearch
Term Number of agencies Included in thesaurus
administrative law 169 crsthesaurus, agift1, agift2, agift3, recordsearch
administrative services 3 crsthesaurus, recordsearch
agriculture 118 crsthesaurus, recordsearch
air force 21 crsthesaurus, agift2, agift3, recordsearch
air force administration 23 crsthesaurus, recordsearch
air force commands 77 crsthesaurus, recordsearch
air operations 140 crsthesaurus, recordsearch
air safety 6 crsthesaurus, recordsearch
@wragge
wragge / question-titles-decade-tfidf.txt
Created April 11, 2017 11:25
Most significant words (via TF-IDF) in titles of questions asked in the House of Reps for each decade 1900-1979
1900
kanakas 0.0725222278008
stripper 0.0604896801498
employes 0.0599783041901
increments 0.0539607103026
creswell 0.0528185096992
drawback 0.0528185096992
masters 0.0528185096992
slanders 0.0528185096992
@wragge
wragge / hansard-speaker-similarities.md
Last active April 27, 2017 13:41
Similarities between 1970s members of the House of Reps, based on their speeches in Hansard (showing 10 most similar)

HOWSON, Peter (LP)

* VINER, Ian (LP)                          0.990147362266
* HUNT, Ralph (NCP/NP)                     0.989726340705
* BOWEN, Nigel (LP)                        0.989052644404
* SWARTZ, Reginald (LP)                    0.988994723622
* NEWMAN, Kevin (LP)                       0.988793740506
* GROOM, Ray (LP)                          0.988658546299
* EVERINGHAM, Douglas (ALP)                0.988432787038
  • STEWART, Francis (ALP) 0.98833413308