Skip to content

Instantly share code, notes, and snippets.

@shawngraham
shawngraham / demo.R
Last active May 16, 2022 19:32
demo for students of gist use
# demo for sharing a script
# in scripts, a hashtag (pound sign) = a comment (in markdown, it means a header)
# we create a variable called y, and give it a _string_ of text; in this case, the direct URL to a csv file
y <- "https://raw.githubusercontent.com/shawngraham/exercise/gh-pages/CND.csv"
# now we use the read.csv command to read the file address stored in 'y'; the results go into a dataframe called 'documents2'
documents2 <- read.csv(y)
@shawngraham
shawngraham / retrieve-theses-metadata.R
Created January 17, 2022 17:49
script to generate a table of metadata concerning theses in CURVE repository; curve is changing soon so this will become dated
library(rvest)
library(dplyr)
library(xml2)
library(stringr)
library(purrr)
base_url <- "https://curve.carleton.ca"
theses = data.frame()
Artifact Position_X Position_Y chip name
242 1110.973076 480.0500947 /Users/shawngraham/Documents/chips2/object1/0000000001.png
252 772.6520572 434.331038 /Users/shawngraham/Documents/chips2/object2/0000000001.png
1231 626.3510759 672.0701326 /Users/shawngraham/Documents/chips2/object3/0000000001.png
35 2059.369176 1630.107527 /Users/shawngraham/Documents/chips2/object4/0000000001.png
35 594.3477363 1275.56168 /Users/shawngraham/Documents/chips2/object4/0000000002.png
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@shawngraham
shawngraham / bib.xsl
Last active October 15, 2020 21:40
sample metadata from jstor-dfr, and an xsl file meant to turn it into a citations.tsv with: xsltproc bib.xsl *.xml > citations.tsv
<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:output method="text"/>
<xsl:template match="/">
<xsl:for-each select="article/front">
"<xsl:value-of select="article-meta/contrib-group/contrib/string-name/surname" />, <xsl:value-of select="article-meta/contrib-group/contrib/string-name/given-names" />" <xsl:text>&#9;</xsl:text><xsl:value-of select="article-meta/pub-date/year" /> <xsl:text>&#9;</xsl:text> "<xsl:value-of select="article-meta/title-group/article-title" />" <xsl:text>&#9;"</xsl:text><xsl:value-of select="journal-meta/journal-title-group/journal-title"/>"<xsl:text>&#9;</xsl:text> "<xsl:value-of select="article-meta/volume" />(<xsl:value-of select="article-meta/issue" />)" <xsl:text>&#9;</xsl:text> "<xsl:value-of select="article-meta/page-range" />" <xsl:text>&#9;</xsl:text>"<xsl:value-of select="article-meta/article-id" />" <xsl:text>&#9;</xsl:text><xsl:value-of select="article-meta/self-uri" /><xsl:text>&#xA;</xsl:text>
</xsl:for-each>
@shawngraham
shawngraham / entities.csv
Last active July 22, 2020 19:47
giving nertwork a spin on chapbooks from the national library of scotland. cancelled out of the script because my machine was running hot; did extract some 13 000 entities though
doc entity entityType count
104184105 ABERDEEN organization 1
104184105 Navy organization 1
104184105 Sceptre organization 1
104184105 1 person 1
104184105 Brodie person 2
104184105 Cromar person 2
104184105 Earl person 1
104184105 Fife person 3
104184105 Glen person 1
#!/usr/bin/python
## Split audio files into chunks
## Daniel Pett 1/5/2020
__author__ = 'portableant'
## Tested on Python 2.7.13
import argparse
import os
import speech_recognition as sr
@shawngraham
shawngraham / getNotes.scpt
Created March 11, 2020 15:35
extract notes from skim to clipboard
(* Inspired and modified based on http://drosophiliac.com/2012/09/an-academic-notetaking-workflow.html and https://gist.github.com/smargh/6068104 *)
(* PROPERTIES *)
property LF : (ASCII character 10)
property tid : AppleScript's text item delimiters
(* THE SCRIPT *)
tell application "Skim"
set the clipboard to ""
activate