Skip to content

Instantly share code, notes, and snippets.

@drjwbaker
Last active December 9, 2016 10:57
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save drjwbaker/5be3513e73e90f6eb3676bf7732bcb8c to your computer and use it in GitHub Desktop.
Save drjwbaker/5be3513e73e90f6eb3676bf7732bcb8c to your computer and use it in GitHub Desktop.
Data Science Training for Librarians, DTU Denmark, 7-9 December 2016

Data Science Training for Librarians, DTU Denmark, 7-9 December 2016

Live notes, so an incomplete, partial record of what actually happened.

Tags: dst4l

My asides in {}

Stream/Deck: http://www.dst4l.info/schedule.html


The Data Savvy Librarian - Christopher Erdmann

Not just tutorials, but also talks that frame learning tools/techniques.

Goals:

  • gain an understanding of the research data lifecycle
  • train librarians with skills
  • create a data-centric culture in libraries
  • grow a community
  • empower librarians (so IT people don't assume librarians can't do stuff)

#dst4l 2016 kickoff - what a great start! looking forward to 3 exciting days of data wrangling :) pic.twitter.com/UwxeYphVvh

— Rainer Mesi (@raineralias) December 7, 2016
<script async src="//platform.twitter.com/widgets.js" charset="utf-8"></script>

Countdown to #DST4L 2016 is over, watch the stream of the #datascience program https://t.co/FymD0fDR1H #dst4l #libraryfutures

— DST4L (@DST4L) December 7, 2016
<script async src="//platform.twitter.com/widgets.js" charset="utf-8"></script>

Challenges: colleagues not getting why you need to know this; you may being separated off from other people. So, management are an important target.

But also, DST4L as a service.


The importance of open heritage data - Henriette Roued-Cunliffe

There are many working with data: such as on the internet. Access to datasets is important then. The importance of letting go: because if you don't, people won't use your locked up data.

#dst4l @HenrietteRoued speaking about levels of openness https://t.co/rhqUC1LHn4

— Chris Erdmann (@libcce) December 7, 2016
<script async src="//platform.twitter.com/widgets.js" charset="utf-8"></script>

Things to consider:

  1. make metadata that enables stuff to be found online
  2. make your licence obvious
  3. does the institution actively support reuse
  4. available in a machine readable format
  5. available via a well-described API or web service

Much of this is about making experimentation possible, of making daring to fail forward possible.


From Counting to Connecting: exploring academia as a complex socio-technical system of information transformation - Pedro Parraguez Ruiz

@parraguezr

How do we go from counting (eg a census) to connecting (via complex analysis). Not something that was, until recently, possible.

Perfection is not the point when you just want to get a sense of the landscape.


Grappling with data - James Baker

Run through of the first two parts of http://data-lessons.github.io/library-shell/


Grappling with data - Thomas Padilla

Tidy Data .. Massive work goes into curating datasets before a workshop, leaving attendees poorly situated to reapply tools/approaches on their data .. So OpenRefine is a way of doing the data prep ..

  • remove inconsistencies
  • split multi-variable fields
  • fix inconsistent terminology

Data from data .. useful to think about affordances .. value of the data about the thing doesn't negate the value of the thing, it adds to it ..

http://thomaspadilla.org/projects/scaredtodeath/#/5


Some admin...

Creative Commons Licence
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Exceptions: embeds to and from external sources, and direct quotations from speakers

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment