Skip to content

Instantly share code, notes, and snippets.

View mkolly's full-sized avatar

Marie-José Kolly mkolly

View GitHub Profile
#!/bin/bash
#Marie-José Kolly via NZZ Storytelling, July 2017: How we scraped the publication database of the Basel Committee on Banking Supervision and analyzed their supervisory texts in order to visualize quantity and quality of regulatory text over time
#Feedback welcome by e-mail marie-jose.kolly[at]nzz.ch or twitter [at]mjkolly
#Article presenting results: www.nzz.ch/ld.1304103
#risk per file, corpus 3
#Marie-José Kolly via NZZ Storytelling, July 2017: How we scraped the publication database of the Basel Committee on Banking Supervision and analyzed their supervisory texts in order to visualize quantity and quality of regulatory text over time
#Feedback welcome by e-mail marie-jose.kolly[at]nzz.ch or twitter [at]mjkolly
#Article presenting results: www.nzz.ch/ld.1304103
library(rvest)
library(pdftools)
library(tm)
library(ggplot2)
library(magrittr)
#NZZ Storytelling, January 2017: This describes how we scraped the FIS database in order to visualize and categorize skiers' careers
#Feedback welcome by e-mail marie-jose.kolly[at]nzz.ch or twitter [at]mjkolly
#Main article that presents graphics and other results: http://nzz.ch/-ld.139656
#Article that describes our methods: http://nzz.ch/-ld.142634
#load some useful packages
library(rvest)
library(directlabels)