Skip to content

Instantly share code, notes, and snippets.

Keybase proof

I hereby claim:

  • I am datawookie on github.
  • I am datawookie (https://keybase.io/datawookie) on keybase.
  • I have a public key whose fingerprint is F835 3643 5C64 6895 617E B3D1 196E B595 5F70 8E76

To claim this, I am signing this object:

install.packages(c(
"aws.s3",
"broom",
"CEoptim",
"fst",
"furrr",
"geosphere",
"ggmap",
"gmapsdistance",
"googledrive",
@datawookie
datawookie / unicode symbols
Last active April 8, 2024 15:29
Useful Unicode characters
https://github.com/muan/unicode-emoji-json
° — degree
× — times
💣 — bomb
💥 — explosion
🔥 — fire
🎉 — tada
🧯 — fire extinguisher
@datawookie
datawookie / quotes-data.txt
Last active July 11, 2020 03:23
Quotes suitable for many occasions
But invariably, simple models and a lot of data trump more elaborate models based on less data.
— Halevy, Alon, Peter Norvig, and Fernando Pereira. 2009. The Unreasonable Effectiveness of Data. IEEE Intelligent Systems 24 (2) (March): 8-12. doi:10.1109/MIS.2009.36.
Data is not information, information is not knowledge, knowledge is not understanding, understanding is not wisdom.
— Clifford Stoll
Errors using inadequate data are much less than those using no data at all.
— Charles Babbage
I never guess. It is a capital mistake to theorize before one has data. Insensibly one begins to twist facts to suit theories, instead of theories to suit facts.
@datawookie
datawookie / springer-books.R
Last active March 1, 2020 17:04
Springer Books
library(DBI)
library(readr)
library(htmlTable)
library(dplyr)
library(janitor)
springer_search <- read_csv("https://link.springer.com/search/csv?previous-end-year=2020&facet-content-type=%22Book%22&date-facet-mode=in&previous-start-year=2020&facet-language=%22En%22&showAll=false&query=&facet-start-year=2020&facet-end-year=2020")
springer_search <- springer_search %>%
clean_names() %>%
@datawookie
datawookie / s3-backup.sh
Last active January 16, 2019 02:23
Compress file and copy to S3
#!/bin/bash
# $ s3-backup data-file.txt s3://my-backup-folder
#
# Install under /usr/bin/ if it will be run from cron.
FILEPATH=$1
FILENAME=`basename $FILEPATH`
FILEDATE=`date +%Y%m%d-%H%M`-$FILENAME
@datawookie
datawookie / dated-backup.sh
Last active November 7, 2018 09:44
Created a dated version of a file and persist on S3.
#!/bin/bash
# Created a dated version of a file and persist on S3.
#
# Usage:
#
# dated-backup.sh file-to-backup.txt s3://name-of-bucket/
FILE=$1
BUCKET=$2
functions { # User-defined functions [Optional]
}
data { # Data (external) [Optional]
}
transformed data { # Preprocess data [Optional]
}
parameters { # Model parameters (must precede model) [Mandatory]
}
transformed parameters { # Preprocess parameters [Optional]
@datawookie
datawookie / Productivity Hacks - Buffon Needle.md
Last active March 17, 2018 03:43
Files for Productivity Hacks talk.

Some files for talk about Productivity Hacks.

Setup:

Launch a t2.2xlarge instance (8 cores) on AWS.

  1. Start a tmux session. Create two horizontal panes (Ctrl-b ").
  2. sudo apt update && sudo apt install -y htop r-base
  3. wget -O buffon-needle.R http://bit.ly/2HieqWc
@datawookie
datawookie / Productivity Hacks - Travelling Salesman.md
Last active February 19, 2018 12:36
Files for Productivity Hacks talk.

Some files for talk about Productivity Hacks.