Skip to content

Instantly share code, notes, and snippets.

@kjschiroo
kjschiroo / keybase.md
Created January 30, 2023 16:54
keybase.md

Keybase proof

I hereby claim:

  • I am kjschiroo on github.
  • I am kjschiroo (https://keybase.io/kjschiroo) on keybase.
  • I have a public key ASDBgNOpUy1RA94eJOPJTVKh2f5oWByEBZMpK51-hMhVUQo

To claim this, I am signing this object:

@kjschiroo
kjschiroo / parquet.py
Created June 7, 2019 03:58
Writing a parquet file with pyarrow
import pyarrow as pa
import pyarrow.parquet as pq
column_1 = pa.array([1, 4, 8], pa.int32())
column_2 = pa.array([True, False, True], pa.bool_())
data = [column_1, column_2]
names = ['my_int_column', 'my_bool_column']
batch = pa.RecordBatch.from_arrays(data, names)
table = pa.Table.from_batches([batch])
@kjschiroo
kjschiroo / my_code.py
Created November 5, 2018 22:35
Self-mocking: my_code.py
import streamer
# put records onto some kinesis stream
streamer.put_records(records)
@kjschiroo
kjschiroo / insta
Last active September 30, 2017 21:37
Instantly start up an instance
insta () {
docker run --rm -it $1 /bin/bash || docker run --rm -it $1 /bin/sh
}
@kjschiroo
kjschiroo / dock
Last active September 30, 2017 21:35
A shell function to quickly connect to a docker container
dock () {
docker exec -it $1 /bin/bash || docker exec -it $1 /bin/sh;
}
module load soft/python/anaconda;
pip install --user mwapi;
pip install --user mwparserfromhell;
pip install --user git+git://github.com/kjschiroo/wikirefclassify.git;
pip install --user git+git://github.com/kjschiroo/wikirefs.git;
CREATE TABLE student_visual_edits
SELECT visual.date as date, visual.edits AS visual_edits, total.edits AS total_edits
FROM (
SELECT DATE(r.rev_timestamp) as date, Count(*) as edits
FROM enwiki_p.revision_userindex as r
JOIN students as s
ON r.rev_user_text = s.name
JOIN enwiki_p.change_tag as t
ON t.ct_rev_id = r.rev_id
WHERE t.ct_tag LIKE "visualeditor%"
CREATE TABLE labeled_page
SELECT
article.page_id,
article.page_namespace,
article.page_title,
cl.cl_type AS category_type,
cl.cl_to AS label,
DATE_FORMAT(cl.cl_timestamp, "%Y%m%d%H%i%S") AS last_update
FROM enwiki_p.categorylinks cl
JOIN enwiki_p.page tp ON
@kjschiroo
kjschiroo / jupyter_tunnel.sh
Created May 20, 2016 18:50
Start jupyter on remote server and start a tunnel to that server (@ UMN)
# start jupyter on remote server at umn
farpy()
{
ssh $1 "module load soft/python/anaconda;
cd ${1}_scratch/notebooks;
screen -S jupyter -d -m sh;
screen -S jupyter -X stuff \"jupyter notebook --no-browser --port=4258\n\"";
ssh -N -f -L localhost:8888:localhost:4258 $1;
xdg-open http://localhost:8888;
setterm -term linux -back red -clear;