Skip to content

Instantly share code, notes, and snippets.

View khellan's full-sized avatar

Knut Hellan khellan

View GitHub Profile
Abigail - Nabby, Abby, Gail
Abraham - Abe, Bram
Adelaida - Ida, Idly
Alan - Al
Alastair - Al, Alex
Albert - Al, Bert
Alexander - Alex, Lex, Xander, Sander, Sandy
Alexandra - Alex, Ali, Lexie, Sandy
Alfred - Al, Alf, Alfie, Fred, Fredo
Alonzo - Lonnie
@khellan
khellan / batch_deleter.py
Created September 21, 2018 11:15
Batchwise deletion of malformed HBase row keys. It will not stop when done so it needs monitoring.
import happybase
connection = happybase.Connection(HBASE_MASTER_IP)
table = connection.table(TABLE_NAME)
while True:
batch = table.batch()
for key, _ in table.scan(columns=[COLUMN_NAMES], filter="RowFilter(=, 'regexstring:.*\x09.*')", limit=10000):
batch.delete(key)
batch.send()
print(key)
@khellan
khellan / README.md
Last active March 30, 2020 13:41
Sentencepiece 0.1.85 for Python 3.8 on OSX/Mac

Download the file and install it:

pipenv install <path to local wheel>

There you go.