Skip to content

Instantly share code, notes, and snippets.

View ikegami-yukino's full-sized avatar

IKEGAMI Yukino ikegami-yukino

View GitHub Profile
@ikegami-yukino
ikegami-yukino / madoka_bayes.py
Last active August 29, 2015 13:57
Standard Naive Bayes and Complement Naive Bayes using madoka
#-*- coding: utf-8 -*-
import numpy as np
from collections import Counter, defaultdict
import madoka
NUM_DOCS_INDEX = '[[NUM_DOCS]]'
ALL_WORD_INDEX = '[[ALL]]'
class TFIDF(object):
@ikegami-yukino
ikegami-yukino / pig_.sh
Last active August 29, 2015 14:04
Apache Pig Installation on Ubuntu
wget http://ftp.kddilabs.jp/infosystems/apache/pig/latest/pig-0.13.0.tar.gz
tar -xvf pig-0.13.0.tar.gz
sudo mv pig-0.13.0 /usr/local/pig
rm pig-0.13.0.tar.gz
echo 'export PIG_HOME=/usr/local/pig' >> ~/.bashrc
echo 'export PATH=$PATH:$PIG_HOME/bin' >> ~/.bashrc
echo 'export PIG_CLASSPATH=$HADOOP_HOME/conf/' >> ~/.bashrc
source ~/.bashrc
pig -h
@ikegami-yukino
ikegami-yukino / vim_pig.sh
Last active August 29, 2015 14:07
Pig Latin syntax coloring for Vim
#!/bin/sh
git clone https://github.com/motus/pig.vim.git /tmp/pig.vim
mkdir ~/.vim/syntax/
mkdir ~/.vim/ftdetect/
cp /tmp/pig.vim/syntax/pig.vim ~/.vim/syntax/
cp /tmp/pig.vim/ftdetect/pig.vim ~/.vim/ftdetect/
rm -r /tmp/pig.vim
@ikegami-yukino
ikegami-yukino / longest_contiguous_common_subsequence.py
Created October 15, 2014 09:59
Longest Contiguous Common Subsequence
def to_ngrams(s, minimum_n):
"""Generate n-grams (len(string) >= n >= minimum) from string
Params:
<str> s
<int> minimum
Return:
<set <str>> ngrams
"""
ngrams = []
length = len(s)
@ikegami-yukino
ikegami-yukino / file0.txt
Last active August 29, 2015 14:13
PythonでMeCabの制約付き解析を使う ref: http://qiita.com/yukinoi/items/4e7afb5e72b3a46da0f2
# -*- coding: utf-8 -*-
import re
import MeCab
from MeCab import MECAB_ANY_BOUNDARY, MECAB_INSIDE_TOKEN, MECAB_TOKEN_BOUNDARY
DICINFO_KEYS = ('charset', 'filename', 'lsize', 'rsize', 'size', 'type', 'version')
class Tagger(MeCab.Tagger):
'''
DO WHAT THE FUCK YOU WANT TO PUBLIC LICENSE
Version 2, December 2004
Copyright (C) 2004 Sam Hocevar <sam@hocevar.net>
Everyone is permitted to copy and distribute verbatim or modified
copies of this license document, and changing it is allowed as long
as the name is changed.
@ikegami-yukino
ikegami-yukino / jpcommon_label.c.patch
Created February 26, 2015 18:44
OpenJtalk 1.08 の間延びするバグ修正パッチ
*** jpcommon/jpcommon_label.c.old 2015-02-27 03:35:32.000000000 +0900
--- jpcommon/jpcommon_label.c 2015-02-27 03:36:34.000000000 +0900
*************** static int index_accent_phrase_in_breath
*** 296,301 ****
--- 296,302 ----
if (index == a)
break;
}
+ if (i > 3) i = 3;
return i;
@ikegami-yukino
ikegami-yukino / mac_notify.sh
Last active August 29, 2015 14:17
Send message to the Mac OSX notification center
function notify() {
echo -e "display notification \"$1\" with title \"Terminal\" subtitle \"`date +"%F %T"`\"" | osascript
}
session = requests.Session()
session.mount('http://', requests.adapters.HTTPAdapter(max_retries=3))
session.mount('https://', requests.adapters.HTTPAdapter(max_retries=3))
response = session.get(lp, headers={'User-agent': UA}, timeout=1)
@ikegami-yukino
ikegami-yukino / Grass.itermcolors
Created May 5, 2015 07:25
Mac Terminal Grass style for iTerm2
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
<plist version="1.0">
<dict>
<key>Ansi 0 Color</key>
<dict>
<key>Blue Component</key>
<real>0.0</real>
<key>Green Component</key>
<real>0.0</real>