Skip to content

Instantly share code, notes, and snippets.

View ikegami-yukino's full-sized avatar

IKEGAMI Yukino ikegami-yukino

View GitHub Profile
@ikegami-yukino
ikegami-yukino / install_sentencepiece_on_mac.sh
Last active July 5, 2018 04:21
Install Sentencepiece on mac OS
brew install autoconf automake libtool protobuf
pushd .
git clone --depth=1 https://github.com/google/sentencepiece.git /tmp/
cd /tmp/sentencepiece
perl -i -pe 's/libtoolize/glibtoolize/' autogen.sh
./autogen.sh
./configure
make
make check
@ikegami-yukino
ikegami-yukino / mecab-skkserv.sh
Last active March 21, 2017 17:02
Installing mecab-skkserv on macOS Sierra
wget http://www.chasen.org/~taku/software/mecab-skkserv/mecab-skkserv-0.03.tar.gz
tar xzf mecab-skkserv-0.03.tar.gz
cd mecab-skkserv-0.03
ls *|xargs nkf -w --overwrite
./configure --with-charset=utf8
echo 'cost-factor = 700' >>dicrc
perl -i -ne '$i++; print if ($i != 36 && $i != 37 && $i != 38 && $i != 44 && $i != 45 && $i != 46 && $i != 47 && $i != 48)' mecab-skkserv.cpp
make
make install
@ikegami-yukino
ikegami-yukino / steps.sh
Last active April 30, 2020 04:24 — forked from albertstartup/steps.sh
aws gpu, ubuntu 16.04, nvidia driver 367, cuda 8,
# Required download
# cudnn-8.0-linux-x64-v5.1.tgz
curl -L -o cuda_8.0.44_linux.run https://developer.nvidia.com/compute/cuda/8.0/prod/local_installers/cuda_8.0.44_linux-run
curl -L -O http://us.download.nvidia.com/XFree86/Linux-x86_64/367.27/NVIDIA-Linux-x86_64-367.27.run
sudo apt-get install build-essential
sudo apt-get install linux-image-extra-`uname -r`
sudo sh cuda_8.0.44_linux.run
echo -e "export CUDA_HOME=/usr/local/cuda\nexport PATH=\$PATH:\$CUDA_HOME/bin\nexport LD_LIBRARY_PATH=\$LD_LINKER_PATH:\$CUDA_HOME/lib64" >> ~/.bashrc
@ikegami-yukino
ikegami-yukino / unidic_yomi.py
Last active January 24, 2017 05:11
UniDicからアルファベット単語と読みのペアを抽出
import re
import os
import glob
re_pair = re.compile('^([ァ-ンー]+)\-([a-zA-Z \'\-\(\)]+)')
UNIDIC_PATH = 'path to UniDic directory'
with open('result.tsv', 'w') as out_fd:
for csvfile in glob.glob(os.path.join(UNIDIC_PATH, '*.csv')):
with open(csvfile) as dic_fd:
#!/usr/bin/env bash
# from 形態素解析システムJUMANをpython3で使う
# https://abeerforyou.com/?p=715
set -eu
pushd . > /dev/null
cd /tmp
curl -L -O 'http://nlp.ist.i.kyoto-u.ac.jp/DLcounter/lime.cgi?down=http://nlp.ist.i.kyoto-u.ac.jp/nl-resource/juman/juman-7.01.tar.bz2&name=juman-7.01.tar.bz2'
--- linear.cpp 2015-09-27 07:03:33.000000000 +0900
+++ new_linear.cpp 2016-04-09 01:32:23.000000000 +0900
@@ -2685,9 +2685,10 @@ double predict_probability(const struct
double label=predict_values(model_, x, prob_estimates);
for(i=0;i<nr_w;i++)
- prob_estimates[i]=1/(1+exp(-prob_estimates[i]));
+ prob_estimates[i]=exp(prob_estimates[i]);
if(nr_class==2) // for binary classification
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@ikegami-yukino
ikegami-yukino / RandomForest2Code.ipynb
Created January 8, 2016 20:16
scikit-learnのRandomForestのモデルをPythonコードに変換
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@ikegami-yukino
ikegami-yukino / DecisionTree2Code.ipynb
Created January 8, 2016 19:25
scikit-learnの決定木をルールベースのコードに変換
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@ikegami-yukino
ikegami-yukino / check_es_synonym.py
Last active March 26, 2020 04:56
Elasticsearch同義語辞書確認用コード
import os
import shutil
import tempfile
import tcptest
from elasticsearch import Elasticsearch
SYNONYMS_PATH = "/tmp/wikipedia_synonym.txt"
settings = {