Skip to content

Instantly share code, notes, and snippets.

View fasiha's full-sized avatar
💭
🧘‍♂️🐻

Ahmed Fasih fasiha

💭
🧘‍♂️🐻
View GitHub Profile
# Put new words in a CSV with this format
# 表層形,左文脈ID,右文脈ID,コスト,品詞,品詞細分類1,品詞細分類2,品詞細分類3,活用形,活用型,原形,読み,発音
# surface_form,left_context_id,right_context_id,cost,part_of_speech,pos_division_1,pos_division_2,pos_division_3,inflection_type,inflection_style,lemma,reading,pronunciation
$ echo "fasihsignal,-1,-1,100,名詞,一般,*,*,*,*,fasihsignal,ファシシグナル,ファシシグナル" > a.csv
# Then use mecab-dict-index to compile the csv into a .dic file, based on an existing mecab dictionary file
$ /usr/local/Cellar/mecab/0.996/libexec/mecab/mecab-dict-index -d/usr/local/Cellar/mecab/0.996/lib/mecab/dic/ipadic/ -u a.dic -f utf8 -t utf8 a.csv
# The use it
$ mecab -ua.dic
@fasiha
fasiha / Dockerfile
Created February 2, 2018 18:15 — forked from klokan/Dockerfile
GDAL in Docker - stable GDAL with JP2KAK, MRSID and ECW: https://registry.hub.docker.com/u/klokantech/gdal/
FROM debian:7
ENV DEBIAN_FRONTEND noninteractive
RUN apt-get -qq update \
&& apt-get -qq -y --no-install-recommends install \
autoconf \
automake \
build-essential \
curl \
@fasiha
fasiha / gist:dd5eb7942c79571dd5895e05f27e8bb9
Created January 18, 2018 02:48 — forked from masayu-a/gist:b3ce862336e47736e84f
English translations of UniDic inflection types, prepared by Irena Srdanovic, 18.1.2013 and 22.1.2013
Inflection type (Ja) Inflection type (En) Inflection type - description (En)
カ行変格 ka_irr kahen_verb.irregular
サ行変格 sa_irr sahen_verb.irregular
ザ行変格 za_irr zahen_verb.irregular
上一段-ア行 V1i.a kamiichidan_verb_i_row.a_column
上一段-カ行 V1i.ka kamiichidan_verb_i_row.ka_column
上一段-ガ行 V1i.ga kamiichidan_verb_i_row.ga_column
上一段-ザ行 V1i.za kamiichidan_verb_i_row.za_column
上一段-タ行 V1i.ta kamiichidan_verb_i_row.ta_column
@fasiha
fasiha / gist:76e542db29b659b3b4ec02d3cd4f2c89
Created January 18, 2018 02:48 — forked from masayu-a/gist:3e11168f9330e2d83a68
English translations of UniDic inflection forms, prepared by Irena Srdanovic, 18.1.2013 and 22.1.2013
Inflected forms (Ja) Inflected forms (En) Inflected forms (En)- description
ク語法 ku_wrd ku_wording
仮定形-一般 Cond.g conditional.general(katei)
仮定形-融合 Cond.int conditional.integrated(katei)
命令形 Imp imperative(meirei)
已然形-一般 Real.g realis.general(izen)
已然形-補助 Real.aux realis.auxiliary(izen)
意志推量形 Vol_tent volitional_tentative(ishi_suiryo)
未然形-サ Irr.sa irrealis.sa(mizen)

Quotes and notes from John K Nelson, “A year in the life of a Shinto shrine".

The practitioners of Shinto hope that anything we can see or since that is full of power, mysterious, marvelous, uncontrolled, strange, or simply beyond our abilities of comprehension is what constitutes the Kami. Therefore, to translate the word as "god "or "deity" is not quite what the Japanese, from ancient times till now, have in mind. P.27

Then suddenly dawned on me – so this is what Shinto holes is divine! Not a text of dubious miracles or what someone maybe said or a particular structure but the actual phenomena of the world itself. P.26

It began an appeal to citizens in need of strength, look, health, new opportunities, a spouse, first pregnancy, protection from sickness and natural disasters, and the perseverance necessary to rebuild shattered lives. We can imagine the comfort of visits to the shrine must have provided at the time – to come from a shantytown structure or damage dwelling (90% of the cities buildings rece

@fasiha
fasiha / packages.md
Last active June 23, 2017 14:42 — forked from sriveravi/packages.md
Apps and packages to install on fresh macOS
@fasiha
fasiha / gist:59fd33c2075b10fdd0df
Created November 9, 2015 01:44 — forked from masayu-a/gist:e3eee0637c07d4019ec9
English Tagset for UniDic - English translations of UniDic POS tags, prepared by Irena Srdanovic, 17.1.2013, checked by Ogiso, Den and Maekawa
PoS(Ja) PoS (En) PoS (En) - descriptions
代名詞 Pron pronoun
副詞 Adv adverb
助動詞 Aux auxiliary_verb
助詞-係助詞 P.bind particle(binding)
助詞-副助詞 P.adv particle(adverbial)
助詞-接続助詞 P.conj particle(conjunctive)
助詞-格助詞 P.case particle(case)
助詞-準体助詞 P.nom particle(nominal)
@fasiha
fasiha / freq.rb
Last active August 29, 2015 14:23 — forked from stephenmac7/freq.rb
# Lemma Frequency Report
# Gem Depends: ve
# System Depends: mecab, mecab-ipadic-utf-8
# Usage: ruby freq.rb [FILE]
require 've'
def main
# Input from stdin or args, UTF-8 required
contents = ARGF.read
# I'm using aozora bunko text as input, so the rubies need to be removed
#!/usr/bin/perl
# This is a script to change KyTea's POS tags in Japanese to English
# abbreviations
use strict;
use utf8;
use Getopt::Long;
use List::Util qw(sum min max shuffle);
binmode STDIN, ":utf8";
binmode STDOUT, ":utf8";
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.