PoS(Ja) | PoS (En) | PoS (En) - descriptions |
---|---|---|
代名詞 | Pron | pronoun |
副詞 | Adv | adverb |
助動詞 | Aux | auxiliary_verb |
助詞-係助詞 | P.bind | particle(binding) |
助詞-副助詞 | P.adv | particle(adverbial) |
助詞-接続助詞 | P.conj | particle(conjunctive) |
助詞-格助詞 | P.case | particle(case) |
助詞-準体助詞 | P.nom | particle(nominal) |
# Put new words in a CSV with this format | |
# 表層形,左文脈ID,右文脈ID,コスト,品詞,品詞細分類1,品詞細分類2,品詞細分類3,活用形,活用型,原形,読み,発音 | |
# surface_form,left_context_id,right_context_id,cost,part_of_speech,pos_division_1,pos_division_2,pos_division_3,inflection_type,inflection_style,lemma,reading,pronunciation | |
$ echo "fasihsignal,-1,-1,100,名詞,一般,*,*,*,*,fasihsignal,ファシシグナル,ファシシグナル" > a.csv | |
# Then use mecab-dict-index to compile the csv into a .dic file, based on an existing mecab dictionary file | |
$ /usr/local/Cellar/mecab/0.996/libexec/mecab/mecab-dict-index -d/usr/local/Cellar/mecab/0.996/lib/mecab/dic/ipadic/ -u a.dic -f utf8 -t utf8 a.csv | |
# The use it | |
$ mecab -ua.dic |
FROM debian:7 | |
ENV DEBIAN_FRONTEND noninteractive | |
RUN apt-get -qq update \ | |
&& apt-get -qq -y --no-install-recommends install \ | |
autoconf \ | |
automake \ | |
build-essential \ | |
curl \ |
Inflection type (Ja) | Inflection type (En) | Inflection type - description (En) |
---|---|---|
カ行変格 | ka_irr | kahen_verb.irregular |
サ行変格 | sa_irr | sahen_verb.irregular |
ザ行変格 | za_irr | zahen_verb.irregular |
上一段-ア行 | V1i.a | kamiichidan_verb_i_row.a_column |
上一段-カ行 | V1i.ka | kamiichidan_verb_i_row.ka_column |
上一段-ガ行 | V1i.ga | kamiichidan_verb_i_row.ga_column |
上一段-ザ行 | V1i.za | kamiichidan_verb_i_row.za_column |
上一段-タ行 | V1i.ta | kamiichidan_verb_i_row.ta_column |
Inflected forms (Ja) | Inflected forms (En) | Inflected forms (En)- description |
---|---|---|
ク語法 | ku_wrd | ku_wording |
仮定形-一般 | Cond.g | conditional.general(katei) |
仮定形-融合 | Cond.int | conditional.integrated(katei) |
命令形 | Imp | imperative(meirei) |
已然形-一般 | Real.g | realis.general(izen) |
已然形-補助 | Real.aux | realis.auxiliary(izen) |
意志推量形 | Vol_tent | volitional_tentative(ishi_suiryo) |
未然形-サ | Irr.sa | irrealis.sa(mizen) |
Quotes and notes from John K Nelson, “A year in the life of a Shinto shrine".
The practitioners of Shinto hope that anything we can see or since that is full of power, mysterious, marvelous, uncontrolled, strange, or simply beyond our abilities of comprehension is what constitutes the Kami. Therefore, to translate the word as "god "or "deity" is not quite what the Japanese, from ancient times till now, have in mind. P.27
Then suddenly dawned on me – so this is what Shinto holes is divine! Not a text of dubious miracles or what someone maybe said or a particular structure but the actual phenomena of the world itself. P.26
It began an appeal to citizens in need of strength, look, health, new opportunities, a spouse, first pregnancy, protection from sickness and natural disasters, and the perseverance necessary to rebuild shattered lives. We can imagine the comfort of visits to the shrine must have provided at the time – to come from a shantytown structure or damage dwelling (90% of the cities buildings rece
- Homebrew
- Ruby https://www.ruby-lang.org/en/
- Octave https://www.gnu.org/software/octave/
- MacVim http://www.vim.org/
- Python https://www.python.org/
- Python3 https://www.python.org/
- pip
- jupyter
- pygments
- numpy
# Lemma Frequency Report | |
# Gem Depends: ve | |
# System Depends: mecab, mecab-ipadic-utf-8 | |
# Usage: ruby freq.rb [FILE] | |
require 've' | |
def main | |
# Input from stdin or args, UTF-8 required | |
contents = ARGF.read | |
# I'm using aozora bunko text as input, so the rubies need to be removed |
#!/usr/bin/perl | |
# This is a script to change KyTea's POS tags in Japanese to English | |
# abbreviations | |
use strict; | |
use utf8; | |
use Getopt::Long; | |
use List::Util qw(sum min max shuffle); | |
binmode STDIN, ":utf8"; | |
binmode STDOUT, ":utf8"; |