Skip to content

Instantly share code, notes, and snippets.

@hnishi
Created July 19, 2020 14:27
Show Gist options
  • Star 1 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save hnishi/610bb552bf2a0a3c429f3844c5d66559 to your computer and use it in GitHub Desktop.
Save hnishi/610bb552bf2a0a3c429f3844c5d66559 to your computer and use it in GitHub Desktop.
mecab-ipadic-NEologd-in-colab.ipynb
Display the source blob
Display the rendered blob
Raw
{
"nbformat": 4,
"nbformat_minor": 0,
"metadata": {
"colab": {
"name": "mecab-ipadic-NEologd-in-colab.ipynb",
"provenance": [],
"collapsed_sections": [],
"toc_visible": true,
"authorship_tag": "ABX9TyOg7k3bYz7/IqjFOWEa6AmG",
"include_colab_link": true
},
"kernelspec": {
"name": "python3",
"display_name": "Python 3"
}
},
"cells": [
{
"cell_type": "markdown",
"metadata": {
"id": "view-in-github",
"colab_type": "text"
},
"source": [
"<a href=\"https://colab.research.google.com/gist/hnishi/610bb552bf2a0a3c429f3844c5d66559/mecab-ipadic-neologd-in-colab.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "3-Lh16_xJR1e",
"colab_type": "text"
},
"source": [
"# mecab-ipadic-NEologd を google colab で使う"
]
},
{
"cell_type": "code",
"metadata": {
"id": "Dz7598vvMs_c",
"colab_type": "code",
"colab": {
"base_uri": "https://localhost:8080/",
"height": 1000
},
"outputId": "4cf92818-0f95-4116-9856-bd0b8bac4b44"
},
"source": [
"# MeCabのインストール\n",
"!apt install mecab libmecab-dev mecab-ipadic-utf8\n",
"!pip install mecab-python3\n",
"\n",
"# mecab-ipadic-NEologdのインストール\n",
"!apt install git make curl xz-utils file\n",
"!git clone --depth 1 https://github.com/neologd/mecab-ipadic-neologd.git\n",
"!echo yes | mecab-ipadic-neologd/bin/install-mecab-ipadic-neologd -n -a\n",
"\n",
"# Ref: https://qiita.com/Fulltea/items/90f6ebe6dcceaf64eaef\n",
"# Ref: https://qiita.com/SUZUKI_Masaya/items/685000d569452585210c\n",
"\n",
"!ln -s /etc/mecabrc /usr/local/etc/mecabrc\n",
"# Ref: https://qiita.com/Naritoshi/items/8f55d7d5cce9ce414395"
],
"execution_count": 2,
"outputs": [
{
"output_type": "stream",
"text": [
"Reading package lists... Done\n",
"Building dependency tree \n",
"Reading state information... Done\n",
"The following package was automatically installed and is no longer required:\n",
" libnvidia-common-440\n",
"Use 'apt autoremove' to remove it.\n",
"The following additional packages will be installed:\n",
" libmecab2 mecab-ipadic mecab-jumandic mecab-jumandic-utf8 mecab-utils\n",
"The following NEW packages will be installed:\n",
" libmecab-dev libmecab2 mecab mecab-ipadic mecab-ipadic-utf8 mecab-jumandic\n",
" mecab-jumandic-utf8 mecab-utils\n",
"0 upgraded, 8 newly installed, 0 to remove and 35 not upgraded.\n",
"Need to get 29.0 MB of archives.\n",
"After this operation, 277 MB of additional disk space will be used.\n",
"Get:1 http://archive.ubuntu.com/ubuntu bionic/universe amd64 libmecab2 amd64 0.996-5 [257 kB]\n",
"Get:2 http://archive.ubuntu.com/ubuntu bionic/universe amd64 libmecab-dev amd64 0.996-5 [308 kB]\n",
"Get:3 http://archive.ubuntu.com/ubuntu bionic/universe amd64 mecab-utils amd64 0.996-5 [4,856 B]\n",
"Get:4 http://archive.ubuntu.com/ubuntu bionic/universe amd64 mecab-jumandic-utf8 all 7.0-20130310-4 [16.2 MB]\n",
"Get:5 http://archive.ubuntu.com/ubuntu bionic/universe amd64 mecab-jumandic all 7.0-20130310-4 [2,212 B]\n",
"Get:6 http://archive.ubuntu.com/ubuntu bionic/universe amd64 mecab-ipadic all 2.7.0-20070801+main-1 [12.1 MB]\n",
"Get:7 http://archive.ubuntu.com/ubuntu bionic/universe amd64 mecab amd64 0.996-5 [132 kB]\n",
"Get:8 http://archive.ubuntu.com/ubuntu bionic/universe amd64 mecab-ipadic-utf8 all 2.7.0-20070801+main-1 [3,522 B]\n",
"Fetched 29.0 MB in 2s (15.9 MB/s)\n",
"Selecting previously unselected package libmecab2:amd64.\n",
"(Reading database ... 144465 files and directories currently installed.)\n",
"Preparing to unpack .../0-libmecab2_0.996-5_amd64.deb ...\n",
"Unpacking libmecab2:amd64 (0.996-5) ...\n",
"Selecting previously unselected package libmecab-dev.\n",
"Preparing to unpack .../1-libmecab-dev_0.996-5_amd64.deb ...\n",
"Unpacking libmecab-dev (0.996-5) ...\n",
"Selecting previously unselected package mecab-utils.\n",
"Preparing to unpack .../2-mecab-utils_0.996-5_amd64.deb ...\n",
"Unpacking mecab-utils (0.996-5) ...\n",
"Selecting previously unselected package mecab-jumandic-utf8.\n",
"Preparing to unpack .../3-mecab-jumandic-utf8_7.0-20130310-4_all.deb ...\n",
"Unpacking mecab-jumandic-utf8 (7.0-20130310-4) ...\n",
"Selecting previously unselected package mecab-jumandic.\n",
"Preparing to unpack .../4-mecab-jumandic_7.0-20130310-4_all.deb ...\n",
"Unpacking mecab-jumandic (7.0-20130310-4) ...\n",
"Selecting previously unselected package mecab-ipadic.\n",
"Preparing to unpack .../5-mecab-ipadic_2.7.0-20070801+main-1_all.deb ...\n",
"Unpacking mecab-ipadic (2.7.0-20070801+main-1) ...\n",
"Selecting previously unselected package mecab.\n",
"Preparing to unpack .../6-mecab_0.996-5_amd64.deb ...\n",
"Unpacking mecab (0.996-5) ...\n",
"Selecting previously unselected package mecab-ipadic-utf8.\n",
"Preparing to unpack .../7-mecab-ipadic-utf8_2.7.0-20070801+main-1_all.deb ...\n",
"Unpacking mecab-ipadic-utf8 (2.7.0-20070801+main-1) ...\n",
"Setting up libmecab2:amd64 (0.996-5) ...\n",
"Setting up mecab-utils (0.996-5) ...\n",
"Setting up mecab-ipadic (2.7.0-20070801+main-1) ...\n",
"Compiling IPA dictionary for Mecab. This takes long time...\n",
"reading /usr/share/mecab/dic/ipadic/unk.def ... 40\n",
"emitting double-array: 100% |###########################################| \n",
"/usr/share/mecab/dic/ipadic/model.def is not found. skipped.\n",
"reading /usr/share/mecab/dic/ipadic/Postp.csv ... 146\n",
"reading /usr/share/mecab/dic/ipadic/Auxil.csv ... 199\n",
"reading /usr/share/mecab/dic/ipadic/Prefix.csv ... 221\n",
"reading /usr/share/mecab/dic/ipadic/Postp-col.csv ... 91\n",
"reading /usr/share/mecab/dic/ipadic/Symbol.csv ... 208\n",
"reading /usr/share/mecab/dic/ipadic/Noun.number.csv ... 42\n",
"reading /usr/share/mecab/dic/ipadic/Adj.csv ... 27210\n",
"reading /usr/share/mecab/dic/ipadic/Noun.adverbal.csv ... 795\n",
"reading /usr/share/mecab/dic/ipadic/Noun.verbal.csv ... 12146\n",
"reading /usr/share/mecab/dic/ipadic/Noun.place.csv ... 72999\n",
"reading /usr/share/mecab/dic/ipadic/Interjection.csv ... 252\n",
"reading /usr/share/mecab/dic/ipadic/Noun.name.csv ... 34202\n",
"reading /usr/share/mecab/dic/ipadic/Filler.csv ... 19\n",
"reading /usr/share/mecab/dic/ipadic/Noun.nai.csv ... 42\n",
"reading /usr/share/mecab/dic/ipadic/Adverb.csv ... 3032\n",
"reading /usr/share/mecab/dic/ipadic/Noun.csv ... 60477\n",
"reading /usr/share/mecab/dic/ipadic/Verb.csv ... 130750\n",
"reading /usr/share/mecab/dic/ipadic/Noun.adjv.csv ... 3328\n",
"reading /usr/share/mecab/dic/ipadic/Suffix.csv ... 1393\n",
"reading /usr/share/mecab/dic/ipadic/Noun.proper.csv ... 27327\n",
"reading /usr/share/mecab/dic/ipadic/Others.csv ... 2\n",
"reading /usr/share/mecab/dic/ipadic/Noun.demonst.csv ... 120\n",
"reading /usr/share/mecab/dic/ipadic/Adnominal.csv ... 135\n",
"reading /usr/share/mecab/dic/ipadic/Noun.org.csv ... 16668\n",
"reading /usr/share/mecab/dic/ipadic/Conjunction.csv ... 171\n",
"reading /usr/share/mecab/dic/ipadic/Noun.others.csv ... 151\n",
"emitting double-array: 100% |###########################################| \n",
"reading /usr/share/mecab/dic/ipadic/matrix.def ... 1316x1316\n",
"emitting matrix : 100% |###########################################| \n",
"\n",
"done!\n",
"update-alternatives: using /var/lib/mecab/dic/ipadic to provide /var/lib/mecab/dic/debian (mecab-dictionary) in auto mode\n",
"Setting up libmecab-dev (0.996-5) ...\n",
"Setting up mecab-jumandic-utf8 (7.0-20130310-4) ...\n",
"Compiling Juman dictionary for Mecab.\n",
"reading /usr/share/mecab/dic/juman/unk.def ... 37\n",
"emitting double-array: 100% |###########################################| \n",
"reading /usr/share/mecab/dic/juman/Postp.csv ... 108\n",
"reading /usr/share/mecab/dic/juman/Demonstrative.csv ... 97\n",
"reading /usr/share/mecab/dic/juman/Prefix.csv ... 90\n",
"reading /usr/share/mecab/dic/juman/Noun.keishiki.csv ... 8\n",
"reading /usr/share/mecab/dic/juman/Noun.suusi.csv ... 49\n",
"reading /usr/share/mecab/dic/juman/Assert.csv ... 34\n",
"reading /usr/share/mecab/dic/juman/Wikipedia.csv ... 167709\n",
"reading /usr/share/mecab/dic/juman/Special.csv ... 158\n",
"reading /usr/share/mecab/dic/juman/Noun.hukusi.csv ... 81\n",
"reading /usr/share/mecab/dic/juman/Emoticon.csv ... 972\n",
"reading /usr/share/mecab/dic/juman/ContentW.csv ... 551145\n",
"reading /usr/share/mecab/dic/juman/AuxV.csv ... 593\n",
"reading /usr/share/mecab/dic/juman/Suffix.csv ... 2128\n",
"reading /usr/share/mecab/dic/juman/Auto.csv ... 18931\n",
"reading /usr/share/mecab/dic/juman/Rengo.csv ... 1118\n",
"reading /usr/share/mecab/dic/juman/Noun.koyuu.csv ... 7964\n",
"emitting double-array: 100% |###########################################| \n",
"reading /usr/share/mecab/dic/juman/matrix.def ... 1876x1876\n",
"emitting matrix : 100% |###########################################| \n",
"\n",
"done!\n",
"Setting up mecab-ipadic-utf8 (2.7.0-20070801+main-1) ...\n",
"Compiling IPA dictionary for Mecab. This takes long time...\n",
"reading /usr/share/mecab/dic/ipadic/unk.def ... 40\n",
"emitting double-array: 100% |###########################################| \n",
"/usr/share/mecab/dic/ipadic/model.def is not found. skipped.\n",
"reading /usr/share/mecab/dic/ipadic/Postp.csv ... 146\n",
"reading /usr/share/mecab/dic/ipadic/Auxil.csv ... 199\n",
"reading /usr/share/mecab/dic/ipadic/Prefix.csv ... 221\n",
"reading /usr/share/mecab/dic/ipadic/Postp-col.csv ... 91\n",
"reading /usr/share/mecab/dic/ipadic/Symbol.csv ... 208\n",
"reading /usr/share/mecab/dic/ipadic/Noun.number.csv ... 42\n",
"reading /usr/share/mecab/dic/ipadic/Adj.csv ... 27210\n",
"reading /usr/share/mecab/dic/ipadic/Noun.adverbal.csv ... 795\n",
"reading /usr/share/mecab/dic/ipadic/Noun.verbal.csv ... 12146\n",
"reading /usr/share/mecab/dic/ipadic/Noun.place.csv ... 72999\n",
"reading /usr/share/mecab/dic/ipadic/Interjection.csv ... 252\n",
"reading /usr/share/mecab/dic/ipadic/Noun.name.csv ... 34202\n",
"reading /usr/share/mecab/dic/ipadic/Filler.csv ... 19\n",
"reading /usr/share/mecab/dic/ipadic/Noun.nai.csv ... 42\n",
"reading /usr/share/mecab/dic/ipadic/Adverb.csv ... 3032\n",
"reading /usr/share/mecab/dic/ipadic/Noun.csv ... 60477\n",
"reading /usr/share/mecab/dic/ipadic/Verb.csv ... 130750\n",
"reading /usr/share/mecab/dic/ipadic/Noun.adjv.csv ... 3328\n",
"reading /usr/share/mecab/dic/ipadic/Suffix.csv ... 1393\n",
"reading /usr/share/mecab/dic/ipadic/Noun.proper.csv ... 27327\n",
"reading /usr/share/mecab/dic/ipadic/Others.csv ... 2\n",
"reading /usr/share/mecab/dic/ipadic/Noun.demonst.csv ... 120\n",
"reading /usr/share/mecab/dic/ipadic/Adnominal.csv ... 135\n",
"reading /usr/share/mecab/dic/ipadic/Noun.org.csv ... 16668\n",
"reading /usr/share/mecab/dic/ipadic/Conjunction.csv ... 171\n",
"reading /usr/share/mecab/dic/ipadic/Noun.others.csv ... 151\n",
"emitting double-array: 100% |###########################################| \n",
"reading /usr/share/mecab/dic/ipadic/matrix.def ... 1316x1316\n",
"emitting matrix : 100% |###########################################| \n",
"\n",
"done!\n",
"update-alternatives: using /var/lib/mecab/dic/ipadic-utf8 to provide /var/lib/mecab/dic/debian (mecab-dictionary) in auto mode\n",
"Setting up mecab (0.996-5) ...\n",
"Compiling IPA dictionary for Mecab. This takes long time...\n",
"reading /usr/share/mecab/dic/ipadic/unk.def ... 40\n",
"emitting double-array: 100% |###########################################| \n",
"/usr/share/mecab/dic/ipadic/model.def is not found. skipped.\n",
"reading /usr/share/mecab/dic/ipadic/Postp.csv ... 146\n",
"reading /usr/share/mecab/dic/ipadic/Auxil.csv ... 199\n",
"reading /usr/share/mecab/dic/ipadic/Prefix.csv ... 221\n",
"reading /usr/share/mecab/dic/ipadic/Postp-col.csv ... 91\n",
"reading /usr/share/mecab/dic/ipadic/Symbol.csv ... 208\n",
"reading /usr/share/mecab/dic/ipadic/Noun.number.csv ... 42\n",
"reading /usr/share/mecab/dic/ipadic/Adj.csv ... 27210\n",
"reading /usr/share/mecab/dic/ipadic/Noun.adverbal.csv ... 795\n",
"reading /usr/share/mecab/dic/ipadic/Noun.verbal.csv ... 12146\n",
"reading /usr/share/mecab/dic/ipadic/Noun.place.csv ... 72999\n",
"reading /usr/share/mecab/dic/ipadic/Interjection.csv ... 252\n",
"reading /usr/share/mecab/dic/ipadic/Noun.name.csv ... 34202\n",
"reading /usr/share/mecab/dic/ipadic/Filler.csv ... 19\n",
"reading /usr/share/mecab/dic/ipadic/Noun.nai.csv ... 42\n",
"reading /usr/share/mecab/dic/ipadic/Adverb.csv ... 3032\n",
"reading /usr/share/mecab/dic/ipadic/Noun.csv ... 60477\n",
"reading /usr/share/mecab/dic/ipadic/Verb.csv ... 130750\n",
"reading /usr/share/mecab/dic/ipadic/Noun.adjv.csv ... 3328\n",
"reading /usr/share/mecab/dic/ipadic/Suffix.csv ... 1393\n",
"reading /usr/share/mecab/dic/ipadic/Noun.proper.csv ... 27327\n",
"reading /usr/share/mecab/dic/ipadic/Others.csv ... 2\n",
"reading /usr/share/mecab/dic/ipadic/Noun.demonst.csv ... 120\n",
"reading /usr/share/mecab/dic/ipadic/Adnominal.csv ... 135\n",
"reading /usr/share/mecab/dic/ipadic/Noun.org.csv ... 16668\n",
"reading /usr/share/mecab/dic/ipadic/Conjunction.csv ... 171\n",
"reading /usr/share/mecab/dic/ipadic/Noun.others.csv ... 151\n",
"emitting double-array: 100% |###########################################| \n",
"reading /usr/share/mecab/dic/ipadic/matrix.def ... 1316x1316\n",
"emitting matrix : 100% |###########################################| \n",
"\n",
"done!\n",
"Setting up mecab-jumandic (7.0-20130310-4) ...\n",
"Processing triggers for man-db (2.8.3-2ubuntu0.1) ...\n",
"Processing triggers for libc-bin (2.27-3ubuntu1) ...\n",
"/sbin/ldconfig.real: /usr/local/lib/python3.6/dist-packages/ideep4py/lib/libmkldnn.so.0 is not a symbolic link\n",
"\n",
"Collecting mecab-python3\n",
"\u001b[?25l Downloading https://files.pythonhosted.org/packages/8b/06/2aeff86243c88580ccf78b136d403ce5e0a1eed9091103157f01e806499f/mecab_python3-1.0.1-cp36-cp36m-manylinux2010_x86_64.whl (3.5MB)\n",
"\u001b[K |████████████████████████████████| 3.5MB 2.9MB/s \n",
"\u001b[?25hInstalling collected packages: mecab-python3\n",
"Successfully installed mecab-python3-1.0.1\n",
"Reading package lists... Done\n",
"Building dependency tree \n",
"Reading state information... Done\n",
"make is already the newest version (4.1-9.1ubuntu1).\n",
"make set to manually installed.\n",
"xz-utils is already the newest version (5.2.2-1.3).\n",
"xz-utils set to manually installed.\n",
"curl is already the newest version (7.58.0-2ubuntu3.9).\n",
"git is already the newest version (1:2.17.1-1ubuntu0.7).\n",
"The following package was automatically installed and is no longer required:\n",
" libnvidia-common-440\n",
"Use 'apt autoremove' to remove it.\n",
"The following additional packages will be installed:\n",
" libmagic-mgc libmagic1\n",
"The following NEW packages will be installed:\n",
" file libmagic-mgc libmagic1\n",
"0 upgraded, 3 newly installed, 0 to remove and 35 not upgraded.\n",
"Need to get 275 kB of archives.\n",
"After this operation, 5,297 kB of additional disk space will be used.\n",
"Get:1 http://archive.ubuntu.com/ubuntu bionic-updates/main amd64 libmagic-mgc amd64 1:5.32-2ubuntu0.4 [184 kB]\n",
"Get:2 http://archive.ubuntu.com/ubuntu bionic-updates/main amd64 libmagic1 amd64 1:5.32-2ubuntu0.4 [68.6 kB]\n",
"Get:3 http://archive.ubuntu.com/ubuntu bionic-updates/main amd64 file amd64 1:5.32-2ubuntu0.4 [22.1 kB]\n",
"Fetched 275 kB in 1s (427 kB/s)\n",
"Selecting previously unselected package libmagic-mgc.\n",
"(Reading database ... 144643 files and directories currently installed.)\n",
"Preparing to unpack .../libmagic-mgc_1%3a5.32-2ubuntu0.4_amd64.deb ...\n",
"Unpacking libmagic-mgc (1:5.32-2ubuntu0.4) ...\n",
"Selecting previously unselected package libmagic1:amd64.\n",
"Preparing to unpack .../libmagic1_1%3a5.32-2ubuntu0.4_amd64.deb ...\n",
"Unpacking libmagic1:amd64 (1:5.32-2ubuntu0.4) ...\n",
"Selecting previously unselected package file.\n",
"Preparing to unpack .../file_1%3a5.32-2ubuntu0.4_amd64.deb ...\n",
"Unpacking file (1:5.32-2ubuntu0.4) ...\n",
"Setting up libmagic-mgc (1:5.32-2ubuntu0.4) ...\n",
"Setting up libmagic1:amd64 (1:5.32-2ubuntu0.4) ...\n",
"Setting up file (1:5.32-2ubuntu0.4) ...\n",
"Processing triggers for man-db (2.8.3-2ubuntu0.1) ...\n",
"Processing triggers for libc-bin (2.27-3ubuntu1) ...\n",
"/sbin/ldconfig.real: /usr/local/lib/python3.6/dist-packages/ideep4py/lib/libmkldnn.so.0 is not a symbolic link\n",
"\n",
"Cloning into 'mecab-ipadic-neologd'...\n",
"remote: Enumerating objects: 75, done.\u001b[K\n",
"remote: Counting objects: 100% (75/75), done.\u001b[K\n",
"remote: Compressing objects: 100% (74/74), done.\u001b[K\n",
"remote: Total 75 (delta 5), reused 54 (delta 0), pack-reused 0\u001b[K\n",
"Unpacking objects: 100% (75/75), done.\n",
"[install-mecab-ipadic-NEologd] : Start..\n",
"[install-mecab-ipadic-NEologd] : Check the existance of libraries\n",
"[install-mecab-ipadic-NEologd] : find => ok\n",
"[install-mecab-ipadic-NEologd] : sort => ok\n",
"[install-mecab-ipadic-NEologd] : head => ok\n",
"[install-mecab-ipadic-NEologd] : cut => ok\n",
"[install-mecab-ipadic-NEologd] : egrep => ok\n",
"[install-mecab-ipadic-NEologd] : mecab => ok\n",
"[install-mecab-ipadic-NEologd] : mecab-config => ok\n",
"[install-mecab-ipadic-NEologd] : make => ok\n",
"[install-mecab-ipadic-NEologd] : curl => ok\n",
"[install-mecab-ipadic-NEologd] : sed => ok\n",
"[install-mecab-ipadic-NEologd] : cat => ok\n",
"[install-mecab-ipadic-NEologd] : diff => ok\n",
"[install-mecab-ipadic-NEologd] : tar => ok\n",
"[install-mecab-ipadic-NEologd] : unxz => ok\n",
"[install-mecab-ipadic-NEologd] : xargs => ok\n",
"[install-mecab-ipadic-NEologd] : grep => ok\n",
"[install-mecab-ipadic-NEologd] : iconv => ok\n",
"[install-mecab-ipadic-NEologd] : patch => ok\n",
"[install-mecab-ipadic-NEologd] : which => ok\n",
"[install-mecab-ipadic-NEologd] : file => ok\n",
"[install-mecab-ipadic-NEologd] : openssl => ok\n",
"[install-mecab-ipadic-NEologd] : awk => ok\n",
"\n",
"[install-mecab-ipadic-NEologd] : mecab-ipadic-NEologd is already up-to-date\n",
"\n",
"[install-mecab-ipadic-NEologd] : mecab-ipadic-NEologd will be install to /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd\n",
"\n",
"[install-mecab-ipadic-NEologd] : Make mecab-ipadic-NEologd\n",
"[make-mecab-ipadic-NEologd] : Start..\n",
"[make-mecab-ipadic-NEologd] : Check local seed directory\n",
"[make-mecab-ipadic-NEologd] : Check local seed file\n",
"[make-mecab-ipadic-NEologd] : Check local build directory\n",
"[make-mecab-ipadic-NEologd] : create /content/mecab-ipadic-neologd/libexec/../build\n",
"[make-mecab-ipadic-NEologd] : Download original mecab-ipadic file\n",
"[make-mecab-ipadic-NEologd] : Try to access to https://ja.osdn.net\n",
"[make-mecab-ipadic-NEologd] : Try to download from https://ja.osdn.net/frs/g_redir.php?m=kent&f=mecab%2Fmecab-ipadic%2F2.7.0-20070801%2Fmecab-ipadic-2.7.0-20070801.tar.gz\n",
" % Total % Received % Xferd Average Speed Time Time Time Current\n",
" Dload Upload Total Spent Left Speed\n",
" 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0\n",
"100 11.6M 100 11.6M 0 0 1391k 0 0:00:08 0:00:08 --:--:-- 2427k\n",
"Hash value of /content/mecab-ipadic-neologd/libexec/../build/mecab-ipadic-2.7.0-20070801.tar.gz matched\n",
"[make-mecab-ipadic-NEologd] : Decompress original mecab-ipadic file\n",
"mecab-ipadic-2.7.0-20070801/\n",
"mecab-ipadic-2.7.0-20070801/README\n",
"mecab-ipadic-2.7.0-20070801/AUTHORS\n",
"mecab-ipadic-2.7.0-20070801/COPYING\n",
"mecab-ipadic-2.7.0-20070801/ChangeLog\n",
"mecab-ipadic-2.7.0-20070801/INSTALL\n",
"mecab-ipadic-2.7.0-20070801/Makefile.am\n",
"mecab-ipadic-2.7.0-20070801/Makefile.in\n",
"mecab-ipadic-2.7.0-20070801/NEWS\n",
"mecab-ipadic-2.7.0-20070801/aclocal.m4\n",
"mecab-ipadic-2.7.0-20070801/config.guess\n",
"mecab-ipadic-2.7.0-20070801/config.sub\n",
"mecab-ipadic-2.7.0-20070801/configure\n",
"mecab-ipadic-2.7.0-20070801/configure.in\n",
"mecab-ipadic-2.7.0-20070801/install-sh\n",
"mecab-ipadic-2.7.0-20070801/missing\n",
"mecab-ipadic-2.7.0-20070801/mkinstalldirs\n",
"mecab-ipadic-2.7.0-20070801/Adj.csv\n",
"mecab-ipadic-2.7.0-20070801/Adnominal.csv\n",
"mecab-ipadic-2.7.0-20070801/Adverb.csv\n",
"mecab-ipadic-2.7.0-20070801/Auxil.csv\n",
"mecab-ipadic-2.7.0-20070801/Conjunction.csv\n",
"mecab-ipadic-2.7.0-20070801/Filler.csv\n",
"mecab-ipadic-2.7.0-20070801/Interjection.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.adjv.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.adverbal.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.demonst.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.nai.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.name.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.number.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.org.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.others.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.place.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.proper.csv\n",
"mecab-ipadic-2.7.0-20070801/Noun.verbal.csv\n",
"mecab-ipadic-2.7.0-20070801/Others.csv\n",
"mecab-ipadic-2.7.0-20070801/Postp-col.csv\n",
"mecab-ipadic-2.7.0-20070801/Postp.csv\n",
"mecab-ipadic-2.7.0-20070801/Prefix.csv\n",
"mecab-ipadic-2.7.0-20070801/Suffix.csv\n",
"mecab-ipadic-2.7.0-20070801/Symbol.csv\n",
"mecab-ipadic-2.7.0-20070801/Verb.csv\n",
"mecab-ipadic-2.7.0-20070801/char.def\n",
"mecab-ipadic-2.7.0-20070801/feature.def\n",
"mecab-ipadic-2.7.0-20070801/left-id.def\n",
"mecab-ipadic-2.7.0-20070801/matrix.def\n",
"mecab-ipadic-2.7.0-20070801/pos-id.def\n",
"mecab-ipadic-2.7.0-20070801/rewrite.def\n",
"mecab-ipadic-2.7.0-20070801/right-id.def\n",
"mecab-ipadic-2.7.0-20070801/unk.def\n",
"mecab-ipadic-2.7.0-20070801/dicrc\n",
"mecab-ipadic-2.7.0-20070801/RESULT\n",
"[make-mecab-ipadic-NEologd] : Configure custom system dictionary on /content/mecab-ipadic-neologd/libexec/../build/mecab-ipadic-2.7.0-20070801-neologd-20200709\n",
"checking for a BSD-compatible install... /usr/bin/install -c\n",
"checking whether build environment is sane... yes\n",
"checking whether make sets $(MAKE)... yes\n",
"checking for working aclocal-1.4... missing\n",
"checking for working autoconf... missing\n",
"checking for working automake-1.4... missing\n",
"checking for working autoheader... missing\n",
"checking for working makeinfo... missing\n",
"checking for a BSD-compatible install... /usr/bin/install -c\n",
"checking for mecab-config... /usr/bin/mecab-config\n",
"configure: creating ./config.status\n",
"config.status: creating Makefile\n",
"[make-mecab-ipadic-NEologd] : Encode the character encoding of system dictionary resources from EUC_JP to UTF-8\n",
"./../../libexec/iconv_euc_to_utf8.sh ./Postp.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Auxil.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Prefix.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Postp-col.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Symbol.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.number.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Adj.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.adverbal.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.verbal.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.place.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Interjection.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.name.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Filler.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.nai.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Adverb.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Verb.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.adjv.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Suffix.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.proper.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Others.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.demonst.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Adnominal.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.org.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Conjunction.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./Noun.others.csv \n",
"rm ./Postp.csv \n",
"rm ./Auxil.csv \n",
"rm ./Prefix.csv \n",
"rm ./Postp-col.csv \n",
"rm ./Symbol.csv \n",
"rm ./Noun.number.csv \n",
"rm ./Adj.csv \n",
"rm ./Noun.adverbal.csv \n",
"rm ./Noun.verbal.csv \n",
"rm ./Noun.place.csv \n",
"rm ./Interjection.csv \n",
"rm ./Noun.name.csv \n",
"rm ./Filler.csv \n",
"rm ./Noun.nai.csv \n",
"rm ./Adverb.csv \n",
"rm ./Noun.csv \n",
"rm ./Verb.csv \n",
"rm ./Noun.adjv.csv \n",
"rm ./Suffix.csv \n",
"rm ./Noun.proper.csv \n",
"rm ./Others.csv \n",
"rm ./Noun.demonst.csv \n",
"rm ./Adnominal.csv \n",
"rm ./Noun.org.csv \n",
"rm ./Conjunction.csv \n",
"rm ./Noun.others.csv \n",
"./../../libexec/iconv_euc_to_utf8.sh ./matrix.def \n",
"./../../libexec/iconv_euc_to_utf8.sh ./char.def \n",
"./../../libexec/iconv_euc_to_utf8.sh ./feature.def \n",
"./../../libexec/iconv_euc_to_utf8.sh ./pos-id.def \n",
"./../../libexec/iconv_euc_to_utf8.sh ./right-id.def \n",
"./../../libexec/iconv_euc_to_utf8.sh ./left-id.def \n",
"./../../libexec/iconv_euc_to_utf8.sh ./unk.def \n",
"./../../libexec/iconv_euc_to_utf8.sh ./rewrite.def \n",
"rm ./matrix.def \n",
"rm ./char.def \n",
"rm ./feature.def \n",
"rm ./pos-id.def \n",
"rm ./right-id.def \n",
"rm ./left-id.def \n",
"rm ./unk.def \n",
"rm ./rewrite.def \n",
"mv ./Suffix.csv.utf8 ./Suffix.csv \n",
"mv ./feature.def.utf8 ./feature.def \n",
"mv ./Adj.csv.utf8 ./Adj.csv \n",
"mv ./Conjunction.csv.utf8 ./Conjunction.csv \n",
"mv ./Noun.name.csv.utf8 ./Noun.name.csv \n",
"mv ./Verb.csv.utf8 ./Verb.csv \n",
"mv ./Filler.csv.utf8 ./Filler.csv \n",
"mv ./Interjection.csv.utf8 ./Interjection.csv \n",
"mv ./Noun.demonst.csv.utf8 ./Noun.demonst.csv \n",
"mv ./char.def.utf8 ./char.def \n",
"mv ./Noun.adjv.csv.utf8 ./Noun.adjv.csv \n",
"mv ./Postp.csv.utf8 ./Postp.csv \n",
"mv ./Adnominal.csv.utf8 ./Adnominal.csv \n",
"mv ./Noun.nai.csv.utf8 ./Noun.nai.csv \n",
"mv ./unk.def.utf8 ./unk.def \n",
"mv ./Noun.proper.csv.utf8 ./Noun.proper.csv \n",
"mv ./Prefix.csv.utf8 ./Prefix.csv \n",
"mv ./Noun.number.csv.utf8 ./Noun.number.csv \n",
"mv ./Noun.verbal.csv.utf8 ./Noun.verbal.csv \n",
"mv ./Noun.place.csv.utf8 ./Noun.place.csv \n",
"mv ./Noun.adverbal.csv.utf8 ./Noun.adverbal.csv \n",
"mv ./Auxil.csv.utf8 ./Auxil.csv \n",
"mv ./left-id.def.utf8 ./left-id.def \n",
"mv ./rewrite.def.utf8 ./rewrite.def \n",
"mv ./Symbol.csv.utf8 ./Symbol.csv \n",
"mv ./Others.csv.utf8 ./Others.csv \n",
"mv ./Noun.org.csv.utf8 ./Noun.org.csv \n",
"mv ./right-id.def.utf8 ./right-id.def \n",
"mv ./matrix.def.utf8 ./matrix.def \n",
"mv ./pos-id.def.utf8 ./pos-id.def \n",
"mv ./Adverb.csv.utf8 ./Adverb.csv \n",
"mv ./Postp-col.csv.utf8 ./Postp-col.csv \n",
"mv ./Noun.csv.utf8 ./Noun.csv \n",
"mv ./Noun.others.csv.utf8 ./Noun.others.csv \n",
"[make-mecab-ipadic-NEologd] : Fix yomigana field of IPA dictionary\n",
"patching file Noun.csv\n",
"patching file Noun.place.csv\n",
"patching file Verb.csv\n",
"patching file Noun.verbal.csv\n",
"patching file Noun.name.csv\n",
"patching file Noun.adverbal.csv\n",
"patching file Noun.csv\n",
"patching file Noun.name.csv\n",
"patching file Noun.org.csv\n",
"patching file Noun.others.csv\n",
"patching file Noun.place.csv\n",
"patching file Noun.proper.csv\n",
"patching file Noun.verbal.csv\n",
"patching file Prefix.csv\n",
"patching file Suffix.csv\n",
"patching file Noun.proper.csv\n",
"patching file Noun.csv\n",
"patching file Noun.name.csv\n",
"patching file Noun.org.csv\n",
"patching file Noun.place.csv\n",
"patching file Noun.proper.csv\n",
"patching file Noun.verbal.csv\n",
"patching file Noun.name.csv\n",
"patching file Noun.org.csv\n",
"patching file Noun.place.csv\n",
"patching file Noun.proper.csv\n",
"patching file Suffix.csv\n",
"patching file Noun.demonst.csv\n",
"patching file Noun.csv\n",
"patching file Noun.name.csv\n",
"[make-mecab-ipadic-NEologd] : Copy user dictionary resource\n",
"[make-mecab-ipadic-NEologd] : Install adverb entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-adverb-dict-seed.20150623.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install interjection entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-interjection-dict-seed.20170216.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install noun orthographic variant entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-common-noun-ortho-variant-dict-seed.20170228.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install noun orthographic variant entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-proper-noun-ortho-variant-dict-seed.20161110.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install entries of orthographic variant of a noun used as verb form using /content/mecab-ipadic-neologd/libexec/../seed/neologd-noun-sahen-conn-ortho-variant-dict-seed.20160323.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install frequent adjective orthographic variant entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-adjective-std-dict-seed.20151126.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install infrequent adjective orthographic variant entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-adjective-exp-dict-seed.20151126.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install adjective verb orthographic variant entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-adjective-verb-dict-seed.20160324.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install infrequent datetime representation entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-date-time-infreq-dict-seed.20190415.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install infrequent quantity representation entries using /content/mecab-ipadic-neologd/libexec/../seed/neologd-quantity-infreq-dict-seed.20190415.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Install entries of ill formed words using /content/mecab-ipadic-neologd/libexec/../seed/neologd-ill-formed-words-dict-seed.20170127.csv.xz\n",
"[make-mecab-ipadic-NEologd] : Re-Index system dictionary\n",
"reading ./unk.def ... 40\n",
"emitting double-array: 100% |###########################################| \n",
"./model.def is not found. skipped.\n",
"reading ./Postp.csv ... 146\n",
"reading ./Auxil.csv ... 199\n",
"reading ./neologd-ill-formed-words-dict-seed.20170127.csv ... 60616\n",
"reading ./Prefix.csv ... 224\n",
"reading ./neologd-adjective-verb-dict-seed.20160324.csv ... 20268\n",
"reading ./Postp-col.csv ... 91\n",
"reading ./Symbol.csv ... 208\n",
"reading ./Noun.number.csv ... 42\n",
"reading ./Adj.csv ... 27210\n",
"reading ./Noun.adverbal.csv ... 808\n",
"reading ./Noun.verbal.csv ... 12150\n",
"reading ./mecab-user-dict-seed.20200709.csv ... 3208856\n",
"reading ./Noun.place.csv ... 73194\n",
"reading ./Interjection.csv ... 252\n",
"reading ./neologd-quantity-infreq-dict-seed.20190415.csv ... 229216\n",
"reading ./neologd-interjection-dict-seed.20170216.csv ... 4701\n",
"reading ./neologd-date-time-infreq-dict-seed.20190415.csv ... 16866\n",
"reading ./Noun.name.csv ... 34215\n",
"reading ./Filler.csv ... 19\n",
"reading ./Noun.nai.csv ... 42\n",
"reading ./neologd-adverb-dict-seed.20150623.csv ... 139792\n",
"reading ./Adverb.csv ... 3032\n",
"reading ./Noun.csv ... 60734\n",
"reading ./Verb.csv ... 130750\n",
"reading ./neologd-common-noun-ortho-variant-dict-seed.20170228.csv ... 152869\n",
"reading ./neologd-adjective-std-dict-seed.20151126.csv ... 507812\n",
"reading ./neologd-adjective-exp-dict-seed.20151126.csv ... 1051146\n",
"reading ./neologd-noun-sahen-conn-ortho-variant-dict-seed.20160323.csv ... 26058\n",
"reading ./Noun.adjv.csv ... 3328\n",
"reading ./Suffix.csv ... 1448\n",
"reading ./Noun.proper.csv ... 27493\n",
"reading ./Others.csv ... 2\n",
"reading ./Noun.demonst.csv ... 120\n",
"reading ./Adnominal.csv ... 135\n",
"reading ./Noun.org.csv ... 17149\n",
"reading ./Conjunction.csv ... 171\n",
"reading ./Noun.others.csv ... 153\n",
"reading ./neologd-proper-noun-ortho-variant-dict-seed.20161110.csv ... 138379\n",
"emitting double-array: 100% |###########################################| \n",
"reading ./matrix.def ... 1316x1316\n",
"emitting matrix : 100% |###########################################| \n",
"\n",
"done!\n",
"[make-mecab-ipadic-NEologd] : Make custom system dictionary on /content/mecab-ipadic-neologd/libexec/../build/mecab-ipadic-2.7.0-20070801-neologd-20200709\n",
"make: Nothing to be done for 'all'.\n",
"[make-mecab-ipadic-NEologd] : Finish..\n",
"[install-mecab-ipadic-NEologd] : Get results of tokenize test\n",
"[test-mecab-ipadic-NEologd] : Start..\n",
"[test-mecab-ipadic-NEologd] : Replace timestamp from 'git clone' date to 'git commit' date\n",
"[test-mecab-ipadic-NEologd] : Get buzz phrases\n",
" % Total % Received % Xferd Average Speed Time Time Time Current\n",
" Dload Upload Total Spent Left Speed\n",
"100 1431 100 1431 0 0 1364 0 0:00:01 0:00:01 --:--:-- 1364\n",
"[test-mecab-ipadic-NEologd] : Get difference between default system dictionary and mecab-ipadic-NEologd\n",
"[test-mecab-ipadic-NEologd] : Tokenize phrase using default system dictionary\n",
"[test-mecab-ipadic-NEologd] : Tokenize phrase using mecab-ipadic-NEologd\n",
"[test-mecab-ipadic-NEologd] : Get result of diff\n",
"[test-mecab-ipadic-NEologd] : Please check difference between default system dictionary and mecab-ipadic-NEologd\n",
"\n",
"default system dictionary\t |\tmecab-ipadic-NEologd\n",
"倍 返し \t\t\t |\t倍返し \n",
"6 連勝 \t\t\t\t |\t6連勝 \n",
"増田 大輝 \t\t\t |\t増田大輝 \n",
"岡本 和 真 \t\t\t |\t岡本和真 \n",
"アビスパ 福岡 \t\t\t |\tアビスパ福岡 \n",
"みや ぞ ん \t\t\t |\tみやぞん \n",
"ミスター サンデー \t\t |\tミスターサンデー \n",
"霜降り 明星 \t\t\t |\t霜降り明星 \n",
"\n",
"[test-mecab-ipadic-NEologd] : Finish..\n",
"\n",
"[install-mecab-ipadic-NEologd] : Please check the list of differences in the upper part.\n",
"\n",
"[install-mecab-ipadic-NEologd] : Do you want to install mecab-ipadic-NEologd? Type yes or no.\n",
"[install-mecab-ipadic-NEologd] : OK. Let's install mecab-ipadic-NEologd.\n",
"[install-mecab-ipadic-NEologd] : Start..\n",
"[install-mecab-ipadic-NEologd] : /usr/lib/x86_64-linux-gnu/mecab/dic isn't current user's directory\n",
"[install-mecab-ipadic-NEologd] : Sudo make install to /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd\n",
"make[1]: Entering directory '/content/mecab-ipadic-neologd/build/mecab-ipadic-2.7.0-20070801-neologd-20200709'\n",
"make[1]: Nothing to be done for 'install-exec-am'.\n",
"/bin/bash ./mkinstalldirs /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd\n",
"mkdir /usr/lib/x86_64-linux-gnu/mecab\n",
"mkdir /usr/lib/x86_64-linux-gnu/mecab/dic\n",
"mkdir /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd\n",
" /usr/bin/install -c -m 644 ./matrix.bin /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/matrix.bin\n",
" /usr/bin/install -c -m 644 ./char.bin /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/char.bin\n",
" /usr/bin/install -c -m 644 ./sys.dic /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/sys.dic\n",
" /usr/bin/install -c -m 644 ./unk.dic /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/unk.dic\n",
" /usr/bin/install -c -m 644 ./left-id.def /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/left-id.def\n",
" /usr/bin/install -c -m 644 ./right-id.def /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/right-id.def\n",
" /usr/bin/install -c -m 644 ./rewrite.def /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/rewrite.def\n",
" /usr/bin/install -c -m 644 ./pos-id.def /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/pos-id.def\n",
" /usr/bin/install -c -m 644 ./dicrc /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd/dicrc\n",
"make[1]: Leaving directory '/content/mecab-ipadic-neologd/build/mecab-ipadic-2.7.0-20070801-neologd-20200709'\n",
"\n",
"[install-mecab-ipadic-NEologd] : Install completed.\n",
"[install-mecab-ipadic-NEologd] : When you use MeCab, you can set '/usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd' as a value of '-d' option of MeCab.\n",
"[install-mecab-ipadic-NEologd] : Usage of mecab-ipadic-NEologd is here.\n",
"Usage:\n",
" $ mecab -d /usr/lib/x86_64-linux-gnu/mecab/dic/mecab-ipadic-neologd ...\n",
"\n",
"[install-mecab-ipadic-NEologd] : Finish..\n",
"[install-mecab-ipadic-NEologd] : Finish..\n"
],
"name": "stdout"
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "zktJso9mNKgz",
"colab_type": "code",
"colab": {}
},
"source": [
"import MeCab\n",
"import subprocess\n",
"\n",
"cmd='echo `mecab-config --dicdir`\"/mecab-ipadic-neologd\"'\n",
"path = (subprocess.Popen(cmd, stdout=subprocess.PIPE,\n",
" shell=True).communicate()[0]).decode('utf-8')\n",
" \n",
"m=MeCab.Tagger(\"-d {0}\".format(path))"
],
"execution_count": 3,
"outputs": []
},
{
"cell_type": "code",
"metadata": {
"id": "oxAc4p50NVK6",
"colab_type": "code",
"colab": {
"base_uri": "https://localhost:8080/",
"height": 204
},
"outputId": "0b0d0c7c-fe2b-41ff-9979-105fd0afcf14"
},
"source": [
"# 以下の出力が一致していれば、NEologd を使えている\n",
"\n",
"print(m.parse(\"彼女はペンパイナッポーアッポーペンと恋ダンスを踊った。\"))\n",
"#>>\n",
"#彼女 名詞,代名詞,一般,*,*,*,彼女,カノジョ,カノジョ\n",
"#は 助詞,係助詞,*,*,*,*,は,ハ,ワ\n",
"#ペンパイナッポーアッポーペン 名詞,固有名詞,一般,*,*,*,Pen-Pineapple-Apple-Pen,ペンパイナッポーアッポーペン,ペンパイナッポーアッポーペン\n",
"#と 助詞,並立助詞,*,*,*,*,と,ト,ト\n",
"#恋ダンス 名詞,固有名詞,一般,*,*,*,恋ダンス,コイダンス,コイダンス\n",
"#を 助詞,格助詞,一般,*,*,*,を,ヲ,ヲ\n",
"#踊っ 動詞,自立,*,*,五段・ラ行,連用タ接続,踊る,オドッ,オドッ\n",
"#た 助動詞,*,*,*,特殊・タ,基本形,た,タ,タ\n",
"#。 記号,句点,*,*,*,*,。,。,。\n",
"#EOS"
],
"execution_count": 4,
"outputs": [
{
"output_type": "stream",
"text": [
"彼女\t名詞,代名詞,一般,*,*,*,彼女,カノジョ,カノジョ\n",
"は\t助詞,係助詞,*,*,*,*,は,ハ,ワ\n",
"ペンパイナッポーアッポーペン\t名詞,固有名詞,一般,*,*,*,Pen-Pineapple-Apple-Pen,ペンパイナッポーアッポーペン,ペンパイナッポーアッポーペン\n",
"と\t助詞,並立助詞,*,*,*,*,と,ト,ト\n",
"恋ダンス\t名詞,固有名詞,一般,*,*,*,恋ダンス,コイダンス,コイダンス\n",
"を\t助詞,格助詞,一般,*,*,*,を,ヲ,ヲ\n",
"踊っ\t動詞,自立,*,*,五段・ラ行,連用タ接続,踊る,オドッ,オドッ\n",
"た\t助動詞,*,*,*,特殊・タ,基本形,た,タ,タ\n",
"。\t記号,句点,*,*,*,*,。,。,。\n",
"EOS\n",
"\n"
],
"name": "stdout"
}
]
},
{
"cell_type": "code",
"metadata": {
"id": "kag-alZRcpYR",
"colab_type": "code",
"colab": {
"base_uri": "https://localhost:8080/",
"height": 119
},
"outputId": "fec469dc-79c1-41cb-9ff8-cf0e297c018c"
},
"source": [
"# 「神ってる」が、1単語として認識されていれば、NEologd を使えている (以下の出力例では、Neologd を使っていないため、そうなっていない)\n",
"# /etc/mecabrc を書き換えることで、デフォルトで使う辞書を NEologd に設定できる。\n",
"# 手順は、以下の記事を参考にすれば良い(例えば sed を使う)\n",
"# https://qiita.com/Naritoshi/items/8f55d7d5cce9ce414395\n",
"%%bash\n",
"mecab \n",
"メカブって神ってる"
],
"execution_count": 5,
"outputs": [
{
"output_type": "stream",
"text": [
"メカブ\t名詞,固有名詞,組織,*,*,*,*\n",
"って\t助詞,格助詞,連語,*,*,*,って,ッテ,ッテ\n",
"神\t名詞,一般,*,*,*,*,神,カミ,カミ\n",
"って\t助詞,格助詞,連語,*,*,*,って,ッテ,ッテ\n",
"る\t助動詞,*,*,*,文語・ル,基本形,る,ル,ル\n",
"EOS\n"
],
"name": "stdout"
}
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "xYa67BXpcjaM",
"colab_type": "text"
},
"source": [
"## References\n",
"\n",
"https://qiita.com/SUZUKI_Masaya/items/685000d569452585210c\n",
"\n",
"https://qiita.com/Naritoshi/items/8f55d7d5cce9ce414395"
]
}
]
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment