Skip to content

Instantly share code, notes, and snippets.

@hideojoho
Last active January 2, 2018 07:52
Show Gist options
  • Save hideojoho/715fca9bc5ddbca1f4cb91e80ba3a380 to your computer and use it in GitHub Desktop.
Save hideojoho/715fca9bc5ddbca1f4cb91e80ba3a380 to your computer and use it in GitHub Desktop.
How to divide Japanese texts using Mecab on Ubuntu

How to divide Japanese texts using Mecab on Ubuntu

  • OS: Ubuntu 16.04 LTS

Installation

$ sudo apt-get -y install mecab libmecab-dev mecab-ipadic-utf8 git make curl xz-utils file
$ git clone --depth 1 https://github.com/neologd/mecab-ipadic-neologd.git
$ cd mecab-ipadic-neologd
$ ./bin/install-mecab-ipadic-neologd -n

Usage

$ mecab -Owakati -d /usr/lib/mecab/dic/mecab-ipadic-neologd file

Source

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment