Google BigQueryで始めるビッグデータ処理入門
- 2章巨大データを扱うシステム
- 2.3.2 形態素解析を使用した日本語ワードカウント
Multiple main classes detected, select one to run: | |
[1] Sample1 | |
[2] Sample2 | |
[3] Sample3 | |
[4] TwitterFollowerScoreRanking | |
Enter number: 4 | |
[info] Running TwitterFollowerScoreRanking |
Google BigQueryで始めるビッグデータ処理入門
TwitterStream twitterStream = new TwitterStreamFactory().getInstance(); | |
twitterStream.setOAuthConsumer(twitterModel.getConsumerKey(), | |
twitterModel.getConsumerSecret()); | |
twitterStream.setOAuthAccessToken(new AccessToken(twitterModel | |
.getAccessToken(), twitterModel.getAccessToken_secret())); | |
// MyStatusAdapterクラスでTwitterのStatusクラスを処理する | |
twitterStream.addListener(new MyStatusAdapter(applicationConfParser, bufferedWriter)); | |
ArrayList<String> track = new ArrayList<String>(); | |
track.addAll(Arrays.asList(Application.searchKeyword.split(","))); |
sbt -mem 10000 (10G) |
[mnt]# wc -l ./data/20150405_182615.cl.txt | |
9920199 ./data/20150405_182615.cl.txt | |
sbt | |
run ../../data/20150405_182615.cl.txt ./dictionary/anime_2015_2Q.txt 100 | |
15/06/02 16:21:01 INFO ContextHandler: stopped o.s.j.s.ServletContextHandler{/stages/stage/kill,null} | |
15/06/02 16:21:01 INFO ContextHandler: stopped o.s.j.s.ServletContextHandler{/,null} | |
15/06/02 16:21:01 INFO ContextHandler: stopped o.s.j.s.ServletContextHandler{/static,null} |
org.h2.jdbc.JdbcSQLException: データベースはすでに閉じられています (VM終了時の自動データベースクローズを無効にするためには、db URLに ";DB_CLOSE_ON_EXIT=FALSE" を追加してください) | |
Database is already closed (to disable automatic closing at VM shutdown, add ";DB_CLOSE_ON_EXIT=FALSE" to the db URL) [90121-180] |
yum -y groups install "GNOME Desktop" | |
startx |
http://www.cs.berkeley.edu/~matei/papers/2012/nsdi_spark.pdf | |
http://www.cs.berkeley.edu/~matei/papers/2011/tr_spark.pdf | |
http://www.cs.berkeley.edu/~matei/papers/2010/hotcloud_spark.pdf |
[root@~]# service vboxdrv setup | |
Stopping VirtualBox kernel modules [ OK ] | |
Uninstalling old VirtualBox DKMS kernel modules [ OK ] | |
Trying to register the VirtualBox kernel modules using DKMSError! echo | |
Your kernel headers for kernel 3.10.0-229.el7.x86_64 cannot be found at | |
/lib/modules/3.10.0-229.el7.x86_64/build or /lib/modules/3.10.0-229.el7.x86_64/source. | |
[FAILED] | |
(Failed, trying without DKMS) | |
Recompiling VirtualBox kernel modules [FAILED] | |
(Look at /var/log/vbox-install.log to find out what went wrong) |
コントロールパネルのコンソールから | |
# yum -y groups install "GNOME Desktop" | |
startx | |
日本語を選択 | |
デスクトップ起動 | |
OracleのサイトからVirtualVm RPMインストール | |
CDHのVMをインストール |