Skip to content

Instantly share code, notes, and snippets.

View serihiro's full-sized avatar

Kazuhiro Serizawa serihiro

View GitHub Profile
@serihiro
serihiro / sed.sh
Created July 28, 2017 09:40
前処理
# http://archive.ics.uci.edu/ml/datasets/NSF+Research+Award+Abstracts+1990-2003
find . -print | grep -i '.*[.]txt' | xargs cat | sed -ne '/^Abstract\s\{1,\}:$/, $p' | sed -ne '3, $p' > merged.txt
@serihiro
serihiro / json_lexer_sample.rb
Created August 11, 2017 02:53
JSON Lexer sample(wip)
require 'strscan'
sample = "{ \"name \" : \"Tarou\" } "
scanner = StringScanner.new(sample)
state = 'waiting_key'
brace_open = false
buffer = ''
output = []
@serihiro
serihiro / draft.md
Last active September 17, 2017 23:43
[WIP]Rails5 mini_blog tutorial contents

Contents

#0 setup ruby2.4.2 and rails 5.1.4

#1 rails new

#2 generate model Post

#3 generate PostController and views

#4 generate User model, implement LoginController and its views

#5 associate Post and User mode

#6 introduce user following feature

#7 setup rspec and implement controller specs

#8 introduce fav feature

@serihiro
serihiro / asf_jira_note.md
Last active September 30, 2017 01:48
JIRA
project in('HADOOP','HDFS','MAPREDUCE','YARN') AND status = 'Open' AND labels='newbie' AND assignee is EMPTY
@serihiro
serihiro / note.md
Last active September 30, 2017 04:30
hadoop build on mac note

my env

$ system_profiler SPSoftwareDataType
Software:

    System Software Overview:

      System Version: OS X 10.11.6 (15G1611)
 Kernel Version: Darwin 15.6.0
@serihiro
serihiro / note.md
Last active September 30, 2017 12:27
spark build on mac note
@serihiro
serihiro / destroy_all_secrets.sh
Created October 20, 2017 08:33
digdagのsecretsを全て削除する1 liner
digdag secrets --project mydag | for secret in `sed -n '1,$p'`; do digdag secrets --project mydag --delete $secret; done
@serihiro
serihiro / design_doc.md
Last active November 18, 2017 00:51
Simple MapReduce Design

Driver script image

require 'simple_map_reduce'

c = SimpleMapReduce::Driver::Config.new
job = SimpleMapReduce::Driver::Job.new(config: c)
job.map_task = map_task_object
job.reduce_task = map_task_object
job.input_s3_file_path = 's3://....'