Created
August 15, 2017 08:51
-
-
Save emesday/0cc7bf0072fe8195b041118f3409c7bf to your computer and use it in GitHub Desktop.
나무위키:데이터베이스 덤프 변환 - one json object per line
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# namuwiki_170327기준 나무위키 덤프는 Json Array로 저장되어 분산 처리를 할 수 없음 | |
# 분산 처리를 위해서 한 라인에 하나의 Json Object로 저장할 필요가 있음 | |
# input: namuwiki_170327.json | |
# output: output.json | |
jq -nc --stream 'fromstream(1|truncate_stream(inputs))' namuwiki_170327.json > output.json | |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment