Skip to content

Instantly share code, notes, and snippets.

@sharifsalah
Created January 13, 2014 14:50
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save sharifsalah/8401611 to your computer and use it in GitHub Desktop.
Save sharifsalah/8401611 to your computer and use it in GitHub Desktop.
Sample command that converts the format for an S3 log file into something compatible for importing into Google Big Query.
# Copyright (c) 2014 Sharif Salah - Released under The MIT License.
# Permission is hereby granted, free of charge, to any person
# obtaining a copy of this software and associated documentation
# files (the "Software"), to deal in the Software without
# restriction, including without limitation the rights to use,
# copy, modify, merge, publish, distribute, sublicense, and/or sell
# copies of the Software, and to permit persons to whom the
# Software is furnished to do so, subject to the following
# conditions:
# The above copyright notice and this permission notice shall be
# included in all copies or substantial portions of the Software.
# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
# EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES
# OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
# NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT
# HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY,
# WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING
# FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR
# OTHER DEALINGS IN THE SOFTWARE.
cat * | awk 'BEGIN {FS = OFS = "\""} /^[[:blank:]]*$/ {next} {for (i=1; i<=NF; i+=2) gsub(/[[:space:]]/,",",$i)} 1'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment