Skip to content

Instantly share code, notes, and snippets.

View y2k-shubham's full-sized avatar
🏁
Chasing Checkpoints

Shubham Gupta y2k-shubham

🏁
Chasing Checkpoints
View GitHub Profile
@y2k-shubham
y2k-shubham / sql_find_and_replace_regex.txt
Created October 16, 2019 04:37
SQL-query-keyword-capitalization
Find regex: `(?<!\w)(insert|is|into|with|as|select|from|where|between|and|or|distinct|case|when|then|else|end|on|null|not|inner|outer|full|join|left|right|group|by|order|limit|in)(?!\w)`
Replace regex: `\U$1\E`
@y2k-shubham
y2k-shubham / create_users.sh
Created July 5, 2019 19:46
Bash script to (1) Read list of usernames from text file (2) Create linux users (3) Set and display random passwords
#!/bin/bash
for user in $( cat users.txt ); do
# create user
sudo useradd --create-home $user
# generate random password
PASSWD="$(pwgen -cnsv -N 1 32)"
# set password
echo $user:$PASSWD | chpasswd
# print message
echo "Created user $user with password $PASSWD"
@y2k-shubham
y2k-shubham / csv_serde_template.sql
Last active April 22, 2019 16:22
Hive Create table DDL statement for CSV file
-- DO NOT use this; see comment below
CREATE EXTERNAL TABLE `db_name.table_name`(
`column_1` string,
`column_2` string)
ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.OpenCSVSerde'
WITH SERDEPROPERTIES (
'escapeChar'='\\',
'quoteChar'='"',
'separatorChar'=',')
@y2k-shubham
y2k-shubham / multi_cmd_ssh_operator.py
Last active March 3, 2019 18:42
[Apache-Airflow] ssh_utils & MultiCmdSSHOperator
from typing import List, Optional, Dict, Any
from airflow.contrib.hooks.ssh_hook import SSHHook
from airflow.contrib.operators.ssh_operator import SSHOperator
from airflow.exceptions import AirflowException
from airflow.utils import apply_defaults
import ssh_utils
@y2k-shubham
y2k-shubham / install_pyenv.sh
Created January 4, 2019 11:17 — forked from ysaotome/install_pyenv.sh
pyenv install for CentOS 6.5 x86_64
#!/bin/zsh
# pyenv install for CentOS 6.5 x86_64
yum install -y gcc gcc-c++ make git patch openssl-devel zlib-devel readline-devel sqlite-devel bzip2-devel
git clone git://github.com/yyuu/pyenv.git ~/.pyenv
export PATH="$HOME/.pyenv/bin:$PATH"
eval "$(pyenv init -)"
@y2k-shubham
y2k-shubham / create_pool.py
Created December 17, 2018 07:10
Code snippet to create Airflow Pool
from airflow.settings import Session
from airflow.utils.db import provide_session
from airflow.models import Pool
# hive_pool is just an example, you might want to create some other types of pools such as for MySQL
@provide_session
def create_hive_pool(session: Optional[Session] = None) -> None:
pool = Pool(pool=pool_templates['hive_name'],
slots=1,
description=pool_templates['hive_description'])
@y2k-shubham
y2k-shubham / server.log
Created December 4, 2018 09:53
Presto-S3-Communication-Faillure-Stacktrace-[/var/log/presto/server.log]
2018-12-04T12:29:54.433+0530 WARN hive-hive-63 com.facebook.presto.hive.util.ResumableTasks ResumableTask completed exceptionally
java.lang.NoClassDefFoundError: Could not initialize class com.amazon.ws.emr.hadoop.fs.util.EmrFsUtils
at com.amazon.ws.emr.hadoop.fs.s3n.S3Credentials.initialize(S3Credentials.java:45)
at com.amazon.ws.emr.hadoop.fs.HadoopConfigurationAWSCredentialsProvider.<init>(HadoopConfigurationAWSCredentialsProvider.java:26)
at com.amazon.ws.emr.hadoop.fs.guice.DefaultAWSCredentialsProviderFactory.getAwsCredentialsProviderChain(DefaultAWSCredentialsProviderFactory.java:44)
at com.amazon.ws.emr.hadoop.fs.guice.DefaultAWSCredentialsProviderFactory.getAwsCredentialsProvider(DefaultAWSCredentialsProviderFactory.java:28)
at com.amazon.ws.emr.hadoop.fs.guice.EmrFSProdModule.getAwsCredentialsProvider(EmrFSProdModule.java:65)
at com.amazon.ws.emr.hadoop.fs.guice.EmrFSProdModule.createS3Configuration(EmrFSProdModule.java:81)
at com.amazon.ws.emr.hadoop.fs.guice.EmrFSProdModule.createAmazonS3Li
@y2k-shubham
y2k-shubham / create-sbt-project-structure.sh
Created September 26, 2018 06:01
Create non-existent sub-directories using Bash
#/bin/sh
# create directories
mkdir -p /{main,test}/{resources,scala/com/zomato}
mkdir -p {nonsvn,project,target}
# create build.sbt file
echo '
name := "ProjectName"
version := "1.0"
@y2k-shubham
y2k-shubham / ConnectionPoolManager.scala
Created September 6, 2018 13:59
ScalikeJdbc ConnectionPool wrapper utility
import java.sql.Connection
import com.typesafe.scalalogging.Logger
import com.company.utils.jdbc.MySQLConfig
import scalikejdbc.{ConnectionPool, ConnectionPoolSettings, DB, using}
import scala.util.control.NonFatal
object ConnectionPoolManager {
@y2k-shubham
y2k-shubham / An integration test for Presto (UDF) plugin
Last active August 21, 2018 14:28
Presto UDFs integration test
Reference files for integration test of a Presto plugin containing hashing-related UDFs