Skip to content

Instantly share code, notes, and snippets.

View shrijeet's full-sized avatar

Shrijeet shrijeet

  • Redwood City, CA
View GitHub Profile
@shrijeet
shrijeet / TabDelimToProtoMessage.java
Created April 11, 2012 00:45
Protobuf message from a tab delimited record
package com.example;
import java.io.*;
import java.util.List;
import java.util.regex.Pattern;
import com.google.protobuf.Descriptors.FieldDescriptor;
import com.google.protobuf.Descriptors.FieldDescriptor.JavaType;
import com.example.generated.LogFileProtos.LogFile;
@shrijeet
shrijeet / quickstart-hdfs.properties
Created July 9, 2016 19:34
kafka-connect-hdfs quickstart-hdfs.properties
name=hdfs-sink
connector.class=io.confluent.connect.hdfs.HdfsSinkConnector
tasks.max=1
topics=test_hdfs
hdfs.url=hdfs://10.211.55.101:8020
flush.size=3
hive.integration=true
hive.metastore.uris=thrift://localhost:9083
schema.compatibility=BACKWARD
format.class=io.confluent.connect.hdfs.parquet.ParquetFormat
@shrijeet
shrijeet / error_trace_hung_client.java
Created October 19, 2012 01:15
Error stack track for asynchbase client getting hung
12/09/23 15:02:18 ERROR async.RegionClient: Unexpected exception from downstream on [id: 0x5cac6a45, /172.22.8.8:44666 => /172.22.4.46:60020]
java.io.IOException: Connection reset by peer
at sun.nio.ch.FileDispatcher.read0(Native Method)
at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:21)
at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:233)
at sun.nio.ch.IOUtil.read(IOUtil.java:200)
at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:236)
at org.jboss.netty.channel.socket.nio.NioWorker.read(NioWorker.java:63)
at org.jboss.netty.channel.socket.nio.AbstractNioWorker.processSelectedKeys(AbstractNioWorker.java:373)
at org.jboss.netty.channel.socket.nio.AbstractNioWorker.run(AbstractNioWorker.java:247)
package com.example;
import com.google.common.hash.Funnel;
import com.google.common.hash.PrimitiveSink;
public enum StringFunnel implements Funnel<String> {
INSTANCE;
@Override
public void funnel(String from, PrimitiveSink into) {
@shrijeet
shrijeet / KeyValue.java
Created September 18, 2013 01:55
KeyValue size
+ 4 // int: Total length of the whole KeyValue.
+ 4 // int: Total length of the key part of the KeyValue.
+ 4 // int: Total length of the value part of the KeyValue.
+ 2 // short: Row key length.
+ key.length // The row key.
+ 1 // byte: Family length.
+ family.length // The family.
+ qualifier.length // The qualifier.
+ 8 // long: The timestamp.
+ 1 // byte: The type of KeyValue.
@shrijeet
shrijeet / hive_no_system_exit.patch
Created May 30, 2013 18:38
Do not run taskCleanup if running hive in sequential mode
if (tsk.ifRetryCmdWhenFail()) {
- if (running.size() != 0) {
+ if (running.size() != 0 && executeTasksInParallel()) {
taskCleanup();
}
// in case we decided to run everything in local mode, restore the
@@ -1183,7 +1183,7 @@ public class Driver implements CommandProcessor {
}
SQLState = "08S01";
console.printError(errorMessage);
@shrijeet
shrijeet / .vimrc
Last active December 17, 2015 20:18 — forked from rocarvaj/.vimrc
" VIM Configuration File
" Description: Optimized for C/C++ development, but useful also for other things.
" Author: Gerhard Gappmeier
"
" set UTF-8 encoding
set enc=utf-8
set fenc=utf-8
set termencoding=utf-8
" disable vi compatibility (emulation of old bugs)
@shrijeet
shrijeet / npe_beeswax_1.java
Created May 15, 2013 20:13
NPE during query result fetch from history
java.lang.NullPointerException
at com.cloudera.beeswax.BeeswaxServiceImpl$RunningQueryState.access$600(BeeswaxServiceImpl.java:124)
at com.cloudera.beeswax.BeeswaxServiceImpl.doWithState(BeeswaxServiceImpl.java:770)
at com.cloudera.beeswax.BeeswaxServiceImpl.fetch(BeeswaxServiceImpl.java:980)
at com.cloudera.beeswax.api.BeeswaxService$Processor$fetch.getResult(BeeswaxService.java:987)
at com.cloudera.beeswax.api.BeeswaxService$Processor$fetch.getResult(BeeswaxService.java:971)
at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:39)
at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:39)
at org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:206)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
/etc/init.d/hadoop-hdfs-namenode
/etc/default/hadoop
/etc/default/hadoop-0.20-mapreduce
/etc/default/hadoop-fuse
/etc/default/hadoop-hdfs
/etc/default/hadoop-hdfs-namenode
/etc/default/hadoop-hdfs-secondarynamenode
/usr/lib/hadoop/libexec/hadoop-config.sh
/usr/lib/hadoop/libexec/hadoop-layout.sh
/etc/hadoop/conf/hadoop-env.sh
export HADOOP_COMMON_HOME=/usr/lib/hadoop
export HADOOP_CONF_DIR=/etc/hadoop/conf
export HADOOP_DATANODE_USER=hdfs
export HADOOP_HDFS_HOME=/usr/lib/hadoop-hdfs
export HADOOP_HOME=/usr/lib/hadoop-0.20-mapreduce
export HADOOP_HOME_WARN_SUPPRESS=true
export HADOOP_IDENT_STRING=hadoop
export HADOOP_IDENT_STRING=hdfs
export HADOOP_JOBTRACKERHA_USER=mapred
export HADOOP_JOBTRACKER_USER=mapred