Skip to content

Instantly share code, notes, and snippets.

View tcarland's full-sized avatar

Timothy C. Arland tcarland

View GitHub Profile

Keybase proof

I hereby claim:

  • I am tcarland on github.
  • I am tcarland (https://keybase.io/tcarland) on keybase.
  • I have a public key ASArEJi3ufKa92N7uTJG5w0LXWWI5y-2b5-ariGWBrIMCAo

To claim this, I am signing this object:

@tcarland
tcarland / hadoop-v3-upgrade.md
Last active December 10, 2020 19:27
Upgrading Hadoop v2 to v3

Hadoop v2 to v3

Stop all Application and Services other than HDFS

Run Fsck

hdfs fsck / -files -blocks -locations > dfs-fsck.log
@tcarland
tcarland / vim-tags-config.md
Last active March 4, 2018 19:14
Configuring Vim for exuberant-ctags

Configuring Vim for ctags and Scala

This document configures Vim or GVim for auto-completion using Vim OmniComplete (built-in) and exuberant-ctags.

It should be noted that the exuberant-ctags project seems a bit dated, and potentially Universal Ctags is a better choice ctags.io.

@tcarland
tcarland / mysql-repl.md
Last active June 29, 2018 02:24
Configuring MySQL Replication

Configuring MySQL for Replication

Configure Mysql:

Additional mysql options common to all instances is provided below. The following are the necessary options for enabling bin-log format necessary for replication. 'server-id' should be unique across all mysql instances.

server-id=1
@tcarland
tcarland / hadoop-psdm.md
Last active June 5, 2020 15:19
Running Hadoop in pseudo-distributed mode.

Pseudo-Distributed Hadoop Environment

A document describing the configuration of a local, apache-based hadoop distribution running in pseudo-distributed mode. While there are useful VM's provided by various hadoop vendors, running natively provides better performance and more control over the environment for testing purposes (such as running multiple versions). For developers interested in underlying details of the hadoop stack, having a native version based on compiled apache projects is much more clear versus trying to make sense of Cloudera's internal versions.

@tcarland
tcarland / hadoop-prereq.md
Last active November 26, 2020 16:03
Hadoop cluster prerequisites

Hadoop Node Prerequisites

Configuring root ssh

There are various methods of automation for applying these nodes requisites, including distributing CDH agents, but it is still very useful to have an administration tool that allows interaction with all nodes with proper feedback and diff capabilities. Clustershell works brilliantly for this and is a must for managing clusters without opening too many windows. If not just for clustershell, configuring root ssh is also useful

@tcarland
tcarland / hadoop-build.md
Last active September 11, 2020 15:04
Information for building hadoop and related components from source.

Building Hadoop and Various Ecosystem Components

A guide for building hadoop and other ecosystem components from source.

Building Hadoop (v2.7.4)

Prerequisites:

  • Oracle JDK 1.8