I hereby claim:
- I am tcarland on github.
- I am tcarland (https://keybase.io/tcarland) on keybase.
- I have a public key ASArEJi3ufKa92N7uTJG5w0LXWWI5y-2b5-ariGWBrIMCAo
To claim this, I am signing this object:
I hereby claim:
To claim this, I am signing this object:
A document describing the configuration of a local, apache-based hadoop distribution running in pseudo-distributed mode. While there are useful VM's provided by various hadoop vendors, running natively provides better performance and more control over the environment for testing purposes (such as running multiple versions). For developers interested in underlying details of the hadoop stack, having a native version based on compiled apache projects is much more clear versus trying to make sense of Cloudera's internal versions.
There are various methods of automation for applying these nodes requisites, including distributing CDH agents, but it is still very useful to have an administration tool that allows interaction with all nodes with proper feedback and diff capabilities. Clustershell works brilliantly for this and is a must for managing clusters without opening too many windows. If not just for clustershell, configuring root ssh is also useful