Skip to content

Instantly share code, notes, and snippets.

View nalingarg2's full-sized avatar

nalin nalingarg2

View GitHub Profile
@nalingarg2
nalingarg2 / gist:139984886140f37a783d
Last active August 29, 2015 14:17
Flume + Hive using Cloudwick LogGenerator
#
# Hadoop:: flumeHive
# Recipe:: Flume and Hive
#
# Copyright (C) 2015 Cloudwick labs
# Contact :: nalin.garg@cloudwick.com
# All rights reserved - Do Not Redistribute
#
#install sbt
# remote mestaore, sql , hive-client
# hive-client
yum install hive
#hive metastore
yum install hive-metstore
#change hive configuration
<?xml version="1.0"?>
# reference: http://transcriptome.ens.fr/eoulsan/hadoopmode.html
core-site.xml
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!-- Put site-specific property overrides in this file. -->
<configuration>
@nalingarg2
nalingarg2 / gist:8e98a11609d33cdd5d34
Last active August 29, 2015 14:18
mysql for cloudera manager
[mysqld]
#transaction-isolation = READ-COMMITTED
datadir=/var/lib/mysql
socket=/var/lib/mysql/mysql.sock
user=mysql
# Disabling symbolic-links is recommended to prevent assorted security$
symbolic-links=0
#https://docs.puppetlabs.com/learning/agent_master_basic.html
┌───────────────┬───────────────────┐
│ Pre-2.6 │ Post-2.6 │
├───────────────┼───────────────────┤
│ puppetmasterd │ puppet master │
│ puppetd │ puppet agent │
│ puppet │ puppet apply │
│ puppetca │ puppet cert │
│ ralsh │ puppet resource │
@nalingarg2
nalingarg2 / gist:3a295cad33125ab8ce70
Created April 10, 2015 19:09
cloudera manager sql database for amon etc
To create the MySQL databases for Cloudera Manager:
Log into MySQL as the root user:
$ mysql -u root -p
Enter password:
Create a database for the Activity Monitor. The database name, user name, and password can be anything you want. For example:
mysql> create database amon DEFAULT CHARACTER SET utf8;
Query OK, 1 row affected (0.00 sec)
mysql> grant all on amon.* TO 'amon'@'%' IDENTIFIED BY 'amon_password';
http://blog.cloudera.com/blog/2013/03/how-to-create-a-cdh-cluster-on-amazon-ec2-via-cloudera-manager/
@nalingarg2
nalingarg2 / gist:eb852f77193325202fa4
Last active January 7, 2020 06:40
kafka with Spark Streaming
#
# Hadoop:: KafkaSpark
# Recipe:: Kafka and Spark
#
# Copyright (C) 2015 Cloudwick labs
# Contact :: nalin.garg@cloudwick.com
# All rights reserved - Do Not Redistribute
#
#One machine is required as below mentioned steps are for POC purpose only.
<?xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>df</groupId>
<artifactId>df</artifactId>
<version>1.0-SNAPSHOT</version>
git init .
git add .
git commit -am "my first commit"
git status
git remote add kinesis https://github.com/nalingarg2/Grad101Infrastructure.git
git remote -v