Skip to content

Instantly share code, notes, and snippets.

@abajwa-hw
Last active June 30, 2016 18:12
Show Gist options
  • Save abajwa-hw/e884bcc423cfbadf4a1d to your computer and use it in GitHub Desktop.
Save abajwa-hw/e884bcc423cfbadf4a1d to your computer and use it in GitHub Desktop.
HDF workshop steps

HDF workshop

Setup already done

  • Download and import latest HDP 2.3 sandbox VM from http://hortonworks.com/sandbox and import into VMware Fusion
  • Deployed and install VNC ambari service (to be able to 'remote desktop' in and use eclipse)
    sudo git clone https://github.com/hortonworks-gallery/ambari-vnc-service.git /var/lib/ambari-server/resources/stacks/HDP/2.3/services/VNCSERVER
    
    
  • Deployed Nifi ambari service (but not yet installed)
sudo git clone https://github.com/abajwa-hw/ambari-nifi-service.git   /var/lib/ambari-server/resources/stacks/HDP/2.3/services/NIFI
service ambari restart
  • Banana setup for Twitter demo
    #download prebuilt banana dashboard
    cd /opt/lucidworks-hdpsearch/solr/server/solr-webapp/webapp/banana/app/dashboards/
    mv default.json default.json.orig
    wget https://raw.githubusercontent.com/abajwa-hw/ambari-nifi-service/master/demofiles/default.json
    
    
  • Solr setup for Twitter demo
    #modify solrconfig to handle twitter time format
    vi /opt/lucidworks-hdpsearch/solr/server/solr/configsets/data_driven_schema_configs/conf/solrconfig.xml
    
    #start Solr in cloud mode
    /opt/lucidworks-hdpsearch/solr/bin/solr start -c -z localhost:2181
    
    #create collection called tweets
    /opt/lucidworks-hdpsearch/solr/bin/solr create -c tweets \
     -d data_driven_schema_configs \
     -s 1 \
     -rf 1 
    
  • Download custom network processor
    cd
    sudo git clone https://github.com/abajwa-hw/nifi-network-processor.git  
    
    
  • Eclipse setup: Created run configuration for /root/nifi-network-processor using /usr/share/apache-maven

Steps

Getting started

Start with steps from: https://github.com/abajwa-hw/ambari-nifi-service

  1. Install Nifi via Ambari
  2. Basic overview of OOTB processors
  3. Build simple flow to Read log over UDF and write to HDFS
  4. Import template for more complete example of log ingest flow
  5. Import template for Twitter -> HDFS/Solr to show Nifi interacting with HDP
  6. Nifi features:

Custom processor

Steps available at: https://github.com/abajwa-hw/nifi-network-processor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment