Skip to content

Instantly share code, notes, and snippets.

@donaldh
Last active March 30, 2017 15:27
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save donaldh/7f831a7def3bbc27102bd0b0a99423ca to your computer and use it in GitHub Desktop.
Save donaldh/7f831a7def3bbc27102bd0b0a99423ca to your computer and use it in GitHub Desktop.
PNDA deployment notes

PNDA Deployment View

           saltmaster
               
         ,- kafka-0, ..., kafka-n
        /   
bastion --- cdh-edge
        \
         `- cdh-mgr1
            cdh-dn-0
            cdh-dn-1, ..., cdh-dn-n

kafka

kafka, kafka_manager, platform_testing_general, elk, zookeeper

cdh-edge

cloudera_edge, console_frontend, console_backend_data_logger, console_backend_data_manager, graphite, gobblin, deployment_manager, package_repository, data_service, impala-shell, yarn-gateway, hbase_opentsdb_tables, hdfs_cleaner, master_dataset, elk, logserver, kibana_dashboard, jupyter, cloudera_manager, platform_testing_cdh, mysql_connector, pnda_restart

gobblin

cron 0,30 mins runs gobblin-mapreduce.sh --conf /opt/pnda/gobblin/configs/mr.pull

KafkaSimpleSource ► PNDAConverter ► [SchemaRowCheckPolicy] ► PNDAKiteWriter to hdfs via cdh-mgr1

https://github.com/pndaproject/platform-salt/tree/c9bc18f2ee6cccda277d05f3dc072d4216dd90e5/salt/gobblin/templates

cdh-mgr1

cloudera_namenode, mysql_connector, oozie_database, hue, opentsdb, grafana

cdh-dn-0

$

@donaldh
Copy link
Author

donaldh commented Mar 30, 2017

Kite - http://kitesdk.org/docs/current/ - a dataset API for hadoop.

Impala - https://impala.incubator.apache.org/ - analytic queries on hadoop.

Spark - http://spark.apache.org/ - fast and general engine for large-scale data processing.

Oozie - http://oozie.apache.org/ - workflow scheduler.

Yarn - https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html - resource manager.

Hive - https://hive.apache.org/ - SQL access to hdfs.

HBase - https://hbase.apache.org/ - Big-table on hadoop hdfs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment