Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save misho-kr/b8dbe9a7bd527ba89c6bf50d150fb96d to your computer and use it in GitHub Desktop.
Save misho-kr/b8dbe9a7bd527ba89c6bf50d150fb96d to your computer and use it in GitHub Desktop.
Summary of "Google Cloud Big Data and Machine Learning Fundamentals" from Coursera.Org

Introduction the Google Cloud big data and machine learning products and services that support the data-to-AI lifecycle. Explore the processes, challenges, and benefits of building a big data pipeline and machine learning models with Vertex AI on Google Cloud.

Big Data and Machine Learning on Google Cloud

Key components of Google Cloud's infrastructure. Big data and machine learning products and services that support the data-to AI lifecycle on Google Cloud.

Learning Objectives:

Identify how elements of the Google Cloud infrastructure have enabled big data and ML capabilities.
Identify the big data and machine learning products available on Google Cloud.
Explore a BigQuery dataset.
  • Google Cloud infrastructure, Compute, Storage
  • The history of big data and ML products
  • Big data and ML product categories

Lab: Exploring a BigQuery Public Dataset

Reading list:

Data Engineering for Streaming Data

Introduction to Google Cloud's solution to managing streaming data. End-to-end pipeline, data ingestion with Pub/Sub, data processing with Dataflow, and data visualization with Looker and Data Studio.

Learning Objectives:

Describe an end-to-end streaming data workflow from ingestion to data visualization.
Identify modern data pipeline challenges and how to solve them at scale with Dataflow
Build collaborative real-time dashboards with data visualization tools.
Create a streaming data pipeline for a real-time dashboard with Dataflow.
  • Big data challenges
  • Message-oriented architecture
  • Designing streaming pipelines with Apache Beam
  • Implementing streaming pipelines on Cloud Dataflow
  • Visualization with Looker Studio

Lab: Creating a Streaming Data Pipeline for a Real-Time Dashboard with Dataflow

Reading list:

Big Data with BigQuery

BigQuery - Google's fully-managed, serverless data warehouse. BigQuery ML, and the processes and key commands that are used to build custom machine learning models.

  • Storage and analytics
  • Querying TB of data in seconds
  • Using BigQuery ML to predict customer lifetime value
  • BigQuery ML project phases and key commands

Lab:

Reading list:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment