Skip to content

Google Cloud Platform Big Data & Machine Learning Fundamentals

This one-day course introduces participants to the big data and machine learning capabilities of Google Cloud Platform (GCP). It provides an overview of the Google Cloud Platform and a more in-depth view of its data processing capabilities. 

  • Duration: 1 day
  • Format: Face-to-face or distance learning
  • Prerequisites: Basic experience in the following areas:
    -Common query language such as SQL
    -ETL tool
    -Data modelling
    -Machine learning and/or statistics
    -Python programming
    Experience in application development.
  • Audience: People who want an overview of Google Cloud Platform products and services for data processing and machine learning.
  • Price: Please contact us
    More information in our training catalogue

The course in detail

 

Module 1: Introduction to the Google Cloud Platform

  • Discover the basic principles of the Google platform.
  • Explore big data products on the Google Cloud Platform.

 

Module 2: Fundamentals of Compute and Storage

  • Discover the power of a CPU on demand (Compute Engine) and a global file system (Cloud Storage).
  • Explore CloudShell.
  • Workshop: Configure an Ingest-Transform-Publish data processing pipeline.

 

Module 3: Data Analytics in the cloud

  • Explore springboards into the cloud.
  • Learn more about Cloud SQL: your SQL database in the cloud.
  • Workshop: Import data into CloudSQL and run queries.
  • Run Spark on Dataproc.
  • Practical: Get machine learning recommendations with Spark on Dataproc.

 

Module 4: Scaling up of data analysis

  • Learn about fast random access.
  • Explore Datalab and BigQuery.
  • Workshop: Create a machine learning dataset.

 

Module 5: Machine learning

  • Explore machine learning with TensorFlow.
  • Laboratory: Perform ML with TensorFlow.
  • Discover pre-built models for common needs.
  • Workshop: Use ML APIs.

 

Module 6: Data processing architectures

  • Discover message-oriented architectures with Pub/Sub.
  • Create pipelines with Dataflow.
  • Learn about reference architecture for real-time and batch data processing.

 

Module 7: Summary

  • Explore the question ‘Why GCP?’
  • Make a plan of where to go from here.
  • Discover additional resources.

Contact us Any questions? Or are you interested in our other Google Cloud services?  Our experts would be happy to help!