Introduction to Data Engineering on Google Cloud

On this one-day course, you'll learn about data engineering on Google Cloud, the roles and responsibilities of data engineers, and how those map to offerings provided by the platform.

google badge
Book this course
Call our sales team today
1 day course
Supporting material
Private
Private
A private training session for your team. Groups can be of any size, at a location of your choice including our training centres.

Course Credits

Select the pre-paid training investment that’s right for you and help your money stretch a little further with our course credits.

Jellyfish is an award-winning Google Cloud Partner. Our trainers work with Google Cloud on a daily basis, so you'll benefit from the years of industry experience they’ll share with you.

On this one-day course, you'll learn about ways to address data engineering challenges. We'll teach you everything you need to know about the role of a data engineer and identifying data engineering tasks and core components used on Google Cloud.

You'll also learn how to create and deploy data pipelines of varying patterns on Google Cloud, as well as how to identify and utilize various automation techniques on the platform.

This Introduction to Data Engineering on Google Cloud course is available as a private session that can be delivered virtually or at a location of your choice in the US.

Course overview

Who should attend:

This course is ideal for data engineers, database administrators and system administrators, as well as any other individuals interested in learning about data engineering techniques on Google Cloud.

What you'll learn:

By the end of this course, you will be able to:

  • Understand the role of a data engineer
  • Identify data engineering tasks and core components used on Google Cloud
  • Understand how to create and deploy data pipelines of varying patterns on Google Cloud
  • Identify and utilise various automation techniques on Google Cloud

Prerequisites

In order to get the most out of this course, you should have prior Google Cloud experience at the fundamental level; especially when it comes to using Cloud Shell and accessing products from the Google Cloud console. Basic proficiency with a common query language such as SQL, experience with data modelling and ETL (extract, transform, load) activities, and experience developing applications using a common programming language such as Python is also recommended.

Course agenda

Module 1: Data Engineering Tasks & Components
  • The role of a data engineer
  • Data sources versus data sinks
  • Data formats
  • Storage solution options on Google Cloud
  • Metadata management options on Google Cloud
  • Sharing datasets using Analytics Hub
Module 2: Data Replication & Migration
  • Replication and migration architecture
  • The gcloud command-line tool
  • Moving datasets
  • Datastream
Module 3: The Extract & Load Data Pipeline Pattern
  • Extract and load architecture
  • The bq command-line tool
  • BigQuery Data Transfer Service
  • BigLake
Module 4: The Extract, Load & Transform Data Pipeline Pattern
  • Extract, load, and transform (ELT) architecture
  • SQL scripting and scheduling with BigQuery
  • Dataform
Module 5: The Extract, Transform, and Load Data Pipeline Pattern
  • Extract, transform, and load (ETL) architecture
  • Google Cloud GUI tools for ETL data pipelines
  • Batch data processing using Dataproc
  • Streaming data processing options
  • Bigtable and data pipelines
Module 6: Automation Techniques
  • Automation patterns and options for pipelines
  • Cloud Scheduler and Workflows
  • Cloud Composer
  • Cloud Run Functions
  • Eventarc
close
Don't miss out
Keep up to date with news, views and offers from Jellyfish Training.
Your data will be handled in accordance with our Privacy Policy