Overview
In this course you will get hands-on in order to work through real-world challenges faced when building streaming data pipelines. The primary focus is on managing continuous, unbounded data with Google Cloud products.
Prerequisites
Participants should have:
- Proficiency in a common programming language like Python
- A strong understanding of SQL
- Data fundamentals like data modelling, formats, and ETL/ELT processes
- Familiarity with the Google Cloud Platform (GCP)
Target audience
This course is designed for:
- Data Engineers
- Data Analysts
- Data Architects
Delegates will learn how to
By the end of this course, learners will be able to:
- Ingest and manage streaming data using Pub/Sub and Managed Service for Apache Kafka
- Build and deploy streaming data pipelines with Dataflow
- Implement streaming data solutions for real-time analytics and application serving with BigQuery and Bigtable
Outline
Module 1
Topics
- This module introduces the fundamentals of building streaming data pipelines on Google Cloud, providing a foundation for the entire course. It begins by outlining the course's overall learning objectives and introducing a practical, hands-on scenario that will be used throughout the content and labs to make the concepts tangible.
Objectives
- Introduce the course learning objectives, and the scenario that will be used to bring hands on learning to building streaming data pipelines. Describe the concept of streaming data pipelines, challenges associated with it, and the role of these pipelines within the data engineering process.
Module 2
Topics
- This module provides an introduction to streaming data use cases and architectures. You will learn about the applications and common architectural patterns for real-time data processing across four key scenarios: Streaming ETL, Streaming AI/ML, Streaming Application, and Reverse ETL.
Objectives
- Learn about the various streaming use cases and their applications, including Streaming ETL, Streaming AI/ML, Streaming Application, and Reverse ETL. Identify and describe common sampe architectures for streaming data including Streaming ETL, Streaming AI/ML, Streaming Application, and Reverse ETL.
Module 3
Topics
- This module provides a comprehensive overview of building streaming data pipelines on Google Cloud, covering the core services for messaging, processing, and analysis. It's designed to give you a hands-on understanding of how these components work together in a cohesive, real-time architecture.
Objectives
- Define messaging concepts
- Use the console to create various PS and Kafka elements
- Know when to use Pub/Sub or Managed Service for Apache Kafka
- Describe the DF service and challenges with streaming data
- Build and deploy a streaming pipeline
- Explore various data ingestion methods into BQ
- Learn about BigQuery continuous queries and using BigQuery ETL and reverse ETL
- Configure Pub/Sub to BigQuery streaming
- Architecting BigQuery into your streaming pipelines
- Describe the big picture of data movement and interaction
- Establish a streaming pipeline from Dataflow to Bigtable
- Analyze the BT continuous data stream for trends using BQ
- Synchronize the trends analysis back into the user-facing application
Module 4
Topics
- This module provides a comprehensive wrap-up of the course, summarizing the key concepts you've learned for building resilient and robust streaming data pipelines on Google Cloud.
Objectives
- Summarize the course and what you learned about the various Google products, what you achieved throughout the course, and what you're enabled to do next as a result of completing the course.
Exams and assessments
There is no specific certification related to this course.
Hands-on learning
There are four practical labs in this course.
Frequently asked questions
How can I create an account on myQA.com?
There are a number of ways to create an account. If you are a self-funder, simply select the "Create account" option on the login page.
If you have been booked onto a course by your company, you will receive a confirmation email. From this email, select "Sign into myQA" and you will be taken to the "Create account" page. Complete all of the details and select "Create account".
If you have the booking number you can also go here and select the "I have a booking number" option. Enter the booking reference and your surname. If the details match, you will be taken to the "Create account" page from where you can enter your details and confirm your account.
Find more answers to frequently asked questions in our FAQs: Bookings & Cancellations page.
How do QA’s virtual classroom courses work?
Our virtual classroom courses allow you to access award-winning classroom training, without leaving your home or office. Our learning professionals are specially trained on how to interact with remote attendees and our remote labs ensure all participants can take part in hands-on exercises wherever they are.
We use the WebEx video conferencing platform by Cisco. Before you book, check that you meet the WebEx system requirements and run a test meeting to ensure the software is compatible with your firewall settings. If it doesn’t work, try adjusting your settings or contact your IT department about permitting the website.
How do QA’s online courses work?
QA online courses, also commonly known as distance learning courses or elearning courses, take the form of interactive software designed for individual learning, but you will also have access to full support from our subject-matter experts for the duration of your course.
Once you have purchased the Online course and have completed your registration, you will receive the necessary details to enable you to immediately access it through our e-learning platform and you can start to learn straight away, from any compatible device. Access to the online learning platform is valid for one year from the booking date.
All courses are built around case studies and presented in an engaging format, which includes storytelling elements, video, audio and humour. Every case study is supported by sample documents and a collection of Knowledge Nuggets that provide more in-depth detail on the wider processes.
When will I receive my joining instructions?
Joining instructions for QA courses are sent two weeks prior to the course start date, or immediately if the booking is confirmed within this timeframe. For course bookings made via QA but delivered by a third-party supplier, joining instructions are sent to attendees prior to the training course, but timescales vary depending on each supplier’s terms. Read more FAQs.
When will I receive my certificate?
Certificates of Achievement are issued at the end the course, either as a hard copy or via email. Read more here.