Build Data Lakes and Warehouses on Google Cloud

Course 1492

  • Duration: 1 day
  • Language: English
  • Level: Intermediate

While the traditional approaches of using data lakes and data warehouses can be effective, they have shortcomings, particularly in large enterprise environments. This course introduces the concept of a data lakehouse and the Google Cloud products used to create one. A lakehouse architecture uses open-standard data sources and combines the best features of data lakes and data warehouses, which addresses many of their shortcomings.

Google Data Lakes and Warehouses Delivery Methods

  • In-Person

  • Online

  • Upskill your whole team by bringing Private Team Training to your facility.

Google Data Lakes and Warehouses Course Information

Important Course Information:

  • Differentiate between a data lake, data warehouse, data lakehouse
  • Explain the data lakehouse concept and how it addresses the limitations of traditional data lakes and data warehouses
  • Identify various data sources that BigQuery can query
  • Recognize the capabilities of using BigQuery to create and access AI models

Prerequisites:

To benefit from this course participants should have familiarity, training, or experience with the principles and activities associated with data engineering, data warehouse or data lake architecture, SQL query language, or data management principles.

Google Data Lakes and Warehouses Course Outline

Module 01

Introduce the learner to the topics that will be covered in the course and the skills they will learn.

Module 02

This module introduces the foundational concepts of data lakes and data warehouses, setting the stage for modern architectures on Google Cloud.

Module 03

This module details the concept of a lakehouse and introduces the Google Cloud products most commonly used to build a modern data lakehouse using open-source formats.

Module  04

This module explores BigQuery as the cornerstone of a modern data warehouse and introduces BigLake for unifying access across the data lake and warehouse.

Module 05

This module focuses on advanced architectural patterns for the lakehouse, including data processing, orchestration, and comprehensive data governance across BigQuery, Cloud Storage, and BigLake.

Module 06

This module provides labs to deepen skills in the tools and technologies used by a lakehouse on Google Cloud and an overview of best practices, common mistakes, and future trends

Module 07

Summarize the architectural and operational capabilities of the BigQuery-centric data lakehouse, covering governance, advanced analytics, and machine learning

Need Help Finding The Right Training Solution?

Our training advisors are here for you.

Google Data Lakes and Warehouses FAQs

Data Engineers, Data Analysts, Database Administrators, Big Data Architects

4 modules, 4 labs, 5 Classroom Activities

The skills taught align with real enterprise use cases such as:

  • Data warehousing and analytics pipelines
  • Large-scale ETL processing
  • Periodic reporting and business intelligence workflows
  • Cost-optimized data processing architectures

  • The course supports enterprise scenarios such as:
  • Building centralized analytics platforms
  • Migrating on-premises data warehouses to the cloud
  • Supporting BI and reporting workloads
  • Designing cost-effective, scalable data architectures