Skip to content
This repository has been archived by the owner on Aug 26, 2024. It is now read-only.

Notes aggregated from various sources in order to prepare the Google Cloud Platform Profesionnal Data Engineer Certification + Labs & Practice, tips, references...

Notifications You must be signed in to change notification settings

obrunet/GCP_Data_Engineering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GCP Data Engineering

A Professional Data Engineer enables data-driven decision making by collecting, transforming, and publishing data. A Data Engineer should be able to design, build, operationalize, secure, and monitor data processing systems with a particular emphasis on security and compliance; scalability and efficiency; reliability and fidelity; and flexibility and portability. A Data Engineer should also be able to leverage, deploy, and continuously train pre-existing machine learning models.

Example

The GCP Professional Data Engineer exam assesses your ability to:

  • Design data processing systems
  • Build and operationalize data processing systems
  • Operationalize machine learning models
  • Ensure solution quality

Introduction

In the Theory chapter are presented my personnal notes that summerize everything one need to know in order to get prepared for this exam/certification. Then in the Practice section, you'll find all the capstone projects i've accomplished to master the different GCP components.

Theory

  1. All Services
  2. Selection of the appropriate Storage Technology
  3. Build a Storage System & Operations
  4. Design Data Pipelines
  5. Data Processing Solutions TO BE CONTINUED
  6. Build an Instrastructure & Operations
  7. Security & Compliance TO BE CONTINUED
  8. DB - Reliability, Scalability & Availability
  9. Flexibility & Portability
  10. ML Pipelines
  11. Choosing the Appropriate Insfrastructure
  12. Measure, Monitore & Troubleshoot ML
  13. Prebuilt ML Models as a Service
  14. Hadoop & Differences with GCP components

The M.L part is not developped here since i'm already familliar with all the Data Science concepts. Anyway you can refer to the Machine Learning cheatsheets for Stanford's CS 229 and a local backup here

Practice

References

About

Notes aggregated from various sources in order to prepare the Google Cloud Platform Profesionnal Data Engineer Certification + Labs & Practice, tips, references...

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published