How to pass the Professional Databricks Data Engineering certification ?

Understand how to use and the benefits of using the Databricks platform and its tools, including:
- Platform (notebooks, clusters, Jobs, Databricks SQL, relational entities, Repos)
- Apache Spark (PySpark, DataFrame API, basic architecture)
- Delta Lake (SQL-based Delta APIs, basic architecture, core functions)
- Databricks CLI (deploying notebook-based workflows)
- Databricks REST API (configure and trigger production pipelines)
Build data processing pipelines using the Spark and Delta Lake APIs, including:
- Building batch-processed ETL pipelines
- Building incrementally processed ETL pipelines
- Optimizing workloads
- Deduplicating data
- Using Change Data Capture (CDC) to propagate changes
Model data management solutions, including:
- Lakehouse (bronze/silver/gold architecture, databases, tables, views, and the physical layout)
- General data modeling concepts (keys, constraints, lookup tables, slowly changing dimensions)
Build production pipelines using best practices around security and governance, including:
- Managing notebook and jobs permissions with ACLs
- Creating row- and column-oriented dynamic views to control user/group access
- Securely storing personally identifiable information (PII)
- Securely delete data as requested according to GDPR & CCPA
Configure alerting and storage to monitor and log production jobs, including:
- Setting up notifications
- Configuring SparkListener
- Recording logged metrics
- Navigating and interpreting the Spark UI
- Debugging errors
Follow best practices for managing, testing and deploying code, including:
- Managing dependencies
- Creating unit tests
- Creating integration tests
- Scheduling Jobs
- Versioning code/notebooks
- Orchestration Jobs

Article written by Youssef Mrini

How to pass the Databricks Platform Admin Accreditation?

How to pass the Associate Machine Learning Certification ?

How to pass the Associate Developer for Apache Spark certification?

How to pass the Associate Data Analyst Certification ?

How to pass the Professional Databricks Data Engineering certification ?

How to pass the Associate Databricks Data Engineering Certification ?

La data avec Youssef

Everything you need to know about Databricks / Tout ce qu'il faut connaitre sur Databricks

What’s new in Databricks for December 2023

What’s new in Databricks for November 2023

What’s new in Databricks for October 2023

What’s new in Databricks for September 2023

What’s new in Databricks for July 2023

What’s new in Databricks for June 2023

How to pass the Professional Databricks Data Engineering certification ?

Articles similaires

Partager :

Articles similaires