Загрузка...

Module 5 Apache Spark Distributed Persistence Deep Dive (Cloudera Data Engineer Certification Exam)

Full Length Training: https://interview.quicktechie.com/training-program/9/cloudera-certification-training-and-big-data-programs/cloudera-data-engineer-certification-training-for-cdp-3002?sessionId=5368

Certification Exam: https://interview.quicktechie.com/certifications

Cloudera Data Engineer
Audience
The exam tests the skills and knowledge required by data engineers to use the Cloudera Data Platform:

It is required for a Data Engineer professional, who knows how to work proficiently designing, developing and optimizing data workflows using Cloudera tools. Strong grasp of data modeling for efficient storage, including formats, partitioning and schema design, and Apache Iceberg. Expertise in performance optimization, bottleneck identification, query tuning and resource efficiency. Proficient in security configuration, monitoring, troubleshooting and cloud integration for Cloudera clusters using mainly Spark and Airflow.

Exam Details
Number of questions: 50
Duration: 90 minutes
Pass Score: 55%
Delivery: online, proctored
Please review the system requirements to enable online, proctored testing through QuestionMark
Allowed resources: none.
You may not use reference materials, white papers, user guides or any other resources during your exam.
Support: if you need help, please email us.
Cloudera Skills & Knowledge Measured
This exam measures the skills and knowledge topics listed in Table 1. below. The weighting of each topic is also listed.

Topic WEIGHT (% of exam)
Spark

Fundamentals on Spark over Kubernetes
Work with DataFrames
Understand Distribute Processing
Implement Hive and Spark Integration
Understand Distributed Persistence


48%
Airflow

Implement incremental extraction in Apache Airflow from source system
Use Apache Airflow to schedule ETL pipelines
Use Apache Airflow to schedule quality checks
Work with DAGs


10%
Performance Tuning

Know Basic tools in (Spark) Performance Tuning
Understand Optimization Framework and Explain plans
Understand Inferring Schemas
Work with Improving Join Performance
Leverage Caching Data for Reuse
Work with Partitioned and Bucketed Tables


22%
Deployment

Use the API and CLI
Work in the Data Engineering Service


10%
Iceberg

Understand Iceberg
10%

Видео Module 5 Apache Spark Distributed Persistence Deep Dive (Cloudera Data Engineer Certification Exam) канала QuickTechie Official
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять