Cloudera CDP Data Engineer - Certification (CDP-3002) Free Practice Test
Question 1
In the context of Airflow, what is the purpose of the "XCom" feature?
Correct Answer: D
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 2
In the context of Cloudera's Optimization Framework, what role does data statistics collection play?
Correct Answer: D
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 3
Which Kubernetes tool would you use to access logs from a Spark Driver running in a pod?
Correct Answer: D
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 4
You want to schedule your ETL pipeline to run daily at 5:00 AM. How can you configure the DAG's scheduling?
Correct Answer: B
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 5
How does Hive handle bucketing when the data inserted into a bucketed table does not evenly distribute across the buckets?
Correct Answer: C
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 6
Your Iceberg table has a hidden partition by month(event_timestamp). You frequently query with filters on the event_timestamp column. What potential problem might you encounter, and how would you address it?
Correct Answer: D
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 7
You're building an Airflow DAG that involves multiple data processing tasks. How can you handle task dependencies and ensure the tasks execute in the correct order?
Correct Answer: C
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 8
You need to filter data from a Hive table based on a specific date range. Which approach would be most efficient and maintainable?
Correct Answer: B
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).