Free Access Databricks.Databricks-Certified-Professional-Data-Engineer.v2025-11-20.q139 Practice Test (Page 7)

Question 26

What could be the expected output of query SELECT COUNT (DISTINCT *) FROM user on this table

A.3

B.2
(Correct)

C.1

D.0

E.NULL

Question 27

A data engineering team has a time-consuming data ingestion job with three data sources. Each notebook takes about one hour to load new data. One day, the job fails because a notebook update introduced a new required configuration parameter. The team must quickly fix the issue and load the latest data from the failing source.
Which action should the team take?

A.Repair the run with the new parameter, and update the task by adding the missing task parameter.

B.Update the task by adding the missing task parameter, and manually run the job.

C.Repair the run with the new parameter.

D.Share the analysis with the failing notebook owner so that they can fix it quickly.

Question 28

An upstream source writes Parquet data as hourly batches to directories named with the current date. A nightly batch job runs the following code to ingest all data from the previous day as indicated by thedatevariable:

Assume that the fieldscustomer_idandorder_idserve as a composite key to uniquely identify each order.
If the upstream system is known to occasionally produce duplicate entries for a single order hours apart, which statement is correct?

A.Each write to the orders table willonly contain unique records, and only those records without duplicates in the target table will be written.

B.Each write to the orders table willonly contain unique records, but newly written records may have duplicates already present in the target table.

C.Each write to the orders table will only contain unique records; if existing records with the same key are present in the target table, these records will be overwritten.

D.Each write to the orders table will only contain unique records; if existing records with the same key are present in the target table, the operation will tail.

E.Each write to the orders table willrun deduplication over the union of new and existing records, ensuringno duplicate records are present.

Question 29

An upstream system has been configured to pass the date for a given batch of data to the Databricks Jobs API as a parameter. The notebook to be scheduled will use this parameter to load data with the following code:
df = spark.read.format("parquet").load(f"/mnt/source/(date)")
Which code block should be used to create the date Python variable used in the above code block?

A.date = spark.conf.get("date")

B.input_dict = input()
date= input_dict["date"]

C.import sys
date = sys.argv[1]

D.date = dbutils.notebooks.getParam("date")

E.dbutils.widgets.text("date", "null")
date = dbutils.widgets.get("date")

Question 30

Two of the most common data locations on Databricks are the DBFS root storage and external object storage mounted with dbutils.fs.mount().
Which of the following statements is correct?

A.DBFS is a file system protocol that allows users to interact with files stored in object storage using syntax and guarantees similar to Unix file systems.

B.By default, both the DBFS root and mounted data sources are only accessible to workspace administrators.

C.The DBFS root is the most secure location to store data, because mounted storage volumes must have full public read and write permissions.

D.Neither the DBFS root nor mounted storage can be accessed when using %sh in a Databricks notebook.

E.The DBFS root stores files in ephemeral block volumes attached to the driver, while mounted directories will always persist saved data to external storage between sessions.

Other Version: 538Databricks.Databricks-Certified-Professional-Data-Engineer.v2025-10-30.q132; 1552Databricks.Databricks-Certified-Professional-Data-Engineer.v2024-03-27.q76; 1283Databricks.Databricks-Certified-Professional-Data-Engineer.v2023-12-11.q54

Latest Upload: 131Huawei.H13-831_V2.0.v2026-01-07.q101; 134Salesforce.Salesforce-Slack-Administrator.v2026-01-06.q103; 118CIPS.L5M15.v2026-01-06.q31; 108Oracle.1Z0-1072-25.v2026-01-06.q18; 110Oracle.1Z0-1042-25.v2026-01-05.q55; 112EMC.D-PCR-DY-01.v2026-01-05.q77; 112DSCI.DCPLA.v2026-01-05.q64; 131TheOpenGroup.OGA-031.v2026-01-05.q42; 156OCEG.GRCP.v2026-01-05.q179; 154CompTIA.N10-009.v2026-01-05.q149

Question 26

Question 27

Question 28

Question 29

Question 30

Download PDF File