Question 21

Which statement describes integration testing?
  • Question 22

    Delete records from the transactions Delta table where transactionDate is greater than current timestamp?
  • Question 23

    Which of the following statement is true about Databricks repos?
  • Question 24

    A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Incremental state information should be maintained for 10 minutes for late-arriving data.
    Streaming DataFrame df has the following schema:
    "device_id INT, event_time TIMESTAMP, temp FLOAT, humidity FLOAT"
    Code block:

    Choose the response that correctly fills in the blank within the code block to complete this task.
  • Question 25

    A data engineer is developing a Lakeflow Declarative Pipeline (LDP) using a Databricks notebook directly connected to their pipeline. After adding new table definitions and transformation logic in their notebook, they want to check for any syntax errors in the pipeline code without actually processing data or running the pipeline.
    How should the data engineer perform this syntax check?