Question 16

A Databricks job has been configured with 3 tasks, each of which is a Databricks notebook. Task A does not depend on other tasks. Tasks B and C run in parallel, with each having a serial dependency on task A.
If tasks A and B complete successfully but task C fails during a scheduled run, which statement describes the resulting state?
  • Question 17

    A data engineering team has created a series of tables using Parquet data stored in an external sys-tem. The
    team is noticing that after appending new rows to the data in the external system, their queries within
    Databricks are not returning the new rows. They identify the caching of the previous data as the cause of this
    issue.
    Which of the following approaches will ensure that the data returned by queries is always up-to-date?
  • Question 18

    A junior data engineer seeks to leverage Delta Lake's Change Data Feed functionality to create a Type 1 table representing all of the values that have ever been valid for all rows in a bronze table created with the property delta.enableChangeDataFeed = true. They plan to execute the following code as a daily job:
    Which statement describes the execution and results of running the above query multiple times?
  • Question 19

    You are currently working with the second team and both teams are looking to modify the same notebook, you noticed that the second member is copying the notebooks to the personal folder to edit and replace the collaboration notebook, which notebook feature do you recommend to make the process easier to collaborate.
  • Question 20

    You are trying to create an object by joining two tables that and it is accessible to data scientist's team, so it does not get dropped if the cluster restarts or if the notebook is detached. What type of object are you trying to create?