Question 86

A Databricks job has been configured with 3 tasks, each of which is a Databricks notebook. Task A does not depend on other tasks. Tasks B and C run in parallel, with each having a serial dependency on task A.
If tasks A and B complete successfully but task C fails during a scheduled run, which statement describes the resulting state?
  • Question 87

    A Spark job is taking longer than expected. Using the Spark UI, a data engineer notes that the Min, Median, and Max Durations for tasks in a particular stage show the minimum and median time to complete a task as roughly the same, but the max duration for a task to be roughly 100 times as long as the minimum.
    Which situation is causing increased duration of the overall job?
  • Question 88

    Although the Databricks Utilities Secrets module provides tools to store sensitive credentials and avoid accidentally displaying them in plain text users should still be careful with which credentials are stored here and which users have access to using these secrets.
    Which statement describes a limitation of Databricks Secrets?
  • Question 89

    A Spark job is taking longer than expected. Using the Spark UI, a data engineer notes that the Min, Median, and Max Durations for tasks in a particular stage show the minimum and median time to complete a task as roughly the same, but the max duration for a task to be roughly 100 times as long as the minimum.
    Which situation is causing increased duration of the overall job?
  • Question 90

    A Delta Lake table representing metadata about content posts from users has the following schema:
    * user_id LONG
    * post_text STRING
    * post_id STRING
    * longitude FLOAT
    * latitude FLOAT
    * post_time TIMESTAMP
    * date DATE
    Based on the above schema, which column is a good candidate for partitioning the Delta Table?