Question 51

You are asked to debug a databricks job that is taking too long to run on Sunday's, what are the steps you are going to take to identify the step that is taking longer to run?
  • Question 52

    A Delta Live Table pipeline includes two datasets defined using STREAMING LIVE TABLE.
    Three datasets are defined against Delta Lake table sources using LIVE TABLE . The table is configured to
    run in Development mode using the Triggered Pipeline Mode.
    Assuming previously unprocessed data exists and all definitions are valid, what is the expected outcome after
    clicking Start to update the pipeline?
  • Question 53

    A Data engineer wants to run unit's tests using common Python testing frameworks on python functions defined across several Databricks notebooks currently used in production.
    How can the data engineer run unit tests against function that work with data in production?
  • Question 54

    Which of the following is true of Delta Lake and the Lakehouse?
  • Question 55

    The data engineering team has configured a Databricks SQL query and alert to monitor the values in a Delta Lake table. The recent_sensor_recordings table contains an identifying sensor_id alongside the timestamp and temperature for the most recent 5 minutes of recordings.
    The below query is used to create the alert:

    The query is set to refresh each minute and always completes in less than 10 seconds. The alert is set to trigger when mean (temperature) > 120. Notifications are triggered to be sent at most every 1 minute.
    If this alert raises notifications for 3 consecutive minutes and then stops, which statement must be true?