Question 31

Which of the following can be used to simplify and unify siloed data architectures that are specialized for specific use cases?
  • Question 32

    A data engineer is designing a data pipeline. The source system generates files in a shared directory that is also used by other processes. As a result, the files should be kept as is and will accumulate in the directory. The data engineer needs to identify which files are new since the previous run in the pipeline, and set up the pipeline to only ingest those new files with each run.
    Which of the following tools can the data engineer use to solve this problem?
  • Question 33

    A data engineer runs a statement every day to copy the previous day's sales into the table transactions. Each day's sales are in their own file in the location "/transactions/raw".
    Today, the data engineer runs the following command to complete this task:

    After running the command today, the data engineer notices that the number of records in table transactions has not changed.
    Which of the following describes why the statement might not have copied any new records into the table?
  • Question 34

    A data engineer is using the following code block as part of a batch ingestion pipeline to read from a composable table:

    Which of the following changes needs to be made so this code block will work when the transactions table is a stream source?
  • Question 35

    Which of the following is a benefit of the Databricks Lakehouse Platform embracing open source technologies?