Question 16

You plan to perform batch processing in Azure Databricks once daily.
Which type of Databricks cluster should you use?
  • Question 17

    You have an Azure event hub named retailhub that has 16 partitions. Transactions are posted to retailhub. Each transaction includes the transaction ID, the individual line items, and the payment details. The transaction ID is used as the partition key.
    You are designing an Azure Stream Analytics job to identify potentially fraudulent transactions at a retail store. The job will use retailhub as the input. The job will output the transaction ID, the individual line items, the payment details, a fraud score, and a fraud indicator.
    You plan to send the output to an Azure event hub named fraudhub.
    You need to ensure that the fraud detection solution is highly scalable and processes transactions as quickly as possible.
    How should you structure the output of the Stream Analytics job? To answer, select the appropriate options in the answer area.
    NOTE: Each correct selection is worth one point.

    Question 18

    You create an Azure Databricks cluster and specify an additional library to install.
    When you attempt to load the library to a notebook, the library in not found.
    You need to identify the cause of the issue.
    What should you review?
  • Question 19

    You are building an Azure Stream Analytics job to identify how much time a user spends interacting with a feature on a webpage.
    The job receives events based on user actions on the webpage. Each row of data represents an event. Each event has a type of either 'start' or 'end'.
    You need to calculate the duration between start and end events.
    How should you complete the query? To answer, select the appropriate options in the answer area.
    NOTE: Each correct selection is worth one point.

    Question 20

    From a website analytics system, you receive data extracts about user interactions such as downloads, link clicks, form submissions, and video plays.
    The data contains the following columns.

    You need to design a star schema to support analytical queries of the data. The star schema will contain four tables including a date dimension.
    To which table should you add each column? To answer, select the appropriate options in the answer area.
    NOTE: Each correct selection is worth one point.