Question 96

Which of the following benefits is provided by the array functions from Spark SQL?
  • Question 97

    A data engineering team has two tables. The first table march_transactions is a collection of all retail transactions in the month of March. The second table april_transactions is a collection of all retail transactions in the month of April. There are no duplicate records between the tables.
    Which of the following commands should be run to create a new table all_transactions that contains all records from march_transactions and april_transactions without duplicate records?
  • Question 98

    A data engineer needs access to a table new_table, but they do not have the correct permissions. They can ask the table owner for permission, but they do not know who the table owner is.
    Which of the following approaches can be used to identify the owner of new_table?
  • Question 99

    Which of the following data workloads will utilize a Gold table as its source?
  • Question 100

    A data engineer needs to apply custom logic to identify employees with more than 5 years of experience in array column employees in table stores. The custom logic should create a new column exp_employees that is an array of all of the employees with more than 5 years of experience for each row. In order to apply this custom logic at scale, the data engineer wants to use the FILTER higher-order function.
    Which of the following code blocks successfully completes this task?