Question 36

Which of the following are valid execution modes?
  • Question 37

    Which of the following code blocks creates a new DataFrame with 3 columns, productId, highest, and lowest, that shows the biggest and smallest values of column value per value in column productId from DataFrame transactionsDf?
    Sample of DataFrame transactionsDf:
    1.+-------------+---------+-----+-------+---------+----+
    2.|transactionId|predError|value|storeId|productId| f|
    3.+-------------+---------+-----+-------+---------+----+
    4.| 1| 3| 4| 25| 1|null|
    5.| 2| 6| 7| 2| 2|null|
    6.| 3| 3| null| 25| 3|null|
    7.| 4| null| null| 3| 2|null|
    8.| 5| null| null| null| 2|null|
    9.| 6| 3| 2| 25| 2|null|
    10.+-------------+---------+-----+-------+---------+----+
  • Question 38

    Which of the following statements about Spark's execution hierarchy is correct?
  • Question 39

    Which of the following describes Spark's Adaptive Query Execution?
  • Question 40

    The code block shown below should add a column itemNameBetweenSeparators to DataFrame itemsDf. The column should contain arrays of maximum 4 strings. The arrays should be composed of the values in column itemsDf which are separated at - or whitespace characters. Choose the answer that correctly fills the blanks in the code block to accomplish this.
    Sample of DataFrame itemsDf:
    1.+------+----------------------------------+-------------------+
    2.|itemId|itemName |supplier |
    3.+------+----------------------------------+-------------------+
    4.|1 |Thick Coat for Walking in the Snow|Sports Company Inc.|
    5.|2 |Elegant Outdoors Summer Dress |YetiX |
    6.|3 |Outdoors Backpack |Sports Company Inc.|
    7.+------+----------------------------------+-------------------+
    Code block:
    itemsDf.__1__(__2__, __3__(__4__, "[\s\-]", __5__))