Question 51

Which of the following describes the role of the cluster manager?
  • Question 52

    Which of the following code blocks creates a new DataFrame with two columns season and wind_speed_ms where column season is of data type string and column wind_speed_ms is of data type double?
  • Question 53

    Which of the following code blocks returns about 150 randomly selected rows from the 1000-row DataFrame transactionsDf, assuming that any row can appear more than once in the returned DataFrame?
  • Question 54

    Which of the following statements about broadcast variables is correct?
  • Question 55

    The code block shown below should return a new 2-column DataFrame that shows one attribute from column attributes per row next to the associated itemName, for all suppliers in column supplier whose name includes Sports. Choose the answer that correctly fills the blanks in the code block to accomplish this.
    Sample of DataFrame itemsDf:
    1.+------+----------------------------------+-----------------------------+-------------------+
    2.|itemId|itemName |attributes |supplier |
    3.+------+----------------------------------+-----------------------------+-------------------+
    4.|1 |Thick Coat for Walking in the Snow|[blue, winter, cozy] |Sports Company Inc.|
    5.|2 |Elegant Outdoors Summer Dress |[red, summer, fresh, cooling]|YetiX |
    6.|3 |Outdoors Backpack |[green, summer, travel] |Sports Company Inc.|
    7.+------+----------------------------------+-----------------------------+-------------------+ Code block:
    itemsDf.__1__(__2__).select(__3__, __4__)