Free Access Databricks.Associate-Developer-Apache-Spark.v2022-05-05.q60 Practice Test (Page 4)

Question 11

Which of the following describes the characteristics of accumulators?

A.Accumulators are used to pass around lookup tables across the cluster.

B.All accumulators used in a Spark application are listed in the Spark UI.

C.Accumulators can be instantiated directly via the accumulator(n) method of the pyspark.RDD module.

D.Accumulators are immutable.

E.If an action including an accumulator fails during execution and Spark manages to restart the action and complete it successfully, only the successful attempt will be counted in the accumulator.

Question 12

Which of the following code blocks returns a new DataFrame in which column attributes of DataFrame itemsDf is renamed to feature0 and column supplier to feature1?

A.itemsDf.withColumnRenamed(attributes, feature0).withColumnRenamed(supplier, feature1)

B.1.itemsDf.withColumnRenamed("attributes", "feature0")
2.itemsDf.withColumnRenamed("supplier", "feature1")

C.itemsDf.withColumnRenamed(col("attributes"), col("feature0"), col("supplier"), col("feature1"))

D.itemsDf.withColumnRenamed("attributes", "feature0").withColumnRenamed("supplier", "feature1")

E.itemsDf.withColumn("attributes", "feature0").withColumn("supplier", "feature1")

Question 13

The code block displayed below contains an error. The code block should write DataFrame transactionsDf as a parquet file to location filePath after partitioning it on column storeId. Find the error.
Code block:
transactionsDf.write.partitionOn("storeId").parquet(filePath)

A.The partitioning column as well as the file path should be passed to the write() method of DataFrame transactionsDf directly and not as appended commands as in the code block.

B.The partitionOn method should be called before the write method.

C.The operator should use the mode() option to configure the DataFrameWriter so that it replaces any existing files at location filePath.

D.Column storeId should be wrapped in a col() operator.

E.No method partitionOn() exists for the DataFrame class, partitionBy() should be used instead.

Question 14

Which of the following code blocks returns a 2-column DataFrame that shows the distinct values in column productId and the number of rows with that productId in DataFrame transactionsDf?

A.transactionsDf.count("productId").distinct()

B.transactionsDf.groupBy("productId").agg(col("value").count())

C.transactionsDf.count("productId")

D.transactionsDf.groupBy("productId").count()

E.transactionsDf.groupBy("productId").select(count("value"))

Question 15

Which of the elements in the labeled panels represent the operation performed for broadcast variables?
Larger image

A.2, 5

B.3

C.2, 3

D.1, 2

E.1, 3, 4

Other Version: 1802Databricks.Associate-Developer-Apache-Spark.v2022-07-05.q65

Latest Upload: 103OCEG.GRCP.v2025-09-11.q211; 102HP.HPE0-V27.v2025-09-11.q78; 118Oracle.1Z0-1057-23.v2025-09-10.q47; 150Google.Professional-Cloud-Network-Engineer.v2025-09-09.q179; 131SAP.C-S4EWM-2023.v2025-09-08.q83; 164TheSecOpsGroup.CNSP.v2025-09-08.q20; 222CFAInstitute.ESG-Investing.v2025-09-08.q173; 158PECB.ISO-IEC-27001-Lead-Implementer.v2025-09-06.q132; 147Salesforce.Data-Architect.v2025-09-05.q216; 142Adobe.AD0-E605.v2025-09-05.q50

Question 11

Question 12

Question 13

Question 14

Question 15

Download PDF File