Free Access Databricks.Associate-Developer-Apache-Spark.v2022-05-05.q60 Practice Test (Page 6)

Question 21

Which of the following describes Spark actions?

A.Writing data to disk is the primary purpose of actions.

B.Actions are Spark's way of exchanging data between executors.

C.The driver receives data upon request by actions.

D.Stage boundaries are commonly established by actions.

E.Actions are Spark's way of modifying RDDs.

Question 22

Which of the following code blocks reads in the parquet file stored at location filePath, given that all columns in the parquet file contain only whole numbers and are stored in the most appropriate format for this kind of data?

A.1.spark.read.schema(
2. StructType(
3. StructField("transactionId", IntegerType(), True),
4. StructField("predError", IntegerType(), True)
5. )).load(filePath)

B.1.spark.read.schema([
2. StructField("transactionId", NumberType(), True),
3. StructField("predError", IntegerType(), True)
4. ]).load(filePath)

C.1.spark.read.schema(
2. StructType([
3. StructField("transactionId", StringType(), True),
4. StructField("predError", IntegerType(), True)]
5. )).parquet(filePath)

D.1.spark.read.schema(
2. StructType([
3. StructField("transactionId", IntegerType(), True),
4. StructField("predError", IntegerType(), True)]
5. )).format("parquet").load(filePath)

E.1.spark.read.schema([
2. StructField("transactionId", IntegerType(), True),
3. StructField("predError", IntegerType(), True)
4. ]).load(filePath, format="parquet")

Question 23

The code block displayed below contains an error. The code block should return the average of rows in column value grouped by unique storeId. Find the error.
Code block:
transactionsDf.agg("storeId").avg("value")

A.Instead of avg("value"), avg(col("value")) should be used.

B.The avg("value") should be specified as a second argument to agg() instead of being appended to it.

C.All column names should be wrapped in col() operators.

D.agg should be replaced by groupBy.

E."storeId" and "value" should be swapped.

Question 24

Which of the following statements about executors is correct, assuming that one can consider each of the JVMs working as executors as a pool of task execution slots?

A.Slot is another name for executor.

B.There must be less executors than tasks.

C.An executor runs on a single core.

D.There must be more slots than tasks.

E.Tasks run in parallel via slots.

Question 25

Which of the following statements about stages is correct?

A.Different stages in a job may be executed in parallel.

B.Stages consist of one or more jobs.

C.Stages ephemerally store transactions, before they are committed through actions.

D.Tasks in a stage may be executed by multiple machines at the same time.

E.Stages may contain multiple actions, narrow, and wide transformations.

Other Version: 1802Databricks.Associate-Developer-Apache-Spark.v2022-07-05.q65

Latest Upload: 103OCEG.GRCP.v2025-09-11.q211; 102HP.HPE0-V27.v2025-09-11.q78; 118Oracle.1Z0-1057-23.v2025-09-10.q47; 150Google.Professional-Cloud-Network-Engineer.v2025-09-09.q179; 131SAP.C-S4EWM-2023.v2025-09-08.q83; 164TheSecOpsGroup.CNSP.v2025-09-08.q20; 222CFAInstitute.ESG-Investing.v2025-09-08.q173; 158PECB.ISO-IEC-27001-Lead-Implementer.v2025-09-06.q132; 147Salesforce.Data-Architect.v2025-09-05.q216; 142Adobe.AD0-E605.v2025-09-05.q50

Question 21

Question 22

Question 23

Question 24

Question 25

Download PDF File