Free Access Databricks.Associate-Developer-Apache-Spark.v2022-05-05.q60 Practice Test (Page 7)

Question 26

Which of the following code blocks prints out in how many rows the expression Inc. appears in the string-type column supplier of DataFrame itemsDf?

A.1.counter = 0
2.
3.for index, row in itemsDf.iterrows():
4. if 'Inc.' in row['supplier']:
5. counter = counter + 1
6.
7.print(counter)

B.1.counter = 0
2.
3.def count(x):
4. if 'Inc.' in x['supplier']:
5. counter = counter + 1
6.
7.itemsDf.foreach(count)
8.print(counter)

C.print(itemsDf.foreach(lambda x: 'Inc.' in x))

D.print(itemsDf.foreach(lambda x: 'Inc.' in x).sum())

E.1.accum=sc.accumulator(0)
2.
3.def check_if_inc_in_supplier(row):
4. if 'Inc.' in row['supplier']:
5. accum.add(1)
6.
7.itemsDf.foreach(check_if_inc_in_supplier)
8.print(accum.value)

Question 27

The code block shown below should return only the average prediction error (column predError) of a random subset, without replacement, of approximately 15% of rows in DataFrame transactionsDf. Choose the answer that correctly fills the blanks in the code block to accomplish this.
transactionsDf.__1__(__2__, __3__).__4__(avg('predError'))

A.1. sample
2. True
3. 0.15
4. filter

B.1. sample
2. False
3. 0.15
4. select

C.1. sample
2. 0.85
3. False
4. select

D.1. fraction
2. 0.15
3. True
4. where

E.1. fraction
2. False
3. 0.85
4. select

Question 28

Which of the following describes a difference between Spark's cluster and client execution modes?

A.In cluster mode, the cluster manager resides on a worker node, while it resides on an edge node in client mode.

B.In cluster mode, executor processes run on worker nodes, while they run on gateway nodes in client mode.

C.In cluster mode, the driver resides on a worker node, while it resides on an edge node in client mode.

D.In cluster mode, a gateway machine hosts the driver, while it is co-located with the executor in client mode.

E.In cluster mode, the Spark driver is not co-located with the cluster manager, while it is co-located in client mode.

Question 29

The code block shown below should convert up to 5 rows in DataFrame transactionsDf that have the value 25 in column storeId into a Python list. Choose the answer that correctly fills the blanks in the code block to accomplish this.
Code block:
transactionsDf.__1__(__2__).__3__(__4__)

A.1. filter
2. "storeId"==25
3. collect
4. 5

B.1. filter
2. col("storeId")==25
3. toLocalIterator
4. 5

C.1. select
2. storeId==25
3. head
4. 5

D.1. filter
2. col("storeId")==25
3. take
4. 5

E.1. filter
2. col("storeId")==25
3. collect
4. 5

Question 30

The code block displayed below contains an error. The code block should return a new DataFrame that only contains rows from DataFrame transactionsDf in which the value in column predError is at least 5. Find the error.
Code block:
transactionsDf.where("col(predError) >= 5")

A.The argument to the where method should be "predError >= 5".

B.Instead of where(), filter() should be used.

C.The expression returns the original DataFrame transactionsDf and not a new DataFrame. To avoid this, the code block should be transactionsDf.toNewDataFrame().where("col(predError) >= 5").

D.The argument to the where method cannot be a string.

E.Instead of >=, the SQL operator GEQ should be used.

Other Version: 1802Databricks.Associate-Developer-Apache-Spark.v2022-07-05.q65

Latest Upload: 103OCEG.GRCP.v2025-09-11.q211; 102HP.HPE0-V27.v2025-09-11.q78; 118Oracle.1Z0-1057-23.v2025-09-10.q47; 150Google.Professional-Cloud-Network-Engineer.v2025-09-09.q179; 131SAP.C-S4EWM-2023.v2025-09-08.q83; 164TheSecOpsGroup.CNSP.v2025-09-08.q20; 222CFAInstitute.ESG-Investing.v2025-09-08.q173; 158PECB.ISO-IEC-27001-Lead-Implementer.v2025-09-06.q132; 147Salesforce.Data-Architect.v2025-09-05.q216; 142Adobe.AD0-E605.v2025-09-05.q50

Question 26

Question 27

Question 28

Question 29

Question 30

Download PDF File