Free Access Google.Professional-Data-Engineer.v2022-07-07.q155 Practice Test (Page 29)

Question 136

Your team is working on a binary classification problem. You have trained a support vector machine (SVM) classifier with default parameters, and received an area under the Curve (AUC) of 0.87 on the validation set.
You want to increase the AUC of the model. What should you do?

A.Perform hyperparameter tuning

B.Deploy the model and measure the real-world AUC; it's always higher because of generalization

C.Scale predictions you get out of the model (tune a scaling factor as a hyperparameter) in order to get the highest AUC

D.Train a classifier with deep neural networks, because neural networks would always beat SVMs

Question 137

You are designing a cloud-native historical data processing system to meet the following conditions:
* The data being analyzed is in CSV, Avro, and PDF formats and will be accessed by multiple analysis tools including Cloud Dataproc, BigQuery, and Compute Engine.
* A streaming data pipeline stores new data daily.
* Peformance is not a factor in the solution.
* The solution design should maximize availability.
How should you design data storage for this solution?

A.Store the data in a multi-regional Cloud Storage bucket. Access the data directly using Cloud Dataproc, BigQuery, and Compute Engine.

B.Store the data in a regional Cloud Storage bucket. Access the bucket directly using Cloud Dataproc, BigQuery, and Compute Engine.

C.Create a Cloud Dataproc cluster with high availability. Store the data in HDFS, and peform analysis as needed.

D.Store the data in BigQuery. Access the data using the BigQuery Connector on Cloud Dataproc and Compute Engine.

Question 138

Your analytics team wants to build a simple statistical model to determine which customers are most likely to work with your company again, based on a few different metrics. They want to run the model on Apache Spark, using data housed in Google Cloud Storage, and you have recommended using Google Cloud Dataproc to execute this job. Testing has shown that this workload can run in approximately 30 minutes on a 15-node cluster, outputting the results into Google BigQuery. The plan is to run this workload weekly.
How should you optimize the cluster for cost?

A.Migrate the workload to Google Cloud Dataflow

B.Use pre-emptible virtual machines (VMs) for the cluster

C.Use SSDs on the worker nodes so that the job can run faster

D.Use a higher-memory node so that the job runs faster

Question 139

You work for a mid-sized enterprise that needs to move its operational system transaction data from an on-premises database to GCP. The database is about 20 TB in size. Which database should you choose?

A.Cloud SQL

B.Cloud Bigtable

C.Cloud Spanner

D.Cloud Datastore

Question 140

You have a data stored in BigQuery. The data in the BigQuery dataset must be highly available. You need to define a storage, backup, and recovery strategy of this data that minimizes cost. How should you configure the BigQuery table?

A.Set the BigQuery dataset to be regional. In the event of an emergency, use a point-in-time snapshot to recover the data.

B.Set the BigQuery dataset to be multi-regional. In the event of an emergency, use a point-in-time snapshot to recover the data.

C.Set the BigQuery dataset to be regional. Create a scheduled query to make copies of the data to tables suffixed with the time of the backup. In the event of an emergency, use the backup copy of the table.

D.Set the BigQuery dataset to be multi-regional. Create a scheduled query to make copies of the data to tables suffixed with the time of the backup. In the event of an emergency, use the backup copy of the table.

Other Version: 153Google.Professional-Data-Engineer.v2025-09-04.q126; 813Google.Professional-Data-Engineer.v2025-03-18.q108; 948Google.Professional-Data-Engineer.v2024-12-09.q327; 1474Google.Professional-Data-Engineer.v2024-02-15.q202; 2261Google.Professional-Data-Engineer.v2023-09-14.q233; 5253Google.Professional-Data-Engineer.v2022-02-11.q268; 112Google.Examcollectionpass.Professional-Data-Engineer.v2021-12-20.by.jonathan.161q.pdf

Latest Upload: 117Oracle.1Z0-1057-23.v2025-09-10.q47; 150Google.Professional-Cloud-Network-Engineer.v2025-09-09.q179; 131SAP.C-S4EWM-2023.v2025-09-08.q83; 156TheSecOpsGroup.CNSP.v2025-09-08.q20; 208CFAInstitute.ESG-Investing.v2025-09-08.q173; 155PECB.ISO-IEC-27001-Lead-Implementer.v2025-09-06.q132; 145Salesforce.Data-Architect.v2025-09-05.q216; 139Adobe.AD0-E605.v2025-09-05.q50; 183Nutanix.NCP-MCI-6.10.v2025-09-05.q55; 114Oracle.1z0-591.v2025-09-05.q104

Question 136

Question 137

Question 138

Question 139

Question 140

Download PDF File