Free Access Google.Professional-Data-Engineer.v2024-12-09.q327 Practice Test (Page 59)

Question 286

You want to create a machine learning model using BigQuery ML and create an endpoint foe hosting the model using Vertex Al. This will enable the processing of continuous streaming data in near-real time from multiple vendors. The data may contain invalid values. What should you do?

A.Create a new BigOuery dataset and use streaming inserts to land the data from multiple vendors. Configure your BigQuery ML model to use the "ingestion' dataset as the training data.

B.Use BigQuery streaming inserts to land the data from multiple vendors whore your BigQuery dataset ML model is deployed.

C.Create a Pub'Sub topic and send all vendor data to it Connect a Cloud Function to the topic to process the data and store it in BigQuery.

D.Create a Pub/Sub topic and send all vendor data to it Use Dataflow to process and sanitize the Pub/Sub data and stream it to BigQuery.

Question 287

You are selecting services to write and transform JSON messages from Cloud Pub/Sub to BigQuery for a data pipeline on Google Cloud. You want to minimize service costs. You also want to monitor and accommodate input data volume that will vary in size with minimal manual intervention. What should you do?

A.Use Cloud Dataproc to run your transformations. Use the diagnosecommand to generate an operational output archive. Locate the bottleneck and adjust cluster resources.

B.Use Cloud Dataproc to run your transformations. Monitor CPU utilization for the cluster. Resize the number of worker nodes in your cluster via the command line.

C.Use Cloud Dataflow to run your transformations. Monitor the job system lag with Stackdriver. Use the default autoscaling setting for worker instances.

D.Use Cloud Dataflow to run your transformations. Monitor the total execution time for a sampling of jobs.
Configure the job to use non-default Compute Engine machine types when needed.

Question 288

You work for an advertising company, and you've developed a Spark ML model to predict click-through rates at advertisement blocks. You've been developing everything at your on-premises data center, and now your company is migrating to Google Cloud. Your data center will be closing soon, so a rapid lift-and-shift migration is necessary. However, the data you've been using will be migrated to migrated to BigQuery. You periodically retrain your Spark ML models, so you need to migrate existing training pipelines to Google Cloud. What should you do?

A.Use Cloud Dataproc for training existing Spark ML models, but start reading data directly from BigQuery

B.Spin up a Spark cluster on Compute Engine, and train Spark ML models on the data exported from BigQuery

C.Rewrite your models on TensorFlow, and start using Cloud ML Engine

D.Use Cloud ML Engine for training existing Spark ML models

Question 289

Your financial services company is moving to cloud technology and wants to store 50 TB of financial time- series data in the cloud. This data is updated frequently and new data will be streaming in all the time. Your company also wants to move their existing Apache Hadoop jobs to the cloud to get insights into this data.
Which product should they use to store the data?

A.Cloud Bigtable

B.Google BigQuery

C.Google Cloud Storage

D.Google Cloud Datastore

Question 290

Your company is performing data preprocessing for a learning algorithm in Google Cloud Dataflow.
Numerous data logs are being are being generated during this step, and the team wants to analyze them. Due to the dynamic nature of the campaign, the data is growing exponentially every hour.
The data scientists have written the following code to read the data for a new key features in the logs.
BigQueryIO.Read
.named("ReadLogData")
.from("clouddataflow-readonly:samples.log_data")
You want to improve the performance of this data read. What should you do?

A.Specify the TableReference object in the code.

B.Use of both the Google BigQuery TableSchema and TableFieldSchema classes.

C.Call a transform that returns TableRow objects, where each element in the PCollection represents a single row in the table.

D.Use .fromQuery operation to read specific fields from the table.

Other Version: 1171Google.Professional-Data-Engineer.v2026-02-05.q279; 1451Google.Professional-Data-Engineer.v2025-09-04.q126; 1459Google.Professional-Data-Engineer.v2025-03-18.q108; 2863Google.Professional-Data-Engineer.v2024-02-15.q202; 3242Google.Professional-Data-Engineer.v2023-09-14.q233; 4657Google.Professional-Data-Engineer.v2022-07-07.q155; 6650Google.Professional-Data-Engineer.v2022-02-11.q268; 112Google.Examcollectionpass.Professional-Data-Engineer.v2021-12-20.by.jonathan.161q.pdf

Latest Upload: 201PaloAltoNetworks.NGFW-Engineer.v2026-05-01.q43; 296Nokia.4A0-113.v2026-05-01.q69; 253EC-COUNCIL.312-49v11.v2026-04-30.q214; 228Microsoft.MB-820.v2026-04-30.q101; 209Salesforce.MC-202.v2026-04-30.q57; 204BICSI.INSTC_V8.v2026-04-29.q53; 333NMLS.MLO.v2026-04-28.q82; 241NCARB.Project-Management.v2026-04-28.q27; 458EMC.D-AV-DY-23.v2026-04-27.q184; 1111ServiceNow.CSA.v2026-04-27.q483

Question 286

Question 287

Question 288

Question 289

Question 290

Download PDF File