Free Access Google.Professional-Machine-Learning-Engineer.v2025-05-29.q260 Practice Test (Page 32)

Question 151

You are working on a system log anomaly detection model for a cybersecurity organization. You have developed the model using TensorFlow, and you plan to use it for real-time prediction. You need to create a Dataflow pipeline to ingest data via Pub/Sub and write the results to BigQuery. You want to minimize the serving latency as much as possible. What should you do?

A.Containerize the model prediction logic in Cloud Run, which is invoked by Dataflow.

B.Load the model directly into the Dataflow job as a dependency, and use it for prediction.

C.Deploy the model to a Vertex AI endpoint, and invoke this endpoint in the Dataflow job.

D.Deploy the model in a TFServing container on Google Kubernetes Engine, and invoke it in the Dataflow job.

Correct Answer: B

The best option for creating a Dataflow pipeline for real-time anomaly detection is to load the model directly into the Dataflow job as a dependency, and use it for prediction. This option has the following advantages:
It minimizes the serving latency, as the model prediction logic is executed within the same Dataflow pipeline that ingests and processes the data. There is no need to invoke external services or containers, which can introduce network overhead and latency.
It simplifies the deployment and management of the model, as the model is packaged with the Dataflow job and does not require a separate service or container. The model can be updated by redeploying the Dataflow job with a new model version.
It leverages the scalability and reliability of Dataflow, as the model prediction logic can scale up or down with the data volume and handle failures and retries automatically.
The other options are less optimal for the following reasons:
Option A: Containerizing the model prediction logic in Cloud Run, which is invoked by Dataflow, introduces additional latency and complexity. Cloud Run is a serverless platform that runs stateless containers, which means that the model prediction logic needs to be initialized and loaded every time a request is made. This can increase the cold start latency and reduce the throughput. Moreover, Cloud Run has a limit on the number of concurrent requests per container, which can affect the scalability of the model prediction logic. Additionally, this option requires managing two separate services: the Dataflow pipeline and the Cloud Run container.
Option C: Deploying the model to a Vertex AI endpoint, and invoking this endpoint in the Dataflow job, also introduces additional latency and complexity. Vertex AI is a managed service that provides various tools and features for machine learning, such as training, tuning, serving, and monitoring. However, invoking a Vertex AI endpoint from a Dataflow job requires making an HTTP request, which can incur network overhead and latency. Moreover, this option requires managing two separate services: the Dataflow pipeline and the Vertex AI endpoint.
Option D: Deploying the model in a TFServing container on Google Kubernetes Engine, and invoking it in the Dataflow job, also introduces additional latency and complexity. TFServing is a high-performance serving system for TensorFlow models, which can handle multiple versions and variants of a model. However, invoking a TFServing container from a Dataflow job requires making a gRPC or REST request, which can incur network overhead and latency. Moreover, this option requires managing two separate services: the Dataflow pipeline and the Google Kubernetes Engine cluster.
Reference:
[Dataflow documentation]
[TensorFlow documentation]
[Cloud Run documentation]
[Vertex AI documentation]
[TFServing documentation]

Question 152

You recently developed a wide and deep model in TensorFlow. You generated training datasets using a SQL script that preprocessed raw data in BigQuery by performing instance-level transformations of the dat a. You need to create a training pipeline to retrain the model on a weekly basis. The trained model will be used to generate daily recommendations. You want to minimize model development and training time. How should you develop the training pipeline?

A.Use the Kubeflow Pipelines SDK to implement the pipeline Use the BigQueryJobop component to run the preprocessing script and the customTrainingJobop component to launch a Vertex Al training job.

B.Use the Kubeflow Pipelines SDK to implement the pipeline. Use the dataflowpythonjobopcomponent to preprocess the data and the customTraining JobOp component to launch a Vertex Al training job.

C.Use the TensorFlow Extended SDK to implement the pipeline Use the Examplegen component with the BigQuery executor to ingest the data the Transform component to preprocess the data, and the Trainer component to launch a Vertex Al training job.

D.Use the TensorFlow Extended SDK to implement the pipeline Implement the preprocessing steps as part of the input_fn of the model Use the ExampleGen component with the BigQuery executor to ingest the data and the Trainer component to launch a Vertex Al training job.

Question 153

A company wants to predict the sale prices of houses based on available historical sales data. The target variable in the company's dataset is the sale price. The features include parameters such as the lot size, living area measurements, non-living area measurements, number of bedrooms, number of bathrooms, year built, and postal code. The company wants to use multi-variable linear regression to predict house sale prices.
Which step should a machine learning specialist take to remove features that are irrelevant for the analysis and reduce the model's complexity?

A.Plot a histogram of the features and compute their standard deviation. Remove features with low variance.

B.Plot a histogram of the features and compute their standard deviation. Remove features with high variance.

C.Run a correlation check of all features against the target variable. Remove features with low target variable correlation scores.

D.Build a heatmap showing the correlation of the dataset against itself. Remove features with low mutual correlation scores.

Question 154

You work for a large retailer and you need to build a model to predict customer churn. The company has a dataset of historical customer data, including customer demographics, purchase history, and website activity. You need to create the model in BigQuery ML and thoroughly evaluate its performance. What should you do?

A.Create a logistic regression model in BigQuery ML Use the ml.confusion_matrix function to evaluate the model performance.

B.Create a linear regression model in BigQuery ML and register the model in Vertex Al Model Registry Evaluate the model performance in Vertex Al.

C.Create a logistic regression model in BigQuery ML and register the model in Vertex Al Model Registry. Evaluate the model performance in Vertex Al.

D.Create a linear regression model in BigQuery ML Use the ml. evaluate function to evaluate the model performance.

Question 155

You work for a retail company. You have been tasked with building a model to determine the probability of churn for each customer. You need the predictions to be interpretable so the results can be used to develop marketing campaigns that target at-risk customers. What should you do?

A.Build a random forest regression model in a Vertex Al Workbench notebook instance Configure the model to generate feature importance's after the model is trained.

B.Build an AutoML tabular regression model Configure the model to generate explanations when it makes predictions.

C.Build a custom TensorFlow neural network by using Vertex Al custom training Configure the model to generate explanations when it makes predictions.

D.Build a random forest classification model in a Vertex Al Workbench notebook instance Configure the model to generate feature importance's after the model is trained.

Other Version: 3776Google.Professional-Machine-Learning-Engineer.v2024-09-13.q242; 2757Google.Professional-Machine-Learning-Engineer.v2023-12-28.q120; 1253Google.Professional-Machine-Learning-Engineer.v2023-02-27.q25; 2607Google.Professional-Machine-Learning-Engineer.v2022-08-07.q66; 2230Google.Professional-Machine-Learning-Engineer.v2022-03-21.q70; 62Google.Prepawayete.Professional-Machine-Learning-Engineer.v2021-09-13.by.murray.54q.pdf

Latest Upload: 452PaloAltoNetworks.NGFW-Engineer.v2026-05-01.q43; 595Nokia.4A0-113.v2026-05-01.q69; 635EC-COUNCIL.312-49v11.v2026-04-30.q214; 616Microsoft.MB-820.v2026-04-30.q101; 444Salesforce.MC-202.v2026-04-30.q57; 483BICSI.INSTC_V8.v2026-04-29.q53; 601NMLS.MLO.v2026-04-28.q82; 404NCARB.Project-Management.v2026-04-28.q27; 783EMC.D-AV-DY-23.v2026-04-27.q184; 2006ServiceNow.CSA.v2026-04-27.q483

Question 151

Question 152

Question 153

Question 154

Question 155

Download PDF File