Free Access Google.Professional-Machine-Learning-Engineer.v2022-03-21.q70 Practice Test (Page 5)

Question 16

You are developing a Kubeflow pipeline on Google Kubernetes Engine. The first step in the pipeline is to issue a query against BigQuery. You plan to use the results of that query as the input to the next step in your pipeline. You want to achieve this in the easiest way possible. What should you do?

A.Use the BigQuery console to execute your query and then save the query results Into a new BigQuery table.

B.Write a Python script that uses the BigQuery API to execute queries against BigQuery Execute this script as the first step in your Kubeflow pipeline

C.Use the Kubeflow Pipelines domain-specific language to create a custom component that uses the Python BigQuery client library to execute queries

D.Locate the Kubeflow Pipelines repository on GitHub Find the BigQuery Query Component, copy that component's URL, and use it to load the component into your pipeline. Use the component to execute queries against BigQuery

Question 17

A Machine Learning Specialist is using an Amazon SageMaker notebook instance in a private subnet of a corporate VPC. The ML Specialist has important data stored on the Amazon SageMaker notebook instance's Amazon EBS volume, and needs to take a snapshot of that EBS volume. However, the ML Specialist cannot find the Amazon SageMaker notebook instance's EBS volume or Amazon EC2 instance within the VPC.
Why is the ML Specialist not seeing the instance visible in the VPC?

A.Amazon SageMaker notebook instances are based on the EC2 instances within the customer account, but they run outside of VPCs.

B.Amazon SageMaker notebook instances are based on the Amazon ECS service within customer accounts.

C.Amazon SageMaker notebook instances are based on EC2 instances running within AWS service accounts.

D.Amazon SageMaker notebook instances are based on AWS ECS instances running within AWS service accounts.

Question 18

A Machine Learning Specialist is implementing a full Bayesian network on a dataset that describes public transit in New York City. One of the random variables is discrete, and represents the number of minutes New Yorkers wait for a bus given that the buses cycle every 10 minutes, with a mean of 3 minutes.
Which prior probability distribution should the ML Specialist use for this variable?

A.Binomial distribution

B.Poisson distribution

C.Normal distribution

D.Uniform distribution

Question 19

A Machine Learning Specialist has completed a proof of concept for a company using a small data sample, and now the Specialist is ready to implement an end-to-end solution in AWS using Amazon SageMaker. The historical training data is stored in Amazon RDS.
Which approach should the Specialist use for training a model using that data?

A.Push the data from Microsoft SQL Server to Amazon S3 using an AWS Data Pipeline and provide the S3 location within the notebook.

B.Write a direct connection to the SQL database within the notebook and pull data in

C.Move the data to Amazon ElastiCache using AWS DMS and set up a connection within the notebook to pull data in for fast access.

D.Move the data to Amazon DynamoDB and set up a connection to DynamoDB within the notebook to pull data in.

Question 20

A web-based company wants to improve its conversion rate on its landing page. Using a large historical dataset of customer visits, the company has repeatedly trained a multi-class deep learning network algorithm on Amazon SageMaker. However, there is an overfitting problem: training data shows 90% accuracy in predictions, while test data shows 70% accuracy only.
The company needs to boost the generalization of its model before deploying it into production to maximize conversions of visits to purchases.
Which action is recommended to provide the HIGHEST accuracy model for the company's test and validation data?

A.Increase the randomization of training data in the mini-batches used in training

B.Apply L1 or L2 regularization and dropouts to the training

C.Reduce the number of layers and units (or neurons) from the deep learning network

D.Allocate a higher proportion of the overall data to the training dataset

Other Version: 955Google.Professional-Machine-Learning-Engineer.v2025-05-29.q260; 1353Google.Professional-Machine-Learning-Engineer.v2024-09-13.q242; 1323Google.Professional-Machine-Learning-Engineer.v2023-12-28.q120; 831Google.Professional-Machine-Learning-Engineer.v2023-02-27.q25; 1839Google.Professional-Machine-Learning-Engineer.v2022-08-07.q66; 62Google.Prepawayete.Professional-Machine-Learning-Engineer.v2021-09-13.by.murray.54q.pdf

Latest Upload: 107Oracle.1Z0-1057-23.v2025-09-10.q47; 150Google.Professional-Cloud-Network-Engineer.v2025-09-09.q179; 131SAP.C-S4EWM-2023.v2025-09-08.q83; 155TheSecOpsGroup.CNSP.v2025-09-08.q20; 206CFAInstitute.ESG-Investing.v2025-09-08.q173; 152PECB.ISO-IEC-27001-Lead-Implementer.v2025-09-06.q132; 140Salesforce.Data-Architect.v2025-09-05.q216; 136Adobe.AD0-E605.v2025-09-05.q50; 176Nutanix.NCP-MCI-6.10.v2025-09-05.q55; 114Oracle.1z0-591.v2025-09-05.q104

Question 16

Question 17

Question 18

Question 19

Question 20

Download PDF File