Free Access Google.Professional-Data-Engineer.v2024-12-09.q327 Practice Test (Page 54)

Question 261

You are migrating a large number of files from a public HTTPS endpoint to Cloud Storage. The files are protected from unauthorized access using signed URLs. You created a TSV file that contains the list of object URLs and started a transfer job by using Storage Transfer Service. You notice that the job has run for a long time and eventually failed Checking the logs of the transfer job reveals that the job was running fine until one point, and then it failed due to HTTP 403 errors on the remaining files You verified that there were no changes to the source system You need to fix the problem to resume the migration process. What should you do?

A.Set up Cloud Storage FUSE, and mount the Cloud Storage bucket on a Compute Engine Instance Remove the completed files from the TSV file Use a shell script to iterate through the TSV file and download the remaining URLs to the FUSE mount point.

B.Update the file checksums in the TSV file from using MD5 to SHA256. Remove the completed files from the TSV file and rerun the Storage Transfer Service job.

C.Renew the TLS certificate of the HTTPS endpoint Remove the completed files from the TSV file and rerun the Storage Transfer Service job.

D.Create a new TSV file for the remaining files by generating signed URLs with a longer validity period.Split the TSV file into multiple smaller files and submit them as separate Storage Transfer Service jobs in parallel.

Question 262

Your software uses a simple JSON format for all messages. These messages are published to Google Cloud Pub/Sub, then processed with Google Cloud Dataflow to create a real-time dashboard for the CFO. During testing, you notice that some messages are missing in the dashboard. You check the logs, and all messages are being published to Cloud Pub/Sub successfully. What should you do next?

A.Run a fixed dataset through the Cloud Dataflow pipeline and analyze the output.

B.Switch Cloud Dataflow to pull messages from Cloud Pub/Sub instead of Cloud Pub/Sub pushing messages to Cloud Dataflow.

C.Check the dashboard application to see if it is not displaying correctly.

D.Use Google Stackdriver Monitoring on Cloud Pub/Sub to find the missing messages.

Question 263

You need to store and analyze social media postings in Google BigQuery at a rate of 10,000 messages per minute in near real-time. Initially, design the application to use streaming inserts for individual postings.
Your application also performs data aggregations right after the streaming inserts. You discover that the queries after streaming inserts do not exhibit strong consistency, and reports from the queries might miss in-flight data. How can you adjust your application design?

A.Re-write the application to load accumulated data every 2 minutes.

B.Convert the streaming insert code to batch load for individual messages.

C.Load the original message to Google Cloud SQL, and export the table every hour to BigQuery via streaming inserts.

D.Estimate the average latency for data availability after streaming inserts, and always run queries after waiting twice as long.

Question 264

Suppose you have a dataset of images that are each labeled as to whether or not they contain a human face. To create a neural network that recognizes human faces in images using this labeled dataset, what approach would likely be the most effective?

A.Use K-means Clustering to detect faces in the pixels.

B.Use feature engineering to add features for eyes, noses, and mouths to the input data.

C.Use deep learning by creating a neural network with multiple hidden layers to automatically detect features of faces.

D.Build a neural network with an input layer of pixels, a hidden layer, and an output layer with two categories.

Question 265

You have a data pipeline with a Cloud Dataflow job that aggregates and writes time series metrics to Cloud Bigtable. This data feeds a dashboard used by thousands of users across the organization. You need to support additional concurrent users and reduce the amount of time required to write the data. Which two actions should you take? (Choose two.)

A.Configure your Cloud Dataflow pipeline to use local execution

B.Increase the maximum number of Cloud Dataflow workers by setting maxNumWorkersin PipelineOptions

C.Increase the number of nodes in the Cloud Bigtable cluster

D.Modify your Cloud Dataflow pipeline to use the Flattentransform before writing to Cloud Bigtable

E.Modify your Cloud Dataflow pipeline to use the CoGroupByKeytransform before writing to Cloud Bigtable

Other Version: 1171Google.Professional-Data-Engineer.v2026-02-05.q279; 1455Google.Professional-Data-Engineer.v2025-09-04.q126; 1462Google.Professional-Data-Engineer.v2025-03-18.q108; 2865Google.Professional-Data-Engineer.v2024-02-15.q202; 3258Google.Professional-Data-Engineer.v2023-09-14.q233; 4657Google.Professional-Data-Engineer.v2022-07-07.q155; 6650Google.Professional-Data-Engineer.v2022-02-11.q268; 112Google.Examcollectionpass.Professional-Data-Engineer.v2021-12-20.by.jonathan.161q.pdf

Latest Upload: 201PaloAltoNetworks.NGFW-Engineer.v2026-05-01.q43; 298Nokia.4A0-113.v2026-05-01.q69; 253EC-COUNCIL.312-49v11.v2026-04-30.q214; 228Microsoft.MB-820.v2026-04-30.q101; 211Salesforce.MC-202.v2026-04-30.q57; 206BICSI.INSTC_V8.v2026-04-29.q53; 335NMLS.MLO.v2026-04-28.q82; 243NCARB.Project-Management.v2026-04-28.q27; 462EMC.D-AV-DY-23.v2026-04-27.q184; 1116ServiceNow.CSA.v2026-04-27.q483

Question 261

Question 262

Question 263

Question 264

Question 265

Download PDF File