Free Access Google.Professional-Data-Engineer.v2022-02-11.q268 Practice Test (Page 53)

Question 256

You are designing a data processing pipeline. The pipeline must be able to scale automatically as load increases. Messages must be processed at least once, and must be ordered within windows of 1 hour. How should you design the solution?

A.Use Cloud Pub/Sub for message ingestion and Cloud Dataflow for streaming analysis.

B.Use Apache Kafka for message ingestion and use Cloud Dataflow for streaming analysis.

C.Use Apache Kafka for message ingestion and use Cloud Dataproc for streaming analysis.

D.Use Cloud Pub/Sub for message ingestion and Cloud Dataproc for streaming analysis.

Question 257

Your company is selecting a system to centralize data ingestion and delivery. You are considering messaging and data integration systems to address the requirements. The key requirements are:
* The ability to seek to a particular offset in a topic, possibly back to the start of all data ever captured
* Support for publish/subscribe semantics on hundreds of topics
* Retain per-key ordering
Which system should you choose?

A.Apache Kafka

B.Cloud Storage

C.Cloud Pub/Sub

D.Firebase Cloud Messaging

Question 258

You have uploaded 5 years of log data to Cloud Storage A user reported that some data points in the log data are outside of their expected ranges, which indicates errors You need to address this issue and be able to run the process again in the future while keeping the original data for compliance reasons What should you do?

A.Create a Cloud Dataflow workflow that reads the data from Cloud Storage, checks for values outside the expected range, sets the value to an appropriate default, and writes the updated records to the same dataset in Cloud Storage

B.Create a Compute Engine instance and create a new copy of the data in Cloud Storage Skip the rows with errors

C.Import the data from Cloud Storage into BigQuery Create a new BigQuery table, and skip the rows with errors.

D.Create a Cloud Dataflow workflow that reads the data from Cloud Storage, checks for values outside the expected range, sets the value to an appropriate default, and writes the updated records to a new dataset in Cloud Storage

Question 259

Your company built a TensorFlow neutral-network model with a large number of neurons and layers. The
model fits well for the training data. However, when tested against new data, it performs poorly. What
method can you employ to address this?

A.Threading

B.Serialization

C.Dropout Methods

D.Dimensionality Reduction

Question 260

You want to build a managed Hadoop system as your data lake. The data transformation process is composed of a series of Hadoop jobs executed in sequence. To accomplish the design of separating storage from compute, you decided to use the Cloud Storage connector to store all input data, output data, and intermediary data. However, you noticed that one Hadoop job runs very slowly with Cloud Dataproc, when compared with the on-premises bare-metal Hadoop environment (8-core nodes with 100-GB RAM).
Analysis shows that this particular Hadoop job is disk I/O intensive. You want to resolve the issue. What should you do?

A.Allocate sufficient memory to the Hadoop cluster, so that the intermediary data of that particular Hadoop job can be held in memory

B.Allocate sufficient persistent disk space to the Hadoop cluster, and store the intermediate data of that particular Hadoop job on native HDFS

C.Allocate more CPU cores of the virtual machine instances of the Hadoop cluster so that the networking bandwidth for each instance can scale up

D.Allocate additional network interface card (NIC), and configure link aggregation in the operating system to use the combined throughput when working with Cloud Storage

Premium Bundle

Newest Professional-Data-Engineer Exam PDF Dumps shared by BraindumpsPass.com for Helping Passing Professional-Data-Engineer Exam! BraindumpsPass.com now offer the updated Professional-Data-Engineer exam dumps, the BraindumpsPass.com Professional-Data-Engineer exam questions have been updated and answers have been corrected get the latest BraindumpsPass.com Professional-Data-Engineer pdf dumps with Exam Engine here:

Access Professional-Data-Engineer Premium Version

(386 Q&As Dumps, 40%OFF Special Discount: Exam-Tests)

Other Version: 153Google.Professional-Data-Engineer.v2025-09-04.q126; 813Google.Professional-Data-Engineer.v2025-03-18.q108; 948Google.Professional-Data-Engineer.v2024-12-09.q327; 1474Google.Professional-Data-Engineer.v2024-02-15.q202; 2261Google.Professional-Data-Engineer.v2023-09-14.q233; 3658Google.Professional-Data-Engineer.v2022-07-07.q155; 112Google.Examcollectionpass.Professional-Data-Engineer.v2021-12-20.by.jonathan.161q.pdf

Latest Upload: 117Oracle.1Z0-1057-23.v2025-09-10.q47; 150Google.Professional-Cloud-Network-Engineer.v2025-09-09.q179; 131SAP.C-S4EWM-2023.v2025-09-08.q83; 156TheSecOpsGroup.CNSP.v2025-09-08.q20; 208CFAInstitute.ESG-Investing.v2025-09-08.q173; 155PECB.ISO-IEC-27001-Lead-Implementer.v2025-09-06.q132; 145Salesforce.Data-Architect.v2025-09-05.q216; 139Adobe.AD0-E605.v2025-09-05.q50; 183Nutanix.NCP-MCI-6.10.v2025-09-05.q55; 114Oracle.1z0-591.v2025-09-05.q104