Free Access Google.Professional-Data-Engineer.v2024-12-09.q327 Practice Test (Page 13)

Question 56

You are building a data pipeline on Google Cloud. You need to prepare data using a casual method for a
machine-learning process. You want to support a logistic regression model. You also need to monitor and
adjust for null values, which must remain real-valued and cannot be removed. What should you do?

A.Use Cloud Dataflow to find null values in sample source data. Convert all nulls to 'none' using a Cloud
Dataprep job.

B.Use Cloud Dataflow to find null values in sample source data. Convert all nulls to using a custom script.

C.Use Cloud Dataprep to find null values in sample source data. Convert all nulls to 'none' using a Cloud
Dataproc job.

D.Use Cloud Dataprep to find null values in sample source data. Convert all nulls to 0 using a Cloud
Dataprep job.

Question 57

You are responsible for writing your company's ETL pipelines to run on an Apache Hadoop cluster. The pipeline will require some checkpointing and splitting pipelines. Which method should you use to write the pipelines?

A.PigLatin using Pig

B.HiveQL using Hive

C.Java using MapReduce

D.Python using MapReduce

Question 58

You architect a system to analyze seismic data. Your extract, transform, and load (ETL) process runs as a series of MapReduce jobs on an Apache Hadoop cluster. The ETL process takes days to process a data set because some steps are computationally expensive. Then you discover that a sensor calibration step has been omitted. How should you change your ETL process to carry out sensor calibration systematically in the future?

A.Introduce a new MapReduce job to apply sensor calibration to raw data, and ensure all other MapReduce jobs are chained after this.

B.Develop an algorithm through simulation to predict variance of data output from the last MapReduce job based on calibration factors, and apply the correction to all data.

C.Add sensor calibration data to the output of the ETL process, and document that all users need to apply sensor calibration themselves.

D.Modify the transformMapReduce jobs to apply sensor calibration before they do anything else.

Question 59

You need to look at BigQuery data from a specific table multiple times a day. The underlying table you are querying is several petabytes in size, but you want to filter your data and provide simple aggregations to downstream users. You want to run queries faster and get up-to-date insights quicker. What should you do?

A.Run a scheduled query to pull the necessary data at specific intervals daily.

B.Create a materialized view based off of the query being run.

C.Use a cached query to accelerate time to results.

D.Limit the query columns being pulled in the final result.

Question 60

You are building a streaming Dataflow pipeline that ingests noise level data from hundreds of sensors placed near construction sites across a city. The sensors measure noise level every ten seconds, and send that data to the pipeline when levels reach above 70 dBA. You need to detect the average noise level from a sensor when data is received for a duration of more than 30 minutes, but the window ends when no data has been received for 15 minutes What should you do?

A.Use session windows with a 30-mmute gap duration.

B.Use tumbling windows with a 15-mmute window and a fifteen-minute. withAllowedLateness operator.

C.Use session windows with a 15-minute gap duration.

D.Use hopping windows with a 15-mmute window, and a thirty-minute period.

Other Version: 1171Google.Professional-Data-Engineer.v2026-02-05.q279; 1444Google.Professional-Data-Engineer.v2025-09-04.q126; 1453Google.Professional-Data-Engineer.v2025-03-18.q108; 2857Google.Professional-Data-Engineer.v2024-02-15.q202; 3239Google.Professional-Data-Engineer.v2023-09-14.q233; 4657Google.Professional-Data-Engineer.v2022-07-07.q155; 6649Google.Professional-Data-Engineer.v2022-02-11.q268; 112Google.Examcollectionpass.Professional-Data-Engineer.v2021-12-20.by.jonathan.161q.pdf

Latest Upload: 200PaloAltoNetworks.NGFW-Engineer.v2026-05-01.q43; 292Nokia.4A0-113.v2026-05-01.q69; 250EC-COUNCIL.312-49v11.v2026-04-30.q214; 227Microsoft.MB-820.v2026-04-30.q101; 207Salesforce.MC-202.v2026-04-30.q57; 204BICSI.INSTC_V8.v2026-04-29.q53; 332NMLS.MLO.v2026-04-28.q82; 241NCARB.Project-Management.v2026-04-28.q27; 457EMC.D-AV-DY-23.v2026-04-27.q184; 1107ServiceNow.CSA.v2026-04-27.q483

Question 56

Question 57

Question 58

Question 59

Question 60

Download PDF File