Free Access Google.Professional-Data-Engineer.v2022-02-11.q268 Practice Test (Page 20)

Question 91

You need to choose a database to store time series CPU and memory usage for millions of computers. You need to store this data in one-second interval samples. Analysts will be performing real-time, ad hoc analytics against the database. You want to avoid being charged for every query executed and ensure that the schema design will allow for future growth of the dataset. Which database and data model should you choose?

A.Create a narrow table in Cloud Bigtable with a row key that combines the Computer Engine computer identifier with the sample time at each second

B.Create a wide table in BigQuery, create a column for the sample value at each second, and update the row with the interval for each second

C.Create a table in BigQuery, and append the new samples for CPU and memory to the table

D.Create a wide table in Cloud Bigtable with a row key that combines the computer identifier with the sample time at each minute, and combine the values for each second as column data.

Question 92

Which methods can be used to reduce the number of rows processed by BigQuery?

A.Splitting tables into multiple tables; putting data in partitions

B.Splitting tables into multiple tables; putting data in partitions; using the LIMIT clause

C.Putting data in partitions; using the LIMIT clause

D.Splitting tables into multiple tables; using the LIMIT clause

Question 93

Your financial services company is moving to cloud technology and wants to store 50 TB of financial time- series data in the cloud. This data is updated frequently and new data will be streaming in all the time. Your company also wants to move their existing Apache Hadoop jobs to the cloud to get insights into this data.
Which product should they use to store the data?

A.Cloud Bigtable

B.Google BigQuery

C.Google Cloud Storage

D.Google Cloud Datastore

Question 94

Your analytics team wants to build a simple statistical model to determine which customers are most likely to work with your company again, based on a few different metrics. They want to run the model on Apache Spark, using data housed in Google Cloud Storage, and you have recommended using Google Cloud Dataproc to execute this job. Testing has shown that this workload can run in approximately 30 minutes on a 15-node cluster, outputting the results into Google BigQuery. The plan is to run this workload weekly. How should you optimize the cluster for cost?

A.Use pre-emptible virtual machines (VMs) for the cluster

B.Migrate the workload to Google Cloud Dataflow

C.Use a higher-memory node so that the job runs faster

D.Use SSDs on the worker nodes so that the job can run faster

Question 95

Cloud Dataproc charges you only for what you really use with _____ billing.

A.month-by-month

B.minute-by-minute

C.week-by-week

D.hour-by-hour

Other Version: 155Google.Professional-Data-Engineer.v2025-09-04.q126; 813Google.Professional-Data-Engineer.v2025-03-18.q108; 948Google.Professional-Data-Engineer.v2024-12-09.q327; 1476Google.Professional-Data-Engineer.v2024-02-15.q202; 2261Google.Professional-Data-Engineer.v2023-09-14.q233; 3673Google.Professional-Data-Engineer.v2022-07-07.q155; 112Google.Examcollectionpass.Professional-Data-Engineer.v2021-12-20.by.jonathan.161q.pdf

Latest Upload: 105OCEG.GRCP.v2025-09-11.q211; 104HP.HPE0-V27.v2025-09-11.q78; 118Oracle.1Z0-1057-23.v2025-09-10.q47; 150Google.Professional-Cloud-Network-Engineer.v2025-09-09.q179; 131SAP.C-S4EWM-2023.v2025-09-08.q83; 164TheSecOpsGroup.CNSP.v2025-09-08.q20; 223CFAInstitute.ESG-Investing.v2025-09-08.q173; 158PECB.ISO-IEC-27001-Lead-Implementer.v2025-09-06.q132; 149Salesforce.Data-Architect.v2025-09-05.q216; 144Adobe.AD0-E605.v2025-09-05.q50

Question 91

Question 92

Question 93

Question 94

Question 95

Download PDF File