Free Access Google.Professional-Data-Engineer.v2022-07-07.q155 Practice Test (Page 14)

Question 61

A shipping company has live package-tracking data that is sent to an Apache Kafka stream in real time.
This is then loaded into BigQuery. Analysts in your company want to query the tracking data in BigQuery to analyze geospatial trends in the lifecycle of a package. The table was originally created with ingest-date partitioning. Over time, the query processing time has increased. You need to implement a change that would improve query performance in BigQuery. What should you do?

A.Implement clustering in BigQuery on the ingest date column.

B.Implement clustering in BigQuery on the package-tracking ID column.

C.Re-create the table using data partitioning on the package delivery date.

D.Tier older data onto Cloud Storage files, and leverage extended tables.

Question 62

Your company is performing data preprocessing for a learning algorithm in Google Cloud Dataflow.
Numerous data logs are being are being generated during this step, and the team wants to analyze them.
Due to the dynamic nature of the campaign, the data is growing exponentially every hour.
The data scientists have written the following code to read the data for a new key features in the logs.
BigQueryIO.Read
.named("ReadLogData")
.from("clouddataflow-readonly:samples.log_data")
You want to improve the performance of this data read. What should you do?

A.Specify the TableReferenceobject in the code.

B.Call a transform that returns TableRowobjects, where each element in the PCollectionrepresents a single row in the table.

C.Use of both the Google BigQuery TableSchemaand TableFieldSchemaclasses.

D.Use .fromQueryoperation to read specific fields from the table.

Question 63

As your organization expands its usage of GCP, many teams have started to create their own projects.
Projects are further multiplied to accommodate different stages of deployments and target audiences. Each project requires unique access control configurations. The central IT team needs to have access to all projects.
Furthermore, data from Cloud Storage buckets and BigQuery datasets must be shared for use in other projects in an ad hoc way. You want to simplify access control management by minimizing the number of policies.
Which two steps should you take? Choose 2 answers.

A.Introduce resource hierarchy to leverage access control policy inheritance.

B.Only use service accounts when sharing data for Cloud Storage buckets and BigQuery datasets.

C.For each Cloud Storage bucket or BigQuery dataset, decide which projects need access. Find all the active members who have access to these projects, and create a Cloud IAM policy to grant access to all these users.

D.Use Cloud Deployment Manager to automate access provision.

E.Create distinct groups for various teams, and specify groups in Cloud IAM policies.

Question 64

Your company is using WHILECARD tables to query data across multiple tables with similar names. The SQL statement is currently failing with the following error:
# Syntax error : Expected end of statement but got "-" at [4:11] SELECT age FROM bigquery-public-data.noaa_gsod.gsod WHERE age != 99 AND_TABLE_SUFFIX = `1929' ORDER BY age DESC Which table name will make the SQL statement work correctly?

A.bigquery-public-data.noaa_gsod.gsod*

B.`bigquery-public-data.noaa_gsod.gsod`

C.`bigquery-public-data.noaa_gsod.gsod*`

D.`bigquery-public-data.noaa_gsod.gsod'*

Question 65

You are choosing a NoSQL database to handle telemetry data submitted from millions of Internet-of- Things (IoT) devices. The volume of data is growing at 100 TB per year, and each data entry has about
100 attributes. The data processing pipeline does not require atomicity, consistency, isolation, and durability (ACID). However, high availability and low latency are required.
You need to analyze the data by querying against individual fields. Which three databases meet your requirements? (Choose three.)

A.Redis

B.HBase

C.MySQL

D.MongoDB

E.Cassandra

F.HDFS with Hive

Other Version: 153Google.Professional-Data-Engineer.v2025-09-04.q126; 813Google.Professional-Data-Engineer.v2025-03-18.q108; 948Google.Professional-Data-Engineer.v2024-12-09.q327; 1474Google.Professional-Data-Engineer.v2024-02-15.q202; 2261Google.Professional-Data-Engineer.v2023-09-14.q233; 5266Google.Professional-Data-Engineer.v2022-02-11.q268; 112Google.Examcollectionpass.Professional-Data-Engineer.v2021-12-20.by.jonathan.161q.pdf

Latest Upload: 117Oracle.1Z0-1057-23.v2025-09-10.q47; 150Google.Professional-Cloud-Network-Engineer.v2025-09-09.q179; 131SAP.C-S4EWM-2023.v2025-09-08.q83; 162TheSecOpsGroup.CNSP.v2025-09-08.q20; 220CFAInstitute.ESG-Investing.v2025-09-08.q173; 155PECB.ISO-IEC-27001-Lead-Implementer.v2025-09-06.q132; 145Salesforce.Data-Architect.v2025-09-05.q216; 139Adobe.AD0-E605.v2025-09-05.q50; 183Nutanix.NCP-MCI-6.10.v2025-09-05.q55; 114Oracle.1z0-591.v2025-09-05.q104

Question 61

Question 62

Question 63

Question 64

Question 65

Download PDF File