Question 66
You are designing the architecture to process your data from Cloud Storage to BigQuery by using Dataflow.
The network team provided you with the Shared VPC network and subnetwork to be used by your pipelines.
You need to enable the deployment of the pipeline on the Shared VPC network. What should you do?
The network team provided you with the Shared VPC network and subnetwork to be used by your pipelines.
You need to enable the deployment of the pipeline on the Shared VPC network. What should you do?
Question 67
What is the general recommendation when designing your row keys for a Cloud Bigtable schema?
Question 68
You are designing a cloud-native historical data processing system to meet the following conditions:
* The data being analyzed is in CSV, Avro, and PDF formats and will be accessed by multiple analysis tools including Cloud Dataproc, BigQuery, and Compute Engine.
* A streaming data pipeline stores new data daily.
* Peformance is not a factor in the solution.
* The solution design should maximize availability.
How should you design data storage for this solution?
* The data being analyzed is in CSV, Avro, and PDF formats and will be accessed by multiple analysis tools including Cloud Dataproc, BigQuery, and Compute Engine.
* A streaming data pipeline stores new data daily.
* Peformance is not a factor in the solution.
* The solution design should maximize availability.
How should you design data storage for this solution?
Question 69
You are creating a model to predict housing prices. Due to budget constraints, you must run it on a single resource-constrained virtual machine. Which learning algorithm should you use?
Question 70
Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion service in the cloud. Transmitted data includes a payload of several fields and the timestamp of the transmission. If there are any concerns about a transmission, the system re-transmits the data. How should you deduplicate the data most efficiency?
