Question 31
You are deploying a new storage system for your mobile application, which is a media streaming service. You decide the best fit is Google Cloud Datastore. You have entities with multiple properties, some of which can take on multiple values. For example, in the entity 'Movie' the property 'actors' and the property 'tags' have multiple values but the property 'date released' does not. A typical query would ask for all movies with actor=<actorname> ordered by date_released or all movies with tag=Comedy ordered by date_released. How should you avoid a combinatorial explosion in the number of indexes?




Question 32
You are working on a sensitive project involving private user data. You have set up a project on Google
Cloud Platform to house your work internally. An external consultant is going to assist with coding a
complex transformation in a Google Cloud Dataflow pipeline for your project. How should you maintain
users' privacy?
Cloud Platform to house your work internally. An external consultant is going to assist with coding a
complex transformation in a Google Cloud Dataflow pipeline for your project. How should you maintain
users' privacy?
Question 33
Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion service in the cloud. Transmitted data includes a payload of several fields and the timestamp of the transmission. If there are any concerns about a transmission, the system re-transmits the dat
Question 34
You've migrated a Hadoop job from an on-prem cluster to dataproc and GCS. Your Spark job is a complicated analytical workload that consists of many shuffing operations and initial data are parquet files (on average 200-400 MB size each). You see some degradation in performance after the migration to Dataproc, so you'd like to optimize for it. You need to keep in mind that your organization is very cost- sensitive, so you'd like to continue using Dataproc on preemptibles (with 2 non-preemptible workers only) for this workload.
What should you do?
What should you do?
Question 35
What are two of the characteristics of using online prediction rather than batch prediction?