Question 191

You are developing an application that uses a recommendation engine on Google Cloud. Your solution should display new videos to customers based on past views. Your solution needs to generate labels for the entities in videos that the customer has viewed. Your design must be able to provide very fast filtering suggestions based on data from other customer preferences on several TB of data. What should you do?
  • Question 192

    Your company receives both batch- and stream-based event data. You want to process the data using
    Google Cloud Dataflow over a predictable time period. However, you realize that in some instances data
    can arrive late or out of order. How should you design your Cloud Dataflow pipeline to handle data that is
    late or out of order?
  • Question 193

    You create an important report for your large team in Google Data Studio 360. The report uses Google BigQuery as its data source. You notice that visualizations are not showing data that is less than 1 hour old. What should you do?
  • Question 194

    Your company built a TensorFlow neutral-network model with a large number of neurons and layers. The model fits well for the training data. However, when tested against new data, it performs poorly.
    What method can you employ to address this?
  • Question 195

    Your company currently runs a large on-premises cluster using Spark, Hive, and HDFS in a colocation facility.
    The cluster is designed to accommodate peak usage on the system; however, many jobs are batch in nature, and usage of the cluster fluctuates quite dramatically. Your company is eager to move to the cloud to reduce the overhead associated with on-premises infrastructure and maintenance and to benefit from the cost savings.
    They are also hoping to modernize their existing infrastructure to use more serverless offerings in order to take advantage of the cloud. Because of the timing of their contract renewal with the colocation facility, they have only 2 months for their initial migration. How would you recommend they approach their upcoming migration strategy so they can maximize their cost savings in the cloud while still executing the migration in time?