Question 171

Your company's on-premises Apache Hadoop servers are approaching end-of-life, and IT has decided to migrate the cluster to Google Cloud Dataproc. A like-for-like migration of the cluster would require 50 TB of Google Persistent Disk per node. The CIO is concerned about the cost of using that much block storage.
You want to minimize the storage cost of the migration. What should you do?
  • Question 172

    Which of the following is NOT one of the three main types of triggers that Dataflow supports?
  • Question 173

    You're using Bigtable for a real-time application, and you have a heavy load that is a mix of read and writes.
    You've recently identified an additional use case and need to perform hourly an analytical job to calculate certain statistics across the whole database. You need to ensure both the reliability of your production application as well as the analytical workload.
    What should you do?
  • Question 174

    Which Java SDK class can you use to run your Dataflow programs locally?
  • Question 175

    You are integrating one of your internal IT applications and Google BigQuery, so users can query BigQuery from the application's interface. You do not want individual users to authenticate to BigQuery and you do not want to give them access to the dataset. You need to securely access BigQuery from your IT application.
    What should you do?