Question 6
You work for a large fast food restaurant chain with over 400,000 employees. You store employee information in Google BigQuery in a Userstable consisting of a FirstNamefield and a LastNamefield. A member of IT is building an application and asks you to modify the schema and data in BigQuery so the application can query a FullNamefield consisting of the value of the FirstNamefield concatenated with a space, followed by the value of the LastNamefield for each employee. How can you make that data available while minimizing cost?
Question 7
An organization maintains a Google BigQuery dataset that contains tables with user-level datA. They want to expose aggregates of this data to other Google Cloud projects, while still controlling access to the user-level data. Additionally, they need to minimize their overall storage cost and ensure the analysis cost for other projects is assigned to those projects. What should they do?
Question 8
You are creating a new pipeline in Google Cloud to stream IoT data from Cloud Pub/Sub through Cloud Dataflow to BigQuery. While previewing the data, you notice that roughly 2% of the data appears to be corrupt.
You need to modify the Cloud Dataflow pipeline to filter out this corrupt data. What should you do?
You need to modify the Cloud Dataflow pipeline to filter out this corrupt data. What should you do?
Question 9
You are managing a Cloud Dataproc cluster. You need to make a job run faster while minimizing costs, without losing work in progress on your clusters. What should you do?
Question 10
How would you query specific partitions in a BigQuery table?
