Question 16

A marketing team is using Snowflake to store customer data including demographics, purchase history, and website activity. They want to perform customer segmentation using hierarchical clustering. Considering performance and scalability with very large datasets, which of the following strategies is the MOST suitable approach?
  • Question 17

    You're analyzing the performance of two different AIB testing variants of an advertisement. You've collected the following data over a period of one week: Variant A: 1000 impressions, 50 conversions Variant B: 1100 impressions, 66 conversions Which of the following statements are TRUE regarding confidence intervals and statistical significance in this scenario?
  • Question 18

    A data scientist is preparing customer churn data for a machine learning model in Snowflake. The dataset contains a 'Contract_Type' column with values 'Month-to-Month', 'One Year', and 'Two Year'. They want to use label encoding to transform this categorical feature into numerical values. Which of the following SQL statements correctly implements label encoding for the 'Contract_Type' column, assigning 'Month-to-Month' to 0, 'One Year' to 1, and 'Two Year' to 2, and creates a new column named 'Contract_Type_Encoded'? Additionally, the data scientist wants to handle potential NULL values in 'Contract_Type' by assigning them the value of -1.
  • Question 19

    You are building a machine learning pipeline that uses data stored in Snowflake. You want to connect a Jupyter Notebook running on your local machine to Snowflake using Snowpark. You need to securely authenticate to Snowflake and ensure that you are using a dedicated compute resource for your Snowpark session. Which of the following approaches is the MOST secure and efficient way to achieve this?
  • Question 20

    You're deploying a pre-built image classification model hosted on a REST API endpoint, and you need to integrate it with Snowflake to classify images stored in cloud storage accessible via an external stage named 'IMAGE STAGE. The API expects image data as a base64 encoded string in the request body. Which SQL query snippet demonstrates the correct approach for calling the external function 'CLASSIFY IMAGE and incorporating the base64 encoding?