Question 71

You are building a product recommendation system using Snowflake Cortex. You have a table 'PRODUCT DESCRIPTIONS' containing product IDs and textual descriptions. You want to generate vector embeddings for these descriptions to perform similarity searches. However, you need to control the cost and latency of the embedding generation process. Which of the following strategies and considerations are MOST important for optimizing performance and cost when generating vector embeddings in Snowflake Cortex using a UDF?
  • Question 72

    Consider you are working on a credit risk scoring model using Snowflake. You have a table 'credit data' with the following schema: 'customer id', 'age', 'income', 'credit_score', 'loan_amount', 'loan_duration', 'defaulted'. You want to create several new features using Snowflake SQL to improve your model. Which combination of the following SQL statements will successfully create features for age groups, income-to-loan ratio, and interaction between credit score and loan amount using SQL in Snowflake? Choose all that apply.
  • Question 73

    You are tasked with optimizing the hyperparameter tuning process for a complex deep learning model within Snowflake using Snowpark Python. The model is trained on a large dataset stored in Snowflake, and you need to efficiently explore a wide range of hyperparameter values to achieve optimal performance. Which of the following approaches would provide the MOST scalable and performant solution for hyperparameter tuning in this scenario, considering the constraints and capabilities of Snowflake?
  • Question 74

    You're tasked with building an image classification model on Snowflake to identify defective components on a manufacturing assembly line using images captured by high-resolution cameras. The images are stored in a Snowflake table named 'ASSEMBLY LINE IMAGES', with columns including 'image_id' (INT), 'image_data' (VARIANT containing binary image data), and 'timestamp' (TIMESTAMP NTZ). You have a pre-trained image classification model (TensorFlow/PyTorch) saved in Snowflake's internal stage. To improve inference speed and reduce data transfer overhead, which approach provides the MOST efficient way to classify these images using Snowpark Python and UDFs?
  • Question 75

    You have trained a fraud detection model using scikit-learn and want to deploy it in Snowflake using the Snowflake Model Registry. You've registered the model as 'fraud _ model' in the registry. You need to create a Snowflake user-defined function (UDF) that loads and executes the model. Which of the following code snippets correctly creates the UDF, assuming the model is a serialized pickle file stored in a stage named 'model_stage'?