Question 56

A financial institution aims to detect fraudulent transactions using a Supervised Learning model deployed in Snowflake. They have a dataset with transaction details, including amount, timestamp, merchant category, and customer ID. The target variable is 'is_fraudulent' (0 or 1). They are considering different Supervised Learning algorithms. Which of the following algorithms would be MOST suitable for this fraud detection task, considering the need for interpretability, scalability, and the potential for imbalanced classes, and what specific strategies can be employed within Snowflake to handle the class imbalance?
  • Question 57

    You are tasked with training a complex machine learning model using scikit-learn and need to leverage Snowflake's data for training outside of Snowflake using an external function. The training data resides in a Snowflake table named 'CUSTOMER DATA'. Due to data governance policies, you must ensure minimal data movement and secure communication. You choose to implement the external function using AWS Lambda'. Which of the following steps are crucial to achieve secure and efficient model training outside of Snowflake?
  • Question 58

    You are developing a Python stored procedure in Snowflake to predict sales for a retail company. You want to incorporate external data (e.g., weather forecasts) into your model. Which of the following methods are valid and efficient ways to access and use external data within your Snowflake Python stored procedure?
  • Question 59

    You have successfully deployed a machine learning model in Snowflake using Snowpark and are generating predictions. You need to implement a robust error handling mechanism to ensure that if the model encounters an issue during prediction (e.g., missing feature, invalid data type), the process doesn't halt and the errors are logged appropriately. You are using a User-Defined Function (UDF) to call the model. Which of the following strategies, when used IN COMBINATION, provides the BEST error handling and monitoring capabilities in this scenario?
  • Question 60

    A financial services company wants to predict loan defaults. They have a table 'LOAN APPLICATIONS' with columns 'application_id', applicant_income', 'applicant_age' , and 'loan_amount'. You need to create several derived features to improve model performance.
    Which of the following derived features, when used in combination, would provide the MOST comprehensive view of an applicant's financial stability and ability to repay the loan? Select all that apply