Free Access Snowflake.DSA-C03.v2025-10-01.q105 Practice Test (Page 4)

Question 11

A Snowflake table named 'SALES DATA contains a 'TRANSACTION DATE column stored as VARCHAR. The data in this column is inconsistent; some rows have dates in 'YYYY-MM-DD' format, others in 'MM/DD/YYYY' format, and some contain invalid date strings like 'N/A'. You need to standardize all dates to 'YYYY-MM-DD' format and store them in a new column called FORMATTED DATE in a new table 'STANDARDIZED_SALES DATA. Which of the following approaches, using Snowpark Python and SQL, most effectively handles these inconsistencies and minimizes errors during data transformation? Select all that apply:

A.Using a Snowpark Python UDF to parse each date string individually, handling different formats with conditional logic, and returning a formatted date string. This provides flexibility in handling diverse date formats.

B.Using a series of DATE" and 'TO_VARCHAR SQL functions in Snowpark to attempt converting the date in different formats and then formatting the result to 'YYYY-MM-DD'. Any conversion failing returns NULL.

C.Employing Snowpark's error handling mechanism (e.g., 'try...except' blocks) within a loop to iteratively convert each date string, catching and logging errors, and storing valid dates in a new column.

D.Creating a view on top of 'SALES_DATA' that implements the conversion logic. This avoids creating a new physical table immediately and allows for experimentation with different conversion strategies before materializing the data.

E.Using a single 'TO_DATE function with format parameter set to 'AUTO' combined with 'TO_VARCHAR to format the date to 'YYYY-MM-DD'.

Question 12

You are analyzing a dataset of website traffic and conversions in Snowflake, aiming to understand the relationship between the number of pages visited CPAGES VISITED) and the conversion rate (CONVERSION_RATE). You perform a simple linear regression using the 'REGR SLOPE and 'REGR INTERCEPT functions. However, after plotting the data and the regression line, you observe significant heteroscedasticity (non-constant variance of errors). Which of the following actions, performed within Snowflake during the data preparation and feature engineering phase, are MOST appropriate to address this heteroscedasticity and improve the validity of your linear regression model? (Select all that apply)

A.Apply a logarithmic transformation to the 'CONVERSION RATE' variable using the 'LN()' function. CREATE OR REPLACE VIEW TRANSFORMED_DATA AS SELECT PAGES VISITED, LN(CONVERSION RATE) AS LOG_CONVERSION RATE FROM ORIGINAL_DATA;

B.Calculate the weighted least squares regression by weighting each observation by the inverse of the squared predicted values from an initial OLS regression. This requires multiple SQL queries.

C.Standardize the 'PAGES_VISITED' and 'CONVERSION_RATE variables using the and functions.Create OR REPLACE VIEW STANDARDIZED_DATA AS SELECT (PAGES_VISITED - OVER()) / OVER() AS Z PAGES_VISITED, (CONVERSION RATE -OVER()) / OVER() AS FROM ORIGINAL_DATA;

D.Remove outlier data points from the dataset based on the Interquartile Range (IQR) of the residuals from the original linear regression model. This requires calculating the residuals first.

E.Apply a Box-Cox transformation to the 'CONVERSION RATE' variable. This transformation will determine the optimal lambda value using some complex SQL statistical operations. This can be approximated to log tranformation in many real life scenarios.

Question 13

You have trained a logistic regression model in Python using scikit-learn and plan to deploy it as a Python stored procedure in Snowflake. You need to serialize the model for deployment. Consider the following code snippet:

B.The code will execute successfully. The model serialization and deserialization using pickle are correctly implemented within the stored procedure.

C.The code will fail because the 'model_bytes' variable is not accessible within the 'predict' function's scope.

D.The code will fail because it does not handle potential security vulnerabilities associated with deserializing pickled objects from untrusted sources.

E.The code will fail because Snowflake stages cannot be used to store model objects.

Question 14

You've built a customer churn prediction model in Snowflake, and are using the AUC as your primary performance metric. You notice that your model consistently performs well (AUC > 0.85) on your validation set but significantly worse (AUC < 0.7) in production. What are the possible reasons for this discrepancy? (Select all that apply)

A.Your training and validation sets are not representative of the real-world production data due to sampling bias.

B.There's a temporal bias: the customer behavior patterns have changed since the training data was collected.

C.Your model is overfitting to the validation data. This causes to give high performance on validation set but less accurate in the real world.

D.The production environment has significantly more missing data compared to the training and validation environments.

E.The AUC metric is inherently unreliable and should not be used for model evaluation.

Question 15

You are building a model to predict loan defaults using data stored in Snowflake. As part of your feature engineering process within a Snowflake Notebook, you need to handle missing values in several columns: 'annual _ income', and You want to use a combination of imputation strategies: replace missing values with the median, 'annual_income' with the mean, and with a constant value of 0.5. You are leveraging the Snowpark DataFrame API. Which of the following code snippets correctly implements this imputation strategy?

A.Option A

B.Option B

C.Option C

D.Option D

E.Option E

Latest Upload: 202PaloAltoNetworks.NGFW-Engineer.v2026-05-01.q43; 299Nokia.4A0-113.v2026-05-01.q69; 255EC-COUNCIL.312-49v11.v2026-04-30.q214; 228Microsoft.MB-820.v2026-04-30.q101; 211Salesforce.MC-202.v2026-04-30.q57; 206BICSI.INSTC_V8.v2026-04-29.q53; 335NMLS.MLO.v2026-04-28.q82; 243NCARB.Project-Management.v2026-04-28.q27; 462EMC.D-AV-DY-23.v2026-04-27.q184; 1116ServiceNow.CSA.v2026-04-27.q483

Question 11

Question 12

Question 13

Question 14

Question 15

Download PDF File