Free Access Snowflake.DSA-C03.v2025-10-01.q105 Practice Test (Page 21)

Question 96

You are working with a Snowflake table named 'CUSTOMER DATA' containing customer information, including a 'PHONE NUMBER' column. Due to data entry errors, some phone numbers are stored as NULL, while others are present but in various inconsistent formats (e.g., with or without hyphens, parentheses, or country codes). You want to standardize the 'PHONE NUMBER column and replace missing values using Snowpark for Python. You have already created a Snowpark DataFrame called 'customer df representing the 'CUSTOMER DATA' table. Which of the following approaches, used in combination, would be MOST efficient and reliable for both cleaning the existing data and handling future data ingestion, given the need for scalability?

A.Use a UDF (User-Defined Function) written in Python that formats the phone numbers based on a regular expression and applies it to the DataFrame using For NULL values, replace them with a default value of 'UNKNOWN'.

B.Use a series of and methods on the Snowpark DataFrame to handle NULL values and different phone number formats directly within the DataFrame operations.

C.Create a Snowflake Stored Procedure in SQL that uses regular expressions and 'CASE statements to format the "PHONE_NUMBER column and replace NULL values. Call this stored procedure from a Snowpark Python script.

D.Leverage Snowflake's data masking policies to mask any invalid phone number and create a view that replaces NULL values with 'UNKNOWN'. This approach doesn't correct existing data but hides the issue.

E.Create a Snowflake Pipe with a COPY INTO statement and a transformation that uses a SQL function within the COPY INTO statement to format the phone numbers and replace NULL values during data loading. Also, implement a Python UDF for correcting already existing data.

Question 97

You have a Snowflake table 'PRODUCT_PRICES' with columns 'PRODUCT_ID' (INTEGER) and 'PRICE' (VARCHAR). The 'PRICE' column sometimes contains values like '10.50 USD', '20.00 EUR', or 'Invalid Price'. You need to convert the 'PRICE column to a NUMERIC(10,2) data type, removing currency symbols and handling invalid price strings by replacing them with NULL. Considering both data preparation and feature engineering, which combination of Snowpark SQL and Python code snippets achieves this accurately and efficiently, preparing the data for further analysis?

A.Option A

B.Option B

C.Option C

D.Option D

E.Option E

Question 98

You are using the Snowflake Python connector from within a Jupyter Notebook running in VS Code to train a model. You have a Snowflake table named 'CUSTOMER DATA' with columns 'ID', 'FEATURE 1', 'FEATURE_2, and 'TARGET. You want to efficiently load the data into a Pandas DataFrame for model training, minimizing memory usage. Which of the following code snippets is the MOST efficient way to achieve this, assuming you only need 'FEATURE 1', 'FEATURE 2, and 'TARGET' columns?

Question 99

You're building a model to predict whether a user will click on an ad (binary classification: click or no-click) using Snowflake. The data is structured and includes features like user demographics, ad characteristics, and past user interactions. You've trained a logistic regression model using SNOWFLAKE.ML and are now evaluating its performance. You notice that while the overall accuracy is high (around 95%), the model performs poorly at predicting clicks (low recall for the 'click' class). Which of the following steps could you take to diagnose the issue and improve the model's ability to predict clicks, and how would you implement them using Snowflake SQL? SELECT ALL THAT APPLY.

A.Generate a confusion matrix using SQL to visualize the model's performance across both classes. Example SQL:

B.Calculate precision, recall, F I-score, and AUC for the 'click' class using SQL queries to get a more detailed understanding of the model's performance on the minority class. Example:

C.Implement feature engineering by creating interaction terms or polynomial features from existing features using SQL, to capture potentially non-linear relationships between features and the target variable. Example:

D.Increase the complexity of the model by switching to a non-linear algorithm like Random Forest or Gradient Boosting without performing hyperparameter tuning, as more complex models always perform better.

E.Reduce the amount of training data to avoid overfitting. Overfitting is known to produce low recall for the 'click' class.

Question 100

You have a binary classification model deployed in Snowflake to predict customer churn. The model outputs a probability score between 0 and 1. You've calculated the following confusion matrix on a holdout set: I I Predicted Positive I Predicted Negative I --1 1 Actual Positive | 80 | 20 | I Actual Negative | 10 | 90 | What are the Precision, Recall, and Accuracy for this model, and what do these metrics tell you about the model's performance? SELECT statement given for true and false condition (True Positive, True Negative, False Positive, False Negative)

A.Precision = 0.89, Recall = 0.80, Accuracy = 0.85. The model has good overall performance with balanced precision and recall.

B.Precision = 0.80, Recall = 0.89, Accuracy = 0.85. The model is slightly better at identifying true positives than avoiding false positives.

C.Precision = 0.89, Recall = 0.80, Accuracy = 0.85. The model is slightly better at avoiding false positives than identifying true positives.

D.Precision = 0.90, Recall = 0.80, Accuracy = 0.80. The model has good overall performance but needs to be adjusted to improve the false negative rate.

E.Precision = 0.80, Recall = 0.90, Accuracy = 0.90. The model is performing poorly, with a high rate of both false positives and false negatives.

Latest Upload: 200PaloAltoNetworks.NGFW-Engineer.v2026-05-01.q43; 292Nokia.4A0-113.v2026-05-01.q69; 250EC-COUNCIL.312-49v11.v2026-04-30.q214; 227Microsoft.MB-820.v2026-04-30.q101; 207Salesforce.MC-202.v2026-04-30.q57; 204BICSI.INSTC_V8.v2026-04-29.q53; 332NMLS.MLO.v2026-04-28.q82; 241NCARB.Project-Management.v2026-04-28.q27; 457EMC.D-AV-DY-23.v2026-04-27.q184; 1107ServiceNow.CSA.v2026-04-27.q483

Question 96

Question 97

Question 98

Question 99

Question 100

Download PDF File