The Snowflake SnowPro Advanced Data Scientist validates the ability to apply data science and machine learning techniques within the Snowflake environment. it focuses on using Snowflake's features to prepare data for modeling and integrate with external ML tools for advanced analytics. Professionals with the symbol SNOW_SADS are experts in leveraging Snowflake to drive intelligent business decisions through data.
---------- Question 1
A data scientist needs to interpret the predictions of a complex machine learning model to understand which features are most influential in its decisions for specific instances. Which model explainability technique is designed to show the marginal effect of a feature on the predicted outcome, averaged over all other features?
- Confusion Matrix
- Area Under the Curve (AUC)
- Partial Dependence Plots (PDP)
- Root Mean Squared Error (RMSE)
---------- Question 2
A data scientist has deployed a classification model to predict customer churn. After deployment, they observe that the model frequently predicts non-churn for customers who actually churn, leading to missed intervention opportunities. Which metric primarily indicates this issue and should be prioritized for improvement?
- Accuracy
- Precision
- F1-Score
- Recall
---------- Question 3
During the development of a predictive model in Snowflake, a data scientist finds that the model's performance is highly sensitive to parameters like learning rate and regularization strength. Which technique is essential to systematically search for the optimal combination of these parameters to maximize model performance?
- Cross-validation
- Down-sampling
- Hyperparameter tuning
- Feature scaling
---------- Question 4
Your team wants to implement a semantic search capability on customer feedback text stored in Snowflake. Which Snowflake Cortex function is most relevant for converting textual data into numerical vectors to measure semantic similarity?
- COMPLETE
- SENTIMENT
- EMBED_TEXT
- SUMMARIZE
---------- Question 5
A data science team wants to use Snowflake Cortex to analyze customer reviews stored in a Snowflake table to understand the overall sentiment towards their products. Which Snowflake Cortex capability is most appropriate for this task?
- Vector Embedding for similarity search.
- Prompt Engineering for custom model interactions.
- Task-specific models for sentiment analysis.
- Fine-tuning a base LLM for domain-specific language.
---------- Question 6
A data scientist wants to automate a data transformation step in their Snowpark ML pipeline that involves complex, custom logic. This logic needs to be applied to a DataFrame column by column in a scalable manner, directly within Snowflake. Which Snowpark construct is best suited for this operation?
- External function
- Python User-Defined Function UDF
- SQL Stored Procedure
- Dynamic Table
---------- Question 7
A deployed machine learning model for predicting customer churn starts showing degraded performance in production, even though the input data schema has not changed. What two key concepts related to model effectiveness and retraining should the data scientist investigate to diagnose and address this issue?
- Model explainability and feature impact
- Hyperparameter tuning and cross-validation
- Data drift and model decay
- A/B testing and ensemble methods
---------- Question 8
When performing exploratory data analysis (EDA) on a Snowflake table containing sales transactions, a data scientist wants to calculate the moving average of sales over the past seven days for each product. Which Snowflake native statistical function and SQL construct should be utilized?
- AVG function with GROUP BY clause
- SUM function with a subquery
- Window function AVG OVER PARTITION BY ORDER BY ROWS BETWEEN
- COUNT function with a HAVING clause
---------- Question 9
To effectively manage and track changes to different versions of a machine learning model, including associated training data and performance metrics, throughout its lifecycle within Snowflake, which feature provides robust support for metadata tagging and versioning?
- Snowflake Dynamic Tables
- Python User-Defined Functions (UDFs)
- Snowflake Model Registry
- Snowflake External Stages
---------- Question 10
After deploying several versions of a machine learning model in Snowflake, a data scientist needs a centralized system to log model artifacts, track performance metrics across versions, and retrieve specific model versions for rollback or A/B testing. Which Snowflake feature is designed to provide these capabilities?
- Snowflake Stages
- Dynamic Tables
- Snowflake Model Registry
- Streamlit in Snowflake
Are they useful?
Click here to get 390 more questions to pass this certification at the first try! Explanation for each answer is included!
Follow the below LINKEDIN channel to stay updated about 89+ exams!

Comments
Post a Comment