Question 186

You are solving a classification task.
The dataset is imbalanced.
You need to select an Azure Machine Learning Studio module to improve the classification accuracy.
Which module should you use?
  • Question 187

    You train a classification model by using a decision tree algorithm.
    You create an estimator by running the following Python code. The variable feature_names is a list of all feature names, and class_names is a list of all class names.
    from interpret.ext.blackbox import TabularExplainer

    You need to explain the predictions made by the model for all classes by determining the importance of all features.
    For each of the following statements, select Yes if the statement is true. Otherwise, select No.
    NOTE: Each correct selection is worth one point.

    Question 188

    You plan to explore demographic data for home ownership in various cities. The data is in a CSV file with the following format:
    age,city,income,home_owner
    21,Chicago,50000,0
    35,Seattle,120000,1
    23,Seattle,65000,0
    45,Seattle,130000,1
    18,Chicago,48000,0
    You need to run an experiment in your Azure Machine Learning workspace to explore the data and log the results. The experiment must log the following information:
    * the number of observations in the dataset
    * a box plot of income by home_owner
    * a dictionary containing the city names and the average income for each city You need to use the appropriate logging methods of the experiment's run object to log the required information.
    How should you complete the code? To answer, drag the appropriate code segments to the correct locations. Each code segment may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
    NOTE: Each correct selection is worth one point.

    Question 189

    Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
    After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
    You create a model to forecast weather conditions based on historical data.
    You need to create a pipeline that runs a processing script to load data from a datastore and pass the processed data to a machine learning model training script.
    Solution: Run the following code:

    Does the solution meet the goal?
  • Question 190

    Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
    After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
    You are using Azure Machine learning Studio to perform feature engineering on a dataset. You need to normalize values to produce a feature column grouped into bins.
    Solution: Apply an Entropy Minimum Description Length (MDI) binning mode.
    Does the solution meet the goal?