Question 1

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are using Azure Machine Learning to run an experiment that trains a classification model.
You want to use Hyperdrive to find parameters that optimize the AUC metric for the model. You configure a HyperDriveConfig for the experiment by running the following code:

You plan to use this configuration to run a script that trains a random forest model and then tests it with validation data. The label values for the validation data are stored in a variable named y_test variable, and the predicted probabilities from the model are stored in a variable named y_predicted.
You need to add logging to the script to allow Hyperdrive to optimize hyperparameters for the AUC metric.
Solution: Run the following code:

Does the solution meet the goal?
  • Question 2

    You are creating a machine learning model in Python. The provided dataset contains several numerical columns and one text column. The text column represents a product's category. The product category will always be one of the following:
    Bikes
    Cars
    Vans
    Boats
    You are building a regression model using the scikit-learn Python package.
    You need to transform the text data to be compatible with the scikit-learn Python package.
    How should you complete the code segment? To answer, select the appropriate options in the answer area.
    NOTE: Each correct selection is worth one point.

    Question 3

    Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
    After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
    You plan to use a Python script to run an Azure Machine Learning experiment. The script creates a reference to the experiment run context, loads data from a file, identifies the set of unique values for the label column, and completes the experiment run:
    from azureml.core import Run
    import pandas as pd
    run = Run.get_context()
    data = pd.read_csv('data.csv')
    label_vals = data['label'].unique()
    # Add code to record metrics here
    run.complete()
    The experiment must record the unique labels in the data as metrics for the run that can be reviewed later.
    You must add code to the script to record the unique label values as run metrics at the point indicated by the comment.
    Solution: Replace the comment with the following code:
    run.upload_file('outputs/labels.csv', './data.csv')
    Does the solution meet the goal?
  • Question 4

    Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
    After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
    You are using Azure Machine Learning Studio to perform feature engineering on a dataset.
    You need to normalize values to produce a feature column grouped into bins.
    Solution: Apply an Entropy Minimum Description Length (MDL) binning mode.
    Does the solution meet the goal?
  • Question 5


    For each of the following statements, select Yes if the statement is true. Otherwise, select No. NOTE: Each correct selection is worth one point.