Question 126

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You create an Azure Machine Learning service datastore in a workspace. The datastore contains the following files:
* /data/2018/Q1 .csv
* /data/2018/Q2.csv
* /data/2018/Q3.csv
* /data/2018/Q4.csv
* /data/2019/Q1.csv
All files store data in the following format:
id,f1,f2,l
1,1,2,0
2,1,1,1
3.2.1.0
You run the following code:

You need to create a dataset named training_data and load the data from all files into a single data frame by using the following code:

Solution: Run the following code:

Does the solution meet the goal?
  • Question 127

    You are performing feature scaling by using the scikit-learn Python library for x.1 x2, and x3 features.
    Original and scaled data is shown in the following image.

    Use the drop-down menus to select the answer choice that answers each question based on the information presented in the graphic.
    NOTE: Each correct selection is worth one point.

    Question 128

    You create a pipeline in designer to train a model that predicts automobile prices.
    Because of non-linear relationships in the data, the pipeline calculates the natural log (Ln) of the prices in the training data, trains a model to predict this natural log of price value, and then calculates the exponential of the scored label to get the predicted price.
    The training pipeline is shown in the exhibit. (Click the
    Training pipeline

    You create a real-time inference pipeline from the training pipeline, as shown in the exhibit. (Click the Real-time pipeline tab.) Real-time pipeline

    You need to modify the inference pipeline to ensure that the web service returns the exponential of the scored label as the predicted automobile price and that client applications are not required to include a price value in the input values.
    Which three modifications must you make to the inference pipeline? Each correct answer presents part of the solution.
    NOTE: Each correct selection is worth one point.
  • Question 129

    Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
    After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
    You are analyzing a numerical dataset which contains missing values in several columns.
    You must clean the missing values using an appropriate operation without affecting the dimensionality of the feature set.
    You need to analyze a full dataset to include all values.
    Solution: Remove the entire column that contains the missing data point.
    Does the solution meet the goal?
  • Question 130

    You are using a decision tree algorithm. You have trained a model that generalizes well at a tree depth equal to
    10.
    You need to select the bias and variance properties of the model with varying tree depth values.
    Which properties should you select for each tree depth? To answer, select the appropriate options in the answer area.