Question 46

You are working as a data scientist for a healthcare company. They decide to analyze the data to find patterns in a large volume of electronic medical records. You are asked to build a PySpark solution to analyze these records in a JupyterLab notebook. What is the order of recommended stepsto develop a PySpark application in Oracle Cloud Infrastructure (OCI) Data Science?
  • Question 47

    Which OCI Data Science interaction method can function without the need of scripting?
  • Question 48

    As a data scientist, you create models for cancer prediction based on mammographic images. The correct identification is very crucial in this case. After evaluating two models, you arrive at the following confusion matrix. Which model would you prefer and why?
    * Model 1 has Test accuracy is 80% and recall is 70%
    * Model 2 has Test accuracy is 75% and recall is 85%
  • Question 49

    Which of the following analytical and statistical techniques do data scientists commonly use?
  • Question 50

    The Oracle AutoML pipeline automates hyperparameter tuning by training the model with different parameters in parallel. You have created an instance of Oracle AutoML as oracle_automl and now you want an output with all the different trials performed by Oracle AutoML. Which of the following commands gives you the results of all trials?