Question 6

Which of the following could be features?
  • Question 7

    Let's say you have two cases as below for the movie ratings
    1. You recommend to a user a movie with four stars and he really doesn't like it and he'd rate it two stars
    2. You recommend a movie with three stars but the user loves it (he'd rate it five stars). So which statement correctly applies?
  • Question 8

    RMSE is a good measure of accuracy, but only to compare forecasting errors of different models for a______, as it is scale-dependent.
  • Question 9

    If E1 and E2 are two events, how do you represent the conditional probability given that E2 occurs given that E1 has occurred?
  • Question 10

    You are working in a classification model for a book, written by HadoopExam Learning Resources and decided to use building a text classification model for determining whether this book is for Hadoop or Cloud computing. You have to select the proper features (feature selection) hence, to cut down on the size of the feature space, you will use the mutual information of each word with the label of hadoop or cloud to select the 1000 best features to use as input to a Naive Bayes model. When you compare the performance of a model built with the 250 best features to a model built with the 1000 best features, you notice that the model with only 250 features performs slightly better on our test data.
    What would help you choose better features for your model?