1.
The most widely used metrics and tools to assess a classification model are:
2.
Which of the following is a good test dataset characteristic?
3.
What is the purpose of performing cross-validation?
4.
Why is second order differencing in time series needed?
5.
Suppose you have trained a logistic regression classifier and it outputs a new example x with a prediction ho(x) = 0.2. This means
6.
You run gradient descent for 15 iterations with a=0.3 and compute J(theta) after each iteration. You find that the value of J(Theta) decreases quickly and then levels off. Based on this, which of the following conclusions seems most plausible?
7.
How can you prevent a clustering algorithm from getting stuck in bad local optima?
8.
Which of the following techniques can be used for normalization in text mining?
9.
What is pca.components_ in Sklearn?
10.
Which of the following is true about Naive Bayes ?