You have designed a model predicting the goals scored for each player in the World Cup and now want to evaluate the model using held-out scoring data. Which node is designed to produce this actual vs. predicted model evaluation?
You are working on a project where the business objective is to increase customer retention. You havecompleted the Data Preparation stage of the CRISP-DM process model. What is the next stage?
An e-retailer conducting a data mining project has limited an initial study to approximately 30,000 customers who have registered on the site. There are still millions of records in the Web logs. The data miner wants to determine the frequency distribution of the age of their customers. The Age column is acontinuous field. Which node would you use to accomplish this task?
You need to produce a variety of data files that will be viewed in external applications outside of IBM SPSS Modeler Professional, such as IBM SPSS Statistics or Microsoft Excel. Which palette tab would be used in this scenario?
You have a poorly performing risk model and are looking for strategies to improve performance. You know that only about one percent of your cases represent risk, and you have over 1 million cases to use for training purposes.
What is the correct approach to test for improving performance?
You have collected data about a set of patients, all of whom suffered from the same illness. During their course of treatment, each patient responded to one of five medications. The column. Drug, is a character field that describes the medication. You need to find out which proportion of the patients responded to each drug. Which node should be used?
You want to obtain a subset of data from a larger data set, with equally represented subgroups within thesubset. Which node would you use to accomplish this task?