Once a logistic regression model has been built, it can be used to “score” new data, that is, to estimate the value of the dependent variable in the model using data for which its value may not be known. Scoring is performed using the values of the b-coefficients in the logistic regression model and the names of the independent variable column names they correspond to. This information resides in the results tables stored in the database by Analytics Library. Other information needed includes the table name in which the data resides, the new table to be created, and primary index information for the new table.
- A new table containing primary index columns
- The probability that the dependent variable is 1 (representing the response value) rather than 0 (representing the non-response value)
- Optionally, an estimate of the dependent variable, either 0 or 1, based on a user-specified threshold value
You can achieve different results based on the threshold value applied to the probability. See Model Evaluation to determine what this threshold value should be.