CALL td_analyze ( 'linear', 'required_parameter_list [ optional_parameter; [...] ]' );
- required_parameter_list
database = input_database_name; tablename = input_table_name; columns = { all | column_name [,...] }; dependent = column_name;
- optional_parameter
{ backward = { true | false } | backwardonly = { true | false } | columnstoexclude = column_name [,...] | conditionindexthreshold = threshold | constant = { true | false } | enter = entry_value | forward = { true | false } | forwardonly = { true | false } | groupby = column_name [,...] | matrixinput = { true | false } | neardependencyreport = { true | false } | outputdatabase = output_database_name | outputtablename = output_table_name | overwrite = { true | false } | remove = removal_value | statstable = { true | false } | stepwise = { true | false } | usefstat = { true | false } | usepvalue = { true | false } | varianceproportionthreshold = threshold }
Syntax Elements
- database
- The database containing the input table.
- tablename
- The input table from which to build a predictive model.
- columns
- The columns to analyze.
keyword Description all All columns. allnumeric All numeric columns. - dependent
- The input table column that represents the dependent variable.
- backward
- [Optional] Whether to start with all independent variables in the model and do the following until no more independent variables can be removed from the model:
- Take one backward step, removing the independent variable that worst explains the variance of the dependent variable (the variable that meets the criterion specified by remove).
- Take one forward step, adding the independent variable that best explains the variance of the dependent variable (the variable that meets the criterion specified by enter).
- backwardonly
- [Optional] Like backward without the forward step.
- columnstoexclude
- [Optional] The columns to exclude when columns specifies a keyword.
- conditionindexthreshold
- [Optional] One of two thresholds for neardependencyreport.
- constant
- [Optional] Whether the linear model includes a constant term.
- enter
- [Optional] The criterion to enter a variable into the model.
Condition Entry Criterion usefstat=true or more than one variable has P-value zero.
Partial F-statistic must be greater than entry_value. Default entry_value: 3.84
usepvalue=true (Ignored if more than one variable has P-value zero.)
TStatistic P-value (ratio of B coefficient of variable to its standard error) must be less than entry_value. Default entry_value: 0.05
- forward
- [Optional] Whether to start with no independent variables in the model and do the following until no more independent variables can be added to the model:
- Take one forward step, adding the independent variable that best explains the variance of the dependent variable (the variable that meets the criterion specified by enter).
- Take one backward step, removing the independent variable that worst explains the variance of the dependent variable (the variable that meets the criterion specified by remove).
- forwardonly
- [Optional] Like forward without the backward step.
- groupby
- [Optional] The input table columns for which to separately analyze each value or combination of values.Do not use the following names for groupby columns. These names are reserved for use by the CALCMATRIX table operator.
- c
- rowname
- rownum
- s
- matrixinput
- [Optional] Whether the input table located by database and tablename represents an ESSCP matrix built by the Matrix Building function and saved to a table.If the input table represents a saved matrix and you do not specify matrixinput=true, the function may interpret the matrix an ordinary table, causing unpredictable results.
- neardependencyreport
- [Optional] Whether to output an XML report showing columns that may be collinear and store it in the XML output table if all these conditions are true:
- You specify outputdatabase and outputtablename.
- The thresholds conditionindexthreshold and varianceproportionthresholdspecify are crossed.
- The function detects collinearity.
- outputdatabase
- [Optional] The database that contains the output table that represents one or more linear models.
- outputtablename
- [Optional] The name of the output table representing one or more linear models (see groupby).
- overwrite
- [Optional] Whether to drop the output tables before creating new ones.
- remove
- [Optional] The criterion to remove a variable from the model.
Condition Entry Criterion usefstat=true Partial F-statistic must be less than removal_value. Default removal_value: 3.84
usepvalue=true TStatistic P-value must be greater than removal_value. Default removal_value: 0.05
- statstable
- [Optional] Whether to include a data quality report in the XML output string. The report includes the mean and standard deviation of each model variable, derived from an ESSCP matrix.
- stepwise
- [Optional] Whether to perform the stepwise procedure (forward, forwardonly, backward, or backwardonly).
- usefstat
- [Optional] Whether to use the partial F-Statistic to decide whether to add or remove a variable.
- usepvalue
- [Optional] Whether to use the T-Statistic P-value to decide whether to add or remove a variable.
- varianceproportionthreshold
- [Optional] One of two thresholds for neardependencyreport.