Statistics analysis helps determine the characteristics of numeric data, assess its quality, and find outlying values and other anomalies.
You can apply Statistics analysis to columns of any numeric type, including date types.
Basic Statistics
Given a table name and the names of one or more numeric columns, Statistics analysis provides the following for each column:
- Count (number of rows with values) *
- Minimum value *
- Maximum value *
- Mean *
- Standard deviation *
- Skewness
- Kurtosis
- Standard error
- Coefficient of variance
- Variance
- Sum
- Uncorrected sum of squares
- Corrected sum of squares
* Selected by default.
For all but count, minimum and maximum, Statistics analysis first converts DATE columns to the number of days since 1900.
Extended Statistics
You can request the following additional statistics:
| Statistic | Description |
|---|---|
| modes | Modal (most frequently occurring value in column). |
| quantiles | Bottom 10 percentiles, deciles, quartiles, tertiles, and top 10 percentiles of values in column. |
| values |
Values analysis provides the same values (see Values). |
| rank | The five smallest values in column, the five largest values in column, and the number of times that each value appears in the column. |