Teradata Warehouse Miner User Guide - Volume 1Introduction and Profiling

Teradata Warehouse Miner
User Guide
  1. Agrawal, R. Mannila, H. Srikant, R. Toivonen, H. and Verkamo, I., Fast Discovery of Association Rules. In Advances in Knowledge Discovery and Data Mining, 1996, eds. U.M. Fayyad, G. Paitetsky-Shapiro, P. Smyth and R. Uthurusamy. Menlo Park, AAAI Press/The MIT Press.
  2. Agresti, A. (1990) Categorical Data Analysis. Wiley, New York.
  3. Arabie, P., Hubert, L., and DeSoete, G., Clustering and Classification, World Scientific, 1996.
  4. Belsley, D.A., Kuh, E., and Welsch, R.E. (1980) Regression Diagnostics: Identifying Influential Data and Sources of Collinearity. Wiley, New York.
  5. Bradley, P., Fayyad, U. and Reina, C., Scaling EM Clustering to Large Databases, Microsoft Research Technical Report MSR-TR-98-35, 1998.
  6. Breiman, L., Friedman, J.H., Olshen, R.A., and Stone, C.J. Classification and Regression Trees. Wadsworth, Belmont, 1984.
  7. Conover, W.J. Practical Nonparametric Statistics, 3rd Edition.
  8. Cox, D.R. and Hinkley, D.V. (1974) Theoretical Statistics. Chapman & Hall/CRC, New York.
  9. D'Agostino, RB. (1971) An omnibus test of normality for moderate and large size samples, Biometrica, 58, 341-348
  10. D'Agostino, R. B. and Stephens, M. A., eds. Goodness-of-fit Techniques, 1986. New York: Dekker.
  11. D’Agostino, R, Belanger, A., and D’Agostino,R. Jr., A Suggestion for Using Powerful and Informative Tests of Normality, American Statistician, 1990, Vol. 44, No. 4.
  12. Finn, J.D. (1974) A General Model for Multivariate Analysis. Holt, Rinehart and Winston, New York.
  13. Harman, H.H. (1976) Modern Factor Analysis. University of Chicago Press, Chicago.
  14. Harter, H.L. and Owen, D.B., eds, Selected Tables in Mathematical Statistics, Vol. 1.. Providence, Rhode Island: American Mathematical Society.
  15. Hosmer, D.W. and Lemeshow, S. (1989) Applied Logistic Regression. Wiley, New York.
  16. Jennrich, R.I., and Sampson, P.F. (1966) Rotation For Simple Loadings. Psychometrika, Vol. 31, No. 3.
  17. Johnson, R.A. and Wichern, D.W. (1998) Applied Multivariate Statistical Analysis, 4th Edition. Prentice Hall, New Jersey.
  18. Kachigan, S.K. (1991) Multivariate Statistical Analysis. Radius Press, New York.
  19. Kaiser, Henry F. (1958) The Varimax Criterion For Analytic Rotation In Factor Analysis. Psychometrika, Vol. 23, No. 3.
  20. Kass, G. V. (1979) An Exploratory Technique for Investigating Large Quantities of Categorical Data, Applied Statistics (1980) 29, No. 2 pp. 119-127.
  21. Kaufman, L. and Rousseeuw, P., Finding Groups in Data, J Wiley & Sons, 1990.
  22. Kennedy, W.J. and Gentle, J.E. (1980) Statistical Computing. Marcel Dekker, New York.
  23. Kleinbaum, D.G. and Kupper, L.L. (1978) Applied Regression Analysis and Other Multivariable Methods. Duxbury Press, North Scituate, Massachusetts.
  24. Maddala, G.S. (1983) Limited-Dependent and Qualitative Variables In Econometrics. Cambridge University Press, Cambridge, United Kingdom.
  25. Maindonald, J.H. (1984) Statistical Computation. Wiley, New York.
  26. McCullagh, P.M. and Nelder, J.A. (1989) Generalized Linear Models, 2nd Edition. Chapman & Hall/CRC, New York.
  27. McLachlan, G.J. and Krishnan, T., The EM Algorithm and Extensions, J Wiley & Sons, 1997.
  28. Menard, S (1995) Applied Logistic Regression Analysis, Sage, Thousand Oaks.
  29. Mulaik, S.A. (1972) The Foundations of Factor Analysis. McGraw-Hill, New York.
  30. Neter, J., Kutner, M.H., Nachtsheim, C.J., and Wasserman, W. (1996) Applied Linear Statistical Models, 4th Edition. WCB/McGraw-Hill, New York.
  31. NIST/SEMATECH e-Handbook of Statistical Methods,, 2005.
  32. Nocedal, J. and Wright, S.J. (1999) Numerical Optimization. Springer-Verlag, New York.
  33. Orchestrate/OSH Component User’s Guide Vol II, Analytics Library, Chapter 2: Introduction to Data Mining. Torrent Systems, Inc., 1997.
  34. Ordonez, C. and Cereghini, P. (2000) SQLEM: Fast Clustering in SQL using the EM Algorithm. SIGMOD Conference 2000: 559-570.
  35. Ordonez, C. (2004): Programming the K-means clustering algorithm in SQL. KDD 2004: 823-828.
  36. Ordonez, C. (2004): Horizontal aggregations for building tabular data sets. DMKD 2004: 35-42.
  37. Pagano, Gauvreau Principles of Biostatistics, 2nd Edition.
  38. Peduzzi, P.N., Hardy, R.J., and Holford, T.R. (1980) A Stepwise Variable Selection Procedure for Nonlinear Regression Models. Biometrics 36, 511-516.
  39. Pregibon, D. (1981) Logistic Regression Diagnostics. Annals of Statistics, Vol. 9, No. 4, 705-724.
  40. PROPHET StatGuide, BBN Corporation, 1996.
  41. Quinlan, J.R. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, 1993.
  42. Roweis, S. and Ghahramani, Z., A Unifying Review of Linear Gaussian Models, Journal of Neural Computation, 1999.
  43. Royston, JP., An Extension of Shapiro and Wilk’s W Test for Normality to Large Samples, Applied Statistics, 1982, 31, No. 2, pp.115-124.
  44. Royston, JP, Algorithm AS 177: Expected normal order statistics (exact and approximate), 1982, Applied Statistics, 31, 161-165.
  45. Royston, JP., Algorithm AS 181: The W Test for Normality, 1982, Applied Statistics, 31, 176-180.
  46. Royston, JP., A Remark on Algorithm AS 181: The W Test for Normality, 1995, Applied Statistics, 44, 547-551.
  47. Rubin, Donald B., and Thayer, Dorothy T. (1982) EM Algorithms For ML Factor Analysis. Psychometrika, Vol. 47, No. 1.
  48. Shapiro, SS and Francia, RS (1972). An approximate analysis of variance test for normality, Journal of the American Statistical Association, 67, 215-216.
  49. SPSS 7.5 Statistical Algorithms Manual, SPSS Inc., Chicago.
  50. SYSTAT 9: Statistics I. (1999) SPSS Inc., Chicago.
  51. Takahashi, T. (2005) Getting Started: International Character Sets and the Teradata Database, Teradata Corporation, 541-0004068-C02.
  52. Tatsuoka, M.M. (1971) Multivariate Analysis: Techniques For Educational and Psychological Research. Wiley, New York.
  53. Tatsuoka, M.M. (1974) Selected Topics in Advanced Statistics, Classification Procedures, Institute for Personality and Ability Testing, 1974.
  54. Teradata Database SQL Functions, Operators, Expressions, and Predicates Release, B035-1145.
  55. Teradata Warehouse Miner Model Manager User Guide, B035-2303.
  56. Teradata Warehouse Miner Release Definition, B035-2494.
  57. Teradata Warehouse Miner User Guide, Volume 1 Introduction and Profiling, B035-2300.
  58. Teradata Warehouse Miner User Guide, Volume 2 ADS Generation, B035-2301.
  59. Teradata Warehouse Miner User Guide, Volume 3 Analytic Functions, B035-2302.
  60. Wendorf, Craig A., MANUALS FOR UNIVARIATE AND MULTIVARIATE STATISTICS © 1997, Revised 2004-03-12, 2005.
  61. Wilkinson, L., Blank, G., and Gruber, C. (1996) Desktop Data Analysis With SYSTAT. Prentice Hall, New Jersey.