1.0 - 8.00 - Multiple-Node Data Sets - Teradata Vantage

Teradata® Vantage Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
1.0
8.00
Release Date
May 2019
Content Type
Programming Reference
Publication ID
B700-4003-098K
Language
English (United States)

DistributionMatchReduce version 1.6, DistributionMatchMultiInput version 1.3

SELECT * FROM DistributionMatchReduce (
  ON DistributionMatchMultiInput (
    ON (SELECT RANK() OVER (PARTITION BY col [,...] 
      ORDER BY column) AS rank, *
      FROM input_table 
      WHERE column IS NOT NULL
    ) AS input PARTITION BY ANY
    ON (SELECT col [,...], COUNT(*) AS group_size
      FROM input_table 
      WHERE column IS NOT NULL
      GROUP BY col [,...] 
    ) AS groupstats DIMENSION
    USING
    ValueColumn ('value_column')
    [ Tests ('test' [,...]) ]
    Distributions ('distribution:parameter' [,...])
  [ GroupByColumns
    ({ 'group_by_column' | group_by_column_range }[,...]) ]
    [ MinGroupSize (minGroupSize) ]
    [ NumCell (cell_size) ]
  ) AS alias_1 PARTITION BY col [,...] 
) AS alias_2;

If your input table already includes a rank column, replace this clause:

ON (SELECT RANK()...

with this clause:

ON SELECT * FROM input_table .