Row Partitioning | Optimizer Process | Teradata Vantage - Row Partitioning - Advanced SQL Engine

Row Partitioning | Optimizer Process | Teradata Vantage - Row Partitioning - Advanced SQL Engine - Teradata Database

SQL Request and Transaction Processing

Product

Advanced SQL Engine

Teradata Database

Release Number

17.10

Published

July 2021

Language

English (United States)

Last Update

2021-07-28

dita:mapPath

uqf1592445067244.ditamap

dita:ditavalPath

uqf1592445067244.ditaval

dita:id

B035-1142

lifecycle

Product Category

Teradata Vantage™

About Row Partitioning

A table or join index uses row partitioning to assign rows to row partitions on AMPs based on the row partitioning expressions used to define the object. A row partition consists of 0 or more rows of a table or join index that have the same value for the partitioning expressions defined for the database object.

The primary advantage presented by row partitioning is using row partition elimination as an aid to optimizing queries made on row-partitioned database objects.

See the following topics for more information about row partitioning:

Consumption of Disk Space by Populated and Empty Partitions

With the large number of partitions that can be defined for a table or join index, it is very likely that a high percentage of those partitions are empty at any given time. For example, a table on a 200 AMP system that defines 100,000 combined partitions with 100 rows per data block and 100 data blocks per each combined partition per AMP has 200 billion rows. This is a relatively small number of combined partitions when you consider that the maximum for a table or join index is 9,223,372,036,854,775,807 combined partitions.

If each row were 100 bytes in length, the primary data alone consumes 20 petabytes of disk. That is 20 x 1015 bytes. It is highly unlikely that every combined partition would be populated. When you consider a multidimensional use of multilevel partitioning, you can easily deduce that not all combinations of dimension values actually occur.

In general, a populated combined partition should have either many data blocks or no data blocks per AMP. For the example proposed in the first paragraph, if there is actually only 200 gigabytes of data and each populated combined partition had 100 data blocks per AMP, about 99% of the combined partitions are empty.