Hashing - Teradata Vantage - Analytics Database

Database Introduction

Deployment
VantageCloud
VantageCore
Edition
VMware
Enterprise
IntelliFlex
Product
Analytics Database
Teradata Vantage
Release Number
17.20
Published
June 2022
ft:locale
en-US
ft:lastEdition
2023-09-27
dita:mapPath
gtm1628096154303.ditamap
dita:ditavalPath
qkf1628213546010.ditaval
dita:id
dsm1472253642401
lifecycle
latest
Product Category
Teradata Vantage™

Because the Vantage architecture is massively parallel, it requires an efficient method for distributing and retrieving its data—hashing.

Vantage uses hashing to distribute data for tables with a PI to disk storage and uses indexes to access the data. Most indexes are based on row hash values rather than table column values.

For PIs, Vantage obtains a row hash by hashing the values of the PI columns. The row hash and a sequence number, which is assigned to distinguish between rows with the same row hash within a table, are called a row identifier, and uniquely identify each row in a table. The row identifiers for partitioned tables include a partition identifier. For more information on partitioned tables, see Partitioned Tables.

For SIs, Vantage computes a hash value using the hash of the values of the SI columns. Vantage uses this value when the SQL specifies an SI value. The SI subtable records the hash value for the SI, the actual value of the index columns (for synonym resolution), and a list of primary index row identifiers for the table being indexed.