Designing for Missing Information - Teradata VantageCloud Lake

Lake - Database Reference

Deployment
VantageCloud
Edition
Lake
Product
Teradata VantageCloud Lake
Release Number
Published
February 2025
ft:locale
en-US
ft:lastEdition
2025-11-21
dita:mapPath
ohi1683672393549.ditamap
dita:ditavalPath
pny1626732985837.ditaval
dita:id
ohi1683672393549

How to handle missing information in relational databases is controversial. Independent of the semantic difficulties SQL presents when dealing with nulls, missing information also makes exploratory data analysis far more difficult than the identical data mining operation performed on data free of missing values.

The problems resulting from recording and manipulating missing information are concerned principally with ambiguous, counterintuitive meaning of nulls as reported by database queries.

This section examines the semantic ambiguities with nulls to make designers aware of their potential problems of interpretation. The principal thrust is to make you aware of the inconsistencies in the way SQL handles missing information. Careful use of nulls can provide benefits that cannot otherwise be achieved, but their careless use can present serious problems for the integrity of your databases and the accuracy of the information you retrieve.

The recommendation is to avoid using nulls. Whenever possible, constrain columns to be NOT NULL.

The following content is mainly applicable to the standard nodes of the primary cluster in VantageCloud Lake and may not apply to compute clusters on the primary cluster.

For information about the primary cluster and compute clusters, see Managing Compute Resources.