TD_GetFutileColumns | GetFutileColumns | Teradata Vantage - TD_GetFutileColumns

TD_GetFutileColumns | GetFutileColumns | Teradata Vantage - TD_GetFutileColumns - Teradata Vantage

Teradata® VantageCloud Lake

Deployment

VantageCloud

Edition

Lake

Product

Teradata Vantage

Published

January 2023

Language

English (United States)

Last Update

2024-04-03

dita:mapPath

phg1621910019905.ditamap

dita:ditavalPath

pny1626732985837.ditaval

dita:id

phg1621910019905

Data cleaning involves identifying and handling various types of issues in a dataset to ensure that it is accurate, complete, and consistent. One common issue is the presence of futile columns. These columns contain data that is not useful for the analysis or modeling process. This can include constant columns that contain the same value for all the rows in the dataset, unique identifier columns that do not provide any meaningful insights, redundant columns that provide the same information as other columns, or text columns that contain irrelevant or unstructured data. Removing these columns can help to simplify the analysis process, reduce the computational cost, and improve the accuracy of the analysis.

Removing futile columns is an important step in data cleaning. It helps to ensure the dataset only contains relevant and useful information. By removing these columns, analysts can:

Focus their attention on the columns that contain the most useful information
Avoid wasting time and resources on data that does not contribute to the analysis or modeling process
Analyze and draw meaningful insights quicker

TD_GetFutileColumns function returns the futile column names if any of these conditions is met:

If all values in the columns are unique
If all the values in the columns are the same
If the count of distinct values in the columns divided by the count of the total number of rows in the input table is greater than or equal to the specified threshold value

Function Information