Teradata Package for Python Function Reference on VantageCloud Lake - __init__ - Teradata Package for Python - Look here for syntax, methods and examples for the functions included in the Teradata Package for Python.
Teradata® Package for Python Function Reference on VantageCloud Lake
- Deployment
- VantageCloud
- Edition
- Lake
- Product
- Teradata Package for Python
- Release Number
- 20.00.00.08
- Published
- November 2025
- ft:locale
- en-US
- ft:lastEdition
- 2025-12-05
- dita:id
- TeradataPython_FxRef_Lake_2000
- Product Category
- Teradata Vantage
- teradataml.automl.autodataprep.AutoDataPrep.__init__ = __init__(self, task_type='Default', verbose=0, **kwargs)
- DESCRIPTION:
AutoDataPrep simplifies the data preparation process by automating the different aspects of
data cleaning and transformation, enabling seamless exploration, transformation, and optimization of datasets.
PARAMETERS:
task_type:
Optional Argument.
Specifies the task type for AutoDataPrep, whether to apply regression OR classification
on the provided dataset. If user wants AutoDataPrep() to decide the task type automatically,
then it should be set to "Default".
Default Value: "Default"
Permitted Values: "Regression", "Classification", "Default"
Types: str
verbose:
Optional Argument.
Specifies the detailed execution steps based on verbose level.
Default Value: 0
Permitted Values:
* 0: prints the progress bar.
* 1: prints the execution steps.
* 2: prints the intermediate data between the execution of each step.
Types: int
**kwargs:
Specifies the additional arguments for AutoDataPrep. Below
are the additional arguments:
custom_config_file:
Optional Argument.
Specifies the path of JSON file in case of custom run.
Types: str
volatile:
Optional Argument.
Specifies whether to put the interim results of the
functions in a volatile table or not. When set to
True, results are stored in a volatile table,
otherwise not.
Default Value: False
Types: bool
persist:
Optional Argument.
Specifies whether to persist the interim results of the
functions in a table or not. When set to True,
results are persisted in a table; otherwise,
results are garbage collected at the end of the
session.
Default Value: False
Types: bool
enable_lasso:
Optional Argument.
Specifies whether to use lasso regression for feature selection.
By default, only RFE and PCA are used for feature selection.
Default Value: False
Types: bool
RETURNS:
Instance of AutoDataPrep.
RAISES:
TeradataMlException, TypeError, ValueError
EXAMPLES:
# Notes:
# 1. Get the connection to Vantage to execute the function.
# 2. One must import the required functions mentioned in
# the example from teradataml.
# 3. Function raises error if not supported on the Vantage
# user is connected to.
# Load the example data.
>>> load_example_data("teradataml", "titanic")
# Create teradataml DataFrames.
>>> titanic = DataFrame.from_table("titanic")
# Example 1: Run AutoDataPrep for classification problem.
# Scenario: Titanic dataset is used to predict the survival of passengers.
# Create an instance of AutoDataPrep.
>>> aprep_obj = AutoDataPrep(task_type="Classification", verbose=2)
# Fit the data.
>>> aprep_obj.fit(titanic, titanic.survived)
# Retrieve the data after Auto Data Preparation.
>>> datas = aprep_obj.get_data()