Step 2: Review the HTML Report | teradataml open-source machine learning - Step 2: Review the HTML Report - Teradata Package for Python

Teradata® pyspark2teradataml User Guide

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
VMware
Product
Teradata Package for Python
Release Number
20.00
Published
December 2024
ft:locale
en-US
ft:lastEdition
2024-12-18
dita:mapPath
oeg1710443196055.ditamap
dita:ditavalPath
ayr1485454803741.ditaval
dita:id
oeg1710443196055
Product Category
Teradata Vantage

Generated Python script/notebook may or may not run directly on Vantage.

The utility function pyspark2teradataml takes care of most of the conversion, but there may be some instances where generated script/notebook requires additional manual changes. Review the generated HTML file to understand the instances that require manual changes.

For the directory containing PySpark scripts, a single HTML file will be generated, which contains notes table for each script.

The generated HTML file contains notes for the script/notebook with line number. The notes are in three different colors as follows:

  • black: Notes which are colored in black do not need any attention.

    These notes give additional information about the APIs used in script/notebook to user.

  • blue: Notes which are colored in blue need user attention.

    These APIs have functionality but there may be some differences in functionality when compared with PySpark.

    These notes specify the exact differences for you to change the references of those APIs manually in script/notebook.

  • red: Notes which are colored in red need user attention.

    These APIs do not have functionality in teradatamlspk. You need to achieve the functionality through some other ways.

Act on the notes generated in the HTML report before running the resulting script or notebook with the data in Vantage.