Generated Python script/notebook may or may not run directly on Vantage.
The utility function pyspark2teradataml takes care of most of the conversion, but there may be some instances where generated script/notebook requires additional manual changes. Review the generated HTML file to understand the instances that require manual changes.
For the directory containing PySpark scripts, a single HTML file will be generated, which contains notes table for each script.
The generated HTML file contains notes for the script/notebook with line number. The notes are in three different colors as follows:
- black: Notes which are colored in black do not need any attention.
These notes give additional information about the APIs used in script/notebook to user.
- blue: Notes which are colored in blue need user attention.
These APIs have functionality but there may be some differences in functionality when compared with PySpark.
These notes specify the exact differences for you to change the references of those APIs manually in script/notebook.
- red: Notes which are colored in red need user attention.
These APIs do not have functionality in teradatamlspk. You need to achieve the functionality through some other ways.