The generated Python script/notebook may or may not run directly on Vantage. The pyspark2teradataml utility function takes care of most of the conversion, but there may be some instances where the generated script/notebook requires additional manual changes. Review the generated HTML file to understand the manual changes.
PySpark script or Jupyter notebook input
The HTML report has two panes.
- Color-coded bell icons that represent unique alerts are located next to lines in the original PySpark script that require attention.
- No bell icons are shown in the teradatamlspk script which represents the generated output.
- Black bell icon: Notifications. Alerts under this category requires no action from user. This is just a notification to the user.
- Blue bell icon: Partially supported APIs. Alerts under this category may need minor change to run the corresponding API on Vantage.
- Red bell icon: Unsupported APIs. Alerts under this category require an alternative implementation since the corresponding APIs are not supported.
- Green tick: Alerts under this category signify the file was successfully converted and can be run as is on Vantage.
- Bug icon: Alerts under this category signify the conversion failed, and the teradatamlspk script was not generated.
- File icon: Alerts under this category signify the input file is empty, and the teradatamlspk script was not generated.
- Differences between PySpark and teradatamlspk for that API.
- Examples demonstrating the differences.
- Required user action to run the corresponding API with teradatamlspk.
Directory input
The index file has two panes.
The left pane displays a list of all scripts and notebooks in the provided directory, showing their full paths. Files that were not processed are highlighted in red and preceded by a bug report icon. Select a filename to open corresponding file report.
- Total number of files converted
- Total number of files not converted
- Total number of empty files
- Total number of files processed
Examples for pyspark2teradataml
Examples can be found in the attachment list associated with this guide. In the left pane, select attachments and download examples_pyspark2teradataml_migration.zip.