colRegex | teradatamlspk | pyspark2teradataml - colRegex - Teradata Vantage

Teradata® VantageCloud Lake

Deployment
VantageCloud
Edition
Lake
Product
Teradata Vantage
Published
January 2023
ft:locale
en-US
ft:lastEdition
2024-12-11
dita:mapPath
phg1621910019905.ditamap
dita:ditavalPath
pny1626732985837.ditaval
dita:id
phg1621910019905

For the colRegex function, PySpark returns result based on Scala or Java regex; teradatamlspk returns based on Python RegEx.

The following two examples show that the return are different in PySpark and teradatamlspk.

PySpark

df.select(df.colRegex("`(Col.)`")).show()
+----+----+
|Col1|Col2|
+----+----+
|   a|   1|
|   b|   2|
|   c|   3|
+----+----+

teradatamlspk

>>> df.select(df.colRegex("(Col.)")).show()
+----+----+
|Col1|Col2|
+----+----+
|   c|   3|
|   b|   2|
|   a|   1|
+----+----+