16.20 - REGEXP_REPLACE - Advanced SQL Engine - Teradata Database

Teradata Vantage™ - SQL Functions, Expressions, and Predicates

Product
Advanced SQL Engine
Teradata Database
Release Number
16.20
Published
March 2019
Language
English (United States)
Last Update
2020-03-25
dita:mapPath
xzf1512079057909.ditamap
dita:ditavalPath
TD_DBS_16_20_Update1.ditaval
dita:id
kby1472250656485

Purpose

Replaces portions of source_string that match regexp_string with the replace_string.

REGEXP_REPLACE supports 2, 3, 4, 5, or 6 parameters.

Syntax



TD_SYSFNLIB.
Name of the database where the function is located.
source_string
A character argument.
If source_string is NULL, NULL is returned.
regexp_string
A character argument.
If regexp_string is NULL, NULL is returned.
replace_string
A character argument.
If a replace_string is not specified, is NULL or is an empty string, the matches are removed from the result.
The maximum backreference number in a replace_string is 9 (for example, \9). Any backreference in the replace_string that is higher than 9 is considered a backreference.
position_arg
A numeric argument.
position_arg specifies the position in source_string from which to start searching (default is 1).
If position_arg is greater than the input string length, NULL is returned.
If position_arg is NULL, the value NULL is used. If position_arg is not specified, the default (1) is used.
occurrence_arg
A numeric argument.
Specifies the occurrence to replace the match with replace_string.
  • If a value of 0 is specified, all occurrences are replaced.
  • If the value is greater than 1, the search begins for the second occurrence beginning with the first character following the first occurrence of the regexp_string, and so on.
If occurrence_arg is greater than the number of matches found, nothing is replaced and source_string is returned.
If occurrence_arg is NULL, a NULL result is returned. If occurrence_arg is omitted, 0 is the default value.
match_arg

A character argument.

The argument can contain more than one character. If a character in the argument is not valid, then that character is ignored.
If match_arg is not specified, is NULL, or is empty:
  • The match is case-sensitive.
  • A period does not match the newline character.
  • source_string is treated as a single line.
  • 'i' = case-insensitive matching.
  • 'c' = case-sensitive matching.
  • 'n' = the period character (match any character) can match the newline character.
  • 'm' = source_string is treated as multiple lines instead of as a single line. With this option, the '^' and '$' characters apply to each line in source_string instead of the entire source_string.
  • 'l' = For a CLOB data type, if source_string exceeds the current maximum allowed source_string size (currently 16 MB), a NULL is returned instead of an error. This is useful for long-running queries where you do not want long strings causing an error that would make the query fail. You can only specify this option for a CLOB data type.
  • 'x' = ignore whitespace.

REGEXP_REPLACE supports 2, 3, 4, 5, or 6 parameters.

Limitation: NULL inside Input Strings

REGEXP_REPLACE has a limitation in handling ASCII Chr(0), which is NULL. If you concatenate two strings using ASCII Chr(0), REGEXP_REPLACE cannot handle that input string. For example, consider the following code:
sel TD_SYSFNLIB.RegExp_Replace ('a'||chr(0)||'bc', '[b]', 'X', 1, 0) AS 
regex_rep_input_string;
The function is supposed to replace the b in the input string 'a'||chr(0)||'bc' with X. The result should be a Xc. However, because of the limitation, the result is a.