Argument | Category | Description |
---|---|---|
TextColumn | Required | Specifies the name of the input table column that contains the XML documents. The function skips malformed XML documents. |
DocIDColumns | Required | Specifies the names of the input table columns that contain the identifiers of the XML documents. No docid_column can have the same name as an output table column. For output column names, refer to Output. |
MaxDepth | Optional | Specifies the maximum depth in the XML tree at which to process XML documents. The MaxDepth and Output arguments determine the schema of the output table, and the number of columns in the output table must not exceed 1600. The default value is 5. |
ExcludeElements | Optional | Specifies the paths to the nodes to exclude from processing. The function excludes each specified node and its child nodes. Examples of paths to nodes are:
|
AttributeAsNode | Optional | Specifies whether to treat the attributes of a node as its child nodes. The default value is 'false' (attributes of a node are stored in one element of the output tuple). |
AttributeDelimiter | Optional | Specifies the delimiter used to separate multiple attributes of one node in XML documents. The default value is a comma ','. |
Output | Optional | Specifies the output table schema (refer to Example 1: Output Three Different Output Table Schemas). The MaxDepth and Output arguments determine the schema of the output table, and the number of columns in the output table must not exceed 1600. The default value is 'fullpath'. |
ErrorHandler | Optional | Specifies whether the function handles errors that occur when parsing an XML document. The default value is 'false' (the function aborts and throws an exception). If you specify 'true':
|
Accumulate | Optional | Specifies the names of input column names to copy to the output table. No accumulate_column can have the same name as an output table column. For output column names, refer to Output. |