TPump TMSM Integration - Parallel Data Pump

Teradata® Parallel Data Pump Reference

Product
Parallel Data Pump
Release Number
16.20
Published
September 2019
Language
English (United States)
Last Update
2019-10-11
dita:mapPath
dmq1512702641516.ditamap
dita:ditavalPath
Audience_PDF_include.ditaval
dita:id
B035-3021
lifecycle
previous
Product Category
Teradata Tools and Utilities

Teradata Multi-System Manager (TMSM) is the monitoring and control facility for a variety of Dual Active Solutions offered by Teradata. The expected users of this facility are administrators of the Enterprise Data Warehouse (EDW) such as Teradata Database administrators, ETL maintenance staff, systems administrators, or anyone who needs to monitor and control processes that include, but are not limited to, Teradata Load and Unload Utilities, Teradata SQL, ETL tools, and Teradata Database.

To integrate with TMSM, TPump has been enhanced to collect operational metadata and event data, obtain the Unit of Work ID (UOW ID) from TMSM for a job, and send such data to TMSM for monitoring purposes using the “send event” interface as described in the TMSM Event System API Reference. By default, a TPump job will send events to TMSM as long as TMSM is active. If TMSM is not active, the job will continue to run, except that no events will be sent to TMSM.

Simple ETL process monitoring requires the tracking of the “start” and “end” of the process, which can include multiple “steps”, each of which represents an activity or event of the process to be monitored. In terms of TPump, a load job can be regarded as such a process.

TPump follows the flow required by TMSM:

  1. Obtain a system-generated UOW ID from TMSM for a TPump job.
  2. Send a “start” event to TMSM along with the UOW ID.
  3. Optionally send one or more “step” events to TMSM along with the UOW ID.
  4. Send an “end” event to TMSM along with the UOW ID.

Below is an example of how TPump job containing two BEGIN EXPORT/END LOAD tasks and two IMPORT tasks sends event messages to TMSM in order for the job to be monitored:

At job start: "Connecting session(s)"
Task 1 Import 1: Loading begins
Task 1 Import 1: Checkpoint completes
Task 1 Import 1: Loading completes
Task 1 Import 2: Loading begins
Task 1 Import 2: Checkpoint completes
Task 1 Import 2: Loading completes
Task 2 Import 1: Loading begins
Task 2 Import 1: Checkpoint completes
Task 2 Import 1: Loading completes
Task 2 Import 2: Loading begins
Task 2 Import 2: Checkpoint completes
Task 2 Import 2: Loading completes
At job end: "Job terminating”

“Task n” refers to multiple BEGIN LOAD/END LOAD tasks, “Import n” refers to multiple IMPORTs. TPump sends number of rows inserted/updated/deleted while loading completes.

For detailed information on setting up the TMSM environment, refer to the Teradata Multi-System Manager Installation Guide (B035-3202).