Description - Parallel Data Pump

Teradata Parallel Data Pump Reference

Product
Parallel Data Pump
Release Number
16.10
Published
May 2017
Language
English (United States)
Last Update
2018-05-22
dita:mapPath
tqa1488824663174.ditamap
dita:ditavalPath
Audience_PDF_include.ditaval
dita:id
B035-3021
lifecycle
previous
Product Category
Teradata Tools and Utilities

Teradata TPump is a data loading utility that helps maintain (update, delete, insert, and atomic upsert) the data in Teradata Database. Teradata TPump allows near-real time data to be achieved in the data warehouse.

Teradata TPump uses standard Teradata SQL to achieve moderate to high data loading rates to Teradata Database. Multiple sessions and multistatement requests are typically used to increase throughput.

Teradata TPump provides an alternative to Teradata MultiLoad for the low volume batch maintenance of large databases under control of a Teradata system. Instead of updating Teradata Databases overnight, or in batches throughout the day, Teradata TPump updates information in real time, acquiring data from the client system with low processor utilization. It does this through a continuous feed of data into the data warehouse, rather than through traditional batch updates. Continuous updates result in more accurate, timely data.

Unlike most load utilities, Teradata TPump uses row hash locks rather than table level locks. This allows queries to be run while Teradata TPump is running. This also means that Teradata TPump can be stopped instantaneously.

Teradata TPump provides a dynamic throttling feature that enables it to run “all out” during batch windows, but within limits when it may impact other business uses of Teradata Database. Operators can specify the number of statements run per minute, or may alter throttling minute-by-minute, if necessary.

Teradata TPump’s main attributes are:

  • Simple, hassle-free setup – does not require staging of data, intermediary files, or special hardware.
  • Efficient, time-saving operation – jobs can continue running in spite of database restarts, dirty data, and network slowdowns. Jobs restart without intervention.
  • Flexible data management – accepts an infinite variety of data forms from an infinite number of data sources, including direct feeds from other databases. Teradata TPump is also able to transform that data on the fly before sending it to Teradata. SQL statements and conditional logic are usable within the utilities, making it unnecessary to write wrapper jobs around the utilities.
Full tape support is not available for any function in Teradata TPump for network-attached client systems. To import data from a tape, a custom access module needs to be written that interfaces with the tape device. Refer to the Teradata Tools and Utilities Access Module Programmer Guide (B035-2424) for information about how to write a custom access module.