Estimating Space Requirements - Parallel Data Pump

Teradata® Parallel Data Pump Reference

Product
Parallel Data Pump
Release Number
16.20
Published
September 2019
Language
English (United States)
Last Update
2019-10-11
dita:mapPath
dmq1512702641516.ditamap
dita:ditavalPath
Audience_PDF_include.ditaval
dita:id
B035-3021
lifecycle
previous
Product Category
Teradata Tools and Utilities

This section discusses space requirements for the Teradata TPump log table.

A row of approximately 200 bytes is written to the log table on each of the following events.
  • One row is written at initialization.
  • One row is written for each SQL statement issued through the Teradata TPump support environment.
  • One row is written at the BEGIN LOAD command.
  • One row is written at the END LOAD command.
  • Two rows are written for each IMPORT command.
  • One row is written for each statement used in a load (between the BEGIN LOAD command and the END LOAD command).
  • One row is written for each checkpoint taken.
  • In the ROBUST mode, for each packed request, a number of partial checkpoint rows are written to the log between checkpoints. The rows are deleted each time a checkpoint is written.

The partial checkpoint row contains 117 + (12 * packfactor) bytes per transaction. So the number of partial checkpoints will vary, depending on the checkpoint frequency, the power of the loading host, and the power of the Teradata target database.

Thus, an equation for the space is:

200 + 200 * each statement in the support environment + 400 * each BEGIN/END LOAD + 200 * each statement issued as DML + 200 * the estimated number of checkpoints + (117 + (12 * packfactor)) * the number of partial checkpoints. A simplified version would be:

R = 200 + 200S + 400L + 200D+ 200C + (117 + (12P))N,

where the following is true:

      R = Required space for Teradata TPump log table

      S = Each SQL statement issued through the support environment

      L = Each BEGIN/END LOAD command pair

      D = Each DML statement

      C = Estimated number of checkpoints

      P = Packfactor

      N = Number of partial checkpoints