Types of Checkpoints - Parallel Transporter

Teradata® Parallel Transporter User Guide

Product
Parallel Transporter
Release Number
17.10
Published
February 2022
Language
English (United States)
Last Update
2022-02-04
dita:mapPath
kyx1608578396289.ditamap
dita:ditavalPath
tvt1507315030722.ditaval
dita:id
B035-2445
lifecycle
previous
Product Category
Teradata Tools and Utilities

The following table describes the various types of checkpoints taken in Teradata PT jobs.

Checkpoint Types and Functions 
Checkpoint Type Function
Basic (default) If the tbuild command does not specify a checkpoint interval, the job automatically takes just two checkpoints during each job step that has consumer and producer operators:
  • a Start-of-Data checkpoint
  • an End-of-Data checkpoint
These two checkpoints allow Teradata PT jobs to restart automatically without requiring user intervention, if the interruption was caused by a database server restart or deadlock situation. The default checkpoints can also be used for a manual restart. In either case, the job will restart after the last checkpoint written to the files.
  • If the End-of-Data checkpoint was taken, the work accomplished between these two checkpoints will not have to be repeated by the restarted job.
  • If the job failed before the End-of-Data checkpoint was taken whatever work was accomplished after the Start-of-Data checkpoint was taken will have to be repeated by the restarted job.

This default checkpoint protection against redoing work is quite minimal, so Teradata recommends the use of interval checkpointing.

Interval Checkpointing When you specify a checkpoint interval for a Teradata PT job using the tbuild -z command, the job takes a checkpoint for each specified interval (in seconds), for each job step with producer and consumer operators.

If a job with interval checkpointing fails to run to completion and is later restarted, the only work that will have to be performed over again is the work done after the last checkpoint was taken. This option offers increased fault tolerance for long running jobs containing a substantial amount of data to be loaded/exported.

The valid checkpoint interval values are between 0 and 86400.

Direct Command A Teradata PT job can also be directed to take a checkpoint at any time through the twbcmd command, either explicitly with the JOB CHECKPOINT command option, or implicitly with the JOB PAUSE command option, which suspends job execution after the checkpoint is taken.

For more information, see “Teradata PT Utility Commands” in Teradata® Parallel Transporter Reference, B035-2436.

Operator Initiated The DataConnector operator automatically initiates a checkpoint after processing all the input files found during an interval-driven scan of a directory.