Hardware Fault Tolerance - Advanced SQL Engine - Teradata Database

Database Introduction

Product
Advanced SQL Engine
Teradata Database
Release Number
17.05
17.00
Published
June 2020
Language
English (United States)
Last Update
2021-01-23
dita:mapPath
qia1556235689628.ditamap
dita:ditavalPath
lze1555437562152.ditaval
dita:id
B035-1091
lifecycle
previous
Product Category
Teradata Vantageā„¢

Teradata Database provides the following facilities for hardware fault tolerance.

Facility Description
Multiple BYNETs Multinode Teradata Database servers are equipped with at least two BYNETs. Interprocessor traffic is never stopped unless all BYNETs fail. Within a BYNET, traffic can often be rerouted around failed components.
RAID disk units
  • Teradata Database servers use Redundant Arrays of Independent Disks (RAIDs) configured for use as RAID1, RAID5, or RAIDS.

    Non-array storage cannot use RAID technology.

  • RAID1 arrays offer mirroring, the method of maintaining identical copies of data.
  • RAID5 or RAIDS protects data from single-disk failures with a 25% increase in disk storage to provide parity.
  • RAID1 provides better performance and data protection than RAID5/RAIDS, but is more expensive.
Multiple client-server connections In a client-server environment, multiple connections between mainframe and workstation-based clients ensure that most processing continues even if one or several connections between the clients and server are not working.

Vproc migration is a software feature supporting this hardware issue.

Isolation from client hardware defects In a client-server environment, a server is isolated from many client hardware defects and can continue processing in spite of such defects.
Power supplies and fans Each cabinet in a configuration has redundant power supplies and fans to ensure fail-safe operation.
Hot swap capability for node components Teradata Database can allow some components to be removed and replaced while the system is running. This process is known as hot swap. Teradata Database offers hot swap capability for the following:
  • Disks within RAID arrays
  • Fans
  • Power supplies
Cliques
  • A clique is a group of nodes sharing access to the same disk arrays. The nodes and disks are interconnected through FC buses and each node can communicate directly to all disks. This architecture provides and balances data availability in the case of a node failure.
  • A clique supports the migration of vprocs following a node failure. If a node in a clique fails, then its vprocs migrate to another node in the clique and continue to operate while recovery occurs on their home node. Migration minimizes the performance impact on the system.
  • PEs that manage TPA-hosted physical channel connections cannot migrate because they depend on the hardware that is physically attached to the assigned node.
  • PEs for workstation-attached connections do migrate when a node failure occurs, as do all AMP vprocs.
  • To ensure maximum fault tolerance, no more than one node in a clique is placed in the same cabinet. Usually the battery backup feature makes this precaution unnecessary, but if you want maximum fault tolerance, then plan your cliques so the nodes are never in the same cabinet.