NFR of Multi-Node Systems | Teradata VantageCloud Enterprise on AWS (DIY) - Node Failure Recovery - Teradata® VantageCloud Enterprise on AWS

VantageCloud Enterprise on AWS (DIY) Installation and Administration Guide - 2.4

Deployment
VantageCloud
Edition
Enterprise
Product
Teradata® VantageCloud Enterprise on AWS
Release Number
2.4
Published
April 2024
Language
English (United States)
Last Update
2024-07-15
dita:mapPath
kma1662437965174.ditamap
dita:ditavalPath
nat1649317391363.ditaval
dita:id
jnv1467245119674
Product Category
Cloud

Node failure recovery is supported only for systems with three or more nodes.

The replacement node is based on an image recorded when the system is first deployed or an updated image after a software upgrade. The replacement node has the same secondary IP, elastic IP, and identifiers as the replaced node. The primary IP address of the replacement node will be new. However, if you had allocated an elastic IP address to each node when you deployed your instance, the public IP address should be the same after the node recovery. The secondary private IP addresses are the same regardless of the elastic IP address settings you chose when you deployed the instance. At least one free IP address in the subnet is required.

You must create a new system image after a software upgrade. If you do not create this image, the software on the recovered instance will not match and the database cannot start if a node failure occurs.

Node failure recovery is handled differently than on-premises systems. Unless you want failed nodes to continue running for diagnostic purposes, you should terminate the instance. If AWS Support cannot diagnose the issue, contact Teradata Customer Support.

There is a cost associated with keeping this downed node running. You have the option of stopping or terminating the instance after the new node is brought online. You can also configure the instance to stop or terminate before a node failure.

Node failure recovery takes longer than a typical reset. If your instance does not restart automatically after 10 minutes, check the alerts in Teradata Server Management or Teradata Viewpoint. You must set up the alerts before the node failed. If you cannot resolve the node failure after reviewing the alerts, contact Teradata Customer Support.

With the NFR triggered instance, the root volume size increases to accommodate heavy data and lets you increase your instance size to 24xl for a day. Also, the increment in root volume size will not have any functional impact on your system.

Do not test NFR by terminating a node. Terminating a node deletes the attached EBS volumes permanently. NFR relies on detaching EBS volumes from a faulty node and reattaching those EBS volumes to a replacement node.