Server Management Operational and Problem Events - Server Management

Teradata® Server Management Product Guide

Product
Server Management
Release Number
14.00
Published
April 2019
Language
English (United States)
Last Update
2020-10-13
dita:mapPath
tsx1552482809995.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-6112
lifecycle
previous
Product Category
Hardware
Software

The CMIC collects events for the components in the collective it manages, and consolidates and correlates them with other CMICs to provide a summarized view of the impacts to the systems and components in the Server Management domain.

Type Description
Event A change in condition that may require intervention. Events are collected from operating system logs, other device-specific interfaces, and the CMICs themselves.

Events that affect system operation generate alerts.

Alert

Alerts are derived from the collected events based on event signatures that are known to present significant problem conditions. The alert severity indicates the level of impact of the problem to the system or subsystem.

Alerts are generated for both software and hardware, including but not limited to the following:
  • Teradata Database
  • Kubernetes clusters
  • BYNET
  • Disk arrays
  • Node operating systems
  • Platform hardware

Default settings in the Java Client Alerts Viewer now include Summary Alerts

System Problems Summary alerts are generated based on groups of alerts that occur together during known system problems. A recommended action is provided for resolution of problem conditions. Based on their state, system problems can be closed (cleared or deleted) when problem conditions are known to be resolved.

Data bundles are used to diagnose problems. Software subsystems generate data bundles when they detect fault conditions. Data bundles reported during known fault conditions are escalated automatically to the TVI backend as soon as they are available, and they become available as attachments to incidents that are generated.