SLES 12 - HDP Server Nodes - Hortonworks Data Platform

Hortonworks Data Platform for Teradata® Administrator Guide

prodname
Hortonworks Data Platform
vrm_release
2.6.4
2.6.5
SLES 12
created_date
November 2019
category
Administration
featnum
B035-6097-028K

Hortonworks for Teradata® is composed of master, data, and edge nodes.

Master Node for Hadoop
Controls the cluster by storing metadata and running master services, including:
  • HCatalog: Describes the structure of data stored in HDFS
  • Hive: Queries structured data in HDFS
  • JournalNode: Modifies log changes in HDFS from the namenode
  • Namenode: Manages HDFS storage; high availability requires an active and standby namenode
  • YARN: Schedules application jobs and manages and allocates resources
  • ZooKeeper: Synchronizes distributed components as well as monitoring the namenode
Data Node for Hadoop
  • Stores HDFS blocks
  • Answers queries from the namenode for filesystem operations
  • Allows client applications to communicate directly with the data node when the namenode determines the data location
Edge Node for Hadoop
The edge node allows client applications to run independently of the master node, reducing both the risk in testing new applications and the impact on Teradata Database throughput by enhancing load performance, which TASM or Teradata Integrated Workload Management ruleset throttles. Located between the Hadoop cluster and the customer network, the edge node runs client services for the cluster:
  • Allows access for external applications and user access to the Hadoop environment
  • Permits access control
  • Enforces policy oversight
  • Logs metadata
  • Provides fast connections by communicating to the Hadoop cluster over the internal InfiniBand network