HDP Server Nodes - Hortonworks Data Platform

Hortonworks Data Platform for Teradata Administrator Guide

Product
Hortonworks Data Platform
Release Number
2.6
Published
July 2017
Language
English (United States)
Last Update
2018-04-12
dita:mapPath
xfe1499786393149.ditamap
dita:ditavalPath
hdp_Hortonworks.ditaval.ditaval
dita:id
B035-6097
lifecycle
previous
Product Category
Open Source

Hortonworks for Teradata is comprised of master, data, and edge nodes.

Master Node for Hadoop
Controls the cluster by storing metadata and running master services, including:
  • HCatalog: Describes the structure of data stored in HDFS
  • Hive: Queries structured data in HDFS
  • JournalNode: Modifies log changes in HDFS from the namenode
  • Namenode: Manages HDFS storage; high availability requires an active and standby namenode
  • YARN: Schedules application jobs and manages and allocates resources
  • ZooKeeper: Synchronizes distributed components as well as monitoring the namenode
Data Node for Hadoop
  • Stores HDFS blocks
  • Answers queries from the namenode for filesystem operations
  • Allows client applications to communicate directly with the data node when the namenode determines the data location
Edge Node for Hadoop
The edge node allows client applications to run independently of the master node, reducing both the risk in testing new applications and the impact on Teradata Database throughput by enhancing load performance, which TASM or Teradata Integrated Workload Management ruleset throttles. Located between the Hadoop cluster and the customer network, the edge node runs client services for the cluster:
  • Allows access for external applications and user access to the Hadoop environment
  • Permits access control
  • Enforces policy oversight
  • Logs metadata
  • Provides fast connections by communicating to the Hadoop cluster over the internal InfiniBand network