HDP Server Nodes - Hortonworks Data Platform

Hortonworks Data Platform for Teradata® Administrator Guide

Product
Hortonworks Data Platform
Release Number
SLES 12
3.1.4
3.0.1
Published
November 30, 2019
Language
English (United States)
Last Update
2019-11-25
dita:mapPath
eof1534538710605.ditamap
dita:ditavalPath
hdp_Hortonworks.ditaval.ditaval
dita:id
B035-6097
lifecycle
previous
Product Category
Open Source

Hortonworks Data Platform for Teradata® is composed of master, data, and edge nodes.

Master Node for Hadoop
Controls the cluster by storing metadata and running master services, including:
  • Hive: Queries structured data in HDFS
  • JournalNode: Modifies log changes in HDFS from the namenode
  • Namenode: Manages HDFS storage; high availability requires an active and standby namenode
  • YARN: Schedules application jobs and manages and allocates resources
  • ZooKeeper: Synchronizes distributed components as well as monitoring the namenode
Data Node for Hadoop
  • Stores HDFS blocks
  • Answers queries from the namenode for filesystem operations
  • Allows client applications to communicate directly with the data node when the namenode determines the data location
Edge Node for Hadoop
The edge node allows client applications to run independently of the master node, reducing the risk in testing new applications. Located between the Hadoop cluster and the customer network, the edge node runs client services for the cluster:
  • Allows access for external applications and user access to the Hadoop environment
  • Permits access control
  • Enforces policy oversight
  • Provides fast connections by communicating to the Hadoop cluster over the internal InfiniBand network