Creating Hadoop Hortonworks Connection Profiles Using TDCH - Teradata Studio

Teradata Studio User Guide

Product
Teradata Studio
Release Number
15.11
Published
May 2016
Language
English (United States)
Last Update
2018-05-03
dita:mapPath
bpb1467322243715.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2041
lifecycle
previous
Product Category
Teradata Tools and Utilities
Teradata Connector for Hadoop (TDCH) enables you to move data between Teradata Databases and Hadoop systems.
  1. In the Data Source Explorer or Navigator, click to create a new connection.
  2. Select Hadoop Hortonwords from Connection Profile Types.
  3. At Name, type a name to identify the connection profile.
  4. [Optional] At Description, type a description of this Connection Profile.
  5. Click Next.
  6. Select TDCH.
  7. [Optional] Select these additional connection options as needed:
    Connection Option Description
    Knox Gateway Connects to the Hadoop system through a Knox Gateway. If you select this, you are not able to select JDBC also.
    JDBC Connects to the Hadoop system using SQL from the SQL Editor. If you select this, you are not able to select Knox also.
    SQL-H SQL-H enables you to transfer data from the Hortonworks Hadoop System to an Aster Database.
  8. Specify when to connect :
    Option Description
    Connect when the wizard completes Connect to the database when you complete the profile.
    Connect every time the workbench is started Connect to this database each time you launch the workbench.
  9. Click Next.
  10. Enter the following credentials:
    1. At WebHDFS Host Name, type the host name or IP address of the system configured to provide access to the Hadoop systems.
    2. At WebHDFS Port Number, type the port number to use to communicate with the Hadoop system.
    3. At WebHDFS User Name, type the user name with permissions to access the WebHDFS host.
    4. If the Hadoop cluster has High Availability enabled for the namenode (the cluster has an active and standby namenode), select HA Enabled Cluster, then type the host name or IP address of the standby or backup Namenode/WebHDFS host at Secondary WebHDFS Host Name.
  11. [Optional] If you select to connect using TDCH and Knox Gateway, complete the properties for a Knox connection. When you select Knox, the TDCH connection properties are automatically generated because the Knox Gateway handles the TDCH connection.
    1. At Gateway Host, type the host name.
    2. At Gateway Port Number, type the port number for the host.
    3. At Cluster Name, type the cluster name.
    4. At Gateway User Name, type the user name for the Knox Gateway.
    5. [Optional] In Gateway Password, type the password for the Knox Gateway.
    6. Select Save Password to save the password.
  12. [Optional] If you select to connect using TDCH and JDBC, enter the connection credentials:
    1. At WebHCat Host Name, type the host name or IP address of the Apache HCatalog system that manages the metadata services for your Hadoop system.
    2. In WebHCat Port Number, type the port number to use to communicate with the WebHCat host.
    3. In WebHCat User Name, type the user name with permissions to access the WebHCat host.
    4. Click Next.
    5. In JDBC Host, type the host name of the Hadoop System to which to connect.
    6. In JDBC Port Number, type the port number to use to communicate with the host.
    7. In JDBC Database, type the name of the Hadoop database.
    8. In JDBC User Name, type the user name to use to connect to the database.
    9. In JDBC Password, type the password required to access the database using Hive JDBC.
    10. Select Save Password to save the password.
    11. Select HTTP Transport Mode to transfer data using the HTTP secure transfer mode.
    12. In HTTP Path type the HTTP server path or accept the default.
  13. Click Next to view the Summary information.
  14. Click Finish to create the connection profile.