Creating Hadoop Hortonworks Connection Profiles - Teradata Studio

Teradata Studio User Guide

Product
Teradata Studio
Release Number
16.00
Published
March 2017
Language
English (United States)
Last Update
2018-03-29
dita:mapPath
hop1484765174877.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2041
lifecycle
previous
Product Category
Teradata Tools and Utilities
You can create a Hadoop Hortonworks Connection Profile using the connection options in the New Connection Profile wizard.
  1. In the Data Source Explorer or Navigator, click to create a new connection.
  2. Select Hadoop from Connection Profile Types.
  3. At Name, type a name to identify the Connection Profile.
  4. [Optional] At Description, type a description of the Connection Profile.
  5. Click Next.
  6. [Optional] Select Knox Gateway if you currently connect to your Hortonworks Hadoop System through a Knox Gateway. If you select this option, the Smart Load and Hive JDBC options are selected and disabled as no additional information is required for those options when Knox is selected. Kerberos is unselected and disabled since only Knox credentials are used to establish a connection, even on a Kerberized cluster. Presto is also unselected and disabled as it is not supported with Knox.
  7. [Optional] Select TDCH if you plan to use this connection for data transfers between the Hortonworks Hadoop System and a Teradata Database.
    TDCH access is not supported if Kerberos authentication is selected.
  8. [Optional] Select SQL-H if you plan to use this connection for data transfers from the Hortonworks Hadoop System to an Aster Database.
  9. [Optional] Select Smart Loader if you plan to use this connection to import data from text delimited files into a Hortonworks Hadoop System. If you select this option, Hive is automatically selected as the Hortonworks Hive JDBC Driver will be used to access the Hortonworks Hadoop System.
  10. [Optional] Select Hive if you plan to use Hive JDBC to create and run SQL.
  11. [Optional] Select Presto if you plan to use Presto JDBC to create and run SQL. If you select this option, you cannot select the Smart Loader option.
  12. [Optional] Specify when to connect:
    Option Description
    Connect when the wizard completes Connect to the database when you complete the profile.
    Connect every time the workbench is started Connect to this database each time you launch the workbench.
  13. Click Next.
  14. If you selected the Knox Gateway option, specify the properties for the Knox gateway connection:
    1. At Gateway Host, type the host name.
    2. At Gateway Port Number, type the port number for the host.
    3. At Cluster Name, type the cluster name.
    4. At Gateway User Name, type the user name for the Knox Gateway.
    5. [Optional] In Gateway Password, type the password for the Knox Gateway.
    6. Select Save Password to save the password.
    7. Select SSL Enabled to enable Secure Sockets Layer encryption.
    8. Select a driver from Select a driver from the drop-down. The default JDBC driver for the Hadoop provider and connection service is selected. If you want to create a new driver definition, click .
    9. Click Next.
  15. If Kerberos authentication is selected you will need to enter authentication properties.
    1. If Smart Load or Hive are selected, enter the name of the Kerberos Realm.
    2. If Presto is selected, enter the SSL TrustStore Path and SSL TrustStore Password.
    3. Click Next.
  16. If you selected a TDCH and/or Smart Load connection, enter the WebHDFS connection credentials:
    1. At WebHDFS Host Name, type the host name or IP address of the system configured to provide access to the Hadoop systems.
    2. At WebHDFS Port Number, type the port number to use to communicate with the Hadoop system.
    3. At WebHDFS User Name, type the user name with permissions to access the WebHDFS host. For systems configured with Kerberos authentication, this should typically be the principal user name.
    4. If the Hadoop cluster has High Availability enabled for the namenode (the cluster has an active and standby namenode), select HA Enabled Cluster, then type the host name or IP address of the standby or backup Namenode/WebHDFS host at Secondary WebHDFS Host Name.
    5. Click Next.
  17. If you selected a SQL-H and/or Hive or Presto connection, enter the WebHCat connection credentials:
    1. At WebHCat Host Name, type the host name or IP address of the Apache HCatalog system that manages the metadata services for your Hadoop system.
    2. In WebHCat Port Number, type the port number to use to communicate with the WebHCat host.
    3. In WebHCat User Name, type the user name with permissions to access the WebHCat host. For systems configured with Kerberos authentication, this should typically be the principal user name.
    4. Click Next.
  18. If you specified a Smart Loader, Hive, or Presto connection, select a driver from Select a driver from the drop-down. The default JDBC driver for the Hadoop provider and connection service is selected. If you want to create a new driver definition, click .
    1. In JDBC Host, type the host name of the Hadoop System to which to connect.
    2. In JDBC Port Number, type the port number to use to communicate with the host.
    3. In JDBC Database, type the name of the Hadoop database.
    4. In JDBC User Name, type the user name to use to connect to the database. For systems configured with Kerberos authentication, this should typically be the principal user name.
    5. In JDBC Password, type the password required to access the database using Hive JDBC.
    6. Select Save Password to save the password.
    7. Select HTTP Transport Mode to transfer data using the HTTP secure transfer mode.
    8. In HTTP Path type the HTTP server path or accept the default.
    9. For connection services other than Presto, select LDAP Security Enabled to enable LDAP authentication.
    10. Select JDBC Logging to generate JDBC log information.
  19. Click Next to view the Summary information.
  20. Click Finish to create the connection profile.