Create a Hadoop Hortonworks Connection Profile | Teradata Studio/Studio Express - Creating a Hadoop Hortonworks Connection Profile - Teradata Studio

Teradata® Studio™ Express User Guide

Product
Teradata Studio
Release Number
17.00
Published
January 2021
Language
English (United States)
Last Update
2021-01-11
dita:mapPath
ovm1576517377363.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2042
lifecycle
previous
Product Category
Teradata Tools and Utilities
The Impala connection service cannot be used with this connection profile type.
  1. In the Data Source Explorer or Navigator, click "" .
  2. Do the following:
    1. Select Hadoop from Connection Profile Types.
    2. Type a Name to identify the Connection Profile.
    3. [Optional] Type a Description of the Connection Profile.
    4. Click Next.
  3. Do the following to specify profile details:
    1. [Optional] Select Knox Gateway if you currently connect to your Hortonworks Hadoop System through a Knox Gateway.
      If you select this option, the Smart Loader and Hive JDBC connection service options are selected and disabled as no additional information is required for those options. Kerberos is unselected and disabled since only Knox credentials are used to establish a connection, even on a Kerberized cluster.
    2. [Optional] If your Hadoop Hortonworks system is configured within a Kerberos realm, select Kerberos under Security Authentication.
      If both Knox Gateway and Kerberos are selected, only the TDCH and SQL-H connection service options are available.
    3. [Optional] Select the applicable connection service:
      Connection Service Option Description
      TDCH Transfers data between the Hortonworks Hadoop System and Vantage.
      SQL-H Transfers data from the Hortonworks Hadoop System to an Aster Database.
      SQL-H is not supported in Hadoop 3.0.1. Data transfer from Hadoop 3.0.1 to Aster will not work.
      Smart Loader

      Imports data from text delimited files into a Hortonworks Hadoop System.

      If you select this option, Hive is automatically selected as the Hortonworks Hive JDBC Driver to access the Hortonworks Hadoop System.

      Hive Uses Hive JDBC to create and run SQL.
    4. [Optional] Specify when to connect:
      Option Action
      Connect when the wizard completes Connect to the database when you complete the profile.
      Connect every time the workbench is started Connect to this database each time you launch the workbench.
    5. Click Next.
  4. If you selected the Knox Gateway option, specify the properties for the Knox gateway connection:
    1. At Gateway Host, type the host name.
    2. At Gateway Port Number, type the port number for the host.
    3. At Cluster Name, type the cluster name.
    4. At Gateway User Name, type the user name for the Knox Gateway.
    5. [Optional] In Gateway Password, type the password for the Knox Gateway.
    6. Select Save Password to save the password.
    7. Select SSL Enabled to enable Secure Sockets Layer encryption.
    8. Select a driver from Select a driver from the drop-down or create one:
      1. Click "" .
      2. Select the External Hortonworks Hive Driver template for your system type.
      3. In Driver name, enter a unique name for the driver definition.
      4. On the JAR List tab, click Add JAR/Zip to add the list of JDBC driver JARs.
      5. On the Properties tab, edit the JDBC driver properties.
      6. Click OK.
    9. [Optional] To add additional JDBC properties, click Add, enter the Name and Value, and select the variable Type.
    10. Click Next.
  5. If you selected Smart Loader or Hive, enter the name of the Kerberos Realm.
    TDCH and SQL-H do not require additional information.
  6. Do the following based on the connection service you selected then click Next to view Summary information:
    If you selected Complete the following:
    TDCH or Smart Loader Enter the WebHDFS connection credentials:
    1. In WebHDFS Host Name, type the host name or IP address of the system configured to provide access to the Hadoop systems.
    2. In WebHDFS Port Number, type the port number to use to communicate with the Hadoop system.
    3. At WebHDFS User Name, type the user name with permissions to access the WebHDFS host.

      For systems configured with Kerberos authentication, this is typically the principal user name.

    4. If the Hadoop cluster has High Availability enabled for the namenode (the cluster has an active and standby namenode), select HA Enabled Cluster, then type the host name or IP address of the standby or backup Namenode/WebHDFS host at Secondary WebHDFS Host Name.
    SQL-H Enter the WebHCat connection credentials:
    1. In WebHCat Host Name, type the host name or IP address of the Apache HCatalog system that manages the metadata services for your Hadoop system.
    2. In WebHCat Port Number, type the port number to use to communicate with the WebHCat host.
    3. In WebHCat User Name, type the user name with permissions to access the WebHCat host.

      For systems configured with Kerberos authentication, this is typically the principal user name.

    Hive Enter the JDBC connection properties:
    1. Select a driver from Select a driver from the drop-down or create one:
      1. Click "" .
      2. Select the External Hortonworks Hive Driver template for your system type.
      3. In Driver name, enter a unique name for the driver definition.
      4. On the JAR List tab, click Add JAR/Zip to add the list of JDBC driver JARs.
      5. On the Properties tab, edit the JDBC driver properties.
      6. Click OK.
    2. In JDBC Host, type the host name of the Hadoop System to which to connect.
    3. In JDBC Port Number, type the port number to use to communicate with the host.
    4. In JDBC Database, type the name of the Hadoop database.
    5. In JDBC User Name, type the user name to use to connect to the database.

      For systems configured with Kerberos authentication, this is typically the principal user name.

    6. In JDBC Password, type the password required to access the database using Hive JDBC.
    7. Select Save Password to save the password.
    8. Select HTTP Transport Mode to transfer data using the HTTP secure transfer mode.
    9. In HTTP Path type the HTTP server path or accept the default.
    10. For a Hive connection, select LDAP Security Enabled to enable LDAP authentication.
  7. Click Finish to create the connection profile.