Create a Hadoop Cloudera Connection Profile | Teradata Studio/Studio Express - Creating a Hadoop Cloudera Connection Profile - Teradata Studio

Teradata® Studio™ Express User Guide - 17.20

Product
Teradata Studio
Release Number
17.20
Published
September 2022
Language
English (United States)
Last Update
2022-09-23
dita:mapPath
ztp1648839236373.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2042
Product Category
Teradata Tools and Utilities
  1. In the Data Source Explorer or Navigator, click "" .
  2. Do the following:
    1. Select Hadoop from Connection Profile Types.
    2. Type a Name to identify the Connection Profile.
    3. [Optional] Type a Description of the Connection Profile.
    4. Click Next.
  3. Specify profile details:
    1. Select Cloudera.
    2. [Optional] If your Hadoop Cloudera system is configured within a Kerberos realm, select Kerberos under Security Authentication.
    3. [Optional] Select the applicable connection service:
      Connection Service Option Description
      TDCH Transfers data between the Hadoop Cloudera System and Vantage.
      SQL-H Transfers data from the Hadoop Cloudera System to Aster Database.
      Smart Loader Imports data from text delimited files into a Hadoop Cloudera System.
      Hive

      Uses Hive JDBC to create and run SQL.

      If you select this option, you cannot select the Impala option.

      Impala

      Uses Hive Impala JDBC to create and run SQL.

      If you select this option, you cannot select the Hive option.

    4. [Optional] Specify when to connect:
      Option Action
      Connect when the wizard completes Connect to the database when you complete the profile.
      Connect every time the workbench is started Connect to this database each time you launch the workbench.
    5. Click Next.
  4. If you selected Smart Loader, Hive, or Impala, enter the name of the Kerberos Realm.
    TDCH and SQL-H do not require additional information.
  5. Do the following based on the connection service you selected then click Next to view Summary information:
    If you selected Complete the following:
    TDCH or Smart Loader Enter the WebHDFS connection credentials:
    1. In WebHDFS Host Name, type the host name or IP address of the system configured to provide access to the Hadoop systems.
    2. In WebHDFS Port Number, type the port number to use to communicate with the Hadoop system.
    3. At WebHDFS User Name, type the user name with permissions to access the WebHDFS host.

      For systems configured with Kerberos authentication, this is typically the principal user name.

    4. If the Hadoop cluster has High Availability enabled for the namenode (the cluster has an active and standby namenode), select HA Enabled Cluster, then type the host name or IP address of the standby or backup Namenode/WebHDFS host at Secondary WebHDFS Host Name.
    SQL-H Enter the WebHCat connection credentials:
    1. In WebHCat Host Name, type the host name or IP address of the Apache HCatalog system that manages the metadata services for your Hadoop system.
    2. In WebHCat Port Number, type the port number to use to communicate with the WebHCat host.
    3. In WebHCat User Name, type the user name with permissions to access the WebHCat host.

      For systems configured with Kerberos authentication, this is typically the principal user name.

    Hive or Impala Enter the JDBC connection properties:
    1. Select a driver from Select a driver from the drop-down or create one:
      1. Click .
      2. Select the External Cloudera Driver template for your system type.
      3. In Driver name, enter a unique name for the driver definition.
      4. On the JAR List tab, click Add JAR/Zip to add the list of JDBC driver JARs.
      5. On the Properties tab, edit the JDBC driver properties.
      6. Click OK.
    2. In JDBC Host, type the host name of the Hadoop System to which to connect.
    3. In JDBC Port Number, type the port number to use to communicate with the host.
    4. In JDBC Database, type the name of the Hadoop database.
    5. In JDBC User Name, type the user name to use to connect to the database.

      For systems configured with Kerberos authentication, this is typically the principal user name.

    6. In JDBC Password, type the password required to access the database using Hive JDBC.
    7. Select Save Password to save the password.
    8. Select HTTP Transport Mode to transfer data using the HTTP secure transfer mode.
    9. In HTTP Path type the HTTP server path or accept the default.
    10. Select LDAP Security Enabled to enable LDAP authentication.
  6. Click Finish to create the connection profile.