Set Up Teradata Connector for Hadoop (TDCH) | Teradata Studio - Setting Up a Teradata Connector for Hadoop - Teradata Studio

Teradata® Studio™ and Studio™ Express Installation Guide

Product
Teradata Studio
Release Number
20.00
Published
October 2023
ft:locale
en-US
ft:lastEdition
2026-04-13
dita:mapPath
auw1696897611677.ditamap
dita:ditavalPath
ft:empty
dita:id
ftk1467314950349
Teradata Studio provides an option to transfer data to and from Hadoop systems. The Smart Loader for Hadoop feature uses the Teradata connector for Hadoop (TDCH) installed on the Hadoop node. Oozie is used for the data transfer workflow.
From version 20.00.00.05, Teradata Studio/Studio Express no longer supports the connections to Hadoop and Aster databases.
  1. Download and install the TDCH onto your Hadoop system from https://downloads.teradata.com/.
  2. Go to the Configure Oozie script (configureOozie.sh), which is available in the TDCH installation path /usr/lib/tdch/<version>/scripts/.
  3. Change the mode so the script is executable:
    chmod +x configureOozie.sh
  4. Run configureOozie.sh to remove any hidden Windows characters from the file.
    dos2unix configureOozie.sh
  5. Run configureOozie.sh as a root user, providing the locations of your Hadoop services:
    Usage: ./configureOozie.sh nn=nameNodeHost[jt=jobTrackerHost][oozie=oozieHost] [nnPort=nameNodePortNum][jtPort=jobTrackerPortNum][ooziePort=ooziePortNum][webhcatPort=webhcatPortNum] [webhdfsPort=webhdfsPortNum] where
    Parameter Definition Value
    nameNodeHost The Name Node host name required
    jobTrackerHost The Job Tracker host name uses nn parameter value if omitted
    oozieHost The Oozie host name uses nn parameter value if omitted
    nameNodePortNum The Name node port number 8020 if omitted
    jobTrackerPortNum The Job Tracker port number 50300 if omitted
    ooziePortNum The Oozie port number 11000 if omitted
    webhcatPortNum The WebHCatalog port number 50111 if omitted
    webhdfsPortNum The WebHDFS port number 50070 if omitted
    The port numbers are default for HDP. If the system being set up has all the services hosted on a single system on the default ports, only the nn parameter is needed.
    The script exits with an error message if the TDCH is not in its expected location. Otherwise the script displays a message indicating the parameters values. For example:
    The following is the specification of the Hadoop services used
    by the Oozie workflows: 
    {
    "Distribution":"HDP",
    "DistributionVersion":"3.2.1",
    "WebHCatalog":"hostname",
    "WebHCatalogPort":50111,
    "WebHDFS":"hostname",
    "WebHDFSPort":50070,
    "JobTracker":"hostname",
    "JobTrackerPort":50300,
    "NameNode":"hostname",
    "NameNodePort":8020,
    "Oozie":"hostname",
    "OoziePort":11000
    }