Job Objective
Extract rows from a database table and write them to a Hadoop table; the Hadoop table's data should be stored in the RCFile format.
Data Flow Diagram
The following figure shows a diagram of the elements for Job Example 18.
Job Example PTS00032 – Extract Rows and Write Them to a Hadoop Table
Sample Script
For the sample script that corresponds to this job, see the following script in the sample/userguide directory:
PTS00032: Extract Rows and Write Them to a Hadoop Table
Rationale
This job uses the operators shown for the following reasons:
- Export operator because it is the fastest way to extract large amounts of data from a database table.
- DataConnector operator along with the TDCH-TPT Interface because it is the only producer operator that writes data to Hadoop tables in the RCFile format.