Data in Parquet Format | Native Object Store | Teradata Python Package - Data in Parquet Format - Teradata Package for Python

Teradata® Package for Python User Guide

Product
Teradata Package for Python
Release Number
17.10
Published
May 2022
Language
English (United States)
Last Update
2022-08-18
dita:mapPath
rsu1641592952675.ditamap
dita:ditavalPath
ayr1485454803741.ditaval
dita:id
B700-4006
lifecycle
previous
Product Category
Teradata Vantage
Foreign table created in Parquet format usually contains the following columns:
  • Location
  • Several other user specified columns in Parquet format specified while creating foreign table

User can create a teradataml DataFrame on a foreign table using "DataFrame()" or "DataFrame.from_table()", the same way to create a teradataml DataFrame on a regular table. With the created DataFrame, user can easily access the data in these columns and process the data using teradataml DataFrame API or other Python packages.

How to access actual data and path variables

Unlike foreign tables on JSON and CSV format data, teradataml DataFrame on Parquet data provides direct access to the actual data in Parquet files, as described in Accessing Foreign Table Created On Parquet Data.