The Teradata Package for R product combines the benefits of open source R language environment with the massive parallel processing capabilities of Teradata Vantage, which includes the Machine Learning Engine analytic functions and the Analytics Database in-database analytic functions. Teradata Package for R allows users to develop and run R programs that take advantage of the Big Data and Machine Learning analytics capabilities of Vantage.
The Teradata Package for R product is tdplyr, an R library package like other open source R packages. The package interface makes available to R users a collection of functions for analytics that reside on Vantage, so that R users can perform in-database analytics with no SQL coding required. Specifically, the tdplyr package provides functions for data manipulation and transformation, data filtering and sub-setting, and can be used in conjunction with open source R capabilities. Moreover, the tdplyr package conforms and works with the functions of the dbplyr package and most of the verbs of the dplyr package.
The Teradata Package for R works over connections to:
- Teradata Vantage with Analytics Database and ML Engine
- Teradata Vantage with Analytics Database onlyFor the Teradata Vantage with Analytics Database only type of connection, only Analytics Database analytic functions are accessible.
Teradata Vantage Modules for Jupyter
Teradata Package for R is included in the Docker image of Teradata Vantage™ Modules for Jupyter, which also includes JupyterLab and other components to run as a Docker container on a client machine.
Teradata Vantage Modules for Jupyter allows users to access Vantage in Python, R or SQL from JupyterLab notebooks.
See Teradata Vantage™ Modules for Jupyter Installation Guide, B700-4010.