Background - Aster Analytics

Teradata AsterĀ® Analytics Foundation User GuideUpdate 2

Product
Aster Analytics
Release Number
7.00.02
Published
September 2017
Language
English (United States)
Last Update
2018-04-17
dita:mapPath
uce1497542673292.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1022
lifecycle
previous
Product Category
Software

Text classification is the task of choosing the correct class label for a given text input. In basic text classification tasks, each input is considered in isolation from all other inputs, and the set of class labels is defined in advance.

Text classification is a two-stage process:

  1. Train the model:

    Preprocess the text data and produce tokens.

    Use natural language processing (NLP) functionality such as tokenization, stemming, and stop words.

    From the tokens, use statistical measures to select a subset.

    Generate the feature for each word in the subset.

    Use machine learning algorithms to train a classifier.

  2. Classify the text.