Table | Description |
---|---|
InputTable | Contains input documents or tokenized data. |
StopWordsTable | [Optional] Contains stop words (a, an, the, and so on). |
InputTable Schema
Column | Data Type | Description |
---|---|---|
doc_id_column | VARCHAR, INTEGER, or SMALLINT | [Column must appear with ModelType ('Bernoulli').] Identifier of document. |
text_column or token_column | VARCHAR | Document text or classified training tokens. |
doc_category_column | VARCHAR | Category of document. |
StopWordsTable Schema
Column | Data Type | Description |
---|---|---|
stop_words_column | CHARACTER or VARCHAR | Stop word (one in each row). |