Description
The SentenceExtractor function extracts sentences from English input text. A sentence ends with a punctuation mark such as period (.), question mark (?), or exclamation mark (!).
Usage
td_sentence_extractor_mle ( data = NULL, text.column = NULL, accumulate = NULL, data.sequence.column = NULL, data.order.column = NULL )
Arguments
data |
Required Argument. |
data.order.column |
Optional Argument. |
text.column |
Required Argument. |
accumulate |
Optional Argument. |
data.sequence.column |
Optional Argument. |
Value
Function returns an object of class "td_sentence_extractor_mle" which
is a named list containing object of class "tbl_teradata".
Named list member can be referenced directly with the "$" operator
using name: result.
Examples
# Get the current context/connection. con <- td_get_context()$connection # Load example data. loadExampleData("sentenceextractor_example", "paragraphs_input") # Create object(s) of class "tbl_teradata". paragraphs_input <- tbl(con, "paragraphs_input") # Example 1 - Extract sentences in the column named 'paratext'. td_sentence_extractor_out <- td_sentence_extractor_mle(data = paragraphs_input, text.column = "paratext", accumulate = c("paraid", "paratopic") )