Description
The SentenceExtractor function extracts sentences from English input text. A sentence ends with a punctuation mark such as period (.), question mark (?), or exclamation mark (!).
Usage
td_sentence_extractor_mle (
data = NULL,
text.column = NULL,
accumulate = NULL,
data.sequence.column = NULL,
data.order.column = NULL
)
Arguments
data |
Required Argument. |
data.order.column |
Optional Argument. |
text.column |
Required Argument. |
accumulate |
Optional Argument. |
data.sequence.column |
Optional Argument. |
Value
Function returns an object of class "td_sentence_extractor_mle" which
is a named list containing object of class "tbl_teradata".
Named list member can be referenced directly with the "$" operator
using name: result.
Examples
# Get the current context/connection.
con <- td_get_context()$connection
# Load example data.
loadExampleData("sentenceextractor_example", "paragraphs_input")
# Create object(s) of class "tbl_teradata".
paragraphs_input <- tbl(con, "paragraphs_input")
# Example 1 - Extract sentences in the column named 'paratext'.
td_sentence_extractor_out <- td_sentence_extractor_mle(data = paragraphs_input,
text.column = "paratext",
accumulate = c("paraid", "paratopic")
)