The input table, nermem_sports_train, is a collection of sports news. There are 50 rows of training data with an id column and a content column (containing text information). The function generates a model file location.sports and accepts only one tag ('LOCATION') in the type argument.
id | content |
---|---|
2 | CRICKET - <START:ORG> LEICESTERSHIRE <END> TAKE OVER AT TOP AFTER INNINGS VICTORY . |
3 | <START:LOCATION> LONDON <END> 1996-08-30 |
4 | West Indian all-rounder <START:PER> Phil Simmons <END> took four for 38 on Friday as <START:ORG> Leicestershire <END> beat <START:ORG> Somerset <END> by an innings and 39 runs in two days to take over at the head of the county championship . |
5 | Their stay on top |
6 | After bowling <START:ORG> Somerset <END> out for 83 on the opening morning at <START:LOCATION> Grace Road <END> |
7 | Trailing by 213 |
8 | <START:ORG> Essex <END> |
9 | <START:PER> Hussain <END> |
10 | By the close <START:ORG> Yorkshire <END> had turned that into a 37-run advantage but off-spinner <START:PER> Such <END> had scuttled their hopes |
11 | At the <START:LOCATION> Oval <END> |
12 | He was well backed by <START:LOCATION> England <END> hopeful <START:PER> Mark Butcher <END> who made 70 as <START:ORG> Surrey <END> closed on 429 for seven |
... | ... |