The input data used to train the two models are shown in the following table.
model_id | seq_id | seq_vertex_id | observed_id |
---|---|---|---|
1 | 1 | 0 | 1 |
1 | 1 | 1 | 1 |
1 | 1 | 2 | 1 |
1 | 1 | 3 | 1 |
1 | 1 | 4 | 1 |
1 | 1 | 5 | 1 |
1 | 1 | 6 | 1 |
1 | 1 | 7 | 1 |
1 | 1 | 8 | 1 |
1 | 1 | 9 | 1 |
1 | 1 | 10 | 1 |
1 | 1 | 11 | 1 |
1 | 1 | 12 | 4 |
1 | 1 | 13 | 5 |
1 | 1 | 14 | 6 |
1 | 1 | 15 | 6 |
1 | 1 | 16 | 6 |
1 | 1 | 17 | 7 |
... | |||
2 | 1 | 0 | 1 |
2 | 1 | 1 | 1 |
2 | 1 | 2 | 1 |
2 | 1 | 3 | 1 |
2 | 1 | 4 | 1 |
2 | 1 | 5 | 1 |
2 | 1 | 6 | 1 |
2 | 1 | 7 | 1 |
2 | 1 | 8 | 1 |
2 | 1 | 9 | 1 |
2 | 1 | 10 | 1 |
2 | 1 | 11 | 1 |
2 | 1 | 12 | 8 |
The status of the loan is shown in the model_id column, where 1 identifies a defaulted loan and 2 identifies a paid loan. Rows with the same model id are used to train a single model. The use of two model ids ensures that two different models are trained. Also notice that the defaulted loans end with observed_id=7 and paid loans end with observed_id=8. The seq_vertex_id column provides the ordering of the symbols in the sequences.