The input table, applicant_reference, contains hypothetical information from people applying for employment at a particular company. This table is a reference table against which external data from various sources can be compared for identity matching.
id | firstname | lastname | city | zipcode | department | gender | |
---|---|---|---|---|---|---|---|
1 | John | Dewey | john.dewey@corp-mark.com | Sugar Land | 77459 | Marketing | Male |
2 | Sarah | Anders | sarah.anders@corp-sales.com | Pearland | 77584 | Sales | Female |
3 | Elizabeth | Hall | elizabeth.hall@corp-eng.com | Galveston | 77550 | Engineering | Female |
4 | James | Nickson | james.nick@corp-it.com | Pasadena | 77501 | IT | Male |
5 | Kim | Lee | kim.lee@corp-sys.com | Clear Lake City | 77058 | Systems | Female |
6 | Jessica | Right | jessica.right@corp-mark.com | Sugar Land | 77459 | Marketing | Female |
The example compares this table with information (including credit scores) from the external source shown in the following table. This table has missing and incomplete information, as expected with data from different sources.
id | firstname | lastname | city | zipcode | department | creditscore | |
---|---|---|---|---|---|---|---|
1 | John | Dewey | john.dewey@corp-mark.com | Sugar Land | 7774 | market | 700 |
2 | Hall | Galveston | 77550 | eng | 790 | ||
3 | Sarah | Anders | sarah.anders@corp-sales.com | pear | 77584 | sales | 650 |
4 | Jessica | right | Sugar Land | 77459 | Marketing | 690 | |
5 | James | Nickson | Pasadena | 7750 | IT | 620 | |
6 | Kim | 77058 | system | 570 |