The input table (salesdata) consists of 341 distinct users and the items they have purchased in an office supplies store. For ease of use, the items are assigned itemids (shown in the following table) which are then used in the input table.
Minhash Example Items and Itemids
item |
itemid |
Storage |
1 |
Appliances |
2 |
Binders |
3 |
Telephones |
4 |
Paper |
5 |
Rubber Bands |
6 |
Computer Peripherals |
7 |
Office Furnishings |
8 |
Office Machines |
9 |
Envelopes |
10 |
Bookcases |
11 |
Tables |
12 |
Pens & Art Supplies |
13 |
Chairs & Chairmats |
14 |
Scissors |
15 |
Rulers & Trimmers |
16 |
Copiers & Fax Storage |
17 |
Labels |
18 |
Minhash Example Input Table salesdata
userid |
itemid |
1 |
1 |
2 |
2 3 |
3 |
4 |
4 |
2 |
5 |
3 1 |
6 |
1 |
7 |
5 |
8 |
5 6 |
9 |
2 1 |
10 |
3 |
11 |
8 |
12 |
10 |
13 |
11 4 |
... |
... |