Link to directory that stores the 300+ documents
README file that tells the entity type and information on where to find the markups
Link to a browsable directory that stores all documents in set I
Link to a browsable directory that stores all documents in set J
Link to a directory that stores all of your code
Link to a compressed file that stores all of the above directories
Link to a pdf document with the report
Link to the directory that stores all of the data (both tables A and B)
Link to the directory that stores all of the code
Link to a pdf document with the report
Link to the directory that stores all of the data (both tables A and B)
- Link to CSV file that lists all tuple pairs that survive the blocking step
- Link to CSV file that lists all tuple pairs in sample taken (i.e. file G)
- Link to file that describes set I
- Link to file that describes set J
Link to the directory that stores all of the code
Link to a pdf document with the report
Link to Jupyter notebook [em_notebook.ipynb]
Link to the directory that stores all of the data
- Link to CSV file storing Table E
- Link to file with the set of matches between tables A and B
- Each match is of the form 'amazon id' '\t' 'walmart id'
- Link to Python script that was used to merge tables A and B