Schedule

Note that the schedule may be subject to change. Please check the course website frequently for the latest schedule.

For your reference: How to read & review a paper? How to give a talk?

Week Date Topics References Notes
1 Th 09/05 Introduction lecture notes-0, notes-1, passjoin paper
2 Th 09/12 Similarity Join 1 lecture notes-2, dossjoin paper, dossjoin code
3 Th 09/19 Similarity Join 2 lecture notes-3, prefix filter paper, partition paper, ed prefix paper, pivotal paper
4 Th 09/26 Similarity Search 1 lecture slides/notes4.pdf R-tree paper
5 Th 10/03 Similarity Search 2 lecture slides/notes5.pdf, PQ scan paper, PQ fast scan paper
6 Th 10/10 Data Transformation paper review transformation 1, transformation 2, transformation 3
7 Th 10/17 Middle Term Exam exam
8 Th 10/24 Data Discovery paper review discovery 1, discovery 2, discovery 3
9 Th 10/31 Data Wrangling paper review wrangling 1, wrangling 2, wrangling 3
10 Th 11/07 Data Cleaning paper review cleaning 1, cleaning 2, cleaning 3
11 Th 11/14 Data Integration - Entity Resolution paper review matching 1, matching 2, matching 3
12 Th 11/21 Data Integration - Entity Consolidation paper review fusion 1, fusion 2, fusion 3
13 Th 11/26 Data Visualization paper review visualization 1, visualization 2, visualization 3
14 Th 12/05 Summary of this class

All the students form groups of three. Each group is in charge of presenting the three papers in a week. The 3 group members can read and discuss the 3 papers together.