Techniques for Data Integration
Department of Computer Science
Winter 2016
Enterprise systems are large-scale software applications to support
business operations; they typically include software systems for data
management, business process/workflow management, information flows,
reporting, and data analytics. Focusing only the data management
aspect, a typical enterprise has to struggle with many data
integration difficulties, since its data are usually spread around
many database systems, workflow systems, file systems, etc. and in a
variety of form possibly with no coherent semantics. In this course,
we plan to discuss some fundamental data modeling and manipulation
techniques that will be useful in tackling these data integration
problems. Topics covered include conjunctive queries, Datalog, data
integration frameworks (GAV, LAV, GLAV), data exchange formalisms,
views and updates.
- Instructor:
Professor Jianwen Su,
Department of Computer Science
- Office:Harold Frank Hall, room 2161
- Telephone: (805) 893-3698
- Email:
su AT cs DOT ucsb DOT edu
- Office hours:
Tuesdays 3-4pm,
Thursdays 1pm-2pm,
or by appointment
- Units: 4
- Lectures:
Mondays and Wednesdays,
9:00am-10:50pm, Harold Frank Hall 1132
- Recommended textbook/references:
- Selected chapters from:
- Foundations of Databases
by Abiteboul, Hull, and Vianu, Addison-Wesley, 1995
- Database Systems: The Complete Book
by Garcia-Monila, Ullman, and Widom, Pearson Prentice Hall, 2009
- Foundations of Data Exchange
by Arenas, Barcelo, Libkin, and Murlak, Cambridge, 2014
- Selected papers
- Requirements:
- Approximately 3-4 homework assignments
- One exam
- A course project
- Active participation
- Course homepage:
http://www.cs.ucsb.edu/~su/cs290d/
- Tentative course outline:
- Conjunctive queries
- Deductive databases (Datalog and optimization)
- Data integration frameworks (GAV, LAV, GLAV)
- Data exchange
- Updates and views
- Homework/Project Assignments
- Lecture Notes
- January 4
- January 6
[Readings: Sections 4.1, 4.2]
- January 11
[Readings: same as last lecture]
- January 13
[Readings: Section 6.2]
- January 18
No lecture
- January 20
[Readings: same as last lecture]
- January 25
[Readings: Sections 12.1, 12.2]
- January 27
[Readings: Sections 12.3, 12.4]
- February 1
[Readings: Sections 13.1, 13.2]
- February 3
[Readings: Section 13.3]
Note: "sub" typos fixed
- February 8
[Readings: Section 13.4]
- February 10
[Readings: Sections 21.1, 21.2,
Chapter 1]
- February 17
[Readings: Sections 21.3-6,
Section 2.2]
- February 22
[Readings: Section 2.3]
- February 24
[Readings: Sections 2.4.1-5]
- February 29
[Readings: Sections 2.4.6-7, 3.3]
- March 2
[Readings: Chapter 1, Sections 2.1, 2.2]
- March 7
[Readings: Sections 2.3]