595J Seminar on Advanced Information Systems (2 units )
Fall 2010. Monday 10-11AM
Code 73171. Room 932 101
Xifeng Yan and Tao Yang
This seminar will study
recent papers and advancement in cyber-enabled information systems.
The topic includes large-scale mining and vertical systems/tools,
cloud-computing platforms and advanced information systems, web
services and applications for search, social, and information
Candidate papers to be studied are listed below and will be
updated. You may also provide us other interesting papers.
Web extraction and vertical systems
- Sources of evidence for vertical selection
by J. Arguello et. al, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information
retrieval table of contents Boston, MA, USA
- "Open Information Extraction from the Web"
Oren Etzioni, Michele Banko, Stephen Soderland, and Daniel S. Weld
Communications of the ACM, 51(12): 68-74, 2008
"WebTables: Exploring the Power of Tables on the Web"
Michael J. Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang
Proceedings of the 34th International Conference on Very Large Data Bases (VLDB 2008)
- MedSearch: A Specialized Search Engine for Medical
Gang Luo, Chunqiang Tang, Hao Yang, Xing Wei,
Text and Structural Data Mining of Influenza Mentions in Web and Social Media
Courtney D. Corley et. al,
Int. J. Environ. Res. Public Health 2010, 7, 596-615;
- Automatic Extraction of Clickable Structured Web Contents for Name Entity Queries [PDF]
Xiaoxin Yin, Wenzhao Tan, Xiao Li, Yi-Chin Tu.
Text Mining from the Web
for Medical Intelligence,
in Mining Massive Data Sets for Security
F. Fogelman-Souliét al. (Eds.)
IOS Press, 2008
Data Platforms and Online Serving
Caching search engine results over incremental indices,
R. Blanco et. al,
Proceeding of the 33rd international
ACM SIGIR conference on Research and development in information retrieval table of contents
MalStone: Towards A Benchmark for Analytics on Large Data Clouds
Collin Bennett, Robert L. Grossman, David Locke, Jonathan Seidman, Steve Vejcik.
ParaTimer: A Progress Indicator for MapReduce DAGs
Kristi Morton, University of Washington; Magdalena Balazinska, University of Washington; Dan Grossman, University of Washington
Pregel: A System for Large-Scale Graph Processing
Greg Malewicz, et. al,
Nectar: Automatic Management of Data and Computation in Data Centers
Pradeep Kumar Gunda, Lenin Ravindranath, Chandramohan A. Thekkath, Yuan Yu, and Li Zhuang, Microsoft Research Silicon Valley.
Incremental Processing of Large Data Sets
Daniel Peng and Frank Dabek, Google, Inc.