595J Seminar on Advanced Information Systems (2 units )
Fall 2010. Monday 10-11AM
Code 73171. Room 932 101
Xifeng Yan and Tao Yang
This seminar will study
recent papers and advancement in cyber-enabled information systems.
The topic includes large-scale mining and vertical systems/tools,
cloud-computing platforms and advanced information systems, web
services and applications for search, social, and information
discovery.
Candidate papers to be studied are listed below and will be
updated. You may also provide us other interesting papers.
Web extraction and vertical systems
- Sources of evidence for vertical selection
by J. Arguello et. al, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information
retrieval table of contents Boston, MA, USA
Pages: 315-322
2009.
- "Open Information Extraction from the Web"
Oren Etzioni, Michele Banko, Stephen Soderland, and Daniel S. Weld
Communications of the ACM, 51(12): 68-74, 2008
-
"WebTables: Exploring the Power of Tables on the Web"
Michael J. Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang
Proceedings of the 34th International Conference on Very Large Data Bases (VLDB 2008)
- MedSearch: A Specialized Search Engine for Medical
Information Retrieval
Gang Luo, Chunqiang Tang, Hao Yang, Xing Wei,
CIKM 2008.
-
Text and Structural Data Mining of Influenza Mentions in Web and Social Media
Courtney D. Corley et. al,
Int. J. Environ. Res. Public Health 2010, 7, 596-615;
- Automatic Extraction of Clickable Structured Web Contents for Name Entity Queries [PDF]
Xiaoxin Yin, Wenzhao Tan, Xiao Li, Yi-Chin Tu.
WWW 2010.
-
Text Mining from the Web
for Medical Intelligence,
in Mining Massive Data Sets for Security
F. Fogelman-SouliƩt al. (Eds.)
IOS Press, 2008
Data Platforms and Online Serving
-
Caching search engine results over incremental indices,
R. Blanco et. al,
Proceeding of the 33rd international
ACM SIGIR conference on Research and development in information retrieval table of contents
Geneva, Switzerland.
2010.
-
MalStone: Towards A Benchmark for Analytics on Large Data Clouds
Collin Bennett, Robert L. Grossman, David Locke, Jonathan Seidman, Steve Vejcik.
KDD 2010.
-
ParaTimer: A Progress Indicator for MapReduce DAGs
Kristi Morton, University of Washington; Magdalena Balazinska, University of Washington; Dan Grossman, University of Washington
SIGMOD 2010.
-
Pregel: A System for Large-Scale Graph Processing
Greg Malewicz, et. al,
SIGMOD 2010.
-
Nectar: Automatic Management of Data and Computation in Data Centers
Pradeep Kumar Gunda, Lenin Ravindranath, Chandramohan A. Thekkath, Yuan Yu, and Li Zhuang, Microsoft Research Silicon Valley.
OSDI 2010.
-
Incremental Processing of Large Data Sets
Daniel Peng and Frank Dabek, Google, Inc.
OSDI 2010.