Journal Papers
- Graph OLAP: A Multi-Dimensional Framework for
Graph Data Analysis,
By C. Chen, X. Yan, F. Zhu, J. Han, P. S Yu,
KAIS'09, Knowledge
and Information Systems: An International Journal, 2009 [pdf] -
Report on the First International Workshop on
Mining Graphs and Complex Structures,
By L. Holder and X. Yan,
SIGMOD Record 37(1): 53-55, 2008 [pdf] -
Frequent Pattern Mining: Current Status and
Future Directions,
by J. Han, H. Cheng, D. Xin and X. Yan,
DMKD'07 (Data Mining and Knowledge
Discovery, 10th Anniversary Issue), 2007 [pdf] -
On compressing frequent patterns,
by D. Xin, J.
Han, X. Yan, H. Chen,
DKE'07
(Data Knowledge Engineering), 60(1): 5-29,
2007 [pdf] - Integrative Array Analyzer:
A Software Package for Analysis of Cross-platform and Cross-species Microarray
Data,
by F. Pan, K Kamath, K.
Zhang, S. Pulapura, A. Achar, J. Nunez-Iglesias, Y. Huang, X. Yan,
J. Han, H. Hu,
M. Xu, J. Hu, and X. Jasmine Zhou,
Bioinformatics'06,
Vol.22 no.13: 1665-1667, 2006. [pdf]-
Feature-based Substructure Similarity
Search,
by X. Yan, F. Zhu, P. S. Yu, and J. Han,
ACM-TODS'06
(ACM Transactions on Database Systems), Dec. 2006. [pdf]
- Statistical Debugging: A Hypothesis Testing-based Approach,
by C. Liu, L. Fei,
X. Yan, J. Han and S. Midkiff,
IEEE-TSE'06
(IEEE Transaction on Software Engineering),
32(10):831-848, 2006. [pdf] -
Graph Indexing Based on
Discriminative Frequent Structure Analysis,
by X. Yan,
P. S. Yu, and J. Han,
ACM-TODS'05
(ACM Transactions on
Database Systems), Dec. 2005. [pdf]
- TSP: Mining Top-K Closed
Sequential Patterns,
by P. Tzvetkov, X. Yan, and
J. Han,
KAIS'05
(Knowledge
and Information Systems: An International Journal),
7:438-457,
2005. [pdf]
- From Sequential Pattern
Mining to Structured Pattern Mining: A Pattern-Growth Approach,
by J. Han, J. Pei, and X. Yan,
JCST'04
(Journal of Computer Science and Technology), 19(3): 257-279, 2004. [pdf]
Conference Papers
Mining Graph Patterns Efficiently via Randomized Summaries,
C. Chen,
C. Lin, M. Fredrikson, M. Christodorescu, X. Yan, and J. Han,
VLDB'09 (Proc. 2009
Int. Conf. on Very Large Data Bases), Aug. 2009 [pdf]
Identifying Bug Signatures Using Discriminative Graph Mining,
by H.
Cheng, D. Lo, Y. Zhou, X. Wang and X. Yan,
ISSTA'09 (Proc. 2009 Int. Symp. On Software
Testing and Analysis), Jul. 2009 [pdf]
Near-Optimal Supervised Feature Selection among Frequent Subgraphs,
by M. Thoma, H. Cheng, A. Gretton, J. Han, H.-P. Kriegel, A.
Smola, L. Song, P. S. Yu, X. Yan, and K. Borgwardt,
SDM'09
(Proc. 2009 SIAM Int.
Conf. on Data Mining), Apr. 2009 [pdf]
SmallBlue: Social Network Analysis for Expertise Search and
Collective Intelligence,
by C. Lin, N. Cao, S. Liu, S.
Papadimitriou, J. Sun, X. Yan,
ICDE'09
(Proc. of 2009 Int. Conf. on Data Engineering ), Mar. 2009 [pdf]
Graph OLAP: Towards Online Analytical Processing on Graphs,
by C.
Chen, X. Yan, F. Zhu, J. Han, and P. S. Yu,
ICDM'08
(Proc. 2008 Int. Conf. on Data
Mining), Dec. 2008 [pdf]
On Effective Presentation of Graph Patterns: A Structural Representative
Approach,
by C. Chen, X. Lin, X. Yan, and J. Han,
CIKM'08
(Proc. 2008
ACM Conf. on Information and Knowledge Management), Oct. 2008 [pdf]
Efficient Ticket Routing by Resolution Sequence Mining,
by Q. Shao,
Y. Chen, S. Tao, X. Yan, N. Anerousis,
SIGKDD'08
(Proc. of 2008 Int. Conf. on Knowledge Discovery and Data Mining),
Aug. 2008 [pdf]
Direct Mining of Discriminative and Essential Graphical and Itemset
Features via Model-based Search Tree,
by W. Fan, K. Zhang, H. Cheng, J. Gao,
X. Yan, J. Han, P. S. Yu, O. Verscheure,
SIGKDD'08
(Proc. of 2008 Int. Conf. on
Knowledge Discovery and Data Mining), Aug. 2008 [pdf]
Mining Significant Graph Patterns by Scalable Leap Search,
by X. Yan, H. Cheng, J. Han, and P. S. Yu,
SIGMOD'08 (Proc. 2008 ACM SIGMOD Int. Conf. on Management of Data), Jun. 2008 [pdf][ppt][dataset]Direct Discriminative Pattern Mining for Effective
Classification,
by H. Cheng, X. Yan, J. Han, and P. S. Yu,
ICDE'08 (Proc. of 2008
Int. Conf. on Data Engineering), Apr. 2008. [pdf]
gApprox: Mining Frequent Approximate Patterns from a Massive Network,
by C. Chen, X. Yan, F. Zhu, and J. Han.
ICDM'07a
(Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
Efficient Discovery of Frequent Approximate Sequential Patterns,
by F. Zhu, X. Yan, J. Han, and P. S. Yu.
ICDM'07b
(Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
Towards Graph Containment Search and Indexing,
by C. Chen, X. Yan, P. S. Yu, J. Han, D.-Q. Zhang and X. Gu.
VLDB'07a
(Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
Entity Search: Search Directly and Holistically,
by T. Cheng, X. Yan and K. Chang.
VLDB'07b
(Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
A Graph-Based Approach to Systematically Reconstruct Human Transcriptional Regulatory Modules,
by X. Yan, M. Mehan, Y. Huang, M. S. Waterman, P. S. Yu, and X. Zhou.
ISMB'07a
(the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
Systematic Discovery of Functional Modules and Context-Specific Functional Annotation of Human Genome,
by Y. Huang, H. Li, H. Hu, X. Yan, M. S. Waterman, H. Huang, and X. Zhou.
ISMB'07b
(the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
gPrune: A Constraint Pushing Framework for Graph Pattern Mining,
by F. Zhu, X. Yan, J. Han, and P. S. Yu.
PAKDD'07 (Proc. of 2007
Pacific-Asia Conference on Knowledge Discovery and Data Mining), May 2007.
Best Student Paper.
[pdf]
Mining Colossal Frequent Patterns by Core Pattern Fusion,
by F. Zhu, X. Yan, J. Han, P. S. Yu, and H. Cheng.
ICDE'07a
(Proc. of
2006 Int. Conf. on Data Engineering), Apr. 2007.
Best Student Paper [pdf]
Discriminative Frequent Pattern Analysis for Effective
Classification,
by H. Cheng, X. Yan, J. Han, and C. Hsu.
ICDE'07b (Proc. of 2006 Int. Conf.
on Data Engineering), Apr. 2007. [pdf]
Extracting Redundancy-aware
Top-k Patterns,
by D. Xin, H. Cheng, X. Yan, J. Han,
SIGKDD'06
(Proc. of 2006 Int. Conf. on
Knowledge Discovery and Data Mining). [pdf]
Mining Control Flow
Abnormality for Logic Error Isolation,
by C. Liu, X. Yan, and J.
Han,
SDM'06
(Proc. of 2006 SIAM Int. Conf.
on Data Mining), 2006. [pdf]
Searching Substructures with
Superimposed Distance,
by X. Yan, F.
Zhu, J. Han, and P. S. Yu,
ICDE'06 (Proc. of 2006 Int. Conf. on Data
Engineering), 2006. [pdf]
[ppt_slides]
Community Mining from Multi-Relational
Networks,
by D. Cai, Z. Shao, X. He, X. Yan, J. Han,
PKDD'05
(Proc. of 2005 European Conf. on Principles and Practice of Knowledge
Discovery in Databases), 2005.
[pdf]
SOBER: Statistical Model-based
Bug Localization,
by C. Liu, X. Yan, L. Fei,
J. Han, and S. Midkiff,
FSE'05
(Proc. of 2005
13th ACM SIGSOFT Symp. on the Foundations of Software
Engineering), 2005.
[pdf] [website]
Mining Compressed Frequent-Pattern
Sets,
by D. Xin, J. Han, X. Yan and H. Cheng,
VLDB'05
(Proc. of 2005 Int. Conf. on Very Large Data Bases),
2005. [pdf]
Summarizing Itemset Patterns: A
Profile-Based Approach,
by X. Yan, H.
Cheng, J. Han, and D. Xin,
SIGKDD'05a
(Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining),
2005, Best Student Paper RunnerUp. [pdf]
Mining Closed Relational Graphs
with Connectivity Constraints,
by X. Yan, X. Jasmine
Zhou, and J. Han,
SIGKDD'05b
(Proc. of 2005
Int. Conf. on Knowledge Discovery and Data Mining),
2005. [pdf]
Mining Coherent Dense Subgraphs
Across Massive Biological Networks for Functional Discovery,
by
H. Hu, X. Yan, Y. Huang, J. Han, X. Jasmine Zhou,
ISMB'05
(also Bioinformatics). [pdf]
[website]
Substructure Similarity Search
in Graph Databases,
by X. Yan, P. S. Yu, and J. Han,
SIGMOD'05
(Proc. of 2005
Int. Conf. on Management of Data),
2005. [pdf]
Among top-ranked papers in
SIGMOD'05, Invited to ACM Transactions on Database Systems (TODS).
Mining Behavior Graphs for `Backtrace'
of Noncrashing Bugs,
by C. Liu, X. Yan, H. Yu, J. Han,
and P. S. Yu,
SDM'05a
(Proc. of 2005
SIAM Int. Conf. on Data Mining), 2005.
[pdf]
SeqIndex: Indexing Sequences by
Sequential Pattern Analysis,
by H. Cheng, X.
Yan, and J. Han,
SDM'05b
(Proc. of 2005
SIAM
Int. Conf. on Data Mining), 2005
(short paper). [pdf]
Mining Closed Relational Graphs
with Connectivity Constraints,
by X. Yan, X. Zhou, J.
Han,
ICDE'05
(Proc. of
2005 Int. Conf. on Data Engineering)
(short paper).
[pdf]
Graph Indexing: A Frequent
Structure-based Approach,
by X. Yan, P. S.
Yu, and J. Han,
SIGMOD'04
(Proc. of 2004
Int. Conf. on Management of Data),
2004. [pdf][dataset]
Among top-ranked papers in
SIGMOD'04, Invited to ACM Transactions on Database Systems (TODS).
IncSpan: Incremental Mining of
Sequential Patterns in Large Database,
by H.
Cheng, X. Yan, and J. Han,
SIGKDD'04
(Proc. 2004 of the Int. Conf. on Knowledge Discovery and Data Mining),
2004. [pdf]
CloseGraph: Mining Closed
Frequent Graph Patterns,
by X. Yan and J. Han,
SIGKDD'03
(Proc. of 2003
Int. Conf. Knowledge Discovery and Data Mining),
2003. [pdf]
Google Scholar ranks CloseGraph as #1 for "graph pattern mining",
with 140 citations. (as of Nov 25, 2007)
CloSpan: Mining Closed
Sequential Patterns in Large Datasets,
by X.
Yan, J. Han, and R. Afshar,
SDM'03
(Proc. of 2003
SIAM Int. Conf. Data Mining), 2003.
[pdf]
TSP: Mining Top-K Closed Sequential Patterns,
by P. Tzvetkov,
X. Yan,
and J. Han,
ICDM'03
(Proc.
of 2003 Int. Conf. on Data Mining),
2003. [pdf]
gSpan: Graph-Based Substructure
Pattern Mining,
by X. Yan and J. Han,
ICDM'02
(Proc. of 2002
Int. Conf. on Data Mining) (short
paper), 2002. [pdf]
Expanded Version, UIUC Technical Report, UIUCDCS-R-2002-2296. [pdf]
Google Scholar ranks gSpan as #3 for "graph pattern mining", with
276 citations. (as of Nov 25, 2007)
Accelerating Volume Rendering with
L-Buffer,
by X. Yan, W. Cai and J. Shi,
CAD&Graphics'97, Wuhan, China, 1997.
Book Chapters
- Discovery of Frequent
Substructures
by
X. Yan and J. Han,
Mining Graph Data, D. Cook and L. Holder,
John Wiley & Sons Inc, 2007.
- Discovering evolutionary
classifier over high speed non-static stream,
by J. Yang, X. Yan,
J. Han, and W. Wang,
Advanced Methods for Knowledge
Discovery from Complex Data, S. Bandyopadhyay, U. Maulik, L. Holder, D. Cook
(Eds.), Springer, 2005.
- Mining Frequent Patterns in
Data Streams at Multiple Time Granularities,
by C. Giannella, J. Han, J. Pei, X. Yan, and P. S. Yu,
Next Generation Data Mining, H.
Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha (eds.), AAAI/MIT, 2004.
- Sequential Pattern Mining
by Pattern-Growth: Principles and Extensions,
by J. Han, J. Pei, and X. Yan,
Recent Advances in Data
Mining and Granular Computing (Mathematical Aspects of Knowledge Discovery),
W. Chu and T. Lin (eds.), Springer Verlag, 2004.
Workshop Papers, Demos, and
Technical Reports
- EasyTicket: A Ticket Routing
Recommendation Engine for Enterprise Problem Resolution,
by Q. Shao, Y.
Chen, S. Tao, X. Yan, N. Anerousis,
Proc. of 2008 Int. Conf. on Very Large
Data Bases (VLDB'08),
Auckland, New Zealand, 2008
- Combining near-optimal feature selection with gSpan,
by
K. Borgwardt1, X. Yan, M. Thoma, H. Cheng, A. Gretton, L. Song, A. Smola, J.
Han, P. Yu, H.-P. Kriegel,
6th Int. Workshop on Mining and Learning
with Graph (MLG'08), Helsinki, Finland, 2008
- Entity Search: Search
Directly and Holistically,
by T. Cheng, X. Yan, K. Chang,
Proc. of 2007 Int. Conf. on
Management of Data (SIGMOD'07), Beijing, China, 2007
- BioArrayMine: A Software
Package for Integrative Analysis of Cross-platform and Cross-species
Microarray Data,
by F. Pan, K. Kamath, H. Hu, Y.
Huang, K. Zhang, M. Xu, X. Yan, J. Han, and X. Jasmine Zhou,
Proc. of 2005 Int. Conf.
on Intelligent Systems for Molecular Biology (ISMB'05), Detroit, MI,
2005 (system demo).
- GraphMiner: A Structural
Pattern Mining System for Large Disk-based Graph Databases and Its
Applications,
by W. Wang, C. Wang, Y. Zhu, B.
Shi, J. Pei, X. Yan, and J. Han,
Proc. of 2005 Int. Conf. on
Management of Data (SIGMOD'05), 879-881, Baltimore, MD, 2005
(system demo).
- Mining Hidden Community in
Heterogeneous Social Networks,
by D. Cai, Z. Shao, X. He, X.
Yan, and J. Han,
Technical Report
UIUCDCS-R-2005-2538, Department of Computer Science, University of Illinois
at Urbana-Champaign, 2005.
- Using Data Mining for
Discovering Patterns in Autonomic Storage Systems,
by Z. Li, S. Srinivasan, Z. Chen,
Y. Zhou, P. Tzvetkov, X. Yan, and J. Han,
ACM Workshop on Algorithms and
Architectures for Self-Managing Systems, Proc. of 2003 Federated Computing
Research Conference (FCRC'03), 2003.
- A Framework for Continuous
Quantile Computation over Sensor Networks,
by X. Yan, J. Yang, J. Han,
and W. Wang,
Technical Report UIUCDCS-R-2003-2382, Department of Computer Science,
University of Illinois at Urbana-Champaign, 2003.
- gSpan: Graph-Based
Substructure Pattern Mining,
by X. Yan and J. Han,
Technical Report
UIUCDCS-R-2002-2296, Department of Computer Science, University of Illinois
at Urbana-Champaign, 2002.