XIFENG YAN

home | research | publications | tutorials | software


[dblp][category] Journal Papers
Conference Papers
Book Chapters
Workshop Papers, Demos, and Technical Reports

Journal Papers

  1. Graph OLAP: A Multi-Dimensional Framework for Graph Data Analysis,
    By C. Chen, X. Yan, F. Zhu, J. Han, P. S Yu,
    KAIS'09, Knowledge and Information Systems: An International Journal, 2009 [pdf]
  2. Report on the First International Workshop on Mining Graphs and Complex Structures,
    By L. Holder and X. Yan,
    SIGMOD Record 37(1): 53-55, 2008 [pdf]
  3. Frequent Pattern Mining: Current Status and Future Directions,
    by J. Han, H. Cheng, D. Xin and X. Yan,
    DMKD'07 (Data Mining and Knowledge Discovery, 10th Anniversary Issue), 2007 [pdf]
  4. On compressing frequent patterns,
    by D. Xin, J. Han, X. Yan, H. Chen, 
    DKE'07 (Data Knowledge Engineering), 60(1): 5-29, 2007 [pdf]
  5. Integrative Array Analyzer: A Software Package for Analysis of Cross-platform and Cross-species Microarray Data,
    by F. Pan, K Kamath, K. Zhang, S. Pulapura, A. Achar, J. Nunez-Iglesias, Y. Huang, X. Yan, J. Han, H. Hu, M. Xu, J. Hu, and X. Jasmine Zhou,
    Bioinformatics'06
    , Vol.22 no.13: 1665-1667, 2006. [pdf]
  6. Feature-based Substructure Similarity Search, 
    by X. Yan, F. Zhu, P. S. Yu, and J. Han,
    ACM-TODS'06 (ACM Transactions on Database Systems), Dec. 2006. [pdf]
  7. Statistical Debugging: A Hypothesis Testing-based Approach,
    by  C. Liu, L. Fei, X. Yan, J. Han and S. Midkiff,
    IEEE-TSE'06 (IEEE Transaction on Software Engineering), 32(10):831-848, 2006. [pdf]
  8. Graph Indexing Based on Discriminative Frequent Structure Analysis, 
    by X. Yan, P. S. Yu, and J. Han,
    ACM-TODS'05 (ACM Transactions on Database Systems), Dec. 2005. [pdf]
  9. TSP: Mining Top-K Closed Sequential Patterns,  
    by P. Tzvetkov, X. Yan, and J. Han,
    KAIS'05 (Knowledge and Information Systems: An International Journal), 7:438-457, 2005. [pdf]
  10. From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach, 
    by J. Han, J. Pei, and X. Yan,
    JCST'04 (Journal of Computer Science and Technology), 19(3): 257-279, 2004. [pdf]

Conference Papers

  1. Mining Graph Patterns Efficiently via Randomized Summaries,
    C. Chen, C. Lin, M. Fredrikson, M. Christodorescu, X. Yan, and J. Han,
    VLDB'09 (Proc. 2009 Int. Conf. on Very Large Data Bases), Aug. 2009 [pdf]
  2. Identifying Bug Signatures Using Discriminative Graph Mining,
    by H. Cheng, D. Lo, Y. Zhou, X. Wang and X. Yan,
    ISSTA'09 (Proc. 2009 Int. Symp. On Software Testing and Analysis), Jul. 2009 [pdf]
  3. Near-Optimal Supervised Feature Selection among Frequent Subgraphs,
    by M. Thoma, H. Cheng, A. Gretton, J. Han, H.-P. Kriegel, A. Smola, L. Song, P. S. Yu, X. Yan, and K. Borgwardt,
    SDM'09 (Proc. 2009 SIAM Int. Conf. on Data Mining), Apr. 2009  [pdf]
  4. SmallBlue: Social Network Analysis for Expertise Search and Collective Intelligence,
    by C. Lin, N. Cao, S. Liu, S. Papadimitriou, J. Sun, X. Yan,
    ICDE'09 (Proc. of 2009 Int. Conf. on Data Engineering ), Mar. 2009 [pdf]
  5. Graph OLAP: Towards Online Analytical Processing on Graphs,
    by C. Chen, X. Yan, F. Zhu, J. Han, and P. S. Yu,
    ICDM'08 (Proc. 2008 Int. Conf. on Data Mining), Dec. 2008 [pdf]
  6. On Effective Presentation of Graph Patterns: A Structural Representative Approach,
    by C. Chen, X. Lin, X. Yan, and J. Han,
    CIKM'08 (Proc. 2008 ACM Conf. on Information and Knowledge Management), Oct. 2008 [pdf]
  7. Efficient Ticket Routing by Resolution Sequence Mining,
    by Q. Shao, Y. Chen, S. Tao, X. Yan, N. Anerousis,
    SIGKDD'08 (Proc. of 2008 Int. Conf. on Knowledge Discovery and Data Mining), Aug. 2008 [pdf]
  8. Direct Mining of Discriminative and Essential Graphical and Itemset Features via Model-based Search Tree,
    by W. Fan, K. Zhang, H. Cheng, J. Gao, X. Yan, J. Han, P. S. Yu, O. Verscheure,
    SIGKDD'08 (Proc. of 2008 Int. Conf. on Knowledge Discovery and Data Mining),  Aug. 2008 [pdf]
  9. Mining Significant Graph Patterns by Scalable Leap Search,
    by X. Yan, H. Cheng, J. Han, and P. S. Yu,
    SIGMOD'08 (Proc. 2008 ACM SIGMOD Int. Conf. on Management of Data), Jun. 2008 [pdf][ppt][dataset]
  10. Direct Discriminative Pattern Mining for Effective Classification,
    by H. Cheng, X. Yan, J. Han, and P. S. Yu,
    ICDE'08 (Proc. of 2008 Int. Conf. on Data Engineering), Apr. 2008. [pdf]
  11. gApprox: Mining Frequent Approximate Patterns from a Massive Network,
    by C. Chen, X. Yan, F. Zhu, and J. Han.
    ICDM'07a (Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
  12. Efficient Discovery of Frequent Approximate Sequential Patterns,
    by F. Zhu, X. Yan, J. Han, and P. S. Yu.
    ICDM'07b (Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
  13. Towards Graph Containment Search and Indexing,
    by C. Chen, X. Yan, P. S. Yu, J. Han, D.-Q. Zhang and X. Gu.
    VLDB'07a (Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
  14. Entity Search: Search Directly and Holistically,
    by T. Cheng, X. Yan and K. Chang.
    VLDB'07b (Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
  15. A Graph-Based Approach to Systematically Reconstruct Human Transcriptional Regulatory Modules,
    by X. Yan, M. Mehan, Y. Huang, M. S. Waterman, P. S. Yu, and X. Zhou.
    ISMB'07a (the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
  16. Systematic Discovery of Functional Modules and Context-Specific Functional Annotation of Human Genome,
    by Y. Huang, H. Li, H. Hu, X. Yan, M. S. Waterman, H. Huang, and X. Zhou.
    ISMB'07b (the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
  17. gPrune: A Constraint Pushing Framework for Graph Pattern Mining,
    by F. Zhu, X. Yan, J. Han, and P. S. Yu.
    PAKDD'07 (Proc. of 2007 Pacific-Asia Conference on Knowledge Discovery and Data Mining), May 2007. Best Student Paper. [pdf]
  18. Mining Colossal Frequent Patterns by Core Pattern Fusion,
    by F. Zhu, X. Yan, J. Han, P. S. Yu, and H. Cheng.
    ICDE'07a (Proc. of 2006 Int. Conf. on Data Engineering), Apr. 2007. Best Student Paper [pdf]
  19. Discriminative Frequent Pattern Analysis for Effective Classification,
    by H. Cheng, X. Yan, J. Han, and C. Hsu.
    ICDE'07b (Proc. of 2006 Int. Conf. on Data Engineering), Apr. 2007. [pdf]
  20. Extracting Redundancy-aware Top-k Patterns,
    by D. Xin, H. Cheng, X. Yan, J. Han, 
    SIGKDD'06 (Proc. of 2006 Int. Conf. on Knowledge Discovery and Data Mining). [pdf]
  21. Mining Control Flow Abnormality for Logic Error Isolation,

    by C. Liu, X. Yan, and J. Han,

    SDM'06 (Proc. of 2006 SIAM Int. Conf. on Data Mining), 2006. [pdf]

  22. Searching Substructures with Superimposed Distance, 
    by X. Yan, F. Zhu, J. Han, and P. S. Yu,
    ICDE'06 (Proc. of 2006 Int. Conf. on Data Engineering), 2006. [pdf] [ppt_slides]
  23. Community Mining from Multi-Relational Networks, 
    by D. Cai, Z. Shao, X. He, X. Yan, J. Han,
    PKDD'05 (Proc. of 2005 European Conf. on Principles and Practice of Knowledge Discovery in Databases), 2005. [pdf]
  24. SOBER: Statistical Model-based Bug Localization, 
    by C. Liu, X. Yan, L. Fei, J. Han, and S. Midkiff,
    FSE'05 (Proc. of 2005 13th ACM SIGSOFT Symp. on the Foundations of Software Engineering), 2005.   [pdf] [website]
  25. Mining Compressed Frequent-Pattern Sets, 
    by D. Xin, J. Han, X. Yan and H. Cheng,
    VLDB'05 (Proc. of 2005 Int. Conf. on Very Large Data Bases), 2005. [pdf]
  26. Summarizing Itemset Patterns: A Profile-Based Approach, 
    by X. Yan, H. Cheng, J. Han, and D. Xin,
    SIGKDD'05a
    (Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining), 2005, Best Student Paper RunnerUp. [pdf]
  27. Mining Closed Relational Graphs with Connectivity Constraints, 
    by X. Yan, X. Jasmine Zhou, and J. Han,
    SIGKDD'05b (Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining), 2005. [pdf]
  28. Mining Coherent Dense Subgraphs Across Massive Biological Networks for Functional Discovery, 
    by H. Hu, X. Yan, Y. Huang, J. Han, X. Jasmine Zhou,
    ISMB'05 (also Bioinformatics). [pdf] [website]
  29. Substructure Similarity Search in Graph Databases, 
    by X. Yan, P. S. Yu, and J. Han,

    SIGMOD'05 (Proc. of 2005 Int. Conf. on Management of Data), 2005. [pdf]
    Among top-ranked papers in SIGMOD'05, Invited to  ACM Transactions on Database Systems (TODS).
  30. Mining Behavior Graphs for `Backtrace' of Noncrashing Bugs, 
    by C. Liu, X. Yan, H. Yu, J. Han, and P. S. Yu,

    SDM'05a (Proc. of 2005 SIAM Int. Conf. on Data Mining), 2005. [pdf]
  31. SeqIndex: Indexing Sequences by Sequential Pattern Analysis, 
    by H. Cheng, X. Yan, and J. Han,

    SDM'05b (Proc. of 2005 SIAM Int. Conf. on Data Mining), 2005 (short paper). [pdf]
  32. Mining Closed Relational Graphs with Connectivity Constraints, 
    by X. Yan, X. Zhou, J. Han,
    ICDE'05 (Proc. of 2005 Int. Conf. on Data Engineering) (short paper). [pdf]
  33. Graph Indexing: A Frequent Structure-based Approach, 
    by X. Yan, P. S. Yu, and J. Han,
    SIGMOD'04 (Proc. of 2004 Int. Conf. on Management of Data), 2004. [pdf][dataset]
    Among top-ranked papers in SIGMOD'04, Invited to  ACM Transactions on Database Systems (TODS).
  34. IncSpan: Incremental Mining of Sequential Patterns in Large Database, 
    by H. Cheng, X. Yan, and J. Han,

    SIGKDD'04 (Proc. 2004 of the Int. Conf. on Knowledge Discovery and Data Mining), 2004. [pdf]
  35. CloseGraph: Mining Closed Frequent Graph Patterns, 
    by X. Yan and J. Han,

    SIGKDD'03 (Proc. of 2003 Int. Conf. Knowledge Discovery and Data Mining), 2003. [pdf]

    Google Scholar ranks CloseGraph as #1 for "graph pattern mining", with 140 citations. (as of Nov 25, 2007)
  36. CloSpan: Mining Closed Sequential Patterns in Large Datasets,
    by X. Yan, J. Han, and R. Afshar,

    SDM'03 (Proc. of 2003 SIAM Int. Conf. Data Mining), 2003.  [pdf]
  37. TSP: Mining Top-K Closed Sequential Patterns,
    by P. Tzvetkov, X. Yan, and J. Han,
    ICDM'03 (Proc. of 2003 Int. Conf. on Data Mining), 2003. [pdf]
  38. gSpan: Graph-Based Substructure Pattern Mining,
    by X. Yan and J. Han,
    ICDM'02 (Proc. of 2002 Int. Conf. on Data Mining) (short paper), 2002.  [pdf]
    Expanded Version, UIUC Technical Report, UIUCDCS-R-2002-2296. [pdf]
    Google Scholar ranks gSpan as #3 for "graph pattern mining", with 276 citations. (as of Nov 25, 2007)
  39. Accelerating Volume Rendering with L-Buffer,
    by X. Yan, W. Cai and J. Shi,
    CAD&Graphics'97
    , Wuhan, China, 1997.

Book Chapters

  1. Discovery of Frequent Substructures
    by X. Yan and J. Han,
    Mining Graph Data, D. Cook and L. Holder, John Wiley & Sons Inc, 2007.
  2. Discovering evolutionary classifier over high speed non-static stream,  
    by J. Yang, X. Yan, J. Han, and W. Wang,
    Advanced Methods for Knowledge Discovery from Complex Data, S. Bandyopadhyay, U. Maulik, L. Holder, D. Cook (Eds.), Springer, 2005.
  3. Mining Frequent Patterns in Data Streams at Multiple Time Granularities,
    by C. Giannella, J. Han, J. Pei, X. Yan, and P. S. Yu,
    Next Generation Data Mining, H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha (eds.),  AAAI/MIT, 2004.
  4. Sequential Pattern Mining by Pattern-Growth: Principles and Extensions,
    by J. Han, J. Pei, and X. Yan,
    Recent Advances in Data Mining and Granular Computing (Mathematical Aspects of Knowledge Discovery), W. Chu and T. Lin (eds.), Springer Verlag, 2004.

Workshop Papers, Demos, and Technical Reports

  1. EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution,
    by Q. Shao, Y. Chen, S. Tao, X. Yan, N. Anerousis,
    Proc. of 2008 Int. Conf. on Very Large Data Bases (VLDB'08),  Auckland, New Zealand,  2008
  2. Combining near-optimal feature selection with gSpan,
    by K. Borgwardt1, X. Yan, M. Thoma, H. Cheng, A. Gretton, L. Song, A. Smola, J. Han, P. Yu, H.-P. Kriegel,
    6th Int. Workshop on Mining and Learning with Graph (MLG'08), Helsinki, Finland, 2008
  3. Entity Search: Search Directly and Holistically,
    by T. Cheng, X. Yan, K. Chang,
    Proc. of 2007 Int. Conf. on Management of Data (SIGMOD'07), Beijing, China, 2007
  4. BioArrayMine: A Software Package for Integrative Analysis of Cross-platform and Cross-species Microarray Data,  
    by F. Pan, K. Kamath, H. Hu, Y. Huang, K. Zhang, M. Xu, X. Yan, J. Han, and X. Jasmine Zhou,
    Proc. of 2005 Int. Conf. on Intelligent Systems for Molecular Biology (ISMB'05), Detroit, MI, 2005 (system demo).
  5. GraphMiner: A Structural Pattern Mining System for Large Disk-based Graph Databases and Its Applications,  
    by W. Wang, C. Wang, Y. Zhu, B. Shi, J. Pei, X. Yan, and J. Han,
    Proc. of 2005 Int. Conf. on Management of Data (SIGMOD'05), 879-881, Baltimore, MD, 2005 (system demo).
  6. Mining Hidden Community in Heterogeneous Social Networks,  
    by D. Cai, Z. Shao, X. He, X. Yan, and J. Han,
    Technical Report UIUCDCS-R-2005-2538, Department of Computer Science, University of Illinois at Urbana-Champaign, 2005.
  7. Using Data Mining for Discovering Patterns in Autonomic Storage Systems,  
    by Z. Li, S. Srinivasan, Z. Chen, Y. Zhou, P. Tzvetkov, X. Yan, and J. Han,
    ACM Workshop on Algorithms and Architectures for Self-Managing Systems, Proc. of 2003 Federated Computing Research Conference (FCRC'03), 2003.
  8. A Framework for Continuous Quantile Computation over Sensor Networks,  
    by X. Yan, J. Yang, J. Han, and W. Wang,
    Technical Report UIUCDCS-R-2003-2382, Department of Computer Science, University of Illinois at Urbana-Champaign, 2003.
  9. gSpan: Graph-Based Substructure Pattern Mining,  
    by X. Yan and J. Han,
    Technical Report UIUCDCS-R-2002-2296, Department of Computer Science, University of Illinois at Urbana-Champaign, 2002.