XIFENG YAN

home | publications | tutorials | software


[dblp][category] Research Papers
Book Chapters
Workshop Papers, Demos, and Technical Reports

Research Papers

  1. Guiding Large Language Models via Directional Stimulus Prompting
    by Z. Li, B. Peng, P. He, M. Galley, J. Gao, X. Yan, 2023 [arxiv]
  2. Explanations from Large Language Models Make Small Reasoners Better
    by S. Li, J. Chen, Y. Shen, Z. Chen, X. Zhang, Z. Li, H. Wang, J. Qian, B. Peng, Y. Mao, W. Chen, X. Yan, 2023 [arxiv]
  3. Visually-augmented language modeling
    by W. Wang, L. Dong, H. Cheng, H. Song, X. Liu, X. Yan, J. Gao, F. Wei
    ICLR'23 (Proceedings of Int. Conf. on Learning Representations) [pdf]
  4. Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling
    by X. Zhang, S. Li, Z. Chen, X. Yan, L. Petzold, 2022 [arxiv]
  5. Limitations of Language Models in Arithmetic and Symbolic Induction
    by J. Qian, H. Wang, Z. Li, S. Li, X. Yan, 2022 [arxiv]
  6. Language Model Detoxification in Dialogue with Contextualized Stance Control
    by J. Qian and X. Yan
    EMNLP'22 (Proceedings of Findings of EMNLP 2022) [pdf]
  7. Controllable Dialogue Simulation with In-context Learning
    Z. Li, W. Chen, S. Li, H. Wang, J. Qian and X. Yan
    EMNLP'22
    (Proceedings of Findings of EMNLP 2022) [pdf]
  8. Explanations from Large Language Models Make Small Reasoners Better
    by S. Li, J. Chen, Y. Shen, Z. Chen, X. Zhang, Z. Li, H. Wang, J. Qian, B. Peng, Y. Mao, W. Chen, X. Yan, 2022 [arxiv]
  9. PathSim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks (VLDB 2022 Test of Time Award)
    by Y. Sun, J. Han, X. Yan, P. S. Yu, T Wu,
    VLDB'11 (Proc. 2011 Int. Conf. on Very Large Data Bases), Aug 2011 [pdf]
  10. Visualization Question Answering Using Introspective Program Synthesis (PLDI'22 Distinguished Paper Award)
    by Y. Chen, X. Yan, Y. Feng. 
    PLDI'22
     (the 43rd ACM SIGPLAN Conference on Programming Language Design and Implementation) [pdf]
  11. Inductive Relation Prediction by BERT
    by H. Zha, Z. Chen, X. Yan,  
    AAAI'22 (Thirty-Sixth AAAI Conference on Artificial Intelligence) [arxiv]
  12. Composite Re-Ranking for Efficient Document Search with BERT
    Y. Yang, Y. Qiao, J. Shao, X. Yan, T. Yang,
    WSDM'22 (ACM International Conference on Web Search and Data Mining) [arxiv]
  13. Task-adaptive Pre-training and Self-training are Complementary for Natural Language Understanding
    by S. Li, S. Yavuz, W. Chen and X. Yan
    EMNLP'21 (Proceedings of Findings of EMNLP) 2021 [pdf]
  14. Comprehensively Computing Link-based Similarities by Building A Random Surfer Graph
    by M. Zhang, X. Yan, W. Wang
    CIKM'21 (The 2021 ACM Int. Conf. on Information and Knowledge Management), Nov 2021. [pdf]
  15. Attention-based Domain Adaptation for Time Series Forecasting
    by X. Jin, Y. Park, D. Maddix, Y. Wang, X. Yan, 2021 [arxiv]
  16. Semi-Supervised Hypothesis Transfer for Source-Free Domain Adaptation
    by N. Ma, J. Bu, L. Lu, J. Wen, Z. Zhang, S. Zhou, X. Yan, 2021 [arxiv]
  17. Lifelong Learning of Hate Speech Classification on Social Media
    by J. Qian, H. Wang, M. ElSherief and X. Yan
    NAACL-HLT'21
    (Proc. of the 2021 North American Chapter of ACL: Human Language Technologies, 2021) [pdf]
  18. Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases
    by Y. Gu, S. Kase, M. Vanni, B. Sadler, P. Liang, X. Yan, Y. Su
    WWW'21 (The World Wide Web Conf.) 2021. [arxiv] [Dataset: GrailQA]
  19. CoCo: Controllable Counterfactuals for Evaluating Dialogue State Trackers
    by S. Li*, S. Yavuz*, K. Hashimoto, J. Li, T. Niu, N. Rajani, X. Yan, Y. Zhou and C. Xiong (*Equal Contribution)
    ICLR'21 (International Conference on Learning Representations), 2021. [pdf] Leaderboard No.1Jan 2021-present in Multiwoz
  20. Inter-Series Attention Model for COVID-19 Forecasting,
    by X. Jin, Y-X Wang, X. Yan,
    SDM'21 (SIAM Int. Conf. on Data Mining), 2021 [arxiv] [github]
  21. KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation,
    by W. Chen, Y. Su, X. Yan, W. Wang,
    EMNLP'20 (Proc. of the 2020 Conference on Empirical Methods in Natural Language Processing) [pdf] [data/code]
  22. Adaptive-Step Graph Meta-Learner for Few-Shot Graph Classification,
    by N. Ma, J. Bu, J. Yang, Z. Zhang, C. Yao, Z. Yu, S. Zhou, X. Yan,
    CIKM'20 (The 2020 ACM Int. Conf. on Information and Knowledge Management), Oct 2020. [pdf]
  23. Network Intervention for Mental Disorders with Minimum Small Dense Subgroups,
    by B.-Y. Hsu, C.-Y. Shen, and X. Yan, 
    TKDE'19 (
    IEEE Transactions on Knowledge and Data Engineering) [pdf]
  24. Performance Bounds of Decentralized Search in Expert Networks for Query Answering,
    by L. Ma, M. Srivatsa, D. Cansever, X. Yan, S. Kase, M. Vanni,
    TKDD'19 (ACM Transactions on Knowledge Discovery from Data), 2019 [pdf]
  25. HierCon: Hierarchical Organization of Technical Documents based on Concepts,
    by K. Li, S. Li, S. Yavuz, H. Zha, Y. Su, and X. Yan,
    ICDM'19 (Proc. 2019 IEEE Int. Conf. on Data Mining), Dec 2019. [pdf] (Best of ICDM 2019 selection)
  26. Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting,
    by S. Li, X. Jin, Y. Xuan, X. Zhou, W. Chen, Y.-X. Wang, X. Yan
    NeurIPS'19 (T(The Thirty-third Annual Conference on Neural Information Processing Systems) [pdf]
  27. Mining Algorithm Roadmap in Scientific Publications,
    by H. Zha, W. Chen, K. Li and X. Yan,
    KDD'19 (Proc. of the 25th Int. Conf. on Knowledge Discovery and Data Mining) [pdf]
  28. Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention,
    by W. Chen, J. Chen, P. Qin, X. Yan and W. Wang,
    ACL'19 (Proc. of the Annual Meeting of the Association for Computational Linguistics) [pdf]
  29. Global Textual Relation Embedding for Relational Understanding,
    by Z. Chen, H. Zha, H. Liu, W. Chen, X. Yan and Y. Su,
    ACL'19 (Proc. of the Annual Meeting of the Association for Computational Linguistics) (Short Paper) [pdf]
  30. How Large A Vocabulary Does Text Classification Need? A Variational Approach on Vocabulary Selection
    by W. Chen, Y. Su, Y. Shen, Z. Chen, X. Yan and W. Wang
    NAACL-HLT'19 (Proc. of the 17th North American Chapter of ACL: Human Language Technologies, 2019) [pdf]
  31. The Genome of the Jellyfish Aurelia and the Evolution of Animal Complexity,
    by D. Gold, T. Katsuki, Y. Li, X. Yan, M. Regulski, D. Ibberson, T. Holstein, R. Steele, D. Jacobs, and R. Greenspan,
    Nature Ecology and Evolution, 2018. [pdf]
  32. Concept Mining via Embedding,
    by K. Li, H. Zha, Y. Su, and X. Yan,
    ICDM'18 (Proc. 2018 IEEE Int. Conf. on Data Mining), Dec 2018. [pdf]
  33. What It Takes to Achieve 100% Condition Accuracy on WikiSQL,
    by S. Yavuz, I. Gur, Y. Su, X. Yan,
    EMNLP'18 (Proc. of the 2018 Conference on Empirical Methods in Natural Language Processing) [pdf]
  34. XL-NBT: A Cross-lingual Neural Belief Tracking Framework,
    by W. Chen, J. Chen, Y. Su, X. Wang, D. Yu, X. Yan and W. Wang, 
    EMNLP'18
    (Proc. of the 2018 Conf. on Empirical Methods in Natural Language Processing)  [pdf]
  35. DialSQL: Dialogue Based Structured Query Generation,
    by I. Gur, S. Yavuz, Y. Su, X. Yan,
    ACL'18 (Proc. of the Annual Meeting of the Association for Computational Linguistics, 2018) [pdf]
  36. Variational Knowledge Graph Reasoning, 
    by W. Chen, W. Xiong, X. Yan and W. Wang, 
    NAACL-HLT'18 (Proc. of the 16th North American Chapter of ACL: Human Language Technologies, 2018) [pdf]
  37. Global Relation Embedding for Relation Extraction
    by Yu Su*, Honglei Liu*, Semih Yavuz, Izzeddin Gur, Huan Sun, Xifeng Yan. [pdf] [code] (*: Equal Contribution)  https://arxiv.org/abs/1704.05958, April 2017
    NAACL-HLT'18 (Proc. of the 16th North American Chapter of ACL: Human Language Technologies, 2018)[pdf]
  38. Unsupervised Neural Categorization for Scientific Publications,
    by K. Li, H. Zha, Y. Su, X. Yan,
    SDM'18 (SIAM Int. Conf. on Data Mining), 2018 [pdf]
  39. Cross-domain Semantic Parsing via Paraphrasing,
    by Y. Su, X. Yan,
    EMNLP'17 (Proc. of the 2017 Conf. on Empirical Methods in Natural Language Processing), 2017 [pdf]
  40. Recovering Question Answering Errors via Query Revision,
    by S. Yavuz, I. Gur, Y. Su, X. Yan,
    EMNLP'17 (Proc. of the 2017 Conference on Empirical Methods in Natural Language Processing), 2017 [pdf]
  41. Privacy-Preserving Community-Aware Trending Topic Detection in Online Social Media,
    by T. Georgiou, A. El Abbadi, and X. Yan,
    DBSec'17
    (the 31st Annual IFIP WG 11.3 Conference on Data and Applications Security and Privacy), 2017 [pdf]
  42. Extracting Topics with Focused Communities for Social Content Recommendation,
    by T. Georgiou, A. El Abbadi, and X. Yan,
    CSCW'17 (The 20th ACM Conf. on Computer-Supported Cooperative Work and Social Computing), 2017 [pdf]
  43. On Generating Characteristic-rich Question Sets for QA Evaluation,
    by Y. Su, H. Sun, B. Sadler, M. Srivatsa, I. Gur, Z. Yan, and X. Yan,
    EMNLP'16 (Proc. of the 2016 Conf. on Empirical Methods in Natural Language Processing) 2016 [pdf]
  44. Improving Semantic Parsing via Answer Type Inference,
    by S. Yavuz, I. Gur, Y. Su, M. Srivatsa, X. Yan,
    EMNLP'16 (Proc. of the 2016 Conf. on Empirical Methods in Natural Language Processing), 2016 [pdf]
  45. Analyzing information sharing strategies of users in online social networks,
    by D. Nguyen, S. Tan, R. Ramanathan, X. Yan ,
    ASONAM'16 (Proc. of 2016 International Conference on Social Networks Analysis and Mining), 2016 [pdf]
  46. Semantic SPARQL Similarity Search Over RDF Knowledge Graphs,
    by W. Zheng, L. Zou, W. Peng, X. Yan, S. Song, D. Zhao,
    VLDB'16 (Prof. of the 42nd International Conference on Very Large Data Bases), 2016. [pdf]
  47. Fast Top-K Search in Knowledge Graphs,
    by S. Yang, F. Han, Y. Wu, X. Yan,
    ICDE'16 (Proc. of Int. Conf. on Data Engineering), 2016. [pdf]
  48. Fast Motif Discovery in Short Sequences,
    by H. Liu, F. Han, H. Zhou, X. Yan, K. Kosik,

    ICDE'16 (Proc. of Int. Conf. on Data Engineering), 2016. [pdf]
  49. Distributed Representations of Expertise,
    by F. Han, S. Tan, H. Sun, M. Srivatsa, D. Cai, X. Yan,

    SDM'16 (SIAM Int. Conf. on Data Mining), 2016. [pdf]
  50. A Fast Kernel for Attributed Graphs, r>byby Y. Su, F. Han, R. E. Harang, X. Yan,
    SDM'16 (SIAM Int. Conf. on Data Mining), 2016. [pdf]
  51. Table Cell Search for Question Answering,
    by H. Sun, H. Ma, X. He, W.-T. Yih, Y. Su, and X. Yan,
    WWW'16 (Proc. of the 25th Int. World Wide Web Conference), 2016. [pdf]
  52. Entity Disambiguation with Linkless Knowledge Bases,
    by Y. Li, S. Tan, H. Sun, J. Han, D. Roth and X. Yan,
    WWW'16 (Proc. of the 25th Int. World Wide Web Conference), 2016. [pdf]
  53. Behavior Query Discovery in System-Generated Temporal Graphs,
    by B. Zong, X. Xiao, Z. Li, Z. Wu, Z. Qian, X. Yan, A. Singh, and G. Jiang,
    VLDB'16 (Proc. of the 42th Int. Conf. on Very Large Databases), 2016. [pdf]
  54. Mining Complaints for Traffic-Jam Estimation: A Social Sensor Application,
    by T. Georgiou, A. Abbadi, X. Yan, and J. George,
    ASONAM'15
    (Proc. 2015 International Conference on Social Networks Analysis and Mining), 2015 [pdf]
  55. Exploiting Relevance Feedback in Knowledge Graph Search,
    by Y. Su, S. Yang, H. Sun, M. Srivatsa, S. Kase, M. Vanni and X. Yan,
    KDD'15 (Proc. of Int. Conf. on Knowledge Discovery and Data Mining), 2015 [pdf]
  56. Query-Based Outlier Detection in Heterogeneous Information Networks,
    by H. Zhuang, J. Zhang, G. Brova, J. Tang, H. Cam, X. Yan, and J. Han,
    EDBT'15 (the 18th Int. Conf. on Extending Database Technology), 2015 [pdf]
  57. Expertise-Based Data Access in Content-Centric Mobile Opportunistic Networks,
    by J. Zhao, X. Zhang, G. Cao, M. Srivatsa, and X. Yan,
    MASS'14 (the 11th IEEE Int. Conf. on Mobile Ad hoc and Sensor Systems), 2014 [pdf]
  58. Mining Query-Based Subnetwork Outliers in Heterogeneous Information Networks,
    by H. Zhuang, J. Zhang, G. Brova, J. Tang, H. Cam, X. Yan, and J. Han,
    ICDM'14 (Proc. 2014 Int. Conf. on Data Mining, Dec 2014. [pdf]
  59. Analyzing Expert Behaviors in Collaborative Networks,
    by H. Sun, M. Srivatsa, S. Tan, Y. Li, L. Kaplan, S. Tao and X. Yan,
    KDD'14 (Proc. of the 20th Int. Conf. on Knowledge Discovery and Data Mining), Aug 2014. [pdf]
  60. Towards Scalable Critical Alert Mining,
    by B. Zong, Y. Wu, J. Song, A. Singh, H. Cam, J. Han and X. Yan,
    KDD'14 (Proc. of the 20th Int. Conf. on Knowledge Discovery and Data Mining), Aug 2014. [pdf]
  61. SLQ: A User-friendly Graph Querying System,
    by S. Yang, Y. Xie, Y. Wu, T. Wu, H. Sun, J. Wu, X. Yan,
    SIGMOD'14 (Proc. 2014 Int. Conf. on Management of Data) (demo paper), 2014. [pdf]
  62. Schemaless and Structureless Graph Querying,
    by S. Yang, Y. Wu, H. Sun, X. Yan,
    VLDB'14 (Proc. of the 40th Int. Conf. on Very Large Databases), 2014. [pdf]
  63. A Probabilistic Approach to Uncovering Attributed Graph Anomalies,
    by N. Li, H. Sun, K. Chipman, J. George, X. Yan,
    SDM'14
    (Proc. 2014 SIAM Int. Conf. on Data Mining), 2014. [pdf]
  64. Extracting Probable Command and Control Signatures for Detecting Botnets,
    by A. Zand, G. Vigna, X. Yan and C. Kruegel,
    SAC'14
    (The Security Track of the 2014 ACM Symp. on Applied Computing), 2014. [pdf]
  65. Cloud Service Placement via Subgraph Matching,
    by B. Zong, R. Raghavendra, M. Srivatsa, X. Yan, A. Singh, and K.-W. Lee,
    ICDE'14 (
    Proc. 2014 Int. Conf. on Data Engineering), 2014 [pdf]
  66. Top-K Interesting Subgraph Discovery in Information Networks,
    by M. Gupta, J. Gao, X. Yan, H. Cam, and J. Han,
    ICDE'14 (Proc.  2014 Int. Conf. on Data Engineering), 2014 [pdf]
  67. Summarizing Answer Graphs Induced by Keyword Queries,
    by Y. Wu, S. Yang, M. Srivatsa, A. Iyengar, X. Yan,
    VLDB'14 (
    Proc. of the 40th Int. Conf. on Very Large Databases), 2014.[pdf]
  68. Automated Trauma Incident Cubes Analysis,
    by A. Srivastava, L. Ferrigno, S. Kaminski, X. Yan and J. Su,
    ICHI'14
    (IEEE Int. Conf. on Healthcare Informatics), 2013. [pdf]
  69. Noise-Resistant Bicluster Recognition,
    by H. Sun, G. Miao, X. Yan,
    ICDM'13 (
    Proc. 2013 IEEE Int. Conf. on Data Mining), Dec 2013. [pdf]
  70. On Detecting Association-Based Clique Outliers in Heterogeneous Information Networks,
    by M. Gupta, J. Gao, X. Yan and J. Han,
    ASONAM'13 (Proc. of 2013 Int. Conf. on Social Networks Analysis and Mining), Aug 2013. [pdf]
  71. I act, therefore I judge: Network sentiment dynamics modeling based on user activity,
    by K. Macropol, P. Bogdanov, A. Singh, L. Petzold and X. Yan,
    ASONAM'13 (Proc. of 2013 Int. Conf. on Social Networks Analysis and Mining), Aug 2013. [pdf]
  72. Synthetic Review Spamming and Defense,
    by H. Sun, A. Morales, and X. Yan,
    KDD'13 (Proc. of the 19th Int. Conf. on Knowledge Discovery and Data Mining), Aug 2013. [pdf]
  73. Mining Evidences for Named Entity Disambiguation,
    by Y. Li, C. Wang, F. Han, J. Han, D. Roth, and X. Yan,
    KDD'13 (Proc. of the 19th Int. Conf. on Knowledge Discovery and Data Mining), Aug 2013. [pdf]
  74. Characterizing Tenant Behavior for Placement and Crisis Mitigation in Multitenant DBMSs,
    by A. Elmore, S. Das, A. Pucher, D. Agrawal, A. El Abbadi, and X. Yan,
    SIGMOD'13 (Proc. 2013 Int. Conf. on Management of Data), June 2013. [pdf]
  75. MATRI: a multi-aspect and transitive trust inference model,
    by Y. Yao, H. Tong, X. Yan, F. Xu, J. Lu,
    WWW'13 (Proc. of the 22nd Int. World Wide Web Conference), May 2013. [pdf]
  76. Memory Efficient Minimum Substring Partitioning,
    by Y. Li, P. Kamousi, F. Han, S. Yang, X. Yan, S. Suri,
    VLDB'13 (Proc. of the 39th Int. Conf. on Very Large Databases), Aug 2013. [pdf] [software release]
  77. NeMa: Fast Graph Search with Label Similarity,
    by A. Khan, Y. Wu, C. Aggarwal, X. Yan,
    VLDB'13 (Proc. of the 39th Int. Conf. on Very Large Databases ), Aug 2013. [pdf]
  78. gIceberg: Towards Iceberg Analysis in Large Graphs,
    by N. Li, Z. Guan, L. Ren, J. Wu, J. Han, X. Yan,
    ICDE'13 (
    Proc. 2013 Int. Conf. on Data Engineering), Apr 2013. [pdf] [software release]
  79. Ontology-based Subgraph Querying,
    by Y. Wu, S. Yang, X. Yan,
    ICDE'13 (
    Proc. 2013 Int. Conf. on Data Engineering), Apr 2013. [pdf] [poster](Best Poster Award)
  80. Inferring the Underlying Structure of Information Cascades,
    by B. Zong, Y. Wu, A. Singh, and X. Yan,
    ICDM'12 (Proc. 2012 Int. Conf. on Data Mining, Dec 2012. [pdf]
  81. Workload characterization and prediction in the cloud: A multiple time series approach
    by A. Khan, X. Yan, S. Tao, N. Anerousis
    NOMS'12 (Network Operations and Management Symposium), 2012 [pdf]
  82. A General Framework to Encode Heterogeneous Information Sources for Contextual Pattern Mining,
    by W. Dong, W. Fan, L. Shi, C. Zhou, and X. Yan,
    CIKM'12 (The 21st ACM Int. Conf. on Information and Knowledge Management), Oct 2012. [pdf]
  83. Density Index and Proximity Search in Large Graphs,
    by N. Li, X. Yan, Z. Wen, and A. Khan,
    CIKM'12 (The 21st ACM Int. Conf. on Information and Knowledge Management), Oct 2012. [pdf] [software release]
  84. Measuring Two-Event Structural Correlations on Graphs,
    by Z. Guan, X. Yan, L. M. Kaplan,
    VLDB'12 (Proc. of the 38th Int. Conf. on Very Large Databases), Aug 2012 [pdf]
  85. Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks,
    by Y. Sun, B. Norick, J. Han, X. Yan, P. S. Yu, X. Yu,
    KDD'12 (Proc. of the 18th Int. Conf. on Knowledge Discovery and Data Mining), Aug 2012. [pdf] (Best Student Research Paper)
  86. Latent Association Analysis of Document Pairs,
    byby G. Miao, Z. Guan, L. Moser, X. Yan, S. Tao, N. Anerousis, and J. Sun,
    KDD'12 (Proc. of the 18th Int. Conf. on Knowledge Discovery and Data Mining), Aug 2012 [pdf]
  87. Towards Effective Partition Management for Large Graphs,
    by S. Yang, X. Yan, B. Zong, A. Khan
    SIGMOD'12 (Proc. 2012 Int. Conf. on Management of Data), Jun 2012 [pdf]
  88. [software release][slides]
  89. Understanding Task-driven Information Flow in Collaborative Networks,
    by G. Miao, S. Tao, W. Cheng, J. Moulic, L. Moser and X. Yan,
    WWW'12 (Proc. 2012 Int. World Wide Web Conference), April 2012 [pdf]
  90. Efficient multicasting for delay tolerant networks using graph indexing,
    by M. Mongiovi, A. Singh, X. Yan, B. Zong, K. Psounis,
    INFOCOM'12 (Proc. 2012 Int. Conf. on Computer Communications), March 2012 [pdf]
  91. Mining Top-K Large Structural Patterns in a Massive Network,
    by F. Zhu, Q. Qu, D. Lo, X. Yan, J. Han, and P. Yu,
    VLDB'11 (Proc. 2011 Int. Conf. on Very Large Data Bases), Aug 2011 [pdf]
  92. Neighborhood Based Fast Graph Search in Large Networks,
    by A. Khan, N. Li, Z. Guan, X. Yan, S. Chakraborty, and S. Tao,
    SIGMOD'11 (Proc. 2011 Int. Conf. on Management of Data), June 2011 [pdf]
  93. Assessing and Ranking Structural Correlations in Graphs,
    by Z. Guan, J. Wu, Q. Zhang, A. Singh, and X. Yan,
    SIGMOD'11 (Proc. 2011 Int. Conf. on Management of Data), June 2011 [pdf]
  94. On Flow Authority Discovery in Social Networks,
    by C. Aggarwal, A. Khan and X. Yan,
    SDM'11 (Proc. 2011 SIAM International Conference on Data Mining),  Apr. 2011 [pdf]
  95. Content-Aware Resolution Sequence Mining for Ticket Routing,
    by P. Sun, S. Tao, X. Yan, N. Anerousis, Y. Chen,
    BPM'10 (The 8th Int. Conf. on Business Process Management),  Sep. 2010 [pdf]
  96. Generative Models for Ticket Resolution in Expert Networks
    G. Miao, L. Moser, X. Yan, S. Tao, Y. Chen, and N. Anerousis
    SIGKDD'10 (Proc. of 2010 Int. Conf. on Knowledge Discovery and Data Mining), Jul. 2010 [pdf]
  97. Assessing Expertise Awareness in Resolution Networks
    Y. Chen, S. Tao, X. Yan, N. Anerousis, and Q. Shao
    ASONAM'10 (Proc. 2010 International Conference on Social Networks Analysis and Mining), Aug. 2010 [pdf]
  98. Synthesizing Near-Optimal Malware Specifications from Suspicious Behaviors,
    M. Fredrikson, M. Christodorescu, S. Jha, R. Sailer, and X. Yan,
    Oakland'10 (31st IEEE Symp. on Security & Privacy), May 2010 [pdf]
  99. Towards Proximity Pattern Mining in Large Graphs,
    A. Khan, X. Yan and K.-L. Wu,
    SIGMOD'10 (Proc. 2010 Int. Conf. on Management of Data), June 2010 [pdf]
  100. Mining Diversity on Networks,
    L. Liu, F. Zhu, C. Chen, X. Yan, J. Han, P. S. Yu, and S. Yang,
    DASFAA'10
    (Proc. 2010 Int. Conf. on Database Systems for Advanced Applications), 2010 [pdf]

  101. Cross-Selling Optimization for Customized Product Promotion, r>N. Li, Y. Yang, X. Yan,
    SDM'10 (P(Proc. 2010 SIAM International Conference on Data Mining), April 2010 [pdf]
  102. Top-K Aggregation Queries over Large Networks,
    X. Yan, B. He, F. Zhu, and J. Han,
    ICDE'10 (Proc. 2010 Int. Conf. on Data Engineering), Mar. 2010 [pdf]
  103. Mining Graph Patterns Efficiently via Randomized Summaries,
    C. Chen, C. Lin, M. Fredrikson, M. Christodorescu, X. Yan, and J. Han,
    VLDB'09 (Proc. 2009 Int. Conf. on Very Large Data Bases), Aug. 2009 [pdf]
  104. Identifying Bug Signatures Using Discriminative Graph Mining,
    by H. Cheng, D. Lo, Y. Zhou, X. Wang and X. Yan,
    ISSTA'09 (Proc. 2009 Int. Symp. On Software Testing and Analysis), Jul. 2009 [pdf]
  105. Near-Optimal Supervised Feature Selection among Frequent Subgraphs,
    by M. Thoma, H. Cheng, A. Gretton, J. Han, H.-P. Kriegel, A. Smola, L. Song, P. S. Yu, X. Yan, and K. Borgwardt,
    SDM'09 (Proc. 2009 SIAM Int. Conf. on Data Mining), Apr. 2009  [pdf]
  106. SmallBlue: Social Network Analysis for Expertise Search and Collective Intelligence,
    by C. Lin, N. Cao, S. Liu, S. Papadimitriou, J. Sun, X. Yan,
    ICDE'09 (Proc. of 2009 Int. Conf. on Data Engineering ), Mar. 2009 [pdf]
  107. Graph OLAP: Towards Online Analytical Processing on Graphs,
    by C. Chen, X. Yan, F. Zhu, J. Han, and P. S. Yu,
    ICDM'08 (Proc. 2008 Int. Conf. on Data Mining), Dec. 2008 [pdf]
  108. On Effective Presentation of Graph Patterns: A Structural Representative Approach,
    by C. Chen, X. Lin, X. Yan, and J. Han,
    CIKM'08 (Proc. 2008 ACM Conf. on Information and Knowledge Management), Oct. 2008 [pdf]
  109. Efficient Ticket Routing by Resolution Sequence Mining,
    by Q. Shao, Y. Chen, S. Tao, X. Yan, N. Anerousis,
    SIGKDD'08 (Proc. of 2008 Int. Conf. on Knowledge Discovery and Data Mining), Aug. 2008 [pdf]
  110. Direct Mining of Discriminative and Essential Graphical and Itemset Features via Model-based Search Tree,
    by W. Fan, K. Zhang, H. Cheng, J. Gao, X. Yan, J. Han, P. S. Yu, O. Verscheure,
    SIGKDD'08 (Proc. of 2008 Int. Conf. on Knowledge Discovery and Data Mining),  Aug. 2008 [pdf]
  111. Mining Significant Graph Patterns by Scalable Leap Search,
    by X. Yan, H. Cheng, J. Han, and P. S. Yu,
    SIGMOD'08 (Proc. 2008 ACM SIGMOD Int. Conf. on Management of Data), Jun. 2008 [pdf][ppt][dataset]
  112. Direct Discriminative Pattern Mining for Effective Classification,
    by H. Cheng, X. Yan, J. Han, and P. S. Yu,
    ICDE'08 (Proc. of 2008 Int. Conf. on Data Engineering), Apr. 2008. [pdf]
  113. gApprox: Mining Frequent Approximate Patterns from a Massive Network,
    by C. Chen, X. Yan, F. Zhu, and J. Han.
    ICDM'07a (Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
  114. Efficient Discovery of Frequent Approximate Sequential Patterns,
    by F. Zhu, X. Yan, J. Han, and P. S. Yu.
    ICDM'07b (Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
  115. Towards Graph Containment Search and Indexing,
    by C. Chen, X. Yan, P. S. Yu, J. Han, D.-Q. Zhang and X. Gu.
    VLDB'07a (Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
  116. EntityRank: Searching Entities Directly and Holistically,
    by T. Cheng, X. Yan and K. Chang.
    VLDB'07b (Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
  117. A Graph-Based Approach to Systematically Reconstruct Human Transcriptional Regulatory Modules,
    by X. Yan, M. Mehan, Y. Huang, M. S. Waterman, P. S. Yu, and X. Zhou.
    ISMB'07a (the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
  118. Systematic Discovery of Functional Modules and Context-Specific Functional Annotation of Human Genome,
    by Y. Huang, H. Li, H. Hu, X. Yan, M. S. Waterman, H. Huang, and X. Zhou.
    ISMB'07b (the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
  119. gPrune: A Constraint Pushing Framework for Graph Pattern Mining,
    by F. Zhu, X. Yan, J. Han, and P. S. Yu.
    PAKDD'07 (Proc. of 2007 Pacific-Asia Conference on Knowledge Discovery and Data Mining), May 2007. Best Student Paper. [pdf]
  120. Mining Colossal Frequent Patterns by Core Pattern Fusion,
    by F. Zhu, X. Yan, J. Han, P. S. Yu, and H. Cheng.
    ICDE'07a (Proc. of 2006 Int. Conf. on Data Engineering), Apr. 2007. Best Student Paper [pdf]
  121. Discriminative Frequent Pattern Analysis for Effective Classification,
    by H. Cheng, X. Yan, J. Han, and C. Hsu.
    ICDE'07b (Proc. of 2006 Int. Conf. on Data Engineering), Apr. 2007. [pdf]
  122. Extracting Redundancy-aware Top-k Patterns,
    by D. Xin, H. Cheng, X. Yan, J. Han, 
    SIGKDD'06 (Proc. of 2006 Int. Conf. on Knowledge Discovery and Data Mining). [pdf]
  123. Mining Control Flow Abnormality for Logic Error Isolation,

    by C. Liu, X. Yan, and J. Han,

    SDM'06 (Proc. of 2006 SIAM Int. Conf. on Data Mining), 2006. [pdf]

  124. Searching Substructures with Superimposed Distance, 
    by X. Yan, F. Zhu, J. Han, and P. S. Yu,
    ICDE'06 (Proc. of 2006 Int. Conf. on Data Engineering), 2006. [pdf] [ppt_slides]
  125. Community Mining from Multi-Relational Networks, 
    by D. Cai, Z. Shao, X. He, X. Yan, J. Han,
    PKDD'05 (Proc. of 2005 European Conf. on Principles and Practice of Knowledge Discovery in Databases), 2005. [pdf]
  126. SOBER: Statistical Model-based Bug Localization, 
    by C. Liu, X. Yan, L. Fei, J. Han, and S. Midkiff,
    FSE'05 (Proc. of 2005 13th ACM SIGSOFT Symp. on the Foundations of Software Engineering), 2005.   [pdf] [website]
  127. Mining Compressed Frequent-Pattern Sets, 
    by D. Xin, J. Han, X. Yan and H. Cheng,
    VLDB'05 (Proc. of 2005 Int. Conf. on Very Large Data Bases), 2005. [pdf]
  128. Summarizing Itemset Patterns: A Profile-Based Approach, 
    by X. Yan, H. Cheng, J. Han, and D. Xin,
    SIGKDD'05a
    (Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining), 2005, Best Student Paper RunnerUp. [pdf]
  129. Mining Closed Relational Graphs with Connectivity Constraints, 
    by X. Yan, X. Jasmine Zhou, and J. Han,
    SIGKDD'05b (Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining), 2005. [pdf]
  130. Mining Coherent Dense Subgraphs Across Massive Biological Networks for Functional Discovery, 
    by H. Hu, X. Yan, Y. Huang, J. Han, X. Jasmine Zhou,
    ISMB'05 (also Bioinformatics). [pdf] [website]
  131. Substructure Similarity Search in Graph Databases, 
    by X. Yan, P. S. Yu, and J. Han,

    SIGMOD'05 (Proc. of 2005 Int. Conf. on Management of Data), 2005. [pdf]
    Among top-ranked papers in SIGMOD'05, Invited to  ACM Transactions on Database Systems (TODS).
  132. Mining Behavior Graphs for `Backtrace' of Noncrashing Bugs, 
    by C. Liu, X. Yan, H. Yu, J. Han, and P. S. Yu,

    SDM'05a (Proc. of 2005 SIAM Int. Conf. on Data Mining), 2005. [pdf]
  133. SeqIndex: Indexing Sequences by Sequential Pattern Analysis, 
    by H. Cheng, X. Yan, and J. Han,

    SDM'05b (Proc. of 2005 SIAM Int. Conf. on Data Mining), 2005 (short paper). [pdf]
  134. Mining Closed Relational Graphs with Connectivity Constraints, 
    by X. Yan, X. Zhou, J. Han,
    ICDE'05 (Proc. of 2005 Int. Conf. on Data Engineering) (short paper). [pdf]
  135. Graph Indexing: A Frequent Structure-based Approach, 
    by X. Yan, P. S. Yu, and J. Han,
    SIGMOD'04 (Proc. of 2004 Int. Conf. on Management of Data), 2004. [pdf][dataset]
    Among top-ranked papers in SIGMOD'04, Invited to  ACM Transactions on Database Systems (TODS).
  136. IncSpan: Incremental Mining of Sequential Patterns in Large Database, 
    by H. Cheng, X. Yan, and J. Han,

    SIGKDD'04 (Proc. 2004 of the Int. Conf. on Knowledge Discovery and Data Mining), 2004. [pdf]
  137. CloseGraph: Mining Closed Frequent Graph Patterns, 
    by X. Yan and J. Han,

    SIGKDD'03 (Proc. of 2003 Int. Conf. Knowledge Discovery and Data Mining), 2003. [pdf]

    Google Scholar ranks CloseGraph as #1 for "graph pattern mining", with 140 citations. (as of Nov 25, 2007)
  138. CloSpan: Mining Closed Sequential Patterns in Large Datasets,
    by X. Yan, J. Han, and R. Afshar,

    SDM'03 (Proc. of 2003 SIAM Int. Conf. Data Mining), 2003.  [pdf]
  139. TSP: Mining Top-K Closed Sequential Patterns,
    by P. Tzvetkov, X. Yan, and J. Han,
    ICDM'03 (Proc. of 2003 Int. Conf. on Data Mining), 2003. [pdf]
  140. gSpan: Graph-Based Substructure Pattern Mining,
    by X. Yan and J. Han,
    ICDM'02 (Proc. of 2002 Int. Conf. on Data Mining) (short paper), 2002.  [pdf]
    Expanded Version, UIUC Technical Report, UIUCDCS-R-2002-2296. [pdf]
    Google Scholar ranks gSpan as #3 for "graph pattern mining", with 276 citations. (as of Nov 25, 2007)
  141. Accelerating Volume Rendering with L-Buffer,
    by X. Yan, W. Cai and J. Shi,
    CAD&Graphics'97
    , Wuhan, China, 1997.

Journal Papers (Merged with Research Papers after 2018)

  1. Observability of Lattice Graphs,
    by F. Han, S. Suri, and X. Yan,
    Algorithmica,2015 [pdf]
  2. Querying Knowledge Graphs by Example Entity Tuples,
    By N. Jayaram, A. Khan, C. Li, X. Yan, R. Elmasri,
    TKDE'15, Transactions on Knowledge and Data Engineering, 2015 [pdf]
  3. Fine-Grained Knowledge Sharing in Collaborative Environments,
    By Z. Guan, S. Yang, H. Sun, M. Srivatsa, X. Yan,
    TKDE'15, Transactions on Knowledge and Data Engineering, 2015 [pdf]
  4. Big Data in Online Social Networks: User Interaction Analysis to Model User Behavior in Social Networks,
    By D. Agrawal, C. Budak, A. El Abbadi, T. Georgiou, X. Yan,
    LNCS'14, Databases in Networked Information Systems - Lecture Notes in Computer Science Volume 8381, 2014, pp 1-16. [pdf]
  5. Multi-Aspect + Transitivity + Bias: An Integral Trust Inference Model,
    By Y. Yao, H. Tong, X. Yan, F. Xu, J. Lu,
    TKDE'14, Transactions on Knowledge and Data Engineering, 2014 [pdf]
  6. Interpreting the Public Sentiment Variations on Twitter,
    by S. Tan, Y. Li, H. Sun, Z. Guan, X. Yan, J. Bu, C. Chen, and X. He
    TKDE'13, Transactions on Knowledge and Data Engineering, 2013 [pdf]
  7. PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks,
    by Y. Sun, B. Norick, J. Han, X. Yan, P. S. Yu, X. Yu,
    TKDD'13, ACM Transactions on Knowledge Discovery from Data, 2013 [pdf(not published yet)]
  8. Static and Dynamic Structural Correlations in Graphs,
    by J. Wu, Z. Guan, Z. Yun, A. Singh, X. Yan,
    TKDE'13, Transactions on Knowledge and Data Engineering, 2013 [pdf(not published yet)]
  9. Co-occurrence Based Diffusion for Expert Search On the Web,
    by Z. Guan, G. Miao, R. McLoughlin, X. Yan, D. Cai
    TKDE'12, Transactions on Knowledge and Data Engineering, 2012 [pdf]
  10. Graph OLAP: A Multi-Dimensional Framework for Graph Data Analysis,
    by C. Chen, X. Yan, F. Zhu, J. Han, P. S Yu,
    KAIS'09, Knowledge and Information Systems: An International Journal, 2009 [pdf]
  11. Report on the First International Workshop on Mining Graphs and Complex Structures,
    by L. Holder and X. Yan,
    SIGMOD Record 37(1): 53-55, 2008 [pdf]
  12. Frequent Pattern Mining: Current Status and Future Directions,
    by J. Han, H. Cheng, D. Xin and X. Yan,
    DMKD'07 (Data Mining and Knowledge Discovery, 10th Anniversary Issue), 2007 [pdf]
  13. On compressing frequent patterns,
    by D. Xin, J. Han, X. Yan, H. Chen, 
    DKE'07 (Data Knowledge Engineering), 60(1): 5-29, 2007 [pdf]
  14. Integrative Array Analyzer: A Software Package for Analysis of Cross-platform and Cross-species Microarray Data,
    by F. Pan, K Kamath, K. Zhang, S. Pulapura, A. Achar, J. Nunez-Iglesias, Y. Huang, X. Yan, J. Han, H. Hu, M. Xu, J. Hu, and X. Jasmine Zhou,
    Bioinformatics'06
    , Vol.22 no.13: 1665-1667, 2006. [pdf]
  15. Feature-based Substructure Similarity Search, 
    by X. Yan, F. Zhu, P. S. Yu, and J. Han,
    ACM-TODS'06 (ACM Transactions on Database Systems), Dec. 2006. [pdf]
  16. Statistical Debugging: A Hypothesis Testing-based Approach,
    by  C. Liu, L. Fei, X. Yan, J. Han and S. Midkiff,
    IEEE-TSE'06 (IEEE Transaction on Software Engineering), 32(10):831-848, 2006. [pdf]
  17. Graph Indexing Based on Discriminative Frequent Structure Analysis, 
    by X. Yan, P. S. Yu, and J. Han,
    ACM-TODS'05 (ACM Transactions on Database Systems), Dec. 2005. [pdf]
  18. TSP: Mining Top-K Closed Sequential Patterns,  
    by P. Tzvetkov, X. Yan, and J. Han,
    KAIS'05 (Knowledge and Information Systems: An International Journal), 7:438-457, 2005. [pdf]
  19. From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach, 
    by J. Han, J. Pei, and X. Yan,
    JCST'04 (Journal of Computer Science and Technology), 19(3): 257-279, 2004. [pdf]font>

 

Book Chapters

  1. Discovery of Frequent Substructures
    by X. Yan and J. Han,
    Mining Graph Data, D. Cook and L. Holder, John Wiley & Sons Inc, 2007.
  2. Discovering evolutionary classifier over high speed non-static stream,  
    by J. Yang, X. Yan, J. Han, and W. Wang,
    AdAdvanced Methods for Knowledge Discovery from Complex Data, S. Bandyopadhyay, U. Maulik, L. Holder, D. Cook (Eds.), Springer, 2005.
  3. Mining Frequent Patterns in Data Streams at Multiple Time Granularities,
    by C. Giannella, J. Han, J. Pei, X. Yan, and P. S. Yu,
    Next Generation Data Mining, H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha (eds.),  AAAI/MIT, 2004.
  4. Sequential Pattern Mining by Pattern-Growth: Principles and Extensions,r /> by J. Han, J. Pei, and X. Yan,
    ReRecent Advances in Data Mining and Granular Computing (Mathematical Aspects of Knowledge Discovery), W. Chu and T. Lin (eds.), Springer Verlag, 2004.

Workshop Papers, Demos, and Technical Reports

  1. EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution,
    by Q. Shao, Y. Chen, S. Tao, X. Yan, N. Anerousis,
    Proc. of 2008 Int. Conf. on Very Large Data Bases (VLDB'08),  span> Auckland, New Zealand, &n 2008
  2. Combining near-optimal feature selection with gSpan,
    by K. Borgwardt1, X. Yan, M. Thoma, H. Cheng, A. Gretton, L. Song, A. Smola, J. Han, P. Yu, H.-P. Kriegel,
    6th Int. Workshop on Mining and Learning with Graph (MLG'08), Helsinki, Finland, 2008span>
  3. Entity Search: Search Directly and Holistically,
    by T. Cheng, X. Yan, K. Chang,
    PrProc. of 2007 Int. Conf. on Management of Data (SIGMOD'07), Beijing, China, 2007
  4. BioArrayMine: A Software Package for Integrative Analysis of Cross-platform and Cross-species Microarray Data,  
    by F. Pan, K. Kamath, H. Hu, Y. Huang, K. Zhang, M. Xu, X. Yan, J. Han, and X. Jasmine Zhou,
    Proc. of 2005 Int. Conf. on Intelligent Systems for Molecular Biology (ISMB'05), Detroit, MI, 2005 (system demo).
  5. GraphMiner: A Structural Pattern Mining System for Large Disk-based Graph Databases and Its Applications,  
    by W. Wang, C. Wang, Y. Zhu, B. Shi, J. Pei, X. Yan, and J. Han,
    Proc. of 2005 Int. Conf. on Management of Data (SIGMOD'05), 879-881, Baltimore, MD, 2005 (system demo).
  6. Mining Hidden Community in Heterogeneous Social Networks,  
    by D. Cai, Z. Shao, X. He, X. Yan, and J. Han,
    Technical Report UIUCDCS-R-2005-2538, Department of Computer Science, University of Illinois at Urbana-Champaign, 2005.font>
  7. Using Data Mining for Discovering Patterns in Autonomic Storage Systems,  
    byby Z. Li, S. Srinivasan, Z. Chen, Y. Zhou, P. Tzvetkov, X. Yan, and J. Han,
    ACM Workshop on Algorithms and Architectures for Self-Managing Systems, Proc. of 2003 Federated Computing Research Conference (FCRC'03), 2003. [pdf]
  8. A Framework for Continuous Quantile Computation over Sensor Networks,  
    by X. Yan, J. Yang, J. Han, and W. Wang,
    Technical Report UIUCDCS-R-2003-2382, Department of Computer Science, University of Illinois at Urbana-Champaign, 2003.
  9. gSpan: Graph-Based Substructure Pattern Mining,  
    by X. Yan and J. Han, r /> Technical Report UIUCDCS-R-2002-2296, Department of Computer Science, University of Illinois at Urbana-Champaign, 2002.