Selected Publications by Category and Year

(List of my publications at DBLP Bibliography Server and Google Scholar)

Copyright Notice. The electronic materials in this web pages are presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

 


Textbook


  1. J. Han, M. Kamber, and J. Pei. Data Mining: Concepts and Techniques, 3rd ed., Morgan Kaufmann, 2011, ISBN: 978-0-123-81479-1.

Monographs


  1. M. Hua and J. Pei, Ranking Queries on Uncertain Data, Springer, 2011, ISBN: 978-1-441-99379-3.
  2. G. Dong and J. Pei, Sequence Data Mining, Springer, 2007, ISBN: 978-0-387-69936-3.

Publications in Refereed Journals


  1. X. Xu, C. Gao, J. Pei, K. Wang, and A. Al-Barakati. "Continuous Similarity Search for Evolving Queries". To appear in Knowledge and Information Systems: An International Journal, Springer-Verlag.

  2. A. Campbell, X. Mao, J. Pei, and A. Al-Barakati. "Multidimensional Business Benchmarking Analysis on Data Warehouses". To appear in Intelligent Data Analysis: An International Journal, IOS Press.

  3. N. X. Vinh, J. Chan, S. Romano, J. Bailey, C. Leckie, K. Ramamohanarao, and J. Pei. "Discovering Outlying Aspects in Large Datasets". To appear in Data Mining and Knowledge Discovery, Springer-Verlag.

  4. S. Liu, J. Yin, X. Wang, W. Cui, K. Cao, and J. Pei. "Online Visual Analytics of Text Streams". To appear in IEEE Transactions on Visualization and Computer Graphics, IEEE Computer Society.

  5. L. Duan, G. Tang, J. Pei, J. Bailey, G. Dong, V. Nguyen, A. Campbell, and C. Tang. "Efficient Discovery of Contrast Subspaces for Object Explanation and Characterization". Knowledge and Information Systems: An International Journal, Volume 47, Issue 1, pages 99-129, April 2016, Springer-Verlag.

  6. L. Duan, G. Tang, J. Pei, J. Bailey, A. Campbell, and C. Tang. "Mining Outlying Aspects on Numeric Data". Data Mining and Knowledge Discovery, Volume 29, Issue 5, pages 1116-1151, September 2015, Springer-Verlag.

  7. G. Tang, J. Pei, J. Bailey, and G. Dong. "Mining Multidimensional Contextual Outliers from Categorical Relational Data". Intelligent Data Analysis: An International Journal, Volume 19, No. 5, pages 1171-1192, September 2015, IOS Press.

  8. X. Zhang, W. Dou, J. Pei, S. Nepal, C. Yang, C. Liu, and J. Chen. "Proximity-aware Local-recoding Anonymization with MapReduce for Scalable Big Data Privacy Preservation in Cloud". To appear in IEEE Transactions on Computers, IEEE Computer Society.

  9. K. Yu, W. Ding, D.A. Simovici, H. Wang, J. Pei, and X. Wu. "Classification with Streaming Features: An Emerging Pattern Mining Approach". ACM Transactions on Knowledge Discovery from Data, Volume 9, Issue 4, Article No. 30, June 2015, ACM Press.

  10. C. Yang, X. Zhang, C. Zhong, C. Liu, J. Pei, K. Ramamohanarao, and J. Chen. "A spatiotemporal compression based approach for efficient big data processing on Cloud". Journal of Computer and System Sciences, Volume 80, Issue 8, pages 1563-1583, December 2014, Elsevier.

  11. D. Huang, K. Xu, and J. Pei. "Malicious URL Detection by Dynamically Mining Patterns without Pre-defined Elements". World Wide Web Journal, Volume 17, Issue 6, pages 1375-1394, November 2014, Springer-Verlag.

  12. G. Tang, J. Pei, and W-S. Luk. "Email Mining: Tasks, Common Techniques, and Tools". Knowledge and Information Systems: An International Journal, Volume 41, Issue 1, pages 1-31, October 2014, Springer-Verlag.

  13. Y. Yang, J. X. Yu, H. Gao, J. Pei, and J. Li. "Mining Most Frequently Changing Component in Evolving Graphs". World Wide Web Journal, Volume 17, Issue 3, pages 351-376, May 2014, Springer-Verlag.

  14. Y. Zhang, W. Zhang, J. Pei, X. Lin, Q. Lin, and A. Li. "Consensus-based Ranking of Multivalued Objects: A Generalized Borda Count Approach". IEEE Transactions on Knowledge and Data Engineering, Volume 26, Issue 1, pages 83-96, January 2014, IEEE Computer Society.

  15. Y-C. Lo, J-Y. Li, M-Y. Yeh, S-D. Lin, and J. Pei. "What Distinguish One from Its Peers in Social Networks?". Data Mining and Knowledge Discovery: An International Journal, Volume 27, Issue 3, November 2013, Springer-Verlag.

  16. Z. Liao, D. Jiang, J. Pei, Y. Huang, E. Chen, H. Cao, and H. Li. "A vlHMM Approach to Context-Aware Search". ACM Transactions on the Web, Volume 7, Issue 4, October 2013, ACM Press.

  17. D. Jiang, J. Pei, and H. Li. "Mining Search and Browse Logs for Web Search: A Survey". ACM Transactions on Intelligent Systems and Technology, Volume 4, Issue 4, September 2013, ACM Press.

  18. B. Jiang, J. Pei, Y. Tao, and X. Lin. "Clustering Uncertain Data Based on Probability Distribution Similarity". IEEE Transactions on Knowledge and Data Engineering, Volume 25, No. 4, pages 751-763, April 2013, IEEE Computer Society.

  19. Y. Cui, J. Pei, G. Tang, W-S. Luk, D. Jiang, and M. Hua. "Finding Email Correspondents in Online Social Networks". World Wide Web Journal, Volume 6, Issue 2, pages 195-218, March 2013, Springer-Verlag.

  20. J. Huang, B. Jiang, J. Pei, J. Chen, and Y. Tang. "Skyline Distance: A Measure of Multidimensional Competence". Knowledge and Information Systems: An International Journal, Volume 34, Issue 2, pages 373-396, February 2013, Springer-Verlag.

  21. J. Chen, J. Huang, B. Jiang, J. Pei, and J. Yin. "Recommendations for Two-Way Selections Using Skyline View Queries". Knowledge and Information Systems: An International Journal, Volume 34, Issue 2, pages 397-424, February 2013, Springer-Verlag.

  22. Y. Liu, Y. Zhao, L. Chen, J. Pei, and J. Han. "Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays".  IEEE Transactions on Parallel and Distributed Systems, Volume 23, Issue 11, pages 2138-2149, November 2012, IEEE Computer Society.

  23. L. Li, S. Petschulat, G. Tang, J. Pei, W-S. Luk. "Efficient and Effective Aggregate Keyword Search on Relational Databases". International Journal of Data Warehousing and Mining (IJDWM), Volume 7, Issue 4, pages 41-81, October 2012, Idea Group, Inc.

  24. Q. Jiang, A. Campbell, G. Tang, and J. Pei. "Multi-level Relationship Outlier Detection". the International Journal of Business Intelligence and Data Mining (IJBIDM), Volume 7, Number 4, pages 253-273, October 2012, InderScience Publishers.

  25. M. Hua and J. Pei. "Clustering in Applications with Multiple Data Sources -- A Mutual Subspace Clustering Approach".  Neurocomputing, Volume 92, pages 133-144, September 2012, Elsevier.

  26. Z. Xing, J. Pei, and P.S. Yu. "Early Classification on Time Series". Knowledge and Information Systems: An International Journal, Volume 31, Issue 1, pages 105-127, April 2012, Springer-Verlag.

  27. B. Zhou and J. Pei. "Aggregate Keyword Search on Large Relational Databases". Knowledge and Information Systems: An International Journal, Volume 30, Number 2, pages 283-318, February 2012, Springer-Verlag.

  28. B. Jiang, J. Pei, and X. Lin. "Probabilistic Skylines on Uncertain Data: Model and Bounding-Pruning-Refining Methods". Journal of Intelligent Information Systems, Volume 38, Number 1, pages 1-39, February 2012, Springer-Verlag.

  29. X. Sun, H. Wang, J. Li, and J. Pei. "Publishing Anonymous Survey Rating Data". Data Mining and Knowledge Discovery, Volume 23, pages 379-406, November 2011, Springer-Verlag.

  30. Z. Liao, D. Jiang, E. Chen, J. Pei, H. Cao, and H. Li. "Mining Concept Sequences from Large-scale Search Logs for Context-aware Query Suggestion".  ACM Transactions on Intelligent Systems and Technology, Volume 3, Issue 1, pages 17:1-40, October 2011, ACM Press.

  31. R.C.W. Wong, A.W.C. Fu, K.Wang, P.S. Yu, and J. Pei. "Can the Utility of Anonymized Data Be Used for Privacy Breaches?". ACM Transactions on Knowledge Discovery in Data, Volume 5, Issue 3, pages 16:1-24, August 2011, ACM Press.

  32. B. Zhou and J. Pei. "k-Anonymity and l-Diversity Approaches for Privacy Preservation in Social Networks against Neighborhood Attacks". Knowledge and Information Systems: An International Journal, Volume 28, Number 1, pages 47-77, July 2011, Springer-Verlag.

  33. Y. Zhang, W. Zhang, X. Lin, B. Jiang, and J. Pei. "Ranking Uncertain Sky: the Probabilistic Top-k Skyline Operator".  The Information System Journal, Volume 36, Issue 5, pages 898-915, July 2011, Wiley.

  34. M. Hua, J. Pei, and X. Lin. "Ranking Queries on Uncertain Data". The VLDB Journal, Volume 20, Number 1, pages 129-153, February 2011, Springer Berlin / Heidelberg.

  35. Z. Xing, J. Pei, and E. Keogh. "A Brief Survey on Sequence Classification".  ACM SIGKDD Explorations, Volume 12, Issue 1, pages 40-48, June 2010, ACM Press.

  36. Z. Lin, B. Jiang, J. Pei, and D. Jiang. "Mining Discriminative Items in Multiple Data Streams".  World Wide Web Journal: Internet and Web Information Systems, Volume 13, Issue 4, pages 497-522, December 2010, Springer-Verlag.

  37. Z. Xing and J. Pei. "Exploring Disease Association from the NHANES Data: Data Mining, Pattern Summarization, and Visual Analytics". International Journal of Data Warehousing and Mining (IJDWM),  Volume 6, Issue 3, pages 11-27, July-September 2010, Idea Group, Inc.

  38. E. Loekito, J. Bailey, and J. Pei. "Binary Decision Diagram Based Approach for Mining Frequent Subsequences". Knowledge and Information Systems: An International Journal, Volume 24, Number 2, pages 235-268, August, 2010, Springer-Verlag.

  39. S. Yuen, Y. Tao, X. Xiao, J. Pei, D. Zhang. "Superseding Nearest Neighbor Search on Uncertain Spatial Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 22, Number 7, pages 1041-1055, July, 2010, IEEE Computer Society.

  40. X. Cheng, J. Xu, J. Pei, and J. Liu. "Hierarchical Distributed Data Classification in Wireless Sensor Networks". Computer Communication, Volume 33, Issue 15, pages 1404-1413, July 15, 2010, Elsevier.

  41. W. Zhang, X. Lin, Y. Zhang, J. Pei, and W. Wang. "Threshold-based Probabilistic Top-k Dominating Queries". The VLDB Journal, Volume 19, Number 2, pages 283-305, April 2010, Springer Berlin / Heidelberg.

  42. M. A. Cheema, X. Lin, W. Wang, W. Zhang, and J. Pei. "Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data". IEEE Transactions on Knowledge and Data Engineering, Volume 22, Number 4, pages 550-564, April 2010, IEEE Computer Society.

  43. B. Aljaber, N. Stokes, J. Bailey, and J. Pei. "Document Clustering of Scientific Texts Using Citations Contexts". Information Retrieval, Volume 13, Number 2, pages 101-131, April 2010, Springer-Verlag.

  44. M. Hua, M. K. Lau, J. Pei, and K. Wu. "Continuous K-Means Monitoring with Low Reporting Cost in Sensor Networks". IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 12, pages 1679-1691, December 2009, IEEE Computer Society.

  45. X. Zeng, J. Pei, K. Wang, and J. Li. "PADS: A Simple Yet Effective Pattern-Aware Dynamic Search Method for Fast Maximal Frequent Pattern mining". Knowledge and Information Systems: An International Journal, Volume 20, Number 3, pages 375-391, September 2009, Springer-Verlag.

  46. M. Hua and J. Pei. "Continuously Monitoring Top-K Uncertain Data Streams: A Probabilistic Threshold Method".  Distributed and Parallel Databases: An International Journal, Volume 26, Number 1, (special issue on ranking in databases), pages 29-65, August 2009, Springer-Verlag.

  47. B. Zhou and J. Pei. "Link Spam Target Detection Using Page Farms". ACM Transactions on Knowledge Discovery in Data, Volume 3, Number 3, pages 13:1-38, July 2009, ACM Press.

  48. R. She, J. Chu, K. Wang, J. Pei, and J. Chen. "GenBlastA: Enabling BLAST to Identify Homologous Gene Sequences". Genome Research, Volume 19, Number 1, pages 143-149, January 2009, Cold Spring Harbor Laboratory Press.

  49. M. P. Ng, I. A. Vergara, C. Frech, Q. Chen, X. Zeng, J. Pei, and N. Chen. "OrthoClusterDB: a Web Server for Synteny Blocks". BMC Bioinformatics, Volume 10, Article 192, 2009.

  50. M. Hua, J. Pei, A. W.-C. Fu, X. Lin, and H-F Leung. "Top-k Typicality Queries and Efficient Query Answering Methods on Large Databases". The VLDB Journal, Volume 18, Number 3, pages 809-835, June 2009, Springer Berlin / Heidelberg.

  51. R. C.-W. Wong, A. W.-C. Fu, K. Wang, and J. Pei. "Anonymization based Attack in Privacy Preserving Data Publishing". ACM Transactions on Database Systems, Volume 34, Issue 2, pages 8:1-46, June 2009, ACM Press.

  52. D. Jiang and J. Pei. "Mining Frequent Cross-Graph Quasi-Cliques". ACM Transactions on Knowledge Discovery in Data, Volume 2, Number 4, pages 16:1-42, January 2009, ACM Press.

  53. R. C.-W. Wong, J. Pei, A. W.-C. Fu, and K. Wang. "Online Skyline Analysis with Dynamic Preferences on Nominal Attributes". IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 1, pages 35-49, January 2009, IEEE Computer Society.

  54. B. Zhou, J. Pei, and W.-S. Luk. "A Brief Survey on Anonymization Techniques for Privacy Preserving Publishing of Social Network Data". ACM SIGKDD Explorations, Volume 10, Issue 2, pages 12-22, December 2008, ACM Press.

  55. J. Li, R. C.-W. Wong, A. W.-C. Fu, and J. Pei. "Anonymization by Local Recoding in Data with Attribute Hierarchical Taxonomies". IEEE Transactions on Knowledge and Data Engineering, Volume 20, Number 9, pages 1181-1194, September 2008, IEEE Computer Society.

  56. H. Wang and J. Pei. "Clustering by Pattern Similarity". Journal of Computer Science and Technology, Volume 23, Number 4, pages 481-496, July 2008, Springer.

  57. D. Jiang, J. Pei, M. Ramanathan, C. Lin, C. Tang, and A. Zhang. "Mining Gene-Sample-Time Microarray Data: A Coherent Gene Cluster Discovery Approach". Knowledge and Information Systems: An International Journal, Volume 13, Number 3, pages 305-335, November 2007, Springer-Verlag.

  58. Y. Tao, X. Xiao, and J. Pei. "Efficient Skyline and Top-k Retrieval in Subspaces". IEEE Transactions on Knowledge and Data Engineering, Volume 19, Number 8, pages 1072-1088, August 2007, IEEE Computer Society.

  59. M. Cho, J. Pei, and K. Wang. "Answering Ad Hoc Aggregate Queries from Data Streams Using Prefix Aggregate Trees". Knowledge and Information Systems: An International Journal, Volume 12, Number 3, pages 301-329, August 2007, Springer-Verlag.

  60. C. Liu, K. Wu, and J. Pei. "An Energy Efficient Data Collection Framework for Wireless Sensor Networks by Exploiting Spatiotemporal Correlation". IEEE Transactions on Parallel and Distributed Systems, Volume 18, Number 7, July 2007, pages 1010-1023, IEEE Computer Society.

  61. J. Pei, J. Han, H. Lu, S. Nishio, S. Tang, and D. Yang. "H-Mine: Fast and space-preserving frequent pattern mining in large databases". IIE Transactions, Volume 39, Issue 6, pages 593-605, June 2007, Taylor & Francis.

  62. J. Pei, J. Han, and W. Wang. "Constraint-Based Sequential Pattern Mining: The Pattern-Growth Methods". Journal of Intelligent Information Systems, Volume 28, Number 2, pages 133-160, April 2007, Springer-Verlag.

  63. J. Pei, Y. Yuan, X. Lin, W. Jin, M. Ester, Q. Liu, W. Wang, Y. Tao, J. X. Yu, and Q. Zhang. "Towards Multidimensional Subspace Skyline Analysis". ACM Transactions on Database Systems, Volume 31, Number 4, pages 1335-1381, December 2006, ACM Press. (Executable code)

  64. Y. Chen, G. Dong, J. Han, J. Pei, B. W. Wah, and J. Wang. "Regression Cubes with Lossless Compression and Aggregation". IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 12, pages 1585-1599, December 2006, IEEE Computer Society.

  65. J. Xu, W. Wang, J. Pei, X. Wang, B. Shi, and A. W. Fu. "Utility-Based Anonymization for Privacy Preservation with Less Information Loss". ACM SIGKDD Explorations, Volume 8, Issue 2, pages 21-30, December 2006.

  66. J. Pei, H. Wang, J. Liu, K. Wang, J. Wang, and P. S. Yu. "Discovering Frequent Closed Partial Orders from Strings". IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 11, pages 1467-1481, November 2006, IEEE Computer Society. (Software)

  67. I. Pekerskaya, J. Pei, and K. Wang. "Mining Changing Regions from Access-Constrained Snapshots: A Cluster-Embedded Decision Tree Approach". Journal of Intelligent Information Systems (Special Issue on Mining Spatio-Temporal Data), Volume 27, Number 3, pages 215-242, November 2006, Springer-Verlag.

  68. Y. Huang, J. Pei, and H. Xiong. "Mining Co-Location Patterns with Rare Events from Spatial Data Sets''. GeoInformatica, Volume 10, Number 3, pages 239-260, September 2006, Springer Netherlands.

  69. J. Wang, J. Han, and J. Pei. "Closed Constrained-Gradient Mining in Retail Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 6, pages 764-769, June 2006, IEEE Computer Society.

  70. J. Han, Y. Chen, G. Dong, J. Pei, B. W. Wah, J. Wang, and Y. D. Cai. "Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams". Distributed and Parallel Databases, Volume 18, Number 2, pages 173-197, September 2005, Springer Science + Business Media.

  71. D. Jiang, J. Pei, and A. Zhang. "An Interactive Approach to Mining Gene Expression Data". IEEE Transactions on Knowledge and Data Engineering, Volume 17, Number 10, pages 1363-1378, October 2005, IEEE Computer Society.

  72. M. Cho, J. Pei, H. Wang, and W. Wang. "Preference-based Frequent Pattern Mining". International Journal of Data Warehousing and Mining, Volume 1, Number 4, pages 56-77, October-December 2005, Idea Group, Inc.

  73. J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal, and M.C. Hsu. "Mining Sequential Patterns by Pattern-growth: The  PrefixSpan Approach". IEEE Transactions on Knowledge and Data Engineering, Volume 16, Number 11, pages 1424-1440, November 2004, IEEE Computer Society.

  74. J. Pei, G. Dong, W. Zou, and J. Han. "Mining Condensed Frequent Pattern Bases". Knowledge and Information Systems: An International Journal, Volume 6, Number 5, pages 570-594, September 2004, Springer-Verlag.

  75. G. Dong, J. Han, J. Lam, J. Pei, K. Wang, and W. Zou. "Mining Constrained Gradients in Large Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 16, Number 8, pages 922-938, August 2004, IEEE Computer Society.

  76. J. Pei, J. Han, and L.V.S. Lakshmanan. "Pushing Convertible Constraints in Frequent Itemset Mining". Data Mining and Knowledge Discovery: An International Journal, Volume 8, Issue 3, pages 227-252, May 2004, Kluwer Academic Publishers. (Erratum)

  77. J. Han, J. Pei, and X. Yan. "From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach". (Invited paper) Journal of Computer Science and Technology, Vol. 19, No. 3, pages 257-279. May 2004. Allerton Press, Inc.

  78. J. Han, J. Pei, Y. Yin, and R. Mao. "Mining Frequent Patterns without Candidate Generation: A Frequent-pattern Tree Approach". Data Mining and Knowledge Discovery: An International Journal, Volume 8, Issue 1, pages 53-87, January 2004,  Kluwer Academic Publishers.

  79. D. Jiang, J. Pei, and A. Zhang. "Towards Interactive Exploration of Gene Expression Patterns". ACM SIGKDD Explorations (Special Issue on Microarray Data Analysis), Volume 5, Issue 2, pages 79-90 2003.

  80. Z. Chen, C. Li, J. Pei, Y. Tao, H. Wang, W. Wang, J. Yang, J. Yang, and D. Zhang. "Recent Progress on Selected Topics in Database Research: A Report from Nine Young Chinese Researchers Working in the United States". (Invited paper) Journal of Computer Science and Technology, Vol. 18, No. 5, 538-552. September 2003. Allerton Press, Inc. (A printable version)

  81. J. Pei and J. Han,  "Constrained Frequent Pattern Mining: A Pattern-Growth View", ACM SIGKDD Explorations (Special Issue on Constraints in Data Mining), Volume 4, Issue 1, pages 31-39, June 2002.

  82. J. Han and J. Pei,  "Mining Frequent Patterns by Pattern-Growth: Methodology and Implications", ACM SIGKDD Explorations (Special Issue on Scalable Data Mining Algorithms), Volume 2, Issue 2, pages 14-20, December 2000.


Publications in Refereed Conferences/Workshops


Year 2016

  1. J. Liu, L. Xiong, J. Pei, J. Luo, and H. Zhang. "Finding Pareto Optimal Groups: Group-based Skyline". In Proceedings of the 42nd International Conference on Very Large Data Bases (VLDB'16), New Delhi, India, September 5-9, 2015.

  2. L. Chu, Z. Wang, J. Pei, J. Wang, Z. Zhao, and E. Chen. "Finding Gangs in War from Signed Networks". In Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'16), San Francisco, CA, USA, August 13-17, 2016.

  3. M. Ou, P. Cui, J. Pei, and W. Zhu. "Asymmetric Transitivity Preserving Graph Embedding". In Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'16), San Francisco, CA, USA, August 13-17, 2016.

  4. H-J. Hung, H-H. Shuai, D-N. Yang, L-H. Huang, W-C. Lee, J. Pei, and M-S. Chen. "When Social Influence Meets Item Inference". In Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'16), San Francisco, CA, USA, August 13-17, 2016.

  5. Y. Yang, X. Mao, J. Pei, and X. He. "Continuous Influence Maximization: What Discounts Should We Offer to Social Network Users?". In Proceedings of the 2016 ACM SIGMOD International Conference on Management of Data (SIGMOD'16), San Francisco, CA, USA, June 26-July 1, 2016.

  6. D-W. Choi, J. Pei, and X. Lin. "Finding the Minimum Spatial Keyword Cover". In Proceedings of the Thirty-second IEEE International Conference on Data Engineering (ICDE'16), Helsinki, Finland, May 16-20, 2016.


Year 2015

  1. J. Hu, Q. Qian, J. Pei, R. Jin, and S. Zhu. "Multi-clustering via Clustering Stability". In Proceedings of the Fifteenth IEEE International Conference on Data Mining series (ICDM'15),
    Atlantic City, NJ, USA, November 14-17, 2015.

  2. L. Duan, G. Tang, J. Pei, J. Bailey, A. Campbell, and C. Tang. "Mining Outlying Aspects on Numeric Data". In Proceedings of the 2015 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPAKDD'15), Porto, Portugal, September 7-11, 2015.

  3. L. Chu, S. Wang, S. Liu, Q. Huang, and J. Pei. "ALID: Scalable Dominant Cluster Detection". In Proceedings of the 41st International Conference on Very Large Data Bases (VLDB'15), Kohala Coast, HI, USA, August 31-September 4, 2015.

  4. Y. Zhang, J. Tang, Z. Yang, J. Pei, and P.S. Yu. "COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency". In Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’15), Sydney, NW, Australia, August 10-13, 2015.

  5. K. Yu, D. Wang, W. Ding, J. Pei, D.L. Small, S. Islam, and X. Wu. "Tornado Forcasting with Multiple Markov Boundaries".  In Proceedings of the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’15), Sydney, NW, Australia, August 10-13, 2015.

  6. N. X. Vinh, J. Chan, J. Bailey, C. Leckie, K. Ramamohanarao, and J. Pei. "Scalable Outlying-Inlying Aspects Discovery via Feature Ranking". In Proceedings of the 19th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'15), Ho Chi Minh City, Viet Nam, May 19-22, 2015.

  7. Y.-F. Lin, H.-H. Chen, V. S. Tseng, and J. Pei. "Reliable Early Classification on Multivariate Time Series with Numerical and Categorical Attributes".  In Proceedings of the 19th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'15), Ho Chi Minh City, Viet Nam, May 19-22, 2015.

  8. J. Wang, S. Song, X. Lin, X. Zhu, and J. Pei. "Cleaning Structured Event Log: A Graph Repair Approach". In Proceedings of the 31st IEEE International Conference on Data Engineering (ICDE'15), Seoul, Korea, April 13-17, 2015.

  9. Z. Yu, X. Yu, Y. Liu, W. Li, and J. Pei. "Mining Frequent Co-occurrence Patterns across Multiple Data Streams". In Proceedings of the 18th International Conference on Extending Database Technology (EDBT'15), Brussels, Belgium, March 23-27, 2015.

  10. L. Chang, X. Lin, L. Qin, J.X. Yu, and J. Pei. "Efficiently Computing Top-K Shortest Path Join". In Proceedings of the 18th International Conference on Extending Database Technology (EDBT'15), Brussels, Belgium, March 23-27, 2015.


Year 2014

  1. K. Yu, W. Ding, X. Wu, and J. Pei. "Towards Scalable and Accurate Online Feature Selection for Big Data". In Proceedings of the 14th IEEE International Conference on Data Mining (ICDM’14), Shenzhen, China, December 14-17, 2014.

  2. T. Guo, X. Zhu, J. Pei, and C. Zhang. "SNOC: Streaming Network Node Classification". In Proceedings of the 14th IEEE International Conference on Data Mining (ICDM’14), Shenzhen, China, December 14-17, 2014.

  3. G. Tang, K. Wu, J. Pei, J. Tang, and J. Lei. "An Appliance-driven Approach to Detection of Corrupted Load Curve Data". In Proceedings of the 23rd ACM International Conference on Information and Knowledge Management (CIKM’14), Shanghai, China, November 3-7, 2014.

  4. J. Han, J. Wen, and J. Pei. "Within-Network Classification Using Radius-Constrained Neighborhood Patterns". In Proceedings of the 23rd ACM International Conference on Information and Knowledge Management (CIKM’14), Shanghai, China, November 3-7, 2014.

  5. X. Hu, J. Pei, and Y. Tao. "Shortest Unique Queries on Strings". In Proceedings of the 21st International Symposium on String Processing and Information Retrieval (SPIRE’14), Ouro Preto, Brazil, October 20-23, 2014.

  6. W. Yu, X. Lin, W. Zhang, L. Chang, and J. Pei. "More is Simpler: Effectively and Efficiently Assessing Node-Pair Similarities Based on Hyperlinks". In Proceedings of the 40th International Conference on Very Large Data Bases (VLDB’14), Hangzhou, China, September 1-5, 2014.

  7. Q. Qian, J. Hu, R. Jin, J. Pei, and S. Zhu. "Distance Metric Learning Using Dropout: A Structured Regularization Approach". In Proceedings of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'14),, New York, NY, USA, August 24-27, 2014.

  8. L. Zhang, J. Pei, Y. Jia, B. Zhou and X. Wang. "Do Neighbor Buddies Make a Difference in Reblog Likelihood? An Analysis on SINA Weibo Data". In Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Network Analysis and Mining (ASONAM’14), Beijing, China, August 17-20, 2014.

  9. L. Duan, G. Tang, J. Pei, J. Bailey, G. Dong, A. Campbell, and C. Tang. "Mining Contrast Subspaces". In Proceedings of the 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’14), Tainan, Taiwan, May 13-16, 2014.

  10. J. Chan, X.V. Nguyen, W. Liu, J. Beiley, C. Leckie, K. Ramamohanarao, and J. Pei. "Structure-aware Distance Measures for Comparing Clusterings in Graphs". In Proceedings of the 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’14), Tainan, Taiwan, May 13-16, 2014.

  11. Y. Wang, J. Pei, X. Lin, and Q. Zhang. "An Iterative Fusion Approach to Graph-based Semi-supervised Learning from Multiple Views". In Proceedings of the 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’14), Tainan, Taiwan, May 13-16, 2014.

  12. J. Hu, J. Pei, and J. Tang. "How Can I Index My Thousands of Photos Effectively and Automatically? An Unsupervised Feature Selection Approach". In Proceedings of the 14th SIAM International Conference on Data Mining (SDM’14), Philadelphia, PA, USA, April 24-26, 2014.

  13. Y. Li, J. Bailey, L. Kulik, and J. Pei. "Efficient Matching of Substrings in Uncertain Sequences". In Proceedings of the 14th SIAM International Conference on Data Mining (SDM’14), Philadelphia, PA, USA, April 24-26, 2014.


Year 2013

  1. G. Tang, Y. Yang, and J. Pei. "Price Information Patterns in Web Search Advertising: An Empirical Case Study on Accommodation Industry". (data sets)  In Proceedings of the 13th IEEE International Conference on Data Mining (ICDM'13), Dallas, TX, USA, December 7-10, 2013.

  2. C.L. Kam, C. Raissi, M. Kaytoue, and J. Pei. "Mining Statistically Significant Sequential Patterns".  In Proceedings of the 13th IEEE International Conference on Data Mining (ICDM'13), Dallas, TX, USA, December 7-10, 2013. (software)

  3. Y. Li, J. Bailey, L. Kulik, and J. Pei. "Mining Probabilistic Frequent Spatio-Temporal Sequential Patterns with Gap Constraints from Uncertain Databases".  In Proceedings of the 13th IEEE International Conference on Data Mining (ICDM'13), Dallas, TX, USA, December 7-10, 2013.

  4. X. Mao, B. Lin, X. He, D. Cai, and J. Pei. "Parallel Field Alignment for Cross Media Retrieval". In Proceedings of the 21st ACM International Conference on Multimedia (MM'13), Barcelona, Catalunya, Spain, October 21-25, 2013.

  5. Y-C. Lo, J-Y. Li, M-Y. Yeh, S-D. Lin, and J. Pei. "What Distinguish One from Its Peers in Social Networks?". In Proceedings of the 2013 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD'13), Prague, Czech Republic, September 23-27, 2013.

  6. Y. Wang, P. Wang, J. Pei, and W. Wang. "A Data-adaptive and Dynamic Segmentation Index for Whole Matching on Time Series". In Proceedings of the 39th International Conference on Very Large Data Bases (VLDB’13), Riva del Garda, Trento, Italy, August 26-30, 2013.

  7. G. Tang, J. Pei, J. Bailey and G. Dong. "Mining Multidimensional Contextual Outliers from Categorical Relational Data". In Proceedings of the 25th International Conference on Scientific and Statistical Database Management (SSDBM’13), Baltimore, MA, USA, July 29-31, 2013.

  8. Y. Xiong, Y. Zhu, J. Pei, and P.S. Yu. "Towards Cohesive Anomalies Mining". In Proceedings of the 27th AAAI Conference on Artificial Intelligence (AAAI'13), Bellevue, WA, USA, July 14-18, 2013.

  9. J. Pei, W. C-H. Wu, and M-Y. Yeh. "On Shortest Unique Substring Queries". In Proceedings of the Twenty-ninth IEEE International Conference on Data Engineering (ICDE’13), Brisbane, Australia, April 8-12, 2013.


Year 2012

  1. H. Maserrat and J. Pei. "Community Preserving Lossy Compression of Social Networks". In Proceedings of the Twelfth IEEE International Conference on Data Mining (ICDM’12), Brussels, Belgium, December 10-13, 2012.

  2. T. Dwyer, A. Fedorova, S. Blagodurov, M. Roth, F. Gaud, and J. Pei, "A Practical Method for Estimating Performance Degradation on Multicore Processors and its Application to HPC Workloads". In Proceedings of the 25th International Conference for High Performance Computing, Networking, Storage and Analysis (SC'12), Salt Lake City, UT, USA. November 10-16, 2012.

  3. W. Liu, A. Kan, J. Chan, J. Bailey, C. Leckie, R. Kotagiri, and J. Pei. "On Compressing Weighted Time-evolving Graphs". In Proceedings of the Twenty-first ACM International Conference on Information and Knowledge Management (CIKM’12), Maui, HI, USA, October 29-November 2, 2012.

  4. Y. Qian, H. Li, D. Jiang, Y. Hu, J. Pei, and Q. Zheng, "Mining Query Subtopics from Search Log Data". In Proceedings of the Thirty- fifth Annual ACM SIGIR Conference (SIGIR'12), Portland, OR, USA, August 12-16, 2012.

  5. W.C.-H. Wu, M.-Y. Yeh, and J. Pei, "Random Error Reduction in Similarity Search on Time Series: A Statistical Approach", In Proceedings of the 28th IEEE International Conference on Data Engineering (ICDE'12), Washington, DC, USA, April 1-5, 2012.

  6. M. Hua and J. Pei, "Aggregate Queries on Probabilistic Record Linkages". In Proceedings of the Fifteenth International Conference on Extending Database Technology (EDBT'12), Berlin, Germany, March 26-30, 2012.


Year 2011

  1. C. Wang, L.Y. Yuan, J-H. You, O.R. Zaiane, and J. Pei. "On Pruning for Top-k Ranking in Uncertain Databases". In Proceedings of the 37th International Conference on Very Large Data Bases (VLDB'11), Seattle, WA, USA, August 29-September 3, 2011.

  2. R. C. Raďssi and J. Pei. "Towards Bounding Sequential Patterns". In Proceedings of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'11), San Diego, CA, USA, August 21-24, 2011.

  3. Y. Tao, C. Sheng, and J. Pei. "On k-skip Shortest Paths". In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD'11), Athens, Greece, June 12-16, 2011.

  4. Z. Xing, J. Pei, P.S. Yu., and K. Wang. "Extracting Interpretable Features for Early Classification on Time Series". In Proceedings of 11th SIAM International Conference on Data Mining (SDM'11), April 28 - 30, 2011, Phoenix, Arizona, USA.

  5. B. Jiang and J. Pei. "Outlier Detection on Uncertain Data: Objects, Instances, and Inferences". In Proceedings of the 27th IEEE International Conference on Data Engineering (ICDE'11), Hannover, Germany, April 11-16, 2011.

  6. D. Kang, D. Jiang, J. Pei, Z. Liao, X. Sun, and H-J. Choi. "Multidimensional Mining of Large-Scale Search Logs: A Topic-Concept Cube Approach". In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), Hong Kong, China, February 9-12, 2011.

  7. Q. He, D. Kifer, J. Pei, P. Mitra, and L. Giles. "Citation Recommendation without Author Supervision". In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), Hong Kong, China, February 9-12, 2011.


Year 2010

  1. R. C-W. Wong, A. W-C. Fu, K. Wang, Y. Xu, J. Pei, and P.S. Yu. "Probabilistic Inference Protection on Anonymized Data". In Proceedings of the 10th IEEE International Conference on Data Mining (ICDM'10), Sydney, Australia, December 14-17, 2010.

  2. R. C. Raďssi, J. Pei, and T. Kister. "Computing Closed Skycubes". In Proceedings of the 36th International Conference on Very Large Data Bases (VLDB'10), Singapore, September 13-17, 2010.

  3. H. Maserrat and J. Pei. "Neighbor Query Friendly Compression of Social Networks". In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.

  4. B. Xiang, D. Jiang, J. Pei, X. Sun, E. Chen, and H. Li. "Context-Aware Ranking in Web Search". In Proceedings of the 33rd Annual ACM SIGIR Conference (SIGIR'10), Geneva, Switzerland, July 19-23, 2010.

  5. Y. Tao, K. Yi, C. Sheng, J. Pei, and F. Li. "Logging Every Footstep: Quantile Summaries for the Entire History". In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data (SIGMOD'10), Indianapolis, Indiana, USA, June 6-11, 2010.

  6. Q. He, J. Pei, D. Kifer, P. Mitra, and C.L. Giles. "Context-aware Citation Recommendation". In Proceedings of the 19th International World Wide Web Conference (WWW'10), Raleigh, NC, USA, April 26-30, 2010.

  7. M. Hua and J. Pei. "Probabilistic Path Queries in Road Networks: Traffic Uncertainty Aware Path Selection" (appendix).  In Proceedings of the 13th International Conference on Extending Database Technology (EDBT'10), Lausanne, Switzerland, March 22-26, 2010.

  8. Y. Tao, J. Pei, J. Li, X. Xiao, K. Yi, and Z. Xing. "Hiding Correlation by Independence Masking".  In Proceedings of the 26th IEEE International Conference on Data Engineering (ICDE'10), Long Beach, CA, USA, March 1-6, 2010.


Year 2009

  1. Q. He, B. Chen, J. Pei, B. Qiu, P. Mitra, and C. L. Giles. "Detecting Topic Evolution in Scientific Literature: How Can Citations Help?".  In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM'09), Hong Kong, November 2-6, 2009.

  2. X. Cheng, J. Xu, J. Pei, and J. Liu. "Hierarchical Distributed Data Classification in Wireless Sensor Networks". In Proceedings of the 6th IEEE International Conference on Mobile Ad Hoc and Sensor Systems (MASS'09), Macau, China, October 12-15, 2009.

  3. Y. Zhao, H. Zhang, L. Cao, J. Pei, and C. Zhang. "Debt Detection in Social Security by Sequence Classification Using Both Positive and Negative Patterns".  In Proceedings of the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD'09), Bled, Slovenia, September 7-11, 2009.

  4. Z. Xing, J. Pei, and P. S. Yu. "Early Classification on Time Series: A Nearest Neighbor Approach". In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09), Pasadena, CA, USA, July 14-17, 2009.

  5. H. Zhong, T. Xie, L. Zhang, J. Pei, and H. Mei. "MAPO: Mining and Recommending API Usage Patterns". In Proceedings of the 23rd European Conference on Object-Oriented Programming (ECOOP 2009), Genova, Italy, July 6-10, 2009.

  6. B. Zhou, D. Jiang, J. Pei, and H. Li. "OLAP on Search Logs: An Infrastructure Supporting Data-Driven Applications in Search Engines".  In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.

  7. J. Wang, C. Chen, C. Wang, J. Pei, J. Bu, Z. Guan, W. V. Zhang. "Can We Learn a Template-Independent Wrapper for News Article Extraction from a Single Training Site?".  In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.

  8. B. Zhou and J. Pei. "OSD: An Online Web Spam Detection" (demo paper).  In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.

  9. T. Wang, B. Yang, J. Gao, D. Yang, S. Tang, H. Wu, K. Liu, and J. Pei. "MobileMiner: A Real World Case Study of Data Mining in Mobile Communication" (demo paper).  In Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data (SIGMOD'09), June 29-July 2, 2009, Providence, Rhode Island, USA.

  10. Y. Han, B. Zhou, J. Pei and Y. Jia. "Understanding Importance of Collaborations in Co-authorship Networks: A Supportiveness Analysis Approach''. In Proceedings of 2009 SIAM International Conference on Data Mining (SDM'09), April 30 - May 2, 2009, Sparks, Nevada.

  11. H. Cao, D. Jiang, J. Pei, E. Chen, and H. Li. "Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs''. In Proceedings of the 18th International World Wide Web Conference (WWW'09) (Search Track), April 20-24, 2009, Madrid, Spain.

  12. J. Wang, X. He, C. Wang, J. Pei, J. Bu, C. Chen, and Z. Guan. "News Article Extraction with Template-Independent Wrapper". In Proceedings of the 18th International World Wide Web Conference (WWW'09) (Poster), April 20-24, 2009, Madrid, Spain.

  13. B. Jiang and J. Pei. "Online Interval Skyline Queries on Time Series". In Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE'09), March 29 - April 4, 2009, Shanghai, China.

  14. Y. Tao, L. Ding, X. Lin, and J. Pei. "Distance-based Representative Skyline". In Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE'09), March 29 - April 4, 2009, Shanghai, China. (software)

  15. J. Pei, Y. Tao, J. Li, X. Xiao. "Privacy Preserving Publishing on Multiple Quasi-Identifiers". In Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE'09), March 29 - April 4, 2009, Shanghai, China.  (Full version as Technical Report TR 2008-18, School of Computing Science, Simon Fraser University.)

  16. B. Zhou and J. Pei. "Answering Aggregate Keyword Queries on Relational Databases Using Minimal Group-bys". In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.

  17. B. Zhou, Y. Han, J. Pei, B. Jiang, Y. Tao, and Y. Jia. "Continuous Privacy Preserving Publishing of Data Streams". In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.

  18. Y. Xiao, W. Wu, J. Pei, W. Wang, and Z. He. "Efficiently Indexing Shortest Paths by Exploiting Symmetry in Graphs". In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.

  19. K. Tsoukalas, B. Zhou, J. Pei, and D. Cubranic. "Personalizing Entity Detection and Recommendation with a Fusion of Web Log mining Techniques" (Industrial track). In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.


Year 2008

  1. Y. Xu, B. Fang, K. Wang, A. W.-C. Fu, and J. Pei. "Publishing Sensitive Transactions for Itemset Utility". In Proceedings of the 8th IEEE International Conference on Data Mining (ICDM'08), December 15-19, 2008, Pisa, Italy.

  2. B. Jiang, J. Pei, X. Lin, D. W-L Cheung, and J. Han. "Mining Preferences from Superior and Inferior Examples''. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.

  3. H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. "Context-Aware Query Suggestion by Mining Click-Through and Session Data" (Best Application Paper Award). In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.

  4. M. Hua and J. Pei. "DiMaC: A Disguised Missing Data Cleaning Tool" (demo paper). In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.

  5. R. C.-W. Wong, A. W.-C. Fu, J. Pei, Y. S. Ho, T. Wong, and Y. Liu. "Efficient Skyline Querying with Variable User Preferences on Nominal Attributes". In Proceedings of the 34th International Conference on Very Large Databases (VLDB'08), August 24-30, 2008, Auckland, New Zealand.

  6. K. Tsoukalas, B. Zhou, J. Pei, and D. Cubranic. "PLEDS: A Personalized Entity Detection System Based on Web Log Mining Techniques" (invited paper). In Proceedings of the 9th International Conference on Web-Age Information Management (WAIM'08), July 20-22, 2008, Zhangjiajie, China.

  7. W. Zhang, X. Lin, J. Pei, and Y. Zhang. "Managing Uncertain Data: A Probabilistic Approach" (invited paper). In Proceedings of the 9th International Conference on Web-Age Information Management (WAIM'08), July 20-22, 2008, Zhangjiajie, China.

  8. M. Hua, J. Pei, W. Zhang, and X. Lin. "Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach". In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD'08), June 11-14, 2008, Vancouver, Canada. (Implementation, the real data sets, the synthetic data generator and the data sets).

  9. M. Hua and J. Pei. "DiMaC: A System for Cleaning Disguised Missing Data" (demo paper). In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD'08), June 11-14, 2008, Vancouver, Canada.

  10. E. Soroush, K. Wu, and J. Pei. "Fast and Quality-Guaranteed Data Streaming in Resource-Constrained Sensor Networks''. In Proceedings of the 9th ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc'08), Hong Kong, China, May 26-30, 2008.

  11. B. Zhou, J. Pei, and Z. Tang. "A Spamicity Approach to Web Spam Detection". In Proceedings of the 2008 SIAM International Conference on Data Mining (SDM'08), Atlanta, GA, April 24-26, 2008.

  12. Z. Xing, J. Pei, G. Dong, and P. S. Yu. "Mining Sequence Classifiers for Early Prediction". In Proceedings of the 2008 SIAM International Conference on Data Mining (SDM'08), Atlanta, GA, April 24-26, 2008.

  13. B. Zhou and J. Pei. "Preserving Privacy in Social Networks against Neighborhood Attacks". In Proceedings of the 24th International Conference on Data Engineering (ICDE'08), Cancún, México, April 7-12, 2008.

  14. M. Hua, J. Pei, W. Zhang, and X. Lin. "Efficiently Answering Probabilistic Threshold Top-k Queries on Uncertain Data". In Proceedings of the 24th International Conference on Data Engineering (ICDE'08), Cancún, México, April 7-12, 2008.

  15. X. Zeng, J. Pei, I. Vergara, M. Nesbitt, K. Wang, and N. Chen. "OrthoCluster: A New Tool for Mining Syntenic Blocks and Applications in Comparative Genomics". In Proceedings of the 11th International Conferences on Extending Database Technology (EDBT'08), Nantes, France, March 25-30, 2008.

  16. B. C. M. Fung, K. Wang, A. W.-C. Fu, and J. Pei. "Anonymity for Continuous Data Publishing". In Proceedings of the 11th International Conferences on Extending Database Technology (EDBT'08), Nantes, France, March 25-30, 2008.


Year 2007

  1. F. M. Jiang, J. Pei, and A. W.-C. Fu. "IX-Cubes: Iceberg Cubes for Data Warehousing and OLAP on XML Data". In Proceedings of the ACM 16th Conference on Information and Knowledge Management (CIKM'07), Lisboa, Portugal, November 6-9, 2007.

  2. J. Pei, B. Jiang, X. Lin, and Y. Yuan. "Probabilistic Skylines on Uncertain Data". In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07), Vienna, Austria, September 23-28 2007.

  3. M. Hua, J. Pei, A. W.-C. Fu, X. Lin, and H-F Leung. "Efficiently Answering Top-k Typicality Queries on Large Databases". In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07), Vienna, Austria, September 23-28 2007.

  4. R. C.-W. Wong, A. W.-C. Fu, K. Wang, and J. Pei. "Minimality Attack in Privacy Preserving Data Publishing". In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07), Vienna, Austria, September 23-28 2007.

  5. M. Acharya, T. Xie, J. Pei, and J. Xu. "Mining API Patterns as Partial Orders from Source Code: From Usage Scenarios to Specifications". In Proceedings of the 6th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE'07), Dubrovnik, Croatia, September 3-7, 2007.

  6. M. Hua and J. Pei. "Cleaning Disguised Missing Data: A Heuristic Approach". In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07), San Jose, California, USA, August 12-15, 2007.

  7. R. C.-W. Wong, J. Pei, A. W.-C. Fu, and K. Wang. "Mining Favorable Facets". In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07), San Jose, California, USA, August 12-15, 2007.

  8. J. Pei, J. Xu, Z. Wang, W. Wang, and K. Wang. "Maintaining K-Anonymity against Incremental Updates". In Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM'07), Banff, Canada, July 9-11, 2007.

  9. R. C.-W. Wong, Y. Liu, J. Yin, Z. Huang, A. W.-C. Fu, and J. Pei. "(alpha, k)-anonymity Based Privacy Preservation by Lossy Join". In Proceedings of the 9th Asia-Pacific Web Conference and the 8th International Conference on Web-Age Information Management (APWEB/WAIM'07), Huangshan, China, June 16-18, 2007.

  10. J. Pei, M. K. Lau, and P. S. Yu. "TS-Trees: A Non-Alterable Search Tree Index for Trustworthy Databases on Write-Once-Read-Many (WORM) Storage" (IEEE Outstanding Paper Award). In Proceedings of the IEEE 21st International Conference on Advanced Information Networking and Applications (AINA'07), Niagara Falls, ON, Canada, May 21-23, 2007.

  11. B. Zhou and J. Pei. "Sketching Landscapes of Page Farms". In Proceedings of the 2007 SIAM International Conference on Data Mining (SDM'07), Minneapolis, MN, USA, April 26-28, 2007.

  12. Y. Bu, T-W Leung, A. W.-C. Fu, E. Keogh, J. Pei, and S. Meshkin. "WAT: Finding Top-K Discords in Time Series Database". In Proceedings of the 2007 SIAM International Conference on Data Mining (SDM'07), Minneapolis, MN, USA, April 26-28, 2007.

  13. J. Pei, A. W. C. Fu, X. Lin, and H. Wang. "Computing Compressed Skyline Cubes Efficiently". In Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDE'07), Istanbul, Turkey, April 16-20, 2007.

  14. Y. Liu, L. Chen, J. Pei, Q. Chen, and Y. Zhao. "Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays". In Proceedings of the 5th Annual IEEE International Conference on Pervasive Computing and Communications (PerCom'07), White Plains, NY, USA, March 19-23, 2007. One of the three best papers.


Year 2006

  1. B.-W. On, E. Elmacioglu, D. Lee, J. Kang, and J. Pei. "Improving Grouped-Entity Resolution using Quasi-Cliques". In Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM'06), Hong Kong, December 18-22, 2006.

  2. Y. Xu, K. Wang, A. W. C. Fu, R. She, and J. Pei. "Classification Spanning Correlated Data Streams". In Proceedings of the ACM 15th Conference on Information and Knowledge Management (CIKM'06), Arlington, VA, USA, November 6-11, 2006.

  3. X. Zhou, J. Zhang, W. Wang, B. Shi, and J. Pei. "Using High Dimensional Indexes to Support Relevance Feedback Based Interactive Images Retrieval" (demo paper). In Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB'06), Seoul, Korea, September 12-15, 2006.

  4. W. Zhu, J. Pei, J. Yin, Y. Xie. "Granularity Adaptive Density Estimation and on-Demand Clustering of Concept-Drifting Data Streams". In Proceedings of the 8th International Conference on Data Warehousing and Knowledge Discovery (DaWaK'06), Krakow, Poland, September 4-8, 2006.

  5. J. Li, R. C.-W. Wong, A. W.-C. Fu, and J. Pei. "Achieving k-Anonymity by Clustering in Attribute Hierarchical Structures". In Proceedings of the 8th International Conference on Data Warehousing and Knowledge Discovery (DaWaK'06), Krakow, Poland, September 4-8, 2006. Selected as one of the 6 best papers invited by the special issue of International Journal of Data Warehousing and Mining.

  6. C. Aggarwal, J. Pei, and B. Zhang. "On Privacy Preservation against Adversarial Data Mining". In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, USA, August 20-23, 2006.

  7. J. Xu, W. Wang, J. Pei, X. Wang, B. Shi, and A. W.-C. Fu. "Utility-Based Anonymization Using Local Recoding". In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, USA, August 20-23, 2006.

  8. J. Xu, W. Wang, J. Pei, X. Wang, B. Shi, and A. W.-C. Fu. "Utility-Based Anonymization for Privacy Preservation with Less Information Loss". In Proceedings of the 2nd ACM SIGKDD Workshop on Utility-Based Data Mining (UBDM'06), in conjunction with the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, USA, August 20, 2006. Best paper in the workshop, to appear in ACM SIGKDD Explorations.

  9. H. Wang, J. Yin, J. Pei, P. S. Yu, and J. X. Yu. "Suppressing Model Overfitting in Mining Concept-Drifting Data Streams". In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, USA, August 20-23, 2006.

  10. J. Li, H. Li, L. Wong, J. Pei, and G. Dong. "Minimum Description Length (MDL) Principle: Generators Are Preferable to Closed Patterns". In Proceedings of the 21st National Conference on Artificial Intelligence (AAAI'06), Boston, MA, USA, July 16-20, 2006.

  11. T. Xie and J. Pei. "MAPO: Mining API Usages from Open Source Repositories" (short paper). In Proceedings of the 3rd International Workshop on Mining Software Repositories (MSR 2006), Shanghai, China, May 22-23, 2006.

  12. B.-W. On, D. Lee, E. Elmacioglu, J. Kang, and J. Pei. "An Effective Approach to Entity Resolution Problem Using Quasi-Clique and its Application to Digital Libraries" (short paper). In Proceedings of the ACM/IEEE 2006 Joint Conf. on Digital Libraries (JCDL'06), Chapel Hill, NC, USA, June 11-15, 2006.

  13. Y. Tao, X. Xiao, and J. Pei. "SUBSKY: Efficient Computation of Skylines in Subspaces". In Proceedings of the 22nd International Conference on Data Engineering (ICDE'06), Atlanta, GA, USA, April 3-7, 2006.


Year 2005

  1. J. Pei, J. Liu, H. Wang, K. Wang, P. S. Yu, and J. Wang. "Efficiently Mining Frequent Closed Partial Orders", In Proceedings of the 5th IEEE International Conference on Data Mining (ICDM'05), New Orleans, Louisiana, USA, November 27-30 2005. (Software)

  2. H. Wang and J. Pei. "A Random Method for Quantifying Changing Distributions in Data Streams", In Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD'05), Porto, Portugal, October 3-7, 2005.

  3. J. Ye, X. Zhou, J. Pei, L. Chen, and L. Zhang. "A Stratification-Based Approach to Accurate and Fast Image Annotation". In Proceedings of the 6th International Conference on Web-Age Information Management (WAIM'05), Hangzhou, China, October 11-13, 2005.

  4. C. Liu, K. Wu, and J. Pei. "A Dynamic Clustering and Scheduling Approach to Energy Saving in Data Collection from Wireless Sensor Networks". In Proceedings of the 2nd Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks (SECON'05), Santa Clara, California, USA, September 26-29, 2005.

  5. J. Pei, W. Jin, M. Ester, and Y. Tao. "Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces". In Proceedings of the 31st International Conference on Very Large Data Bases (VLDB'05), Trondheim, Norway, August 30-September 2, 2005.

  6. J. Pei, D. Jiang, and A. Zhang. "On Mining Cross-Graph Quasi-Cliques". In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'05), Chicago, IL, USA, August 21-24, 2005.

  7. H. Wang, J. Pei, and P. S. Yu. "Pattern based Similarity Search for Microarray Data" (Industrial and Government Track poster paper). In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'05), Chicago, IL, USA, August 21-24, 2005.

  8. H. Yu, J. Pei, S. Tang, and D. Yang. "Mining Most General Multidimensional Summarization of Probable Groups in Data Warehouses". In Proceedings of the 17th International Scientific and Statistical Database Management Conference (SSDBM'05), Santa Barbara, California, USA, June 27-29, 2005.

  9. W. Wang, C. Wang, Y. Zhu, B. Shi, J. Pei, X. Yan, and J. Han. "GraphMiner: A Structural Pattern Mining System for Large Disk-Based Graph Databases and Its Applications" (demo paper). In Proceedings of the 24th ACM SIGMOD International Conference on Management of Data (SIGMOD'05), Baltimore, Maryland, USA, June 14-16, 2005.

  10. M. Cho, J. Pei and D. Cheung. "Cross Table Cubing: Mining Iceberg Cubes from Data Warehouses" (poster paper). In Proceedings of the 5th SIAM International Conference on Data Mining (SDM'05), Newport Beach, CA, USA,  April 21-23, 2005.

  11. D. Jiang, J. Pei and A. Zhang. "A General Approach to Mining Quality Pattern-based Clusters from Gene Expression Data". In Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA'05), Beijing, China, April 18-20, 2005.

  12. G. Dong, C. Jiang, J. Pei, J. Li and L. Wong. "Mining Succinct Systems of Minimal Generators of Formal Concepts". In Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA'05), Beijing, China, April 18-20, 2005.

  13. J. Pei, D. Jiang and A. Zhang. "Mining Cross-graph Quasi-cliques in Gene Expression and Protein Interaction Data" (research poster paper). In Proceedings of the 21st International Conference on Data Engineering (ICDE'05), Tokyo, Japan, April 5-8, 2005.


Year 2004

  1. C. Wang, W. Wang, J. Pei, Y. Zhu and B. Shi. "Scalable Mining of Large Disk-based Graph Databases" (research full paper). In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04), Seattle, WA, USA, August 22 - 25, 2004.

  2. D. Jiang, J. Pei, M. Ramanathan, C. Tang and A. Zhang. "Mining Coherent Gene Clusters from Gene-Sample-Time Microarray Data" (industrial full paper, Runner-up for the best application paper award). In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04), Seattle, WA, USA, August 22-25, 2004.

  3. L. Deng, J. Pei, J. Ma and D.L. Lee. "A Rank Sum Test Method for Informative Gene Discovery" (industrial full paper). In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04), Seattle, WA, USA, August 22-25, 2004.

  4. D. Jiang, J. Pei and A. Zhang. "GPX: Interactive Mining of Gene Expression Data" (demo paper). In Proceedings of the 30th International Conference on Very Large Data Bases (VLDB'04), Toronto, ON, Canada, August 30-September 3, 2004.

  5. H. Wang, F. Chu, W. Fan, P.S. Yu and J. Pei. "A Fast Algorithm for Subspace Clustering by Pattern Similarity" (full paper). In Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM'04), Santorini Island, Greece, 21-23 June 2004.

  6. C. Wang, M. Hong, J. Pei, H. Zhou, W. Wang and B. Shi. "Efficient Pattern-Growth Methods for Frequent Tree Pattern Mining" (full paper). In Proceedings of the 8th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'04), Sydney, Australia, May 26-28, 2004.


Year 2003

  1. J. Pei, X. Zhang, M. Cho, H. Wang and P.S. Yu. "MaPle: A Fast Algorithm for Maximal Pattern-based Clustering" (Regular paper). In Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM'03), Melbourne, Florida, USA, November 19-22, 2003.

  2. D. Jiang, J. Pei and A. Zhang. "Interactive Exploration of Coherent Patterns in Time-Series Gene Expression Data". In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03), Washington, DC, USA, August 24-27, 2003. (The poster)

  3. C. Tang, A. Zhang, and J. Pei. "Mining Phenotypes and Informative Genes from Gene Expression Data". In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03), Washington, DC, USA, August 24-27, 2003. (The poster)

  4. J. Wang, J. Han, and J. Pei. "CLOSET+: Searching for the Best Strategies for Mining Frequent Closed Itemsets". In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03), Washington, DC, USA, August 24-27, 2003.

  5. L.V.S. Lakshmanan, J. Pei, and Y. Zhao. "Efficacious Data Cube Exploration by Semantic Summarization and Compression" (demo paper). In Proceedings of the 29th International Conference on Very Large Data Bases (VLDB'03), Berlin, Germany, September 9-12, 2003.

  6. L.V.S. Lakshmanan, J. Pei, and Y. Zhao. "QC-Trees: An Efficient Summary Structure for Semantic OLAP". In Proceedings of the 2003 ACM-SIGMOD International Conference on Management of Data (SIGMOD'03), San Diego, CA, June 9-12, 2003.

  7. L.V.S. Lakshmanan, J. Pei, and Y. Zhao. "SOCQET: Semantic OLAP with Compressed Cube and Summarization" (demo paper). In Proceedings of the 2003 ACM-SIGMOD International Conference on Management of Data (SIGMOD'03), San Diego, CA, June 9-12, 2003.

  8. J. Pei. "A General Model for Online Analytical Processing of Complex Data". In Proceedings of the 22nd International Conference on Conceptual Modeling (ER'03), Chicago, IL, October 13-16, 2003.

  9. G. Dong, J. Han, L.V.S. Lakshmanan, J. Pei, H. Wang and P.S. Yu.  "Online mining of changes from data streams: Research problems and preliminary results",  In Proceedings of the 2003 ACM SIGMOD Workshop on Management and Processing of Data Streams. In cooperation with the 2003 ACM-SIGMOD International Conference on Management of Data (SIGMOD'03), San Diego, CA, June 8, 2003.

  10. H.C. Kum, J. Pei, and W. Wang. "ApproxMAP: Approximate Mining of Consensus Sequential Patterns". In Proceedings of the 2003 SIAM International Conference on Data Mining (SIAM DM '03), San Francisco, CA, May 1-3, 2003.

  11. D. Jiang, J. Pei, and A. Zhang. "DHC: A Density-based Hierarchical Clustering Method for Time Series Gene Expression Data" (Regular paper). In Proceedings of the 3rd IEEE Symposium on Bio-informatics and Bio-engineering (BIB'03), Washington D.C., March 10-12, 2003.

  12. Y. Huang, H. Xiong, S. Shekhar, and J. Pei. "Mining Confident Co-location Rules without A Support Threshold". In Proceedings of the 18th Annual ACM Symposium on Applied Computing (SAC'03), Melbourne, Florida, March 9 - 12, 2003.


Year 2002

  1. J. Pei, G. Dong, W. Zou, and J. Han. "On Computing Condensed Frequent Pattern Bases" (Regular paper). In Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM'02), Maebashi TERRSA, Maebashi City, Japan, December 9 - 12, 2002.

  2. J. Pei, J. Han, and W. Wang. "Mining Sequential Patterns with Constraints in Large Databases" (Regular paper). In Proceedings of the 11th ACM International Conference on Information and Knowledge Management (CIKM'02), McLean, VA, November 4-9, 2002.

  3. L.V.S. Lakshmanan, J. Pei, J. Han. "Quotient Cube: How to Summarize The Semantics of A Data Cube". In Proceedings of the 28th International Conference on Very Large Databases (VLDB'02), Hong Kong, China, August 20-23, 2002.

  4. G. Dong, J. Han, J. Lam, J. Pei, J. Wang, K. Wang. "CubeExplorer: Online Exploration of Data Cubes" (demo paper). In Proceedings of the 2002 ACM-SIGMOD International Conference on Management of Data (SIGMOD'02), Madison, Wisconsin, June 3-6, 2002.

  5. T. Wang, S. Tang, D. Yang, J. Gao, Y. Wu, J. Pei. "COMMIX: Towards Effective Web Information Extraction. Integration and Query Answering" (demo paper). In Proceedings of the 2002 ACM-SIGMOD International Conference on Management of Data (SIGMOD'02), Madison, Wisconsin, June 3-6, 2002.

  6. Y. Chen, G. Dong, J. Han, J. Pei, B. Wah, J. Wang "Online Analytical Processing Stream Data: Is It Feasible?". In Proceedings of the 2002 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD'2002), Madison, Wisconsin, June 2, 2002.


Year 2001

  1. J. Pei, J. Han, H. Lu, S. Nishio, S. Tang, and D. Yang. "H-Mine: Hyper-structure Mining of Frequent Patterns in Large Databases" (Regular paper). In Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM'01), San Jose, California, November 29-December 2, 2001.

  2. W. Li, J. Han, and J. Pei. "CMAR: Accurate and Efficient Classification Based on Multiple Class-association Rules" (Regular paper). In Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM'01), San Jose, California, November 29-December 2, 2001.

  3. H. Pinto, J. Han, J. Pei, K. Wang, Q. Chen, and U. Dayal. "Multi-Dimensional Sequential Pattern Mining" (Regular paper). In Proceedings of the 10th ACM International Conference on Information and Knowledge Management (CIKM'01), Atlanta, Georgia, November 2001. 

  4. G. Dong, J. Han, J. Lam, J. Pei, and K. Wang. "Mining Multi-Dimensional Constrained Gradients in Data Cubes", Proceedings of the 27th International Conference on Very Large Data Base (VLDB'01), Roma, Italy, September 2001.

  5. J. Han and J. Pei. "Pattern Growth Methods for Sequential Pattern Mining: Principles and Extensions" (invited paper). In Proceedings of the 2001 ACM SIGKDD Workshop on Temporal Data Mining, San Francisco, California, USA, August 26, 2001.

  6. J. Han, J. Pei, G. Dong, and K. Wang, "Efficient Computation of Iceberg Cubes with Complex Measures". In Proceedings of the 2001 ACM-SIGMOD International Conference on Management of Data (SIGMOD'01), Santa Barbara, CA, May 2001.

  7. J. Han, H. Jamil, Y. Lu, L. Chen, Y. Liao and J. Pei, "DNA-Miner: A System Prototype for Mining DNA Sequences" (demo paper). In Proceedings of the 2001 ACM-SIGMOD International Conference on Management of Data (SIGMOD'01), Santa Barbara, CA, May 2001.

  8. J. Pei, A.K.H. Tung, and J. Han, "Fault-Tolerant Frequent Pattern Mining: Problems and Challenges". In Proceedings of the 2001 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discover (DMKD'01), Santa Barbara, CA, May 2001.

  9. J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M.-C. Hsu, "PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth". In Proceedings of the 2001 International Conference on Data Engineering (ICDE'01), Heidelberg, Germany, April 2001.

  10. J. Pei, J. Han, and L. V. S. Lakshmanan, "Mining Frequent Itemsets with Convertible Constraints". In Proceedings of the 2001 International Conference on Data Engineering (ICDE'01), Heidelberg, Germany, April 2001.


Year 2000

  1. J. Pei and J. Han. "Can We Push More Constraints into Frequent Pattern Mining?". In Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'2000), Boston, MA, August 2000.

  2. J. Han, J. Pei, B. Mortazavi-Asl, Q. Chen, U. Dayal, and M. Hsu. "FreeSpan: Frequent pattern-projected sequential pattern mining". In Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'2000), Boston, MA, August 2000.

  3. J. Han, J. Pei, and Y. Yin. "Mining Frequent Patterns without Candidate Generation". In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD'00), Dallas, TX, May 2000. 

  4. J. Pei, R. Mao, K. Hu, and H. Zhu. "Towards Data Mining Benchmarking: A Test Bed for Performance Study of Frequent Pattern Mining" (demo paper). In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD'00), Dallas, TX, May 2000. 

  5. J. Pei, J. Han, and R. Mao. "CLOSET: An efficient algorithm for mining frequent closed itemsets". In Proceedings of the 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, Dallas, TX, May, 2000.

  6. J. Pei, J. Han, B. Mortazavi-Asl, and H. Zhu. "Mining Access Patterns efficiently from Web logs". In Proceedings of the 2000 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'00), Kyoto, Japan, April 2000.


Tutorials


  1. D. Jiang, J. Pei, and H. Li. "Enhancing Web Search by Mining Search and Browse Logs". In Proceedings of the 34th Annual ACM SIGIR Conference (SIGIR'11), Beijing, China, July 24-28, 2011.

  2. M. Hay, K. Liu, G. Miklau, J. Pei, and E. Terzi. "Privacy-aware Data Management in Information Networks''. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD'11), Athens, Greece, June 12-16, 2011.

  3. K. Liu, G. Miklau, J. Pei, and E. Terzi. "Privacy-aware Data Mining in Information Networks''. In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.

  4. D. Jiang, J. Pei, and H. Li. "Web Search/Browse Log Mining: Challenges, Methods, and Applications". In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.

  5. D. Jiang, J. Pei, and H. Li. "Web Search/Browse Log Mining: Challenges, Methods, and Applications". In Proceedings of the 33rd Annual ACM SIGIR Conference (SIGIR'10), Geneva, Switzerland, July 19-23, 2010.

  6. D. Jiang, J. Pei, and H. Li. "Web Search/Browse Log Mining: Challenges, Methods, and Applications". In Proceedings of the 19th International World Wide Web Conference (WWW'10), Raleigh, NC, USA, April 26-30, 2010.

  7. J. Pei, Y. Tao, and J. Han. "Preference Queries from OLAP and Data Mining Perspective". In Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE'09), March 29 - April 4, 2009, Shanghai, China.

  8. J. Pei, M. Hua, Y. Tao, and X. Lin. "Mining Uncertain and Probabilistic Data: Problems, Challenges, Methods and Applications''. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.

  9. J. Pei, M. Hua, Y. Tao, and X. Lin. "Query Answering Techniques on Uncertain and Probabilistic Data'' (slides). In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD'08), June 11-14, 2008, Vancouver, Canada.

  10. J. Pei, B. Zhou. Z. Tang, and H. Huang. "Data Mining Techniques for Web Spam Detection". In Proceedings of the 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'08), May 20-23, 2003, Osaka, Japan.

  11. T. Xie and J. Pei. "Data Mining for Software Engineering". In Proceedings of the 12th ACM SIGKDD International Conference on Data Mining (KDD'06), Philadelphia, USA, August 20-23, 2006.

  12. J. Pei, H. Wang and P.S. Yu. "Online Mining Data Streams: Problems, Applications and Progress". In Proceedings of the 6th International Conference on Web-Age Information Management (WAIM'05), Hangzhou, China, October 11-13, 2005.

  13. H. Wang, J. Pei and P.S. Yu. "Online Mining Data Streams: Problems, Applications and Progress". In Proceedings of the 21st International Conference on Data Engineering (ICDE'05), Tokyo, Japan, April 5-8, 2005.

  14. J. Pei, H. Wang and P.S. Yu. "Online Mining Data Streams: Problems, Applications and Progress". In Proceedings of the 10th ACM SIGKDD International Conference on Data Mining (KDD'04), Seattle, WA, August 22 - 25, 2004.

  15. J. Pei, S.J. Upadhyaya, F. Farooq and V. Govindaraju. "Data Mining for Intrusion Detection: Techniques, Applications and Systems". In Proceedings of the 20th IEEE International Conference on Data Engineering (ICDE'04), Boston, MA, March 30 - April 2, 2004.

  16. J. Han, L. V. S. Lakshmanan and J. Pei. "Scalable Frequent-Pattern Mining Methods: An Overview (MS PowerPoint Slides)". In Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'2001), San Francisco, California, USA, August 26 - 29, 2001.

  17. J. Pei and J. Han, "Sequential Pattern Mining: From Shopping History Analysis to Weblog Mining and DNA Mining (MS PowerPoint Slides)", In the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD-01), April 16, 2001, Hong Kong.


Book Chapters


  1. M. Hua and J. Pei, "A Survey of Utility-based Privacy-preserving Data Transformation Methods", in C. Aggarwal and P. S. Yu (eds.), Privacy-Preserving Data Mining: Models and Algorithms, Springer-Verlag, 2007.

  2. Y. Xu, K. Wang, A. W.-C. Fu, R. She, and J. Pei. "Privacy-preserving Data Stream Classification", in C. Aggarwal and P. S. Yu (eds.), Privacy-Preserving Data Mining: Models and Algorithms, Springer-Verlag, 2007.

  3. J. Han, J. Pei, and X. Yan. "Sequential Pattern Mining by Pattern-Growth: Principles and Extensions", in W. Chu and T. Lin (eds.), Foundations and Advances in Data Mining, Studies in Fuzziness and Soft Computing 180, pages 183-220, Springer-Verlag GmbH, 2005.

  4. C. Giannella, J. Han, J. Pei, X. Yan, and P.S. Yu, "Mining Frequent Patterns in Data Streams at Multiple Time Granularities", in H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha (eds.), Next Generation Data Mining, AAAI/MIT, 2004.


Edited books and special issues in journals and magazines


  1. Q. Li, L. Feng, J. Pei, X. Sean Wang, X. Zhou, Q.-M. Zhu. Advances in Data and Web Management (Proceedings of the Joint APWeb/WAIM 2009 Conferences, Suzhou, China, April 2-4, 2009), Springer, 2009.

  2. R. Huang, Q. Yang, J. Pei, J. Gama, X. Meng, X. Li. Advanced Data Mining and Applications (Proceedings of the 5th ADMA International Conference,  Beijing, China, August 17-19, 2009), Springer, 2009.

  3. R. T. Ng and J. Pei. Special Issue on Data Mining for Health Informatics. SIGKDD Explorations Volume 9, Issue 1, 2007.


U.S. Patents


  1.  "Systems and Methods to Automatically Generate Enhanced Information Associated with a Selected Web Table", with K. Tsoukalas and D. Cubranic, filed on February 5, 2009

  2. "Methods and System for Mining Frequent Patterns'', with J. Han, Y. Yin, and R. Mao, U.S. Patent, No. 6665669, awarded on December 16, 2003.


Ph.D. Thesis


Jian Pei, "Pattern-growth Methods for Frequent Pattern Mining", School of Computing Science, Simon Fraser University, Canada, May 2002.


This pages last updated on June 10, 2016. Copyright 2002 - 2016, Jian Pei. All rights reserved.