(List of my publications by category and year, my lists at DBLP Bibliography Server and Google Scholar)
Copyright Notice. The electronic materials in this web pages are presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Z. Xing and J. Pei. "Exploring Disease Association from the NHANES Data: Data Mining, Pattern Summarization, and Visual Analytics". International Journal of Data Warehousing and Mining (IJDWM), Volume 6, Issue 3, pages 11-27, July-September 2010, Idea Group, Inc.
D. Jiang and J. Pei. "Mining Frequent Cross-Graph Quasi-Cliques". ACM Transactions on Knowledge Discovery in Data, Volume 2, Number 4, pages 16:1-42, January 2009, ACM Press.
M. P. Ng, I. A. Vergara, C. Frech, Q. Chen, X. Zeng, J. Pei, and N. Chen. "OrthoClusterDB: a Web Server for Synteny Blocks". BMC Bioinformatics, Volume 10, Article 192, 2009.
R. She, J. Chu, K. Wang, J. Pei, and J. Chen. "GenBlastA: Enabling BLAST to Identify Homologous Gene Sequences". Genome Research, Volume 19, Number 1, pages 143-149, January 2009, Cold Spring Harbor Laboratory Press.
X. Zeng, J. Pei, I. Vergara, M. Nesbitt, K. Wang, and N. Chen. "OrthoCluster: A New Tool for Mining Syntenic Blocks and Applications in Comparative Genomics". In Proceedings of the 11th International Conferences on Extending Database Technology (EDBT'08), Nantes, France, March 25-30, 2008.
D. Jiang, J. Pei, M. Ramanathan, C. Lin, C. Tang, and A. Zhang. "Mining Gene-Sample-Time Microarray Data: A Coherent Gene Cluster Discovery Approach". Knowledge and Information Systems: An International Journal, Volume 13, Number 3, pages 305-335, November 2007, Springer-Verlag.
D. Jiang, J. Pei, and A. Zhang. "An Interactive Approach to Mining Gene Expression Data". IEEE Transactions on Knowledge and Data Engineering, Volume 17, Number 10, pages 1363-1378, October 2005, IEEE Computer Society.
D. Jiang, J. Pei, and A. Zhang. "Towards Interactive Exploration of Gene Expression Patterns". ACM SIGKDD Explorations (Special Issue on Microarray Data Analysis), Volume 5, Issue 2, pages 79-90, 2003.
J. Pei, D. Jiang, and A. Zhang. "On Mining Cross-Graph Quasi-Cliques". In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'05), Chicago, IL, USA, August 21-24, 2005.
H. Wang, J. Pei, and P. S. Yu. "Pattern based Similarity Search for Microarray Data" (Industrial and Government Track poster paper). In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'05), Chicago, IL, USA, August 21-24, 2005.
D. Jiang, J. Pei and A. Zhang. "A General Approach to Mining Quality Pattern-based Clusters from Gene Expression Data". In Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA'05), Beijing, China, April 18-20, 2005.
J. Pei, D. Jiang and A. Zhang. "Mining Cross-graph Quasi-cliques in Gene Expression and Protein Interaction Data" (research poster paper). In Proceedings of the 21st International Conference on Data Engineering (ICDE'05), Tokyo, Japan, April 5-8, 2005.
D. Jiang, J. Pei, M. Ramanathan, C. Tang and A. Zhang. "Mining Coherent Gene Clusters from Gene-Sample-Time Microarray Data" (industrial full paper, Runner-up for the best application paper award). In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04), Seattle, WA, USA, August 22-25, 2004.
L. Deng, J. Pei, J. Ma and D.L. Lee. "A Rank Sum Test Method for Informative Gene Discovery" (industrial full paper). In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04), Seattle, WA, USA, August 22-25, 2004.
(Demo) D. Jiang, J. Pei and A. Zhang. "GPX: Interactive Mining of Gene Expression Data". In Proceedings of the 30th International Conference on Very Large Data Bases (VLDB'04), Toronto, ON, Canada, August 30-September 3, 2004.
J. Pei, X. Zhang, M. Cho, H. Wang and P.S. Yu. "MaPle: A Fast Algorithm for Maximal Pattern-based Clustering" (Regular paper). In Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM'03), Melbourne, Florida, USA, November 19-22, 2003.
D. Jiang, J. Pei and A. Zhang. "Interactive Exploration of Coherent Patterns in Time-Series Gene Expression Data". In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03), Washington, DC, USA, August 24-27, 2003. (The poster)
C. Tang, A. Zhang, and J. Pei. "Mining Phenotypes and Informative Genes from Gene Expression Data". In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03), Washington, DC, USA, August 24-27, 2003. (The poster)
D. Jiang, J. Pei, and A. Zhang. "DHC: A Density-based Hierarchical Clustering Method for Time Series Gene Expression Data" (Regular paper). In Proceedings of the 3rd IEEE Symposium on Bio-informatics and Bio-engineering (BIB'03), Washington D.C., March 10-12, 2003.
(Demo) J. Han, H. Jamil, Y. Lu, L. Chen, Y. Liao and J. Pei, "DNA-Miner: A System Prototype for Mining DNA Sequences". In Proceedings of the 2001 ACM-SIGMOD International Conference on Management of Data (SIGMOD'01), Santa Barbara, CA, May 2001.
D. Jiang, J. Pei, and H. Li. "Enhancing Web Search by Mining Search and Browse Logs". In Proceedings of the 34th Annual ACM SIGIR Conference (SIGIR'11), Beijing, China, July 24-28, 2011.
M. Hay, K. Liu, G. Miklau, J. Pei, and E. Terzi. "Privacy-aware Data Management in Information Networks''. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD'11), Athens, Greece, June 12-16, 2011.
Z. Xing and J. Pei. "Exploring Disease Association from the NHANES Data: Data Mining, Pattern Summarization, and Visual Analytics". International Journal of Data Warehousing and Mining (IJDWM), Volume 6, Issue 3, pages 11-27, July-September 2010, Idea Group, Inc.
X. Cheng, J. Xu, J. Pei, and J. Liu. "Hierarchical Distributed Data Classification in Wireless Sensor Networks". Computer Communication, Volume 33, Issue 15, pages 1404-1413, July 15, 2010, Elsevier.
K. Liu, G. Miklau, J. Pei, and E. Terzi. "Privacy-aware Data Mining in Information Networks''. In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.
D. Jiang, J. Pei, and H. Li. "Web Search/Browse Log Mining: Challenges, Methods, and Applications". In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.
D. Jiang, J. Pei, and H. Li. "Web Search/Browse Log Mining: Challenges, Methods, and Applications". In Proceedings of the 33rd Annual ACM SIGIR Conference (SIGIR'10), Geneva, Switzerland, July 19-23, 2010.
D. Jiang, J. Pei, and H. Li. "Web Search/Browse Log Mining: Challenges, Methods, and Applications". In Proceedings of the 19th International World Wide Web Conference (WWW'10), Raleigh, NC, USA, April 26-30, 2010.
M. Hua, M. K. Lau, J. Pei, and K. Wu. "Continuous K-Means Monitoring with Low Reporting Cost in Sensor Networks". IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 12, pages 1679-1691, December, 2009, IEEE Computer Society.
B. Aljaber, N. Stokes, J. Bailey, and J. Pei. "Document Clustering of Scientific Texts Using Citations Contexts". Information Retrieval, Volume 13, Number 2, pages 101-131, April, 2010, Springer-Verlag.
M. Hua and J. Pei. "Probabilistic Path Queries in Road Networks: Traffic Uncertainty Aware Path Selection" (appendix). In Proceedings of the 13th International Conference on Extending Database Technology (EDBT'10), Lausanne, Switzerland, March 22-26, 2010.
X. Cheng, J. Xu, J. Pei, and J. Liu. "Hierarchical Distributed Data Classification in Wireless Sensor Networks". In Proceedings of the 6th IEEE International Conference on Mobile Ad Hoc and Sensor Systems (MASS'09), Macau, China, October 12-15, 2009.
M. P. Ng, I. A. Vergara, C. Frech, Q. Chen, X. Zeng, J. Pei, and N. Chen. "OrthoClusterDB: a Web Server for Synteny Blocks". BMC Bioinformatics, Volume 10, Article 192, 2009.
Y. Zhao, H. Zhang, L. Cao, J. Pei, and C. Zhang. "Debt Detection in Social Security by Sequence Classification Using Both Positive and Negative Patterns". In Proceedings of the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD'09), Bled, Slovenia, September 7-11, 2009.
B. Zhou and J. Pei. "OSD: An Online Web Spam Detection" (demo paper). In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.
H. Zhong, T. Xie, L. Zhang, J. Pei, and H. Mei. "MAPO: Mining and Recommending API Usage Patterns". In Proceedings of the 23rd European Conference on Object-Oriented Programming (ECOOP 2009), Genova, Italy, July 6-10, 2009.
T. Wang, B. Yang, J. Gao, D. Yang, S. Tang, H. Wu, K. Liu, and J. Pei. "MobileMiner: A Real World Case Study of Data Mining in Mobile Communication" (demo paper). In Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data (SIGMOD'09), June 29-July 2, 2009, Providence, Rhode Island, USA.
J. Wang, X. He, C. Wang, J. Pei, J. Bu, C. Chen, and Z. Guan. "News Article Extraction with Template-Independent Wrapper". In Proceedings of the 18th International World Wide Web Conference (WWW'09) (Poster), April 20-24, 2009, Madrid, Spain.
D. Jiang and J. Pei. "Mining Frequent Cross-Graph Quasi-Cliques". ACM Transactions on Knowledge Discovery in Data, Volume 2, Number 4, pages 16:1-42, January 2009, ACM Press.
K. Tsoukalas, B. Zhou, J. Pei, and D. Cubranic. "Personalizing Entity Detection and Recommendation with a Fusion of Web Log mining Techniques" (Industrial track). In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.
J. Pei, M. Hua, Y. Tao, and X. Lin. "Mining Uncertain and Probabilistic Data: Problems, Challenges, Methods and Applications''. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.
K. Tsoukalas, B. Zhou, J. Pei, and D. Cubranic. "PLEDS: A Personalized Entity Detection System Based on Web Log Mining Techniques" (invited paper). In Proceedings of the 9th International Conference on Web-Age Information Management (WAIM'08), July 20-22, 2008, Zhangjiajie, China.
M. Hua and J. Pei. "DiMaC: A Disguised Missing Data Cleaning Tool" (demo paper). In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.
M. Hua and J. Pei. "DiMaC: A System for Cleaning Disguised Missing Data" (demo paper). In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD'08), June 11-14, 2008, Vancouver, Canada.
(Tutorial) J. Pei, B. Zhou. Z. Tang, and H. Huang. "Data Mining Techniques for Web Spam Detection". In Proceedings of the 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'08), May 20-23, 2003, Osaka, Japan.
X. Zeng, J. Pei, I. Vergara, M. Nesbitt, K. Wang, and N. Chen. "OrthoCluster: A New Tool for Mining Syntenic Blocks and Applications in Comparative Genomics". In Proceedings of the 11th International Conferences on Extending Database Technology (EDBT'08), Nantes, France, March 25-30, 2008.
M. Acharya, T. Xie, J. Pei, and J. Xu. "Mining API Patterns as Partial Orders from Source Code: From Usage Scenarios to Specifications". In Proceedings of the 6th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE'07), Dubrovnik, Croatia, September 3-7, 2007.
M. Hua and J. Pei. "Cleaning Disguised Missing Data: A Heuristic Approach". In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07), San Jose, California, USA, August 12-15, 2007.
Y. Liu, L. Chen, J. Pei, Q. Chen, and Y. Zhao. "Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays". In Proceedings of the 5th Annual IEEE International Conference on Pervasive Computing and Communications (PerCom'07), White Plains, NY, USA, March 19-23, 2007.
B.-W. On, E. Elmacioglu, D. Lee, J. Kang, and J. Pei. "Improving Grouped-Entity Resolution using Quasi-Cliques". In Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM'06), Hong Kong, December 18-22, 2006.
C. Liu, K. Wu, and J. Pei. "An Energy Efficient Data Collection Framework for Wireless Sensor Networks by Exploiting Spatiotemporal Correlation". IEEE Transactions on Parallel and Distributed Systems, Volume 18, Number 7, July 2007, pages 1010-1023, IEEE Computer Society.
T. Xie and J. Pei. "MAPO: Mining API Usages from Open Source Repositories" (short paper). In Proceedings of the 3rd International Workshop on Mining Software Repositories (MSR 2006), Shanghai, China, May 22-23, 2006.
B.-W. On, D. Lee, E. Elmacioglu, J. Kang, and J. Pei. "An Effective Approach to Entity Resolution Problem Using Quasi-Clique and its Application to Digital Libraries" (short paper). In Proceedings of the ACM/IEEE 2006 Joint Conf. on Digital Libraries (JCDL'06), Chapel Hill, NC, USA, June 11-15, 2006.
C. Liu, K. Wu, and J. Pei. "A Dynamic Clustering and Scheduling Approach to Energy Saving in Data Collection from Wireless Sensor Networks". In Proceedings of the 2nd Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks (SECON'05), Santa Clara, California, USA, September 26-29, 2005.
(Tutorial) T. Xie and J. Pei, "Data Mining for Software Engineering". In Proceedings of the 12th ACM SIGKDD International Conference on Data Mining (KDD'06), Philadelphia, USA, August 20-23, 2006.
(Tutorial) J. Pei, H. Wang and P.S. Yu, "Online Mining Data Streams: Problems, Applications and Progress". In Proceedings of the 10th ACM SIGKDD International Conference on Data Mining (KDD'04), Seattle, WA, August 22 - 25, 2004, Proceedings of the 21st International Conference on Data Engineering (ICDE'05), Tokyo, Japan, April 5-8, 2005, and Proceedings of the 6th International Conference on Web-Age Information Management (WAIM'05), Hangzhou, China, October 11-13, 2005.
(Tutorial) J. Pei, S.J. Upadhyaya, F. Farooq and V. Govindaraju, "Data Mining for Intrusion Detection: Techniques, Applications and Systems". In Proceedings of the 20th IEEE International Conference on Data Engineering (ICDE'04), Boston, MA, March 30 - April 2, 2004.
T. Wang, B. Yang, J. Gao, D. Yang, S. Tang, H. Wu, K. Liu, and J. Pei. "MobileMiner: A Real World Case Study of Data Mining in Mobile Communication" (demo paper). In Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data (SIGMOD'09), June 29-July 2, 2009, Providence, Rhode Island, USA.
I. Pekerskaya, J. Pei, and K. Wang. "Mining Changing Regions from Access-Constrained Snapshots: A Cluster-Embedded Decision Tree Approach". Journal of Intelligent Information Systems (Special Issue on Mining Spatio-Temporal Data), Volume 27, Number 3, pages 215-242, November 2006, Springer-Verlag.
J. Wang, J. Han, and J. Pei. "Closed Constrained-Gradient Mining in Retail Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 6, pages 764-769, June 2006, IEEE Computer Society.
G. Dong, J. Han, J. Lam, J. Pei, K. Wang, and W. Zou. "Mining Constrained Gradients in Large Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 16, Number 8, pages 922-938, August 2004, IEEE Computer Society.
H. Wang and J. Pei. "A Random Method for Quantifying Changing Distributions in Data Streams", In Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD'05), Porto, Portugal, October 3-7, 2005.
G. Dong, J. Han, L.V.S. Lakshmanan, J. Pei, H. Wang and P.S. Yu. "Online mining of changes from data streams: Research problems and preliminary results", In Proceedings of the 2003 ACM SIGMOD Workshop on Management and Processing of Data Streams. In cooperation with the 2003 ACM-SIGMOD International Conference on Management of Data (SIGMOD'03), San Diego, CA, June 8, 2003.
G. Dong, J. Han, J. Lam, J. Pei, and K. Wang. "Mining Multi-Dimensional Constrained Gradients in Data Cubes", Proceedings of the 27th International Conference on Very Large Data Base (VLDB'01), Roma, Italy, September 2001.
Z. Xing, J. Pei, and P.S. Yu. "Early Classification on Time Series". To appear in Knowledge and Information Systems: An International Journal, Springer-Verlag.
Z. Xing, J. Pei, P.S. Yu., and K. Wang. "Extracting Interpretable Features for Early Classification on Time Series". In Proceedings of 11th SIAM International Conference on Data Mining (SDM'11), April 28 - 30, 2011, Phoenix, Arizona, USA.
X. Cheng, J. Xu, J. Pei, and J. Liu. "Hierarchical Distributed Data Classification in Wireless Sensor Networks". Computer Communication, Volume 33, Issue 15, pages 1404-1413, July 15, 2010, Elsevier.
Z. Xing, J. Pei, and E. Keogh. "A Brief Survey on Sequence Classification". ACM SIGKDD Explorations, Volume 12, Issue 1, pages 40-48, June 2010, ACM Press.
X. Cheng, J. Xu, J. Pei, and J. Liu. "Hierarchical Distributed Data Classification in Wireless Sensor Networks". In Proceedings of the 6th IEEE International Conference on Mobile Ad Hoc and Sensor Systems (MASS'09), Macau, China, October 12-15, 2009.
Y. Zhao, H. Zhang, L. Cao, J. Pei, and C. Zhang. "Debt Detection in Social Security by Sequence Classification Using Both Positive and Negative Patterns". In Proceedings of the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD'09), Bled, Slovenia, September 7-11, 2009.
Z. Xing, J. Pei, and P. S. Yu. "Early Classification on Time Series: A Nearest Neighbor Approach". In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09), Pasadena, CA, USA, July 14-17, 2009.
H. Zhong, T. Xie, L. Zhang, J. Pei, and H. Mei. "MAPO: Mining and Recommending API Usage Patterns". In Proceedings of the 23rd European Conference on Object-Oriented Programming (ECOOP 2009), Genova, Italy, July 6-10,
J. Wang, X. He, C. Wang, J. Pei, J. Bu, C. Chen, and Z. Guan. "News Article Extraction with Template-Independent Wrapper". In Proceedings of the 18th International World Wide Web Conference (WWW'09) (Poster), April 20-24, 2009, Madrid, Spain.
Z. Xing, J. Pei, G. Dong, and P. S. Yu. "Mining Sequence Classifiers for Early Prediction". In Proceedings of the 2008 SIAM International Conference on Data Mining (SDM'08), Atlanta, GA, April 24-26, 2008.
Y. Xu, K. Wang, A. W.-C. Fu, R. She, and J. Pei. "Privacy-preserving Data Stream Classification", in C. Aggarwal and P. S. Yu (eds.), Privacy-Preserving Data Mining: Models and Algorithms, Springer-Verlag, 2007.
Y. Xu, K. Wang, A. W. C. Fu, R. She, and J. Pei. "Classification Spanning Correlated Data Streams". In Proceedings of the ACM 15th Conference on Information and Knowledge Management (CIKM'06), Arlington, VA, USA, November 6-11, 2006.
H. Wang, J. Yin, J. Pei, P. S. Yu, and J. X. Yu. "Suppressing Model Overfitting in Mining Concept-Drifting Data Streams". In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, USA, August 20-23, 2006.
H. Wang and J. Pei. "A Random Method for Quantifying Changing Distributions in Data Streams", In Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD'05), Porto, Portugal, October 3-7, 2005.
W. Li, J. Han, and J. Pei. "CMAR: Accurate and Efficient Classification Based on Multiple Class-association Rules" (Regular paper). In Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM'01), San Jose, California, November 29-December 2, 2001.
B. Jiang, J. Pei, Y. Tao, and X. Lin. "Clustering Uncertain Data Based on Probability Distribution Similarity". To appear in IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society.
M. Hua and J. Pei. "Clustering in Applications with Multiple Data Sources -- A Mutual Subspace Clustering Approach". To appear in Neurocomputing, Elsevier.
Z. Liao, D. Jiang, E. Chen, J. Pei, H. Cao, and H. Li. "Mining Concept Sequences from Large-scale Search Logs for Context-aware Query Suggestion". To appear in ACM Transactions on Intelligent Systems and Technology, ACM Press.
B. Jiang and J. Pei. "Outlier Detection on Uncertain Data: Objects, Instances, and Inferences". In Proceedings of the 27th IEEE International Conference on Data Engineering (ICDE'11), Hannover, Germany, April 11-16, 2011.
D. Kang, D. Jiang, J. Pei, Z. Liao, X. Sun, and H-J. Choi. "Multidimensional Mining of Large-Scale Search Logs: A Topic-Concept Cube Approach". In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), Hong Kong, China, February 9-12, 2011.
Z. Xing and J. Pei. "Exploring Disease Association from the NHANES Data: Data Mining, Pattern Summarization, and Visual Analytics". International Journal of Data Warehousing and Mining (IJDWM), Volume 6, Issue 3, pages 11-27, July-September 2010, Idea Group, Inc.
M. Hua, M. K. Lau, J. Pei, and K. Wu. "Continuous K-Means Monitoring with Low Reporting Cost in Sensor Networks". IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 12, pages 1679-1691, December, 2009, IEEE Computer Society.
B. Aljaber, N. Stokes, J. Bailey, and J. Pei. "Document Clustering of Scientific Texts Using Citations Contexts". Information Retrieval, Volume 13, Number 2, pages 101-131, April, 2010, Springer-Verlag.
H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. "Context-Aware Query Suggestion by Mining Click-Through and Session Data" (Best Application Paper Award). In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.
H. Wang and J. Pei. "Clustering by Pattern Similarity". Journal of Computer Science and Technology, Volume 23, Number 4, pages 481-496, July 2008, Springer.
D. Jiang, J. Pei, M. Ramanathan, C. Lin, C. Tang, and A. Zhang. "Mining Gene-Sample-Time Microarray Data: A Coherent Gene Cluster Discovery Approach". Knowledge and Information Systems: An International Journal, Volume 13, Number 3, pages 305-335, November 2007, Springer-Verlag.
C. Liu, K. Wu, and J. Pei. "An Energy Efficient Data Collection Framework for Wireless Sensor Networks by Exploiting Spatiotemporal Correlation". IEEE Transactions on Parallel and Distributed Systems, Volume 18, Number 7, July 2007, pages 1010-1023, IEEE Computer Society.
D. Jiang, J. Pei, and A. Zhang. "An Interactive Approach to Mining Gene Expression Data". IEEE Transactions on Knowledge and Data Engineering, Volume 17, Number 10, pages 1363-1378, October 2005, IEEE Computer Society.
W. Zhu, J. Pei, J. Yin, Y. Xie. "Granularity Adaptive Density Estimation and on-Demand Clustering of Concept-Drifting Data Streams". In Proceedings of the 8th International Conference on Data Warehousing and Knowledge Discovery (DaWaK'06), Krakow, Poland, September 4-8, 2006.
C. Liu, K. Wu, and J. Pei. "A Dynamic Clustering and Scheduling Approach to Energy Saving in Data Collection from Wireless Sensor Networks". In Proceedings of the 2nd Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks (SECON'05), Santa Clara, California, USA, September 26-29, 2005.
D. Jiang, J. Pei and A. Zhang. "A General Approach to Mining Quality Pattern-based Clusters from Gene Expression Data". In Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA'05), Beijing, China, April 18-20, 2005.
D. Jiang, J. Pei, M. Ramanathan, C. Tang and A. Zhang. "Mining Coherent Gene Clusters from Gene-Sample-Time Microarray Data" (industrial full paper, Runner-up for the best application paper award). In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04), Seattle, WA, USA, August 22-25, 2004.
H. Wang, F. Chu, W. Fan, P.S. Yu and J. Pei. "A Fast Algorithm for Subspace Clustering by Pattern Similarity" (full paper). In Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM'04), Santorini Island, Greece, 21-23 June 2004.
J. Pei, X. Zhang, M. Cho, H. Wang and P.S. Yu. "MaPle: A Fast Algorithm for Maximal Pattern-based Clustering" (Regular paper). In Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM'03), Melbourne, Florida, USA, November 19-22, 2003.
C. Tang, A. Zhang, and J. Pei. "Mining Phenotypes and Informative Genes from Gene Expression Data". In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03), Washington, DC, USA, August 24-27, 2003. (The poster)
D. Jiang, J. Pei, and A. Zhang. "DHC: A Density-based Hierarchical Clustering Method for Time Series Gene Expression Data" (Regular paper). In Proceedings of the 3rd IEEE Symposium on Bio-informatics and Bio-engineering (BIB'03), Washington D.C., March 10-12, 2003.
Z. Lin, B. Jiang, J. Pei, and D. Jiang. "Mining Discriminative Items in Multiple Data Streams". World Wide Web Journal: Internet and Web Information Systems, Volume 13, Issue 4, pages 497-522, December 2010, Springer-Verlag.
X. Zeng, J. Pei, K. Wang, and J. Li. "PADS: A Simple Yet Effective Pattern-Aware Dynamic Search Method for Fast Maximal Frequent Pattern mining". Knowledge and Information Systems: An International Journal, Volume 20, Number 3, pages 375-391, September, 2009, Springer-Verlag.
J. Pei, J. Han, H. Lu, S. Nishio, S. Tang, and D. Yang. "H-Mine: Fast and space-preserving frequent pattern mining in large databases". IIE Transactions, Volume 39, Issue 6, pages 593-605, June 2007, Taylor & Francis.
Y. Huang, J. Pei, and H. Xiong. "Mining Co-Location Patterns with Rare Events from Spatial Data Sets''. GeoInformatica, Volume 10, Number 3, pages 239-260, September 2006, Springer Netherlands.
M. Cho, J. Pei, H. Wang, and W. Wang. "Preference-based Frequent Pattern Mining". International Journal of Data Warehousing and Mining, Volume 1, Number 4, pages 56-77, October-December 2005, Idea Group, Inc.
J. Pei, G. Dong, W. Zou, and J. Han. "Mining Condensed Frequent Pattern Bases". Knowledge and Information Systems: An International Journal, Volume 6, Number 5, pages 570-594, September 2004, Springer-Verlag.
J. Pei, J. Han, and L.V.S. Lakshmanan. "Pushing Convertible Constraints in Frequent Itemset Mining". Data Mining and Knowledge Discovery: An International Journal, Volume 8, Issue 3, pages 227-252, May 2004, Kluwer Academic Publishers. (Erratum)
J. Han, J. Pei, Y. Yin, and R. Mao. "Mining Frequent Patterns without Candidate Generation: A Frequent-pattern Tree Approach". Data Mining and Knowledge Discovery: An International Journal, Volume 8, Issue 1, pages 53-87, January 2004, Kluwer Academic Publishers.
J. Pei and J. Han, "Constrained Frequent Pattern Mining: A Pattern-Growth View", ACM SIGKDD Explorations (Special Issue on Constraints in Data Mining), Volume 4, Issue 1, pages 31-39, June 2002.
J. Han and J. Pei, "Mining Frequent Patterns by Pattern-Growth: Methodology and Implications", ACM SIGKDD Explorations (Special Issue on Scalable Data Mining Algorithms), Volume 2, Issue 2, pages 14-20, December 2000.
J. Li, H. Li, L. Wong, J. Pei, and G. Dong. "Minimum Description Length (MDL) Principle: Generators Are Preferable to Closed Patterns". In Proceedings of the 21st National Conference on Artificial Intelligence (AAAI'06), Boston, MA, USA, July 16-20, 2006.
G. Dong, C. Jiang, J. Pei, J. Li and L. Wong. "Mining Succinct Systems of Minimal Generators of Formal Concepts". In Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA'05), Beijing, China, April 18-20, 2005.
J. Wang, J. Han, and J. Pei. "CLOSET+: Searching for the Best Strategies for Mining Frequent Closed Itemsets". In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03), Washington, DC, USA, August 24-27, 2003.
Y. Huang, H. Xiong, S. Shekhar, and J. Pei. "Mining Confident Co-location Rules without A Support Threshold". In Proceedings of the 18th Annual ACM Symposium on Applied Computing (SAC'03), Melbourne, Florida, March 9 - 12, 2003.
J. Pei, G. Dong, W. Zou, and J. Han. "On Computing Condensed Frequent Pattern Bases" (Regular paper). In Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM'02), Maebashi TERRSA, Maebashi City, Japan, December 9 - 12, 2002.
J. Pei, J. Han, H. Lu, S. Nishio, S. Tang, and D. Yang. "H-Mine: Hyper-structure Mining of Frequent Patterns in Large Databases" (Regular paper). In Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM'01), San Jose, California, November 29-December 2, 2001.
J. Pei, A.K.H. Tung, and J. Han, "Fault-Tolerant Frequent Pattern Mining: Problems and Challenges". In Proceedings of the 2001 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discover (DMKD'01), Santa Barbara, CA, May 2001.
J. Pei, J. Han, and L. V. S. Lakshmanan, "Mining Frequent Itemsets with Convertible Constraints". In Proceedings of the 2001 International Conference on Data Engineering (ICDE'01), Heidelberg, Germany, April 2001.
J. Pei and J. Han. "Can We Push More Constraints into Frequent Pattern Mining?". In Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'2000), Boston, MA, August 2000.
J. Han, J. Pei, and Y. Yin. "Mining Frequent Patterns without Candidate Generation". In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD'00), Dallas, TX, May 2000.
(Demo) J. Pei, R. Mao, K. Hu, and H. Zhu. "Towards Data Mining Benchmarking: A Test Bed for Performance Study of Frequent Pattern Mining". In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD'00), Dallas, TX, May 2000.
J. Pei, J. Han, and R. Mao. "CLOSET: An efficient algorithm for mining frequent closed itemsets". In Proceedings of the 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, Dallas, TX, May 2000.
J. Pei, J. Han, B. Mortazavi-Asl, and H. Zhu. "Mining Access Patterns efficiently from Web logs". In Proceedings of the 2000 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'00), Kyoto, Japan, April 2000.
C. Giannella, J. Han, J. Pei, X. Yan, and P.S. Yu, "Mining Frequent Patterns in Data Streams at Multiple Time Granularities", in H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha (eds.), Next Generation Data Mining, AAAI/MIT, 2004.
(Tutorial) J. Han, L. V. S. Lakshmanan and J. Pei, "Scalable Frequent-Pattern Mining Methods: An Overview (MS PowerPoint Slides)". In Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'2001), San Francisco, California, USA, August 26 - 29, 2001.
Y. Tao, C. Sheng, and J. Pei. "On k-skip Shortest Paths". In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD'11), Athens, Greece, June 12-16, 2011.
H. Maserrat and J. Pei. "Neighbor Query Friendly Compression of Social Networks". In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.
M. Hua and J. Pei. "Probabilistic Path Queries in Road Networks: Traffic Uncertainty Aware Path Selection" (appendix). In Proceedings of the 13th International Conference on Extending Database Technology (EDBT'10), Lausanne, Switzerland, March 22-26, 2010.
D. Jiang and J. Pei. "Mining Frequent Cross-Graph Quasi-Cliques". ACM Transactions on Knowledge Discovery in Data, Volume 2, Number 4, pages 16:1-42, January 2009, ACM Press.
Y. Han, B. Zhou, J. Pei and Y. Jia. "Understanding Importance of Collaborations in Co-authorship Networks: A Supportiveness Analysis Approach''. In Proceedings of 2009 SIAM International Conference on Data Mining (SDM'09), April 30 - May 2, 2009, Sparks, Nevada.
Y. Xiao, W. Wu, J. Pei, W. Wang, and Z. He. "Efficiently Indexing Shortest Paths by Exploiting Symmetry in Graphs". In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.
B.-W. On, E. Elmacioglu, D. Lee, J. Kang, and J. Pei. "Improving Grouped-Entity Resolution using Quasi-Cliques". In Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM'06), Hong Kong, December 18-22, 2006.
J. Pei, H. Wang, J. Liu, K. Wang, J. Wang, and P. S. Yu. "Discovering Frequent Closed Partial Orders from Strings". IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 11, pages 1467-1481, November 2006, IEEE Computer Society. (Software)
B.-W. On, D. Lee, E. Elmacioglu, J. Kang, and J. Pei. "An Effective Approach to Entity Resolution Problem Using Quasi-Clique and its Application to Digital Libraries" (short paper). In Proceedings of the ACM/IEEE 2006 Joint Conf. on Digital Libraries (JCDL'06), Chapel Hill, NC, USA, June 11-15, 2006.
J. Pei, J. Liu, H. Wang, K. Wang, P. S. Yu, and J. Wang. "Efficiently Mining Frequent Closed Partial Orders", In Proceedings of the 5th IEEE International Conference on Data Mining (ICDM'05), New Orleans, Louisiana, USA, November 27-30 2005. (Software)
J. Pei, D. Jiang, and A. Zhang. "On Mining Cross-Graph Quasi-Cliques". In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'05), Chicago, IL, USA, August 21-24, 2005.
(Demo) W. Wang, C. Wang, Y. Zhu, B. Shi, J. Pei, X. Yan, and J. Han. "GraphMiner: A Structural Pattern Mining System for Large Disk-Based Graph Databases and Its Applications". In Proceedings of the 24th ACM SIGMOD International Conference on Management of Data (SIGMOD'05), Baltimore, Maryland, USA, June 14-16, 2005.
J. Pei, D. Jiang and A. Zhang. "Mining Cross-graph Quasi-cliques in Gene Expression and Protein Interaction Data" (research poster paper). In Proceedings of the 21st International Conference on Data Engineering (ICDE'05), Tokyo, Japan, April 5-8, 2005.
C. Wang, W. Wang, J. Pei, Y. Zhu and B. Shi. "Scalable Mining of Large Disk-based Graph Databases" (research full paper). In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04), Seattle, WA, USA, August 22 - 25, 2004.
C. Wang, M. Hong, J. Pei, H. Zhou, W. Wang and B. Shi. "Efficient Pattern-Growth Methods for Frequent Tree Pattern Mining" (full paper). In Proceedings of the 8th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'04), Sydney, Australia, May 26-28, 2004.
R. C.-W. Wong, J. Pei, A. W.-C. Fu, and K. Wang. "Online Skyline Analysis with Dynamic Preferences on Nominal Attributes". IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 1, pages 35-49, January 2009, IEEE Computer Society.
J. Pei, Y. Tao, and J. Han. "Preference Queries from OLAP and Data Mining Perspective". In Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE'09), March 29 - April 4, 2009, Shanghai, China.
B. Jiang, J. Pei, X. Lin, D. W.-L. Cheung, and J. Han. "Mining Preferences from Superior and Inferior Examples''. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.
R. C.-W. Wong, J. Pei, A. W.-C. Fu, and K. Wang. "Mining Favorable Facets". In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07), San Jose, California, USA, August 12-15, 2007.
Z. Liao, D. Jiang, E. Chen, J. Pei, H. Cao, and H. Li. "Mining Concept Sequences from Large-scale Search Logs for Context-aware Query Suggestion". To appear in ACM Transactions on Intelligent Systems and Technology, ACM Press.
Z. Xing, J. Pei, and P.S. Yu. "Early Classification on Time Series". To appear in Knowledge and Information Systems: An International Journal, Springer-Verlag.
Y. Liu, Y. Zhao, L. Chen, J. Pei, and J. Han. "Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays". To appear in IEEE Transactions on Parallel and Distributed Systems, IEEE Computer Society.
Z. Xing, J. Pei, P.S. Yu., and K. Wang. "Extracting Interpretable Features for Early Classification on Time Series". In Proceedings of 11th SIAM International Conference on Data Mining (SDM'11), April 28 - 30, 2011, Phoenix, Arizona, USA.
E. Loekito, J. Bailey, and J. Pei. "Binary Decision Diagram Based Approach for Mining Frequent Subsequences". Knowledge and Information Systems: An International Journal, Volume 24, Number 2, pages 235-268, August, 2010, Springer-Verlag.
Z. Xing, J. Pei, and E. Keogh. "A Brief Survey on Sequence Classification". ACM SIGKDD Explorations, Volume 12, Issue 1, pages 40-48, June 2010, ACM Press.
D. Kang, D. Jiang, J. Pei, Z. Liao, X. Sun, and H-J. Choi. "Multidimensional Mining of Large-Scale Search Logs: A Topic-Concept Cube Approach". In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), Hong Kong, China, February 9-12, 2011.
Y. Zhao, H. Zhang, L. Cao, J. Pei, and C. Zhang. "Debt Detection in Social Security by Sequence Classification Using Both Positive and Negative Patterns". In Proceedings of the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD'09), Bled, Slovenia, September 7-11, 2009.
Z. Xing, J. Pei, and P. S. Yu. "Early Classification on Time Series: A Nearest Neighbor Approach". In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09), Pasadena, CA, USA, July 14-17, 2009.
H. Zhong, T. Xie, L. Zhang, J. Pei, and H. Mei. "MAPO: Mining and Recommending API Usage Patterns". In Proceedings of the 23rd European Conference on Object-Oriented Programming (ECOOP 2009), Genova, Italy, July 6-10,
H. Zhong, T. Xie, L. Zhang, J. Pei, and H. Mei. "MAPO: Mining and Recommending API Usage Patterns". In Proceedings of the 23rd European Conference on Object-Oriented Programming (ECOOP 2009), Genova, Italy, July 6-10, 2009.
B. Zhou, D. Jiang, J. Pei, and H. Li. "OLAP on Search Logs: An Infrastructure Supporting Data-Driven Applications in Search Engines". In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.
H. Cao, D. Jiang, J. Pei, E. Chen, and H. Li. "Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs''. In Proceedings of the 18th International World Wide Web Conference (WWW'09) (Search Track), April 20-24, 2009, Madrid, Spain.
Z. Xing, J. Pei, G. Dong, and P. S. Yu. "Mining Sequence Classifiers for Early Prediction". In Proceedings of the 2008 SIAM International Conference on Data Mining (SDM'08), Atlanta, GA, April 24-26, 2008.
G. Dong and J. Pei, Sequence Data Mining (monograph), Springer, 2007, ISBN: 978-0-387-69936-3.
J. Pei, J. Han, and W. Wang. "Constraint-Based Sequential Pattern Mining: The Pattern-Growth Methods". Journal of Intelligent Information Systems, Volume 28, Number 2, pages 133-160, April 2007, Springer-Verlag.
Y. Bu, T-W Leung, A. W.-C. Fu, E. Keogh, J. Pei, and S. Meshkin. "WAT: Finding Top-K Discords in Time Series Database". In Proceedings of the 2007 SIAM International Conference on Data Mining (SDM'07), Minneapolis, MN, USA, April 26-28, 2007.
J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal, and M.C. Hsu. "Mining Sequential Patterns by Pattern-growth: The PrefixSpan Approach". IEEE Transactions on Knowledge and Data Engineering, Volume 16, Number 11, pages 1424-1440, November 2004, IEEE Computer Society.
J. Han, J. Pei, and X. Yan. "From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach". (Invited paper) Journal of Computer Science and Technology, Vol. 19, No. 3, pages 257-279. May 2004. Allerton Press, Inc.
Y. Liu, L. Chen, J. Pei, Q. Chen, and Y. Zhao. "Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays". In Proceedings of the 5th Annual IEEE International Conference on Pervasive Computing and Communications (PerCom'07), White Plains, NY, USA, March 19-23, 2007.
T. Xie and J. Pei. "MAPO: Mining API Usages from Open Source Repositories" (short paper). In Proceedings of the 3rd International Workshop on Mining Software Repositories (MSR 2006), Shanghai, China, May 22-23, 2006.
H.C. Kum, J. Pei, and W. Wang. "ApproxMAP: Approximate Mining of Consensus Sequential Patterns". In Proceedings of the 2003 SIAM International Conference on Data Mining (SIAM DM '03), San Francisco, CA, May 1-3, 2003.
J. Pei, J. Han, and W. Wang. "Mining Sequential Patterns with Constraints in Large Databases" (Regular paper). In Proceedings of the 11th ACM International Conference on Information and Knowledge Management (CIKM'02), McLean, VA, November 4-9, 2002.
H. Pinto, J. Han, J. Pei, K. Wang, Q. Chen, and U. Dayal. "Multi-Dimensional Sequential Pattern Mining" (Regular paper). In Proceedings of the 10th ACM International Conference on Information and Knowledge Management (CIKM'01), Atlanta, Georgia, November 2001.
J. Han and J. Pei. "Pattern Growth Methods for Sequential Pattern Mining: Principles and Extensions" (invited paper). In Proceedings of the 2001 ACM SIGKDD Workshop on Temporal Data Mining, San Francisco, California, USA, August 26, 2001.
J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M.-C. Hsu, "PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth". In Proceedings of the 2001 International Conference on Data Engineering (ICDE'01), Heidelberg, Germany, April 2001.
J. Han, J. Pei, B. Mortazavi-Asl, Q. Chen, U. Dayal, and M. Hsu. "FreeSpan: Frequent pattern-projected sequential pattern mining". In Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'2000), Boston, MA, August 2000.
(Tutorial) J. Pei and J. Han, "Sequential Pattern Mining: From Shopping History Analysis to Weblog Mining and DNA Mining (MS PowerPoint Slides)", In the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD-01), April 16, 2001, Hong Kong.
J. Han, J. Pei, and X. Yan. "Sequential Pattern Mining by Pattern-Growth: Principles and Extensions", in W. Chu and T. Lin (eds.), Foundations and Advances in Data Mining, Studies in Fuzziness and Soft Computing 180, pages 183-220, Springer-Verlag GmbH, 2005.
C. Wang, L.Y. Yuan, J-H. You, O.R. Zaiane, and J. Pei. "On Pruning for Top-k Ranking in Uncertain Databases". In Proceedings of the 37th International Conference on Very Large Data Bases (VLDB'11), Seattle, WA, USA, August 29-September 3, 2011.
Y. Zhang, W. Zhang, X. Lin, B. Jiang, and J. Pei. "Ranking Uncertain Sky: the Probabilistic Top-k Skyline Operator". The Information System Journal, Volume 36, Issue 5, pages 898-915, July 2011, Wiley.
M. Hua, J. Pei, and X. Lin. "Ranking Queries on Uncertain Data". The VLDB Journal, Volume 20, Number 1, pages 129-153, February 2011, Springer Berlin / Heidelberg.
H. Maserrat and J. Pei. "Neighbor Query Friendly Compression of Social Networks". In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.
W. Zhang, X. Lin, Y. Zhang, J. Pei, and W. Wang. "Threshold-based Probabilistic Top-k Dominating Queries". The VLDB Journal, Volume 19, Number 2, pages 283-305, April, 2010, Springer Berlin / Heidelberg.
Y. Tao, K. Yi, C. Sheng, J. Pei, and F. Li. "Logging Every Footstep: Quantile Summaries for the Entire History". In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data (SIGMOD'10), Indianapolis, Indiana, USA.
M. Hua and J. Pei. "Probabilistic Path Queries in Road Networks: Traffic Uncertainty Aware Path Selection" (appendix). In Proceedings of the 13th International Conference on Extending Database Technology (EDBT'10), Lausanne, Switzerland, March 22-26, 2010.
M. Hua, J. Pei, A. W.-C. Fu, X. Lin, and H-F Leung. "Top-k Typicality Queries and Efficient Query Answering Methods on Large Databases". The VLDB Journal, Volume 18, Number 3, pages 809-835, June 2009Springer Berlin / Heidelberg.
M. Hua and J. Pei. "Continuously Monitoring Top-K Uncertain Data Streams: A Probabilistic Threshold Method". Distributed and Parallel Databases: An International Journal, Volume 26, Number 1, (special issue on ranking in databases), pages 29-65, August, 2009, Springer-Verlag.
M. Hua, J. Pei, W. Zhang, and X. Lin. "Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach". In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD'08), June 11-14, 2008, Vancouver, Canada. (Implementation, the real data sets, the synthetic data generator and the data sets).
M. Hua, J. Pei, W. Zhang, and X. Lin. "Efficiently Answering Probabilistic Threshold Top-k Queries on Uncertain Data". In Proceedings of the 24th International Conference on Data Engineering (ICDE'08), Cancún, México, April 7-12, 2008.
M. Hua, J. Pei, A. W.-C. Fu, X. Lin, and H-F Leung. "Efficiently Answering Top-k Typicality Queries on Large Databases". In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07), Vienna, Austria, September 23-28 2007.
Z. Lin, B. Jiang, J. Pei, and D. Jiang. "Mining Discriminative Items in Multiple Data Streams". World Wide Web Journal: Internet and Web Information Systems, Volume 13, Issue 4, pages 497-522, December 2010, Springer-Verlag.
Y. Tao, K. Yi, C. Sheng, J. Pei, and F. Li. "Logging Every Footstep: Quantile Summaries for the Entire History". In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data (SIGMOD'10), Indianapolis, Indiana, USA.
M. Hua, M. K. Lau, J. Pei, and K. Wu. "Continuous K-Means Monitoring with Low Reporting Cost in Sensor Networks". IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 12, pages 1679-1691, December, 2009, IEEE Computer Society.
M. Hua and J. Pei. "Continuously Monitoring Top-K Uncertain Data Streams: A Probabilistic Threshold Method". Distributed and Parallel Databases: An International Journal, Volume 26, Number 1, (special issue on ranking in databases), pages 29-65, August, 2009, Springer-Verlag.
E. Soroush, K. Wu, and J. Pei. "Fast and Quality-Guaranteed Data Streaming in Resource-Constrained Sensor Networks''. In Proceedings of the 9th ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc'08), Hong Kong, China, May 26-30, 2008.
M. Cho, J. Pei, and K. Wang. "Answering Ad Hoc Aggregate Queries from Data Streams Using Prefix Aggregate Trees". Knowledge and Information Systems: An International Journal, Volume 12, Number 3, pages 301-329, August 2007, Springer-Verlag.
Y. Xu, K. Wang, A. W.-C. Fu, R. She, and J. Pei. "Privacy-preserving Data Stream Classification", in C. Aggarwal and P. S. Yu (eds.), Privacy-Preserving Data Mining: Models and Algorithms, Springer-Verlag, 2007.
J. Han, Y. Chen, G. Dong, J. Pei, B. W. Wah, J. Wang, and Y. D. Cai. "Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams". Distributed and Parallel Databases, Volume 18, Number 2, pages 173-197, September 2005, Springer Science + Business Media.
Y. Xu, K. Wang, A. W. C. Fu, R. She, and J. Pei. "Classification Spanning Correlated Data Streams". In Proceedings of the ACM 15th Conference on Information and Knowledge Management (CIKM'06), Arlington, VA, USA, November 6-11, 2006.
W. Zhu, J. Pei, J. Yin, Y. Xie. "Granularity Adaptive Density Estimation and on-Demand Clustering of Concept-Drifting Data Streams". In Proceedings of the 8th International Conference on Data Warehousing and Knowledge Discovery (DaWaK'06), Krakow, Poland, September 4-8, 2006.
H. Wang, J. Yin, J. Pei, P. S. Yu, and J. X. Yu. "Suppressing Model Overfitting in Mining Concept-Drifting Data Streams". In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, USA, August 20-23, 2006.
H. Wang and J. Pei. "A Random Method for Quantifying Changing Distributions in Data Streams", In Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD'05), Porto, Portugal, October 3-7, 2005.
(Tutorial) J. Pei, H. Wang and P.S. Yu, "Online Mining Data Streams: Problems, Applications and Progress". In Proceedings of the 10th ACM SIGKDD International Conference on Data Mining (KDD'04), Seattle, WA, August 22 - 25, 2004, Proceedings of the 21st International Conference on Data Engineering (ICDE'05), Tokyo, Japan, April 5-8, 2005, and Proceedings of the 6th International Conference on Web-Age Information Management (WAIM'05), Hangzhou, China, October 11-13, 2005.
C. Giannella, J. Han, J. Pei, X. Yan, and P.S. Yu, "Mining Frequent Patterns in Data Streams at Multiple Time Granularities", in H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha (eds.), Next Generation Data Mining, AAAI/MIT, 2004.
B. Zhou and J. Pei. "Aggregate Keyword Search on Large Relational Databases". To appear in Knowledge and Information Systems: An International Journal, Springer-Verlag.
D. Kang, D. Jiang, J. Pei, Z. Liao, X. Sun, and H-J. Choi. "Multidimensional Mining of Large-Scale Search Logs: A Topic-Concept Cube Approach". In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), Hong Kong, China, February 9-12, 2011.
R. C. Raïssi, J. Pei, and T. Kister. "Computing Closed Skycubes". In Proceedings of the 36th International Conference on Very Large Data Bases (VLDB'10), Singapore, September 13-17, 2010.
B. Zhou, D. Jiang, J. Pei, and H. Li. "OLAP on Search Logs: An Infrastructure Supporting Data-Driven Applications in Search Engines". In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.
B. Zhou and J. Pei. "Answering Aggregate Keyword Queries on Relational Databases Using Minimal Group-bys". In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.
F. M. Jiang, J. Pei, and A. W.-C. Fu. "IX-Cubes: Iceberg Cubes for Data Warehousing and OLAP on XML Data". In Proceedings of the ACM 16th Conference on Information and Knowledge Management (CIKM 2007), Lisboa, Portugal, November 6-9, 2007.
R. C.-W. Wong, J. Pei, A. W.-C. Fu, and K. Wang. "Mining Favorable Facets". In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07), San Jose, California, USA, August 12-15, 2007.
J. Pei, A. W. C. Fu, X. Lin, and H. Wang. "Computing Compressed Skyline Cubes Efficiently". In Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDE'07), Istanbul, Turkey, April 16-20, 2007.
J. Pei, Y. Yuan, X. Lin, W. Jin, M. Ester, Q. Liu, W. Wang, Y. Tao, J. X. Yu, and Q. Zhang. "Towards Multidimensional Subspace Skyline Analysis". ACM Transactions on Database Systems, Volumn 31, Number 4, pages 1335-1381, December 2006, ACM Press.
M. Cho, J. Pei, and K. Wang. "Answering Ad Hoc Aggregate Queries from Data Streams Using Prefix Aggregate Trees". Knowledge and Information Systems: An International Journal, Volume 12, Number 3, pages 301-329, August 2007, Springer-Verlag.
Y. Chen, G. Dong, J. Han, J. Pei, B. W. Wah, and J. Wang. "Regression Cubes with Lossless Compression and Aggregation". IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 12, pages 1585-1599, December 2006, IEEE Computer Society.
J. Wang, J. Han, and J. Pei. "Closed Constrained-Gradient Mining in Retail Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 6, pages 764-769, June 2006, IEEE Computer Society.
J. Han, Y. Chen, G. Dong, J. Pei, B. W. Wah, J. Wang, and Y. D. Cai. "Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams". Distributed and Parallel Databases, Volume 18, Number 2, pages 173-197, September 2005, Springer Science + Business Media.
G. Dong, J. Han, J. Lam, J. Pei, K. Wang, and W. Zou. "Mining Constrained Gradients in Large Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 16, Number 8, pages 922-938, August 2004, IEEE Computer Society.
J. Pei, W. Jin, M. Ester, and Y. Tao. "Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces". In Proceedings of the 31st International Conference on Very Large Data Bases (VLDB'05), Trondheim, Norway, August 30-September 2, 2005.
H. Yu, J. Pei, S. Tang, and D. Yang. "Mining Most General Multidimensional Summarization of Probable Groups in Data Warehouses". In Proceedings of the 17th International Scientific and Statistical Database Management Conference (SSDBM'05), Santa Barbara, California, USA, June 27-29, 2005.
M. Cho, J. Pei and D. Cheung. "Cross Table Cubing: Mining Iceberg Cubes from Data Warehouses" (poster paper). In Proceedings of the 5th SIAM International Conference on Data Mining (SDM'05), Newport Beach, CA, USA, April 21-23, 2005.
(Demo) L.V.S. Lakshmanan, J. Pei, and Y. Zhao. "Efficacious Data Cube Exploration by Semantic Summarization and Compression". In Proceedings of the 29th International Conference on Very Large Data Bases (VLDB'03), Berlin, Germany, September 9-12, 2003.
L.V.S. Lakshmanan, J. Pei, and Y. Zhao. "QC-Trees: An Efficient Summary Structure for Semantic OLAP". In Proceedings of the 2003 ACM-SIGMOD International Conference on Management of Data (SIGMOD'03), San Diego, CA, June 9-12, 2003.
(Demo) L.V.S. Lakshmanan, J. Pei, and Y. Zhao. "SOCQET: Semantic OLAP with Compressed Cube and Summarization". In Proceedings of the 2003 ACM-SIGMOD International Conference on Management of Data (SIGMOD'03), San Diego, CA, June 9-12, 2003.
J. Pei. "A General Model for Online Analytical Processing of Complex Data". In Proceedings of the 22nd International Conference on Conceptual Modeling (ER'03), Chicago, IL, October 13-16, 2003.
L.V.S. Lakshmanan, J. Pei, J. Han. "Quotient Cube: How to Summarize The Semantics of A Data Cube". In Proceedings of the 28th International Conference on Very Large Databases (VLDB'02), Hong Kong, China, August 20-23, 2002.
(Demo) G. Dong, J. Han, J. Lam, J. Pei, J. Wang, K. Wang. "CubeExplorer: Online Exploration of Data Cubes". In Proceedings of the 2002 ACM-SIGMOD International Conference on Management of Data (SIGMOD'02), Madison, Wisconsin, June 3-6, 2002.
Y. Chen, G. Dong, J. Han, J. Pei, B. Wah, J. Wang "Online Analytical Processing Stream Data: Is It Feasible?". In Proceedings of the 2002 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD'2002), Madison, Wisconsin, June 2, 2002.
H. Pinto, J. Han, J. Pei, K. Wang, Q. Chen, and U. Dayal. "Multi-Dimensional Sequential Pattern Mining" (Regular paper). In Proceedings of the 10th ACM International Conference on Information and Knowledge Management (CIKM'01), Atlanta, Georgia, November 2001.
G. Dong, J. Han, J. Lam, J. Pei, and K. Wang. "Mining Multi-Dimensional Constrained Gradients in Data Cubes", Proceedings of the 27th International Conference on Very Large Data Base (VLDB'01), Roma, Italy, September 2001.
J. Han, J. Pei, G. Dong, and K. Wang, "Efficient Computation of Iceberg Cubes with Complex Measures". In Proceedings of the 2001 ACM-SIGMOD International Conference on Management of Data (SIGMOD'01), Santa Barbara, CA, May 2001.
Z. Liao, D. Jiang, E. Chen, J. Pei, H. Cao, and H. Li. "Mining Concept Sequences from Large-scale Search Logs for Context-aware Query Suggestion". To appear in ACM Transactions on Intelligent Systems and Technology, ACM Press.
Q. He, D. Kifer, J. Pei, P. Mitra, and L. Giles. "Citation Recommendation without Author Supervision". In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), Hong Kong, China, February 9-12, 2011.
B. Xiang, D. Jiang, J. Pei, X. Sun, E. Chen, and H. Li. "Context-Aware Ranking in Web Search". In Proceedings of the 33rd Annual ACM SIGIR Conference (SIGIR'10), Geneva, Switzerland, July 19-23, 2010.
Q. He, J. Pei, D. Kifer, P. Mitra, and C.L. Giles. "Context-aware Citation Recommendation". In Proceedings of the 19th International World Wide Web Conference (WWW'10), Raleigh, NC, USA, April 26-30, 2010.
B. Aljaber, N. Stokes, J. Bailey, and J. Pei. "Document Clustering of Scientific Texts Using Citations Contexts". Information Retrieval, Volume 13, Number 2, pages 101-131, April, 2010, Springer-Verlag.
Q. He, B. Chen, J. Pei, B. Qiu, P. Mitra, and C. L. Giles. "Detecting Topic Evolution in Scientific Literature: How Can Citations Help?". In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM'09), Hong Kong, November 2-6, 2009.
H. Cao, D. Jiang, J. Pei, E. Chen, and H. Li. "Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs''. In Proceedings of the 18th International World Wide Web Conference (WWW'09) (Search Track), April 20-24, 2009, Madrid, Spain.
J. Wang, X. He, C. Wang, J. Pei, J. Bu, C. Chen, and Z. Guan. "News Article Extraction with Template-Independent Wrapper". In Proceedings of the 18th International World Wide Web Conference (WWW'09) (Poster), April 20-24, 2009, Madrid, Spain.
H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. "Context-Aware Query Suggestion by Mining Click-Through and Session Data" (Best Application Paper Award). In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.
X. Sun, H. Wang, J. Li, and J. Pei. "Publishing Anonymous Survey Rating Data". Data Mining and Knowledge Discovery, Volume 23, pages 379-406, November 2011, Springer-Verlag.
B. Zhou and J. Pei. "k-Anonymity and l-Diversity Approaches for Privacy Preservation in Social Networks against Neighborhood Attacks". Knowledge and Information Systems: An International Journal, Volume 28, Number 1, pages 47-77, July 2011, Springer-Verlag.
R.C.W. Wong, A.W.C. Fu, K.Wang, P.S. Yu, and J. Pei. "Can the Utility of Anonymized Data Be Used for Privacy Breaches?". ACM Transactions on Knowledge Discovery in Data, Volume 5, Issue 3, pages 16:1-24, August 2011, ACM Press.
M. Hay, K. Liu, G. Miklau, J. Pei, and E. Terzi. "Privacy-aware Data Management in Information Networks''. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD'11), Athens, Greece, June 12-16, 2011.
R. C-W. Wong, A. W-C. Fu, K. Wang, Y. Xu, J. Pei, and P.S. Yu. "Probabilistic Inference Protection on Anonymized Data". In Proceedings of the 10th IEEE International Conference on Data Mining (ICDM'10), Sydney, Australia, December 14-17, 2010.
K. Liu, G. Miklau, J. Pei, and E. Terzi. "Privacy-aware Data Mining in Information Networks''. In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.
Y. Tao, J. Pei, J. Li, X. Xiao, K. Yi, and Z. Xing. "Hiding Correlation by Independence Masking". In Proceedings of the 26th IEEE International Conference on Data Engineering (ICDE'10), Long Beach, CA, USA, March 1-6, 2010.
R. C.-W. Wong, A. W.-C. Fu, K. Wang, and J. Pei. "Anonymization based Attack in Privacy Preserving Data Publishing". ACM Transactions on Database Systems, Volume 34, Issue 2, pages 8:1-46, June 2009, ACM Press.
B. Zhou, J. Pei, and W.-S. Luk. "A Brief Survey on Anonymization Techniques for Privacy Preserving Publishing of Social Network Data". ACM SIGKDD Explorations, Volume 10, Issue 2, pages 12-22, December 2008, ACM Press.
B. Zhou, Y. Han, J. Pei, B. Jiang, Y. Tao, and Y. Jia. "Continuous Privacy Preserving Publishing of Data Streams". In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.
J. Li, R. C.-W. Wong, A. W.-C. Fu, and J. Pei. "Anonymization by Local Recoding in Data with Attribute Hierarchical Taxonomies". IEEE Transactions on Knowledge and Data Engineering, Volume 20, Number 9, pages 1181-1194, September 2008, IEEE Computer Society.
J. Pei, Y. Tao, J. Li, X. Xiao. "Privacy Preserving Publishing on Multiple Quasi-Identifiers". In Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE'09), March 29 - April 4, 2009, Shanghai, China. (Full version as Technical Report TR 2008-18, School of Computing Science, Simon Fraser University.)
Y. Xu, B. Fang, K. Wang, A. W.-C. Fu, and J. Pei. "Publishing Sensitive Transactions for Itemset Utility". In Proceedings of the 8th IEEE International Conference on Data Mining (ICDM'08), December 15-19, 2008, Pisa, Italy.
B. Zhou and J. Pei. "Preserving Privacy in Social Networks against Neighborhood Attacks". In Proceedings of the 24th International Conference on Data Engineering (ICDE'08), Cancún, México, April 7-12, 2008.
B. C. M. Fung, K. Wang, A. W.-C. Fu, and J. Pei. "Anonymity for Continuous Data Publishing". In Proceedings of the 11th International Conferences on Extending Database Technology (EDBT'08), Nantes, France, March 25-30, 2008.
R. C.-W. Wong, A. W.-C. Fu, K. Wang, and J. Pei. "Minimality Attack in Privacy Preserving Data Publishing". In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07), Vienna, Austria, September 23-28 2007.
J. Pei, J. Xu, Z. Wang, W. Wang, and K. Wang. "Maintaining K-Anonymity against Incremental Updates". In Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM'07), Banff, Canada, July 9-11, 2007.
M. Hua and J. Pei, "A Survey of Utility-based Privacy-preserving Data Transformation Methods", in C. Aggarwal and P. S. Yu (eds.), Privacy-Preserving Data Mining: Models and Algorithms, Springer-Verlag, 2007.
Y. Xu, K. Wang, A. W.-C. Fu, R. She, and J. Pei. "Privacy-preserving Data Stream Classification", in C. Aggarwal and P. S. Yu (eds.), Privacy-Preserving Data Mining: Models and Algorithms, Springer-Verlag, 2007.
R. C.-W. Wong, Y. Liu, J. Yin, Z. Huang, A. W.-C. Fu, and J. Pei. "(alpha, k)-anonymity Based Privacy Preservation by Lossy Join". In Proceedings of the 9th Asia-Pacific Web Conference and the 8th International Conference on Web-Age Information Management (APWEB/WAIM'07), Huangshan, China, June 16-18, 2007.
J. Pei, M. K. Lau, and P. S. Yu. "TS-Trees: A Non-Alterable Search Tree Index for Trustworthy Databases on Write-Once-Read-Many (WORM) Storage" (IEEE Outstanding Paper Award). In Proceedings of the IEEE 21st International Conference on Advanced Information Networking and Applications (AINA'07), Niagara Falls, ON, Canada, May 21-23, 2007.
J. Xu, W. Wang, J. Pei, X. Wang, B. Shi, and A. W. Fu. "Utility-Based Anonymization for Privacy Preservation with Less Information Loss". ACM SIGKDD Explorations, Volume 8, Issue 2, pages 21-30, December 2006.
J. Li, R. C.-W. Wong, A. W.-C. Fu, and J. Pei. "Achieving k-Anonymity by Clustering in Attribute Hierarchical Structures". In Proceedings of the 8th International Conference on Data Warehousing and Knowledge Discovery (DaWaK'06), Krakow, Poland, September 4-8, 2006.
C. Aggarwal, J. Pei, and B. Zhang. "On Privacy Preservation against Adversarial Data Mining". In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, USA, August 20-23, 2006.
J. Xu, W. Wang, J. Pei, X. Wang, B. Shi, and A. W.-C. Fu. "Utility-Based Anonymization Using Local Recoding". In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, USA, August 20-23, 2006.
J. Xu, W. Wang, J. Pei,
X. Wang, B. Shi, and A. W.-C. Fu. "Utility-Based Anonymization for
Privacy Preservation with Less Information Loss". In
Proceedings of
the
2nd ACM SIGKDD Workshop on Utility-Based Data Mining (UBDM'06),
in conjunction with the
12th ACM SIGKDD International Conference on Knowledge Discovery and Data
Mining (KDD'06), Philadelphia, PA, USA, August 20, 2006.
B. Zhou and J. Pei. "k-Anonymity and l-Diversity Approaches for Privacy Preservation in Social Networks against Neighborhood Attacks". Knowledge and Information Systems: An International Journal, Volume 28, Number 1, pages 47-77, July 2011, Springer-Verlag.
M. Hay, K. Liu, G. Miklau, J. Pei, and E. Terzi. "Privacy-aware Data Management in Information Networks''. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD'11), Athens, Greece, June 12-16, 2011.
K. Liu, G. Miklau, J. Pei, and E. Terzi. "Privacy-aware Data Mining in Information Networks''. In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.
B. Zhou, J. Pei, and W.-S. Luk. "A Brief Survey on Anonymization Techniques for Privacy Preserving Publishing of Social Network Data". ACM SIGKDD Explorations, Volume 10, Issue 2, pages 12-22, December 2008, ACM Press.
H. Maserrat and J. Pei. "Neighbor Query Friendly Compression of Social Networks". In Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'10), Washington, DC, USA, July 25-28, 2010.
Y. Han, B. Zhou, J. Pei and Y. Jia. "Understanding Importance of Collaborations in Co-authorship Networks: A Supportiveness Analysis Approach''. In Proceedings of 2009 SIAM International Conference on Data Mining (SDM'09), April 30 - May 2, 2009, Sparks, Nevada.
B. Zhou and J. Pei. "Preserving Privacy in Social Networks against Neighborhood Attacks". In Proceedings of the 24th International Conference on Data Engineering (ICDE'08), Cancún, México, April 7-12, 2008.
Y. Zhang, W. Zhang, X. Lin, B. Jiang, and J. Pei. "Ranking Uncertain Sky: the Probabilistic Top-k Skyline Operator". The Information System Journal, Volume 36, Issue 5, pages 898-915, July 2011, Wiley.
B. Jiang, J. Pei, and X. Lin. "Probabilistic Skylines on Uncertain Data: Model and Bounding-Pruning-Refining Methods". To appear in Journal of Intelligent Information Systems, Springer-Verlag.
Y. Liu, Y. Zhao, L. Chen, J. Pei, and J. Han. "Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays". To appear in IEEE Transactions on Parallel and Distributed Systems, IEEE Computer Society.
R. C. Raïssi, J. Pei, and T. Kister. "Computing Closed Skycubes". In Proceedings of the 36th International Conference on Very Large Data Bases (VLDB'10), Singapore, September 13-17, 2010.
S. Yuen, Y. Tao, X. Xiao, J. Pei, D. Zhang. "Superseding Nearest Neighbor Search on Uncertain Spatial Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 22, Number 7, pages 1041-1055, July, 2010, IEEE Computer Society.
M. Hua and J. Pei. "Probabilistic Path Queries in Road Networks: Traffic Uncertainty Aware Path Selection" (appendix). In Proceedings of the 13th International Conference on Extending Database Technology (EDBT'10), Lausanne, Switzerland, March 22-26, 2010.
T. Wang, B. Yang, J. Gao, D. Yang, S. Tang, H. Wu, K. Liu, and J. Pei. "MobileMiner: A Real World Case Study of Data Mining in Mobile Communication" (demo paper). In Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data (SIGMOD'09), June 29-July 2, 2009, Providence, Rhode Island, USA.
R. C.-W. Wong, J. Pei, A. W.-C. Fu, and K. Wang. "Online Skyline Analysis with Dynamic Preferences on Nominal Attributes". IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 1, pages 35-49, January, 2009, IEEE Computer Society.
B. Jiang and J. Pei. "Online Interval Skyline Queries on Time Series". In Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE'09), March 29 - April 4, 2009, Shanghai, China.
Y. Tao, L. Ding, X. Lin, and J. Pei. "Distance-based Representative Skyline". In Proceedings of the 25th IEEE International Conference on Data Engineering (ICDE'09), March 29 - April 4, 2009, Shanghai, China.
R. C.-W. Wong, A. W.-C. Fu, J. Pei, Y. S. Ho, T. Wong, and Y. Liu. "Efficient Skyline Querying with Variable User Preferences on Nominal Attributes". In Proceedings of the 34th International Conference on Very Large Databases (VLDB'08), August 24-30, 2008, Auckland, New Zealand.
J. Pei, B. Jiang, X. Lin, and Y. Yuan. "Probabilistic Skylines on Uncertain Data". In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07), Vienna, Austria, September 23-28 2007.
Y. Tao, X. Xiao, and
J. Pei. "Efficient Skyline
and Top-k Retrieval in Subspaces". IEEE Transactions on Knowledge and Data Engineering,
Volume 19, Number 8, pages
1072-1088, August 2007, IEEE Computer Society.
R. C.-W. Wong, J. Pei, A. W.-C. Fu, and K. Wang. "Mining Favorable Facets". In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07), San Jose, California, USA, August 12-15, 2007.
J. Pei, A. W. C. Fu, X. Lin, and H. Wang. "Computing Compressed Skyline Cubes Efficiently". In Proceedings of the 23rd IEEE International Conference on Data Engineering (ICDM'07), Istanbul, Turkey, April 16-20, 2007.
J. Pei, Y. Yuan, X. Lin, W. Jin, M. Ester, Q. Liu, W. Wang, Y. Tao, J. X. Yu, and Q. Zhang. "Towards Multidimensional Subspace Skyline Analysis". ACM Transactions on Database Systems, Volumn 31, Number 4, pages 1335-1381, December 2006, ACM Press.
Y. Huang, J. Pei, and H. Xiong. "Mining Co-Location Patterns with Rare Events from Spatial Data Sets''. GeoInformatica, Volume 10, Number 3, pages 239-260, September 2006, Springer Netherlands.
(Demo) X. Zhou, J. Zhang, W. Wang, B. Shi, and J. Pei. "Using High Dimensional Indexes to Support Relevance Feedback Based Interactive Images Retrieval". In Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB'06), Seoul, Korea, September 12-15, 2006.
Y. Tao, X. Xiao, and J. Pei. "SUBSKY: Efficient Computation of Skylines in Subspaces". In Proceedings of the 22nd International Conference on Data Engineering (ICDE'06), Atlanta, GA, USA, April 3-7, 2006.
J. Ye, X. Zhou, J. Pei, L. Chen, and L. Zhang. "A Stratification-Based Approach to Accurate and Fast Image Annotation". In Proceedings of the 6th International Conference on Web-Age Information Management (WAIM'05), Hangzhou, China, October 11-13, 2005.
J. Pei, W. Jin, M. Ester, and Y. Tao. "Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces". In Proceedings of the 31st International Conference on Very Large Data Bases (VLDB'05), Trondheim, Norway, August 30-September 2, 2005.
Y. Huang, H. Xiong, S. Shekhar, and J. Pei. "Mining Confident Co-location Rules without A Support Threshold". In Proceedings of the 18th Annual ACM Symposium on Applied Computing (SAC'03), Melbourne, Florida, March 9 - 12, 2003.
B. Jiang, J. Pei, Y. Tao, and X. Lin. "Clustering Uncertain Data Based on Probability Distribution Similarity". To appear in IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society.
C. Wang, L.Y. Yuan, J-H. You, O.R. Zaiane, and J. Pei. "On Pruning for Top-k Ranking in Uncertain Databases". In Proceedings of the 37th International Conference on Very Large Data Bases (VLDB'11), Seattle, WA, USA, August 29-September 3, 2011.
Y. Zhang, W. Zhang, X. Lin, B. Jiang, and J. Pei. "Ranking Uncertain Sky: the Probabilistic Top-k Skyline Operator". The Information System Journal, Volume 36, Issue 5, pages 898-915, July 2011, Wiley.
M. Hua, J. Pei, and X. Lin. "Ranking Queries on Uncertain Data". The VLDB Journal, Volume 20, Number 1, pages 129-153, February 2011, Springer Berlin / Heidelberg.
B. Jiang, J. Pei, and X. Lin. "Probabilistic Skylines on Uncertain Data: Model and Bounding-Pruning-Refining Methods". To appear in Journal of Intelligent Information Systems, Springer-Verlag.
B. Jiang and J. Pei. "Outlier Detection on Uncertain Data: Objects, Instances, and Inferences". In Proceedings of the 27th IEEE International Conference on Data Engineering (ICDE'11), Hannover, Germany, April 11-16, 2011.
W. Zhang, X. Lin, Y. Zhang, J. Pei, and W. Wang. "Threshold-based Probabilistic Top-k Dominating Queries". The VLDB Journal, Volume 19, Number 2, pages 283-305, April, 2010, Springer Berlin / Heidelberg.
M. Hua and J. Pei. "Probabilistic Path Queries in Road Networks: Traffic Uncertainty Aware Path Selection" (appendix). In Proceedings of the 13th International Conference on Extending Database Technology (EDBT'10), Lausanne, Switzerland, March 22-26, 2010.
M. Hua, J. Pei, A. W.-C. Fu, X. Lin, and H-F Leung. "Top-k Typicality Queries and Efficient Query Answering Methods on Large Databases". The VLDB Journal, Volume 18, Number 3, pages 809-835, June 2009Springer Berlin / Heidelberg.
S. Yuen, Y. Tao, X. Xiao, J. Pei, D. Zhang. "Superseding Nearest Neighbor Search on Uncertain Spatial Databases". IEEE Transactions on Knowledge and Data Engineering, Volume 22, Number 7, pages 1041-1055, July, 2010, IEEE Computer Society.
M. A. Cheema, X. Lin, W. Wang, W. Zhang, and J. Pei. "Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data". IEEE Transactions on Knowledge and Data Engineering, Volume 22, Number 4, pages 550-564, April, 2010, IEEE Computer Society.
M. Hua and J. Pei. "Continuously Monitoring Top-K Uncertain Data Streams: A Probabilistic Threshold Method". Distributed and Parallel Databases: An International Journal, Volume 26, Number 1, (special issue on ranking in databases), pages 29-65, August, 2009, Springer-Verlag.
J. Pei, M. Hua, Y. Tao, and X. Lin. "Mining Uncertain and Probabilistic Data: Problems, Challenges, Methods and Applications''. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.
W. Zhang, X. Lin, J. Pei, and Y. Zhang. "Managing Uncertain Data: A Probabilistic Approach" (invited paper). In Proceedings of the 9th International Conference on Web-Age Information Management (WAIM'08), July 20-22, 2008, Zhangjiajie, China.
(Tutorial) J. Pei, M. Hua, Y. Tao, and X. Lin. "Query Answering Techniques on Uncertain and Probabilistic Data'' (slides). In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD'08), June 11-14, 2008, Vancouver, Canada.
M. Hua, J. Pei, W. Zhang, and X. Lin. "Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach". In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD'08), June 11-14, 2008, Vancouver, Canada. (Implementation, the real data sets, the synthetic data generator and the data sets).
M. Hua, J. Pei, W. Zhang, and X. Lin. "Efficiently Answering Probabilistic Threshold Top-k Queries on Uncertain Data". In Proceedings of the 24th International Conference on Data Engineering (ICDE'08), Cancún, México, April 7-12, 2008.
J. Pei, B. Jiang, X. Lin, and Y. Yuan. "Probabilistic Skylines on Data". In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07), Vienna, Austria, September 23-28 2007.
M. Hua, J. Pei, A. W.-C. Fu, X. Lin, and H-F Leung. "Efficiently Answering Top-k Typicality Queries on Large Databases". In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07), Vienna, Austria, September 23-28 2007.
Z. Liao, D. Jiang, E. Chen, J. Pei, H. Cao, and H. Li. "Mining Concept Sequences from Large-scale Search Logs for Context-aware Query Suggestion". To appear in ACM Transactions on Intelligent Systems and Technology, ACM Press.
D. Kang, D. Jiang, J. Pei, Z. Liao, X. Sun, and H-J. Choi. "Multidimensional Mining of Large-Scale Search Logs: A Topic-Concept Cube Approach". In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11), Hong Kong, China, February 9-12, 2011.
B. Zhou and J. Pei. "Link Spam Target Detection Using Page Farms". ACM Transactions on Knowledge Discovery in Data, Volume 3, Number 3, pages 13:1-38, ACM Press.
B. Zhou, D. Jiang, J. Pei, and H. Li. "OLAP on Search Logs: An Infrastructure Supporting Data-Driven Applications in Search Engines". In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.
J. Wang, C. Chen, C. Wang, J. Pei, J. Bu, Z. Guan, W. V. Zhang. "Can We Learn a Template-Independent Wrapper for News Article Extraction from a Single Training Site?". In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.
B. Zhou and J. Pei. "OSD: An Online Web Spam Detection" (demo paper). In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), Paris, France, June 28 - July 1, 2009.
Y. Han, B. Zhou, J. Pei and Y. Jia. "Understanding Importance of Collaborations in Co-authorship Networks: A Supportiveness Analysis Approach''. In Proceedings of 2009 SIAM International Conference on Data Mining (SDM'09), April 30 - May 2, 2009, Sparks, Nevada.
H. Cao, D. Jiang, J. Pei, E. Chen, and H. Li. "Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs''. In Proceedings of the 18th International World Wide Web Conference (WWW'09) (Search Track), April 20-24, 2009, Madrid, Spain.
H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. "Context-Aware Query Suggestion by Mining Click-Through and Session Data" (Best Application Paper Award). In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), August 24-27, 2008, Las Vegas, NV, USA.
K. Tsoukalas, B. Zhou, J. Pei, and D. Cubranic. "Personalizing Entity Detection and Recommendation with a Fusion of Web Log mining Techniques" (Industrial track). In Proceedings of the 12th International Conference on Extending Database Technology (EDBT'09), March 23-26, 2009, Saint-Petersburg, Russia.
K. Tsoukalas, B. Zhou, J. Pei, and D. Cubranic. "PLEDS: A Personalized Entity Detection System Based on Web Log Mining Techniques" (invited paper). In Proceedings of the 9th International Conference on Web-Age Information Management (WAIM'08), July 20-22, 2008, Zhangjiajie, China.
(Tutorial) J. Pei, B. Zhou. Z. Tang, and H. Huang. "Data Mining Techniques for Web Spam Detection". In Proceedings of the 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'08), May 20-23, 2003, Osaka, Japan.
B. Zhou, J. Pei, and Z. Tang. "A Spamicity Approach to Web Spam Detection". In Proceedings of the 2008 SIAM International Conference on Data Mining (SDM'08), Atlanta, GA, April 24-26, 2008.
B. Zhou and J. Pei. "Preserving Privacy in Social Networks against Neighborhood Attacks". In Proceedings of the 24th International Conference on Data Engineering (ICDE'08), Cancún, México, April 7-12, 2008.
B. Zhou and J. Pei. "Sketching Landscapes of Page Farms". In Proceedings of the 2007 SIAM International Conference on Data Mining (SDM'07), Minneapolis, MN, USA, April 26-28, 2007.
(Demo) T. Wang, S. Tang, D. Yang, J. Gao, Y. Wu, J. Pei. "COMMIX: Towards Effective Web Information Extraction. Integration and Query Answering". In Proceedings of the 2002 ACM-SIGMOD International Conference on Management of Data (SIGMOD'02), Madison, Wisconsin, June 3-6, 2002.
J. Pei, J. Han, B. Mortazavi-Asl, and H. Zhu. "Mining Access Patterns efficiently from Web logs". In Proceedings of the 2000 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'00), Kyoto, Japan, April 2000.
Z. Chen, C. Li, J. Pei, Y. Tao, H. Wang, W. Wang, J. Yang, J. Yang, and D. Zhang. "Recent Progress on Selected Topics in Database Research: A Report from Nine Young Chinese Researchers Working in the United States". (Invited paper) Journal of Computer Science and Technology, Vol. 18, No. 5, 538-552. September 2003. Allerton Press, Inc. (A printable version)
This pages last updated on September 20, 2011. Copyright 2002 - 2011, Jian Pei. All rights reserved.