IR research in LTI    

My Ph.D. thesis: Bayesian Graphical Models for Adaptive Information Filtering

Selected Publications:

Y. Zhang Using Bayesian Priors to Combine Classifiers for Adaptive Filtering In Proceedings of the 27st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, United Kingdom, 2004

Y. Zhang, W. Xu, J. Callan  "Exploration and Exploitation in Adaptive Filtering Based on Bayesian Active Learning International Conference on Machine Learning (ICML 2003) slides

Y. Zhang, J. Callan and T. Minka Novelty and Redundancy Detection in Adaptive Filtering. In Proceedings of the 25st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, 2002. (Best Paper Award in SIGIR 2002) Slides Data

Other publications:

Byron Dom, Iris Eiron, Alex Cozzi, Yi Zhang "Graph-Based Ranking Algorithms for E-mail Expertise Analysis Data Mining and Knowledge Discovery Workshop(DMKD2003) in ACM SIGMOD Conference (SIGMOD2003). 

K. Collins-Thompson, P. Ogilvie, Y. Zhang, and J. Callan. "Information filtering, novelty detection, and named-page finding." In Proceedings of the 2002 Text REtrieval Conference (TREC 2002). National Institute of Standards and Technology, special publication

Y. Zhang, W. Xu and J. Callan Exact Maximum Likelihood Estimation for Word Mixtures Text Learning Workshop in International Conference on Machine Learning (ICML), Sydney, Australia, 2002. Slides

Y. Zhang and J. Callan. "The bias problem and language models in adaptive filtering." In Proceedings of the 2001 Text REtrieval Conference (TREC 2001). National Institute of Standards and Technology, special publication.

Y. Zhang and J. Callan "Maximum Likelihood Estimation for Filtering Thresholds". In Proceedings of the 24st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, LA, 2001. Slides

Y. Zhang and J. Callan. "YFilter at TREC-9". In Proceedings of the Ninth Text REtrieval Conference (TREC-9). National Institute of Standards and Technology, special publication.

DataSet: Dataset for Novelty and Redundancy Detection while Filtering

Other old presentations:

Linear Method for Regression  Aadaboost1 (2) Boosting and Additive Trees Based on "Elementary of Statistical Learning" by Trevor Hastie

Previous research work:

Data Mining from Emails and Web Pages: IBM Almaden Reserach Lab

Information Extraction from Hospital Diagnosis: Medical Archival Systems, Inc

Natural language Processing Lab Project: Open Domain Question and Answering System

Advanced Information Retrieval Seminar Project: A generative Model for generic Text Summarization

Information Theory Course Project w/ Wei Xu: Building Language Model Using ECOC coding

Machine Learning Course Project: Class and feature based language modeling by neural network

Statistics for Natural Language Processing Course Project: Comparision of Query Expansion Methods

State Key Laboratory of Intelligent Technology and Systems (B.S. thesis) : A general Purpose Virtual Reality Toolkit: ROBVR