IR research in LTI
My Ph.D. thesis: Bayesian Graphical Models for Adaptive Information Filtering
Y. Zhang Using Bayesian Priors to Combine Classifiers for Adaptive Filtering In Proceedings of the 27st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, United Kingdom, 2004
Y. Zhang, W. Xu, J. Callan "Exploration and Exploitation in Adaptive Filtering Based on Bayesian Active Learning International Conference on Machine Learning (ICML 2003) slides
Y. Zhang, J. Callan and T. Minka Novelty and Redundancy Detection in Adaptive Filtering. In Proceedings of the 25st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, 2002. (Best Paper Award in SIGIR 2002) Slides Data
Byron Dom, Iris Eiron, Alex Cozzi, Yi Zhang "Graph-Based Ranking Algorithms for E-mail Expertise Analysis Data Mining and Knowledge Discovery Workshop(DMKD2003) in ACM SIGMOD Conference (SIGMOD2003).
K. Collins-Thompson, P. Ogilvie, Y. Zhang, and J. Callan. "Information filtering, novelty detection, and named-page finding." In Proceedings of the 2002 Text REtrieval Conference (TREC 2002). National Institute of Standards and Technology, special publication
Y. Zhang, W. Xu and J. Callan Exact Maximum Likelihood Estimation for Word Mixtures Text Learning Workshop in International Conference on Machine Learning (ICML), Sydney, Australia, 2002. Slides
Y. Zhang and J. Callan. "The bias problem and language models in adaptive filtering." In Proceedings of the 2001 Text REtrieval Conference (TREC 2001). National Institute of Standards and Technology, special publication.
Y. Zhang and J. Callan "Maximum Likelihood Estimation for Filtering Thresholds". In Proceedings of the 24st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, LA, 2001. Slides
Y. Zhang and J. Callan. "YFilter at TREC-9". In Proceedings of the Ninth Text REtrieval Conference (TREC-9). National Institute of Standards and Technology, special publication.
DataSet: Dataset for Novelty and Redundancy Detection while Filtering
Other old presentations:
Linear Method for Regression Aadaboost1 (2) Boosting and Additive Trees Based on "Elementary of Statistical Learning" by Trevor Hastie
Previous research work:
Data Mining from Emails and Web Pages: IBM Almaden Reserach Lab
Information Extraction from Hospital Diagnosis: Medical Archival Systems, Inc
Natural language Processing Lab Project: Open Domain Question and Answering System
Advanced Information Retrieval Seminar Project: A generative Model for generic Text Summarization
Information Theory Course Project w/ Wei Xu: Building Language Model Using ECOC coding
Machine Learning Course Project: Class and feature based language modeling by neural network
Statistics for Natural Language Processing Course Project: Comparision of Query Expansion Methods
State Key Laboratory of Intelligent Technology and Systems (B.S. thesis) : A general Purpose Virtual Reality Toolkit: ROBVR