Selected and/or recent papers by William W. Cohen

[RSS feed]

Recent papers: 2008

  1. Andrew Arnold, Ramesh Nallapati and William W. Cohen (2008): Exploiting Feature Hierarchy for Transfer Learning in Named Entity Recognition in ACL-2008.
  2. Ramnath Balasubramanyan, Vitor Carvalho, and William W. Cohen (2008): CutOnce - Recipient Recommendation and Leak Detection in Action in AAAI-2008 Workshop on Enhanced Messaging.
  3. Noboru Matsuda, William W. Cohen, Jonathan Sewall, Gustavo Lacerda, Kenneth Koedinger (2008): SimStudent: Building an Intelligent Tutoring System by Tutoring a Synthetic Student in preparation.
  4. Einat Minkov and William W. Cohen (2008): Learning to Walk Structured Text Networks in CMU SCS Technical Report Series (CMU-LTI-08-02).
  5. Ramesh Nallapati, Amr Ahmed, Eric Xing, and William W. Cohen (2008): Joint Latent Topic Models for Text and Citations in KDD-2008.
  6. Frank Lin and William W. Cohen (2008): Accurate Semi-supervised Classification for Graph Data in preparation.
  7. Ramesh Nallapati and William W. Cohen (2008): Link-PLSA-LDA: A New Unsupervised Model for Topics and Influence of Blogs in ICWSM-2008.
  8. Yi-Chia Wang, Mahesh Joshi, William Cohen, and Carolyn Rosé (2008): Recovering Implicit Thread Structure in Newsgroup Style Conversations in ICWSM-2008.
  9. Frank Lin and William W. Cohen (2008): The MultiRank Bootstrap Algorithm: SemiSupervised Political Blog Classification and Ranking Using SemiSupervised Link Classification (2-page abstract) in ICWSM-2008.
  10. Frank Lin and William W. Cohen (2008): The MultiRank Bootstrap Algorithm: SemiSupervised Political Blog Classification and Ranking Using SemiSupervised Link Classification in CMU SCS Technical Report Series (CMU-LTI-08-03).
  11. Noboru Matsuda, William W. Cohen, Jonathan Sewall, Gustavo Lacerda, and Kenneth R. Koedinger (2008): Why Tutored Problem Solving may be better than Example Study: Theoretical Implications from a Simulated-Student Study in ITS-2008.
  12. Vitor Carvalho and William W. Cohen (2008): Ranking Users for Intelligent Message Addressing in ECIR-2008.

Recent papers: 2007

  1. Andrew Arnold, Ramesh Nallapati and William W. Cohen (2007): A Comparative Study of Methods for Transductive Transfer Learning in ICDM Workshop on Mining and Management of Biological Data.
  2. Ramesh Nallapati, William W. Cohen, and John Lafferty (2007): Parallelized Variational EM for Latent Dirichlet Allocation: An Experimental Evaluation of Speed and Scalability in ICDM Workshop on High Performance Data Mining.
  3. Ramesh Nallapati, Amr Ahmed, William Cohen and Eric Xing (2007): Sparse Word Graphs: A Scalable Algorithm for Capturing Word Correlations in Topic Models in ICDM Workshop on High Performance Data Mining.
  4. William Cohen (2007): Graph Walks and Graphical Models in preparation.
  5. Richard Wang and William Cohen (2007): Language-Independent Set Expansion of Named Entities using the Web in ICDM-2007.
  6. Einat Minkov and William Cohen (2007): Learning to Rank Typed Graph Walks: Local and Global Approaches in WebKDD-2007.
  7. Sarah Zelikovitz, William Cohen, and Haym Hirsh (2007): Extending WHIRL with background knowledge for improved text classification in Information Retrieval 10(1) pp 35-67.
  8. Vitor Carvalho, Wen Wu and William Cohen (2007): Discovering Leadership Roles in Email Workgroups in CEAS-2007.
  9. Zhenzhen Kou, Vitor Carvalho and William Cohen (2007): Online Stacked Graphical Learning in NIPS-07 Workshop on Efficient Machine Learning .
  10. Vitor Carvalho and William Cohen (2007): Recommending Recipients in the Enron Corpus in preparation.
  11. Ramesh Nallapati, William Cohen, Susan Ditmore, John Lafferty and Kin Ung (2007): Multiscale Topic Tomography in KDD-2007.
  12. Noboru Matsuda, William Cohen, Jonathan Sewall, Gustavo Lacerda and Ken Koedinger (2007): Predicting students performance with a SimStudent that learns cognitive skills from observation in AIED-2007.
  13. Noboru Matsuda, William Cohen, Jonathan Sewall, Gustavo Lacerda and Ken Koedinger (2007): Evaluating a simulated student using real students data for training and testing in UM-2007.
  14. Juchang Hua, Orhan Ayasli, William Cohen and Robert Murphy (2007): Identifying Fluorescence Microscope Images in Online Journal Publications using Both Image and Text Features in ISBI-2007.
  15. Vitor Carvalho and William W. Cohen (2007): Preventing Information Leaks in Email in SDM-2007.
  16. Zhenzhen Kou and William W. Cohen (2007): Stacked Graphical Models for Efficient Inference in Markov Random Fields in SDM-2007.
  17. Zhenzhen Kou, William W. Cohen, and Robert F. Murphy (2007): A Stacked Graphical Model for Associating Information from Text And Images In Figures in PSB-2007.

Recent papers: 2006

  1. Richard C. Wang, Anthony Tomasic, Robert E. Frederking, William W. Cohen (2006): Learning to Extract Gene-Protein Names from Weakly-Labeled Text in CMU SCS Technical Report Series (CMU-LTI-08-04).
  2. Noboru Matsuda, William Cohen & Ken Koedinger (2006): What characterizes a better demonstration for cognitive modeling by demonstration? in CMU SCS Technical Report Series (CMU-ML-06-106).
  3. Noboru Matsuda, William W. Cohen, Jonathan Sewall, Kenneth R. Koedinger (2006): Applying Machine Learning to Cognitive Modeling for Cognitive Tutors in CMU SCS Technical Report Series (CMU-ML-06-105).
  4. Einat Minkov and William W. Cohen (2006): An Email and Meeting Assistant using Graph Walks in CEAS-2006.
  5. Einat Minkov, Andrew Ng and William W. Cohen (2006): Contextual Search and Name Disambiguation in Email using Graphs in SIGIR-2006.
  6. Vitor Carvalho and William W. Cohen (2006): Single-Pass Online Learning: Performance, Voting Schemes and Online Feature Selection in KDD-2006 .
  7. Vitor Carvalho and William W. Cohen (2006): Improving Email Speech Act Analysis via N-gram Selection in HLT/NAACL ACTS Workshop 2006.
  8. Einat Minkov, Richard C.Wang, Anthony Tomasic and William W. Cohen (2006): NER Systems that Suit Users Preferences: Adjusting the Recall-Precision Trade-off for Entity Extraction in HLT/NAACL-2006 (short paper).
  9. William W. Cohen (2006): A Graph-Search Framework for GeneId Ranking (Extended Abstract) in BioNLP'06.
  10. William W. Cohen & Einat Minkov (2006): A Graph-Search Framework for Associating Gene Identifiers with Documents in BMC Bioinformatics.

Selected other papers

  1. William W. Cohen & Vitor Carvalho (2005): Stacked Sequential Learning in IJCAI-2005.
  2. Vitor Carvalho & William W. Cohen (2005): On the Collective Classification of Email Speech Acts in SIGIR 2005.
  3. Zhenzhen Kou, William W. Cohen & Robert F. Murphy (2005): High-Recall Protein Entity Recognition Using a Dictionary in ISMB-2005.
  4. Sunita Sarawagi & William W. Cohen (2004): Semi-Markov Conditional Random Fields for Information Extraction in NIPS 2004.
  5. William W. Cohen, Vitor R. Carvalho & Tom Mitchell (2004): Learning to Classify Email into "Speech Acts" in EMNLP 2004.
  6. Pradeep Ravikumar & William W. Cohen (2004): A Hierarchical Graphical Model for Record Linkage in UAI 2004.
  7. William W. Cohen & Sunita Sarawagi (2004): Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods in KDD 2004: 89-98.
  8. William W. Cohen (2003): Learning and Discovering Structure in Web Pages in IEEE Data Eng. Bull. 26(3): 3-10 (2003).
  9. Mikael Bilenko, Ray Mooney, William W. Cohen, Pradeep Ravikumar & Steve Fienberg (2003): Adaptive Name-Matching in Information Integration in IEEE Intelligent Systems 18(5): 16-23 (2003).
  10. William W. Cohen (2003): Infrastructure Components for Large-Scale Information Extraction Systems in IAAI 2003: 71-78.
  11. Cheng Zhai, William W. Cohen & John Lafferty (2003): Beyond Independent Topical Relevance: Methods and Evaluation Metrics for Subtopic Retrieval in SIGIR 2003: 10-17.
  12. William W. Cohen, Matthew Hurst & Lee S. Jensen (2003): A Flexible Learning System for Wrapping Tables and Lists in HTML Documents in Web Document Analysis: Challenges and Opportunities, ed. Antonacopoulos & Hu, Word Scientific Publishing. (Originally published as: William W. Cohen, Matthew Hurst & Lee S. Jensen (2002): A Flexible Learning System for Wrapping Tables and Lists in HTML Documents in WWW 2002: 232-241; Lee S. Jensen & William W. Cohen (2001): A Structured Wrapper Induction System for Extracting Information from Semi-Structured Documents in Proc. of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining).
  13. Chumki Basu, Haym Hirsh, William W. Cohen & Craig Neville-Manning (2001): Technical Paper Recommendation: A Study in Combining Multiple Information Sources in J. Artif. Intell. Res. (JAIR) 14: 231-252 (2001). (Originally published as: Chumki Basu, Haym Hirsh, William W. Cohen (1998): Recommendation as Classification: Using Social and Content-Based Information in Recommendation. in AAAI/IAAI 1998: 714-720).
  14. William W. Cohen, David McAllester, and Henry Kautz (2000): Hardening Soft Information Sources in KDD 2000: 255-259.
  15. William W. Cohen (2000): Automatically extracting features for concept learning from the Web in ICML 2000: 159-166.
  16. William W. Cohen and Wei Fan (2000): Web-Collaborative Filtering: Recommending Music by Crawling The Web in Computer Networks 33(1-6): 685-698 (2000). (Originally published as: William W. Cohen and Wei Fan (2000): Web-Collaborative Filtering: Recommending Music by Crawling The Web in WWW 2000).
  17. William W. Cohen (2000): Data Integration using Similarity Joins and a Word-based Information Representation Language in ACM Trans. Inf. Syst. 18(3): 288-321 (2000). (Originally published as: William W. Cohen (1998): Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity in SIGMOD Conference 1998: 201-212; William W. Cohen (1997): Knowledge Integration for Structured Information Sources Containing Text (Extended Abstract) in SIGIR Workshop on Networked IR (informal proceedings)).
  18. William W. Cohen and Yoram Singer (1999): Simple, Fast, and Effective Rule Learner in AAAI/IAAI 1999: 335-342.
  19. William W. Cohen, Rob Schapire, Yoram Singer (1999): Learning to Order Things in J. Artif. Intell. Res. (JAIR) 10: 243-270 (1999). (Originally published as: William W. Cohen, Robert E. Schapire, Yoram Singer (1997): Learning to Order Things in NIPS 1997).
  20. William W. Cohen (1996): Learning Trees and Rules with Set-valued Features in AAAI/IAAI, Vol. 1 1996: 709-716.
  21. William W. Cohen (1996): Learning Rules that Classify E-Mail in AAAI Spring Symposium on ML and IR 1996.
  22. William W. Cohen (1995): Fast effective rule induction in ICML 1995: 115-123.
  23. William W. Cohen and Haym Hirsh (1994): Learning the CLASSIC description logic: Theoretical and experimental results in KR 1994: 121-133.

[Selected papers| By topic: Matching/Data Integration| Text Categorization| Rule Learning| Explanation-Based Learning| Formal Results| Inductive Logic Programming| Information Extraction| Collaborative Filtering| Applications| By year: All papers| RSS]