Picture of William Cohen

William W. Cohen

Professor, Machine Learning Department and Language Technology Institute, Carnegie Mellon University

Director of the Undergraduate Minor in Machine Learning and Co-Director of the Master's of Science in Machine Learning program

[ Bio | Announcements | Teaching | Projects | Publications (recent, all) | Software | Datasets | Talks | Students & Colleagues | Blog | Contact Info | Other Stuff ]

Biography

William Cohen received his bachelor's degree in Computer Science from Duke University in 1984, and a PhD in Computer Science from Rutgers University in 1990. From 1990 to 2000 Dr. Cohen worked at AT&T Bell Labs and later AT&T Labs-Research, and from April 2000 to May 2002 Dr. Cohen worked at Whizbang Labs, a company specializing in extracting information from the web. Dr. Cohen is a past president of the International Machine Learning Society, an Action Editor for the Journal of Machine Learning Research, and an Action Editor for the journal ACM Transactions on Knowledge Discovery from Data. He is also an editor, with Ron Brachman, of the AI and Machine Learning series of books published by Morgan Claypool. In the past he has also served as an action editor for the journal Machine Learning, the journal Artificial Intelligence, and the Journal of Artificial Intelligence Research. He was General Chair for the 2008 International Machine Learning Conference, held July 6-9 at the University of Helsinki, in Finland; Program Co-Chair of the 2006 International Machine Learning Conference; and Co-Chair of the 1994 International Machine Learning Conference. Dr. Cohen was also the co-Chair for the 3rd Int'l AAAI Conference on Weblogs and Social Media, which was held May 17-20, 2009 in San Jose, and was the co-Program Chair for the 4rd Int'l AAAI Conference on Weblogs and Social Media. He is a AAAI Fellow, and was a winner of the 2008 the SIGMOD "Test of Time" Award for the most influential SIGMOD paper of 1998, and the 2014 SIGIR "Test of Time" Award for the most influential SIGIR paper of 2002-2004.

Dr. Cohen's research interests include information integration and machine learning, particularly information extraction, text categorization and learning from large datasets. He holds seven patents related to learning, discovery, information retrieval, and data integration, and is the author of more than 200 publications.

Announcements

Projects

Projects I'm currently (or recently) involved with include:

Software and demos

Demos: Software:

Datasets

The following datasets are available for anyone to use for research purposes:

Talks and presentations

Teaching

Current/upcoming: Past courses:

Publications

Recent papers I'm keeping in HTML or PDF (which requires Adobe Acrobat Reader to view). Older papers are mostly in Postscript. For Windows, I use the GSView reader for postscript. Most of these papers are viewable in several formats in ResearchIndex.

Students and other colleagues