Lemur Search   
Language Technologies Institute
Carnegie Mellon University
School of Computer Science

LTI Colloquium Spring 2012

Course Information

The LTI colloquium is a series of talks related to language technologies. The topics include but are not restricted to Computational Linguistics, Machine Translation, Speech Recognition and Synthesis, Information Retrieval, Computational Biology, Machine Learning, Text Mining, Knowledge Representation, Computer-Assisted Language Learning and Intelligent Language Tutoring. To get credit of the course, students are required to write either a short critique of one of the presentations or a comparison of two.

Time: Fridays 2:30-4:00pm
Location: Baker Hall A51 (The Giant Eagle Auditorium)
Instructor: Roni Rosenfeld,roni (at) cs.cmu.edu
TA: Reyyan Yeniterzi, reyyan (at) cs.cmu.edu
Course Secretary: Corinne Meloni, cmeloni (at) cs.cmu.edu

Upcoming Talk

May 4, Friday, 2:30pm

Baker Hall A51 (The Giant Eagle Auditorium)

Eric Xing

CMU

Jointly Maximum Margin and Maximum Entropy Learning of Graphical Models

Graphical models (GMs) offer a powerful language to elegantly define expressive distributions, and a generic computational framework to support reasoning under uncertainty in a wide range of problems. Popular paradigms for training GMs include the maximum likelihood estimation, and more recently the max-margin learning, each enjoys some advantages, as well as weaknesses. For example, the maximum margin structured prediction model such as M3N lacks a straightforward probabilistic interpretation of the learning scheme and the prediction rule. Therefore its unique advantages such as support vector sparsity and kernel tricks cannot be easily conjoined with the merits of a probabilistic model such as Bayesian regularization, model averaging, and ability to model hidden variables.

In this talk, I present a new general framework called Maximum Entropy Discrimination Markov Networks (MEDN), which integrates the margin-based and likelihood-based approaches and combines and extends their merits. This new learning paradigm naturally facilitates integration of the generative and discriminative principles under a unified framework, and the basic strategies can be generalized to learn arbitrary GMs, such as the generative Bayesian networks, models with structured hidden variables, and even nonparametric Bayesian models, with a desirable maximum margin effect on structured or unstructured predictions. I will discuss a number of theoretical properties of this approach, and show applications of MEDN to learning a wide range of GMs including: fully supervised structured i/o model, max-margin structured i/o models with hidden variables, a max-margin LDA-style model for jointly discovering 'discriminative' latent topics and predicting document label/score of text documents, or total scene and objective categories in natural images, etc. Our empirical results strongly suggest that, for any GM with structured or unstructured labels, MEDN always leads to a more accurate predictive GM than the one trained under either MLE or Max Margin.

Joint work with Jun Zhu.

Bio: Dr. Eric Xing is an associate professor in the School of Computer Science at Carnegie Mellon University. His principal research interests lie in the development of machine learning and statistical methodology; especially for solving problems involving automated learning, reasoning, and decision-making in high-dimensional and dynamic possible worlds; and for building quantitative models and predictive understandings of biological systems. Professor Xing received a Ph.D. in Molecular Biology from Rutgers University, and another Ph.D. in Computer Science from UC Berkeley. His current work involves, 1) foundations of statistical learning, including theory and algorithms for estimating time/space varying-coefficient models, sparse structured input/output models, and nonparametric Bayesian models; 2) computational and statistical analysis of gene regulation, genetic variation, and disease associations; and 3) application of statistical learning in social networks, computer vision, and natural language processing. Professor Xing has published over 140 peer-reviewed papers, and is an associate editor of the Annals of Applied Statistics, the IEEE Transaction of Pattern Analysis and Machine Intelligence (PAMI), the PLoS Journal of Computational Biology, an Action Editor of the Machine Learning journal, and a member of the DARPA Information Science and Technology (ISAT) Advisory Group. He is a recipient of the NSF Career Award, the Alfred P. Sloan Research Fellowship in Computer Science, and the United States Air Force Young Investigator Award, and best paper awards in a number of premier conferences including UAI, ACL, SDM, and ISMB.

Schedule

Jan 20 Dipanjan Das, CMU Multilingual Guidance for Unsupervised Linguistic Structure Prediction
Jan 27 No colloquium - LTI Admissions meeting
Feb 3 Pedro Moreno, Google Google's Speech Internationalization Project: From 1 to 300 Languages and Beyond
Feb 10 William Cohen, CMU Fast Effective Clustering for Graphs and Documents
Feb 17 Ben Snyder, U. Wisc Harnessing Dozens of Languages for Robust Language Technology
Feb 24 No colloquium - LTI Open House
Mar 2 No colloquium - LTI Faculty Retreat
Mar 9 No colloquium - Mid-semester Break
Mar 16 No colloquium - Spring Break
Mar 23

1:00 pm
Hammerschlag Hall B131
Tie-Yan Liu, Microsoft

2:30 pm
Baker Hall A51
Hans Uszkoreit, German Research Ctr. for AI


Computational Advertising: Challenges and Opportunities



Learning Relation Extraction Rules from Massive Data

Mar 30 Joe Reisinger, U. Texas Latent Variable Models of Distributional Lexical Semantics
Apr 6 John McDonough Distant Speech Recognition: No Black Boxes Allowed
Apr 13 Gerald Friedland, ICSI Cybercasing the Joint: Language Technologies, Multimedia Retrieval, and Online Privacy
Apr 20 No colloquium - Spring Carnival
Apr 27 Kevyn Collins-Thompson, Microsoft Research (Redmond) Not Just for Kids: Enriching Information Retrieval with Reading Level Metadata
May 4 Eric Xing, CMU Jointly Maximum Margin and Maximum Entropy Learning of Graphical Models

Past Colloquia

Language Technologies Institute • 5000 Forbes Ave • Pittsburgh, PA 15213-3891 • (412) 268-6591