School of Computer Science

Carnegie Mellon University


I am a Ph.D. Student at Language Technologies Institute (LTI) in School of Computer Science at Carnegie Mellon University. My thesis adviser is Dr. Eric Nyberg

My thesis topic is “Minimizing Costs in Generalized Interactive Annotation Learning", where the goal is to minimize the total annotation cost for learning the target concept in a multi-annotator environment with multiple annotation strategies. I also investigate how to automatically derive linguistic features from annotation graphs to minimize the cost of feature engineering.

As a Graduate Research Assistant at LTI, I have worked on several research projects such as DARPA’s Machine Reading Program (MRP) with IBM Research, Javelin Multilingual QA system and ROSETTA: Multilingual & Multi-Modal Information Management (Part of DARPA sponsored GALE program).

My research interests are: Information Extraction, Active Learning, Sentiment Analysis, and in general applying Machine Learning to Natural Language Processing & Text Mining.


· Ph.D. Candidate in Language Technologies, School of Computer Science, Carnegie Mellon University (2008 - Present)

· M.S. in Language Technologies, School of Computer Science, Carnegie Mellon University (2006-2008)

· M.S. in Computer Science, Singapore-MIT Alliance, National University of Singapore, Singapore (2004-2005)

· B.E. in Computer Engineering, Nanyang Technological University, Singapore (2000-2004)

Work Experience:

· Research Assistant at Carnegie Mellon University (2006– Present)

· Software Engineering Intern at Google Inc. (May-Aug, 2011)

· Global Technology Consultant at Accenture Pvt. Ltd, Singapore (2005-2006)

· Research Intern at Singapore Institute of Manufacturing Technology (SIMTech), Singapore (2003)



For more information view my resume.


Shilpa Arora

Ph.D. Student

Language Technologies Institute

Carnegie Mellon University

Pittsburgh, PA



E-Mail: shilpaa  AT