Contact Information

Email:    shomir -at-
Office:GHC 6607
Mailing Address:   Shomir Wilson
Language Technologies Institute
Carnegie Mellon University
5000 Forbes Avenue
Pittsburgh, PA 15213

Who I Am

I am a NSF International Research Fellow affiliated with Carnegie Mellon University's Language Technologies Institute and the University of Edinburgh's School of Informatics. My hosts at these two universities are Alan W Black and Jon Oberlander, respectively. Previously I was a postdoctoral fellow with the Mobile Commerce Lab in Carnegie Mellon University's Institute for Software Research, and my supervisor was Norman Sadeh.

My research spans natural language processing, usable privacy, and artificial intelligence. Read about my interests here.

I am currently (in spring 2015) co-teaching 11-411/611 Natural Language Processing with Chris Dyer and Alan Black.

I received my Ph.D. (2011) and M.S. (2008) from the Department of Computer Science at the University of Maryland, where I was a student with the ALMECOM Lab. My supervisor there was Donald Perlis.

I received my undergraduate education at Virginia Tech, where in 2005 I graduated with a B.S. in Computer Science, a B.S. in Mathematics, and a B.A. in Philosophy.

Curriculum Vitae

Have a look here.


2015-02-20: I had a digitally-manipulated photograph in the art gallery of Carnegie Mellon University's SCS Day.

2014-10-07: I've been selected to be a Grand Awards Judge in Computer Science at the Intel International Science and Engineering Fair in May 2015.

2014-08-20: I have moved from the University of Edinburgh to the Language Technologies Institute at Carnegie Mellon University for the second part of my NSF IRFP fellowship. It's been great meeting new colleagues and reconnecting with old colleagues from my previous stay here.

2014-05-09: I gave a talk today for the NLIP Seminar Series at the University of Cambridge. Here are my slides. My recent work at the University of Edinburgh is described in the second half of the presentation.

2014-05-01: My short paper submission to ACL in Baltimore was accepted. The dataset it describes is here.

2013-11-04: Here's a belated link to a press release on the usable privacy policies project that I'm involved with. Also, here's the project website.

2013-10-28: The overview of my research is now up to date.

2013-08-05: I recently arrived at the University of Edinburgh to begin the first part of my NSF International Research Fellowship. It's been great meeting several new colleagues, and I look forward to working here for the next twelve months. Also, in coming months I will be attending Ubicomp in Zurich and IJCNLP in Nagoya to present papers at both.

2013-04-30: I recently gave a talk for the CL+NLP Lunch at Carnegie Mellon. You can take a look at my slides.

2012-11-18: I've updated the overview of my research.

2012-07-02: I've created a stub page to host the metalanguage corpus described in my recent ACL paper.

2012-03-16: My paper "The Creation of a Corpus of English Metalanguage" has been accepted for oral presentation at ACL this July in Jeju, South Korea. I will attend to present it.

2012-02-24: A profile of some of my Ph.D. research is up on

2012-02-12: I've been adding content to the EAPSI page over the past few months, and it is now complete.

2011-10-23: Prompted by my move to Carnegie Mellon last month, I've assembled this long-overdue renovation of my website. I'll add more content in the coming months.