Lexical Semantics Resources for English
This page provides resources for computational analysis of English lexical semantics,
- annotations of comprehensive multiword expressions and noun, verb, and preposition supersenses for a 55,000-token corpus of web reviews,
- a tool (trained on the corpus) that identifies multiword expressions and noun/verb supersenses in context. (It does not currently predict preposition supersenses.)
These were developed by
, and others primarily at
Carnegie Mellon University
The multiword expression annotations are described in:
The original MWE identification system is described in:
The noun+verb supersense annotations and the combined MWE+supersense tagger are described in:
The preposition supersense annotation scheme and corpus for STREUSLE 3.0 are respectively described in the following (but note that the preposition annotation scheme has changed effective STREUSLE 4.0):
The DiMSUM 2016 shared task, which extended the STREUSLE 2.1 annotations to additional domains, is described in:
Additional details may be found in Nathan Schneider's dissertation (2014).
A Hierarchy with, of, and for Preposition Supersenses
Vivek Srikumar, Jena D. Hwang, and Martha Palmer.
In Proceedings of the 9th Linguistic Annotation Workshop, Denver, Colorado, June 5, 2015.
A Corpus of Preposition Supersenses
Jena D. Hwang,
Vivek Srikumar, Meredith Green, Abhijit Suresh, Kathryn Conger, Tim O’Gorman, and Martha Palmer.
In Proceedings of the 10th Linguistic Annotation Workshop, Berlin, Germany, August 11, 2016.
This research was supported in part by NSF CAREER grant IIS-1054319, Google through the Reading is Believing project at CMU, and DARPA grant FA8750-12-2-0342 funded under the DEFT program.
We are grateful to Google and LDC for permission to redistribute their data along with our annotations.
Please e-mail nathan.schneider [strudel] georgetown.edu with questions.