AQMAR Arabic Wikipedia Supersense Corpus

This is a 65,000-token corpus of 28 Arabic Wikipedia articles hand-annotated for nominal supersenses. It extends the Named Entity Corpus and was developed by Nathan Schneider, Behrang Mohit, Kemal Oflazer, and Noah Smith as part of the AQMAR project.


Further Reading

Please cite the following if you write any papers involving the use of the data above:


This research was supported by Qatar National Research Fund grant NPRP 08-485-1-083.


Please e-mail nschneid [strudel] or behrang [strudel] with questions.