Llab (read [ɛɫ læb], [ʎɑβ̞] or [ɬɑb])

Llab is lead by Lori Levin. Current members include:

Linguistics Lab Course and Directed Study

Spring 2020

Llab news

Datasets and Software

  1. AlloVera Allophone Database
  2. URIEL Typological Database
  3. URIEL lang2vec tool
  4. PanPhon
  5. Epitran
  6. Swahili Morphological Parser
  7. Hmong Usenet Corpus (soc.culture.hmong)
  8. TRC dataset
  9. African code-switching corpus
  10. Chuvash Morphological Analyzer
  11. Inuktitut Morphology

Publications

  1. Mortensen, David R. (2019). Hmong. In Justin Watkins and Alice Vittrant (eds.), The Mainland Southeast Asia Linguistic Area. Berlin: De Gruyter Mouton.
  2. Chaudhary, Aditi, Elizabeth Salesky, Gayatri Bhat, David R. Mortensen, Jaime Carbonell, and Yulia Tsvetkov (2019). CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology. Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology.
  3. Chaudhary, Aditi, Chunting Zhou, Lori Levin, Graham Neubig, Jaime G. Carbonell (2018). Adapting word embeddings to new languages with morphological and phonological subword representations. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP).
  4. Mortensen, David R., Siddharth Dalmia, and Patrick Littell (2018). Epitran: Precision G2P for many languages. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Paris, France. European Language Resources Association (ELRA).
  5. Littell, Patrick, Tom McCoy, Na-Rae Han, Shruti Rijhwani, Zaid Sheikh, David R. Mortensen, Teruko Mitamura and Lori Levin (2018). Parser combinators for Tigrinya and Oromo morphology. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Paris, France. European Langugae Resources Association (ELRA).
  6. Dunietz, Jesse, Lori Levin, and Jaime Carbonell (2017). “Automatically Tagging Constructions of Causation and Their Slot-Fillers.” Transactions of the Association for Computational Linguistics.
  7. Dunietz, Jesse, Lori Levin, and Jaime Carbonell (2017). “The BECauSE Corpus 2.0: Annotating Causality and Overlapping Relations.” Proceedings of LAW XI – The 11th Linguistic Annotation Workshop.
  8. Patrick Littell, David Mortensen, Ke Lin, Katherine Kairis, Carlisle Turner, and Lori Levin (2017) “URIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors,” Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 8–14, Valencia, Spain, April 3-7, 2017 (EACL 2017).
  9. Patrick Littell, Kartik Goyal, David Mortensen, Alexa Little, Chris Dyer, Lori Levin (2016) “Named Entity Recognition for Linguistic Rapid Response in Low-Resource Languages: Sorani Kurdish and Tajik”, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Paper, pages 998‐1006, Osaka, Japan, December 11-17 2016 (COLING 2016).
  10. David R. Mortensen, Patrick Littell, Akash Bharadwaj, Kartik Goyal, Chris Dyer, Lori Levin (2016) “PanPhon: A Resource for Mapping IPA Segments to Articulatory Feature Vectors”, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 3475–3484, Osaka, Japan, December 11-17 2016 (COLING 2016).
  11. Littell, Patrick, David R. Mortensen, Kartik Goyal, Chris Dyer, Lori Levin (2016) “Bridge-language capitalization inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik”, in Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2016).
  12. Archna Bhatia, Mandy Simons, Lori Levin, Yulia Tsvetkov, Chris Dyer, and Jordan Bender (2014) “A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness”, in Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), pages 910–916.
  13. Lori Levin, Teruko Mitamura, Brian MacWhinney, Davida Fromm, Jaime Carbonell, Weston Feely, Robert Frederking, Anatole Gershman, Carlos Ramirez (2014) “Resources for the Detection of Conventionalized Metaphors in Four Languages”, in Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014), pages 498–501.
  14. Weston Feely and Mehdi Manshadi and Robert Frederking and Lori Levin (2014) “The CMU METAL Farsi NLP Approach”, in Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), pages 4052–4055.
  15. Patrick Littell, Kaitlyn Price, and Lori Levin (2014) “Morphological parsing of Swahili using crowdsourced lexical resources”, in the Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014), pages 3333-3339.
  16. Archna Bhatia, Chu-Cheng Lin, Nathan Schneider, Yulia Tsvetkov, Fatima Talib Al-Raisi, Laleh Roostapour, Jordan Bender, Abhimanu Kumar, Lori Levin, Mandy Simons, and Chris Dyer (2014) “Automatic Classification of Communicative Functions of Definiteness” in Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pages 1059–1070.
  17. Yulia Tsvetkov, Chris Dyer, Lori Levin, and Archna Bhatia (2013) “Generating English Determiners in Phrase-Based Translation with Synthetic Translation Options” in Proceedings of the Eighth Workshop on Statistical Machine Translation, Association for Computational Linguistics, Sofia, Bulgaria (ACL 2013).
  18. Patrick Littell, Lori Levin, Jason Eisner, and Dragomir Radev (2013) “Introducing Computational Concepts in a Linguistics Olympiad” in Proceedings of the Fourth Workshop on Teaching NLP and CL, Association for Computational Linguistics, Sofia, Bulgaria.
  19. Jesse Dunietz, Lori Levin, and Jaime Carbonell (2013) “The effects of lexical resource quality on preference violation detection” in Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Sofia, Bulgaria.
  20. Archna Bhatia, Michael Deeringer, Matthew Gardner, Carlos Ram´ırez, Lori Levin, and Owen Rambow (2013) “Repurposing Treebanks” in Proceedings of the Twelfth Workshop on Treebanks and Linguistic Theories, Sofia, Bulgaria.
  21. Vinodkumar Prabhakaran; Michael Bloodgood; Mona Diab; Bonnie Dorr; Lori Levin; Christine D. Piatko; Owen Rambow; Benjamin Van Durme (2012) “Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing”, Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, Association for Computational Linguistics.
  22. Kathrin Baker, Michael Bloodgood, Bonnie Dorr, Nathaniel W. Filardo, Lori Levin and Christine Piatko (2010) “A Modality Lexicon and its use in Automatic Tagging”, Proceedings of the Language Resources and Evaluation Conference (LREC).
  23. Kathy Baker, Chris Callison-Burch, Bonnie Dorr, Nathaniel Filardo, Scott Miller, Christine Piatko (2010) “Semantically-Informed Machine Translation: A Tree-Grafting Approach” in the Proceedings of the Association for Machine Translation in the Americas (AMTA).
  24. Aric Bills, Lori S. Levin, Lawrence D. Kaplan, and Edna Agheak MacLean (2010) “Finite-State Morphology for Iñupiaq” 7th SaLTMiL Workshop on “Creation and use of basic lexical resources for less-resourced languages.”