URIEL Typological Database

Introduction

The URIEL knowledge base is a structured compendium of information on language typology and language universals that is being developed as part of DARPA's LORELEI project.

For the most recent version of URIEL and lang2vec, use pip to install the lang2vec package from PyPI.

Download the newest non-PyPI release here. Read the README in Markdown.

Releases

URIEL is currently released with lang2vec, a tool for querying it. The development versions of URIEL and lang2vec are available in a GitHub repository. For most users, it will be more convinient to install the lang2vec package from PyPI using pip. The latest stand-along release of lang2vec is available here.

How to Cite

Patrick Littell, David Mortensen, Ke Lin, Katherine Kairis, Carlisle Turner, and Lori Levin (2017). URIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 8–14, Valencia, Spain, April 3-7.


@InProceedings{Littel-et-al:2017,
  author = "Littell, Patrick
           and Mortensen, David R.
           and Lin, Ke
           and Kairis, Katherine
           and Turner, Carlisle
           and Levin, Lori",
  title = "URIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors",
  booktitle = "Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers",
  year = "2017",
  publisher = "Association for Computational Linguistics",
  pages = "8--14",
  location = "Valencia, Spain",
  url = "http://aclweb.org/anthology/E17-2002"
}

Contributors