This page provides a link to a corpus of Wikipedia articles, manually-generated factoid questions
from them, and manually-generated answers to these questions, for use in
These data were collected by
many students at Carnegie Mellon University and the University of Pittsburgh between 2008 and 2010.
Manually-generated factoid question/answer pairs with difficulty ratings from Wikipedia articles. Dataset includes articles, questions, and answers.
Please cite this paper if you write any papers involving the use of the data above:
This research project was supported by NSF IIS-0713265 (to Smith),
an NSF Graduate Research Fellowship (to Heilman),
NSF IIS-0712810 and IIS-0745914 (to Hwa), and Institute of
Education Sciences, U.S. Department of Education
R305B040063 (to Carnegie Mellon).