Two text learning data sets