This is the annotated dataset used in:

William Yang Wang and Diyi Yang, "That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets", in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015), short paper, Lisbon, Portugal, Sept. 17-21, ACL. 

-------------------------------------------------------
train.data: tab-separated training data.
test.data: tab-separated test data.

The first column is the label, and the second column is the text.
-------------------------------------------------------
The original source retains the copyright of the data.

Note that there are absolutely no guarantees with this data,
and we provide this dataset "as is",
but you are welcome to report the issues of the preliminary version
of this data.

You are allowed to use this dataset for research purposes only.
You may re-distribute the dataset, but you must retain this readme file in the re-distribution.

For more question about the dataset, please contact:
William Wang, yww@cs.cmu.edu

v1.0 08/14/2015
