Essay Thesis Clarity Dataset
This page is a distribution site of essay data for the tasks of Essay Thesis Clarity Scoring and Error Identification. Data available on this page include annotated thesis clarity scores and errors for 830 essays from the International Corpus of Learner English (ICLE).
The problems of Essay Thesis Clarity Scoring and Error Identification for which this dataset is intended are described in:
Essay Thesis Clarity Dataset
- Cross Validation Folds Includes lists of the essays in each of the five folds used for cross validation in the experiments described in Modeling Thesis Clarity in Student Essays.
- Keyword Features Describes the keyword features discussed in section 4.3 of Modeling Thesis Clarity in Student Essays.
- Note that we only own the annotations on the ICLE essay dataset. The essays themselves must be purchased here.
The creation of this website is based upon work supported in part by National Science Foundation (NSF) Grants IIS-1147644 and IIS-1219142. Any opinions, findings, and conclusions or
recommendations expressed above are those of the authors and do not
necessarily reflect the views of NSF and should
not be interpreted as representing the official policies, either expressed
or implied, of any sponsoring institution, the U.S. government or any other
entity.