Health-related CQA quality assessment (Otvety@Mail.Ru data)

Data: health_cqa_assessment.txt

The dataset and annotation scheme is described in the paper

Beloborodov, A., Braslavski, P., & Driker, M. (2014). Towards Automatic Evaluation of Health-Related CQA Data. In Information Access Evaluation. Multilinguality, Multimodality, and Interaction (pp. 7-18). Springer International Publishing. (BibTeX, SpringerLink)

Each question-answer pair is annotated with a line of tab-delimited values of the following format:

AssessorId MailRuQuestionId MailRuAnswerId Label AssessmentDate

Based on MailRuQuestionId you can compose an URL and access the actual question, e.g. 30001000 → http://otvet.mail.ru/question/30001000/
MailRuAnswerId will help accessing the actual question-answer pair: 189528773 → http://otvet.mail.ru/answer/189528773
Also question info is available through Mail.Ru API, e.g. http://otvet.mail.ru/api/v2/question?qid=30001000

Each question-answer pair evaluated on a three-grade scale:

Please refer to the above mentioned publication when using the data.

Please send questions and suggestions to xander-beloborodov àÒ yandex dîÒ ru