Data: health_cqa_assessment.txt
The dataset and annotation scheme is described in the paper
Beloborodov, A., Braslavski, P., & Driker, M. (2014). Towards Automatic Evaluation of Health-Related CQA Data. In Information Access Evaluation. Multilinguality, Multimodality, and Interaction (pp. 7-18). Springer International Publishing. (BibTeX, SpringerLink)
Each question-answer pair is annotated with a line of tab-delimited values of the following format:
AssessorId MailRuQuestionId MailRuAnswerId Label AssessmentDate
Based on MailRuQuestionId
you can compose an URL and access the actual question, e.g. 30001000 → http://otvet.mail.ru/question/30001000/
MailRuAnswerId
will help accessing the actual question-answer pair: 189528773 → http://otvet.mail.ru/answer/189528773
Also question info is available through Mail.Ru API, e.g. http://otvet.mail.ru/api/v2/question?qid=30001000
Each question-answer pair evaluated on a three-grade scale:
0
- low quality,
1
- potentially useful answer,
2
- high-quality answer
Please refer to the above mentioned publication when using the data.
Please send questions and suggestions to xander-beloborodov àÒ yandex dîÒ ru