SleepQA: A health coaching dataset on sleep for extractive question answering

I Bojic, QC Ong, M Thakkar, E Kamran, IY Le Shua, JRE Pang, J Chen, V Nayak, S Joty, J Car

Research output: Other contribution

Abstract

Question Answering (QA) systems can support health coaches in facilitating clients' lifestyle behavior changes (e.g., in adopting healthy sleep habits). In this paper, we design a domain-specific QA pipeline for sleep coaching. To this end, we release SleepQA, a dataset created from 7,005 passages comprising 4,250 training examples with single annotations and 750 examples with 5-way annotations1. We fine-Tuned different domain-specific BERT models on our dataset and perform extensive automatic and human evaluation of the resulting end-To-end QA pipeline. Comparisons of our pipeline with baseline show improvements in domain-specific natural language processing on realworld questions. We hope that this dataset will lead to wider research interest in this important health domain.
Original languageUndefined/Unknown
Number of pages19
Volume193
Publication statusPublished - 1 Jan 2022

Cite this