This hybrid dataset was developed for the task of multi-label lexical answer type prediction. The dataset uses the strengths of two datasets: BioMedLAT corpus previously developed by Neves et al. and the automated corpus generation process of OAQA system. To cite our work, please use the following bibliography information:
@article{wasim2019multi,
title={Multi-label Biomedical Question Classification for Lexical Answer Type Prediction},
author={Wasim, Muhammad and Asim, Muhammad Nabeel and Khan, Muhammad Usman Ghani and Mahmood, Waqar},
journal={Journal of biomedical informatics},
pages={103143},
year={2019},
publisher={Elsevier}
}