A Dataset for Informal Query underStanding in E-commerce with Machine Reading Comprehension
IQuS is a new dataset in form of MRC with SQuAD-style, which aims to reduce the vocabulary gap between informal user queries and formal product titles in e-commerce. The dataset contains 65k informal queries as questions, 420k product descriptions as contents, and 95k item attributes as answers in total.
IQuS has already been split into train/dev/test set. We release only the test set now for previewing, and will release the whole dataset once our paper gets accepted.