The data set contains 302 North American speakers' speech data. The recording contents include phrases and sentences with rich scenes. The valid time is 201 hours. The recording environment is quiet indoor. The recording device includes PC, android cellphone, and iPhone. This data can be used in speech recognition research in North American area.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/33?source=Github
16kHz/44.1kHz, 16bit, uncompressed wav, mono channel
quiet indoor environment, without echo
words, common sentences
302 people from North America, 59% of which are male
Android mobile phone, iPhone and PC
English
text
speech recognition, voiceprint recognition
Commercial License