The data volumn is 227 hours. It is recorded by Spanish native speakers from Spain, Mexico and Venezuela. It is recorded in quiet environment. The recording contents cover various fields like economy, entertainment, news and spoken language. All texts are manually transcribed. The sentence accurate is 95%.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/116?source=Github
16kHz, 16bit, uncompressed wav, mono channel
quiet indoor environment, without echo
economy, entertainment, news, oral language, numbers, letters
352 people from Spain, Mexico and Venezuela etc., 55% of which are male
Android mobile phone: iPhone=3.5:1
Spanish
text, time point of speech data, 5 noise symbols, special identifiers
95% (the accuracy rate of noise symbols and other identifiers is not included)
speech recognition, voiceprint recognition
Commercial License