The 100 Hours - Indonesian Child's Spontaneous Speech Data, manually screened and processed. Annotation contains transcription text, speaker identification, gender and other informantion. This dataset can be applied in speech recognition (acoustic model or language model training), caption generation, voice content moderation and other AI algorithm research.
For more details, please refer to the link:https://www.nexdata.ai/datasets/speechrecog/1332?source=Github
16k Hz, 16 bit, wav, mono channel;
12 years old and younger children;
including self-media, conversation, live, lecture, variety show;
Indonesian
annotation for the transcription text, speaker identification, gender;
Word Accuracy Rate (WAR) at least 98%.
Commercial License