100-Hours-Indonesian-Children-Spontaneous-Speech-Data

Description

The 100 Hours - Indonesian Child's Spontaneous Speech Data, manually screened and processed. Annotation contains transcription text, speaker identification, gender and other informantion. This dataset can be applied in speech recognition (acoustic model or language model training), caption generation, voice content moderation and other AI algorithm research.

For more details, please refer to the link:https://www.nexdata.ai/datasets/speechrecog/1332?source=Github

Specifications

Format

16k Hz, 16 bit, wav, mono channel;

Age

12 years old and younger children;

Content category

including self-media, conversation, live, lecture, variety show;

Language

Indonesian

Annotation

annotation for the transcription text, speaker identification, gender;

Accuracy

Word Accuracy Rate (WAR) at least 98%.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
000114_1.txt		000114_1.txt
000114_1.wav		000114_1.wav
000114_10.txt		000114_10.txt
000114_10.wav		000114_10.wav
000114_11.txt		000114_11.txt
000114_11.wav		000114_11.wav
000114_2.txt		000114_2.txt
000114_2.wav		000114_2.wav
000114_3.txt		000114_3.txt
000114_3.wav		000114_3.wav
000114_4.txt		000114_4.txt
000114_4.wav		000114_4.wav
000114_5.txt		000114_5.txt
000114_5.wav		000114_5.wav
000114_6.txt		000114_6.txt
000114_6.wav		000114_6.wav
000114_7.txt		000114_7.txt
000114_7.wav		000114_7.wav
000114_8.txt		000114_8.txt
000114_8.wav		000114_8.wav
000114_9.txt		000114_9.txt
000114_9.wav		000114_9.wav
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

100-Hours-Indonesian-Children-Spontaneous-Speech-Data

Description

Specifications

Format

Age

Content category

Language

Annotation

Accuracy

Licensing Information

About

Releases

Packages

Nexdata-AI/100-Hours-Indonesian-Children-Spontaneous-Speech-Data

Folders and files

Latest commit

History

Repository files navigation

100-Hours-Indonesian-Children-Spontaneous-Speech-Data

Description

Specifications

Format

Age

Content category

Language

Annotation

Accuracy

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages