Different Task Guides for Audio Data
The goal of this task is to categorize audio input into different types such as Music, Speech, or sounds from Nature.
This task involves coverting spoken words(Speech) into text. The use cases are Communicating with computer-machines, Voice activated commands, Live-transcription, Live-Translation etc..
This task is opposite of ASR i.e. converting input text to synthetic speech. Different use cases are helping visually impaired people, Live communication in foreign language etc..