Transcription of spoken words into text and alerts for risky situations
- Transcription of spoken words
- Saving text for future reference
- Sound loudness (dB) meter
- Bluetooth connection
- Sound recorder
- Splash Screen Activity
- Home Activity
- Conference Activity
- Files Activity
- Outdoor/Indoor Activity
- Bluetooth Activity
- Files Activity
- Record Activity
All the code
required to get started
- Clone this repo to your local machine using
https://github.com/sauravmittal001/VoxCo
update and install git
$ ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)"
$ brew install git
$ git --version
config account
$ git config --global user.name "Your Name"
$ git config --global user.email "Your Email"
$ git config --list
clone repo
$ git init
$ git clone https://github.com/sauravmittal001/VoxCo
I originally made this app as a part of a bigger research project which eventually subsided due to some reasons.
Idea: Using microphones to receiving words from two persons simultaneously, separating those mixed speech signals and converting it into text will enable a person with hearing impairment to engage in a group conversation comfortably. Additionally, saving the conversation if he/she misses something for future reference would also help.
The project went through multiple phases, and the solution got re-shaped as it proceeded. The PPT and pdf of the initial proposal can be found here and here. TL;DR, we were trying to build a device for hearing impaired persons by using Blind Source Separation. The full project setup and theory can be understood here. The project involved:
- Separating two independent mixed sound signals into original sound signals by maximising non-Gaussianity.
- Implementing Fast ICA algorithm for component extraction in MATLAB through fixed-point iteration scheme.
- Involved Performing pre-processing and pre-whitening of centred data by eigenvalue decomposition of the covariance matrix.
The app's function was to receive those processed and separated speech signals into the hearing impaired person's device and displaying it as text. (Along with other features)