This repository (Audio_Transcript) is a flask web app that transcribes audio recordings to test. It uses the IBM Watson developer cloud speech recognition API to transcribe the audio files.
git clone https://github.com/ave12345/Audio_Transcription.git
cd Audio_Transcription
[optional] mkvirtualenv Audio_Transcription
pip install -r requirements.txt
In order to make a call to the IBM Watson cloud for any service, it is required that you sign up for an [IBM Watson cloud developer Account] (https://console.bluemix.net/). After registration, you request to access the Speech recognition Services after which some credentials would be generated for you and stored in the form of a JSON file
In the JSON file copy the username value and paste it in the username
attribute of the app.config.update
method found in the Audio.py
file.
Do the same for the password.
In your terminal execute:
set FLASK_APP=Audio.py
flask run
- open your web browser
- enter "localhost:5000" in your address bar (without the quotes)
- select a file and click Transcribe [ There are sample audio files in the project directory for testing purpose] NB1 : audio file format can only be any of these (.wav, .flac, .mp3, .basic, .l16, .mpeg, .mulaw, .ogg, .webm)