A Python app for making custom audiograms with built-in captioning engine
Barry.Cant.Swim.1.mp4
data:image/s3,"s3://crabby-images/c8c8c/c8c8c5cfda6838fa2e9dd352694df781e71d3ac7" alt=""
data:image/s3,"s3://crabby-images/ebd96/ebd96fa5cf81b7334d77e84ecdc3ae04e8f5d2df" alt=""
Follow these instructions to run locally. Either install locally or if you have docker installed using the docker commands.
- Make sure ffmpeg and imagemagic are installed
//Linux
sudo apt install ffmpeg imagemagick
//Mac
brew install ffmpeg imagemagick
- Install pip packages
pip install -r requirements.txt
- Install speech-to-text model
wget https://alphacephei.com/vosk/models/vosk-model-en-us-0.42-gigaspeech.zip -O model/model_files.zip && unzip model/model_files.zip -d model && mv model/vosk-model-en-us-0.42-gigaspeech/* model/ && rm -r model/vosk-model-en-us-0.42-gigaspeech
- Run the server
python app.py
If you install this way make sure it is a clean copy of the folder structure (model and uploads should be empty to avoid issues)
- Get base image
docker pull python:3.9-buster
- Build
docker-compose build
- Run
docker-compose up