Latency improvements with the new manual VAD algorithm #368

clemlesne · 2024-12-07T12:02:20Z

Done:

Todo:

Parallelize more TTS and database calls (study OTEL traces for opportunity confirmation)
Reduce dependency calls before sending call to the LLM or defer them
Compress the prompt (LLMlingua?)
Use a LLM with a lower latency (Phi 4?)
Trace the code executions with local debugger to pin points unseen optimizations

clemlesne · 2025-01-09T18:59:11Z

OTEL gauges and counters has been added to follow technical and functional metrics.

Notably:

call.aec.droped, number of times the echo cancellation dropped the voice completely.
call.aec.missed, number of times the echo cancellation failed to remove the echo in time.
call.answer.latency, time between the end of the user voice and the start of the bot voice.

clemlesne added the enhancement New feature or request label Dec 7, 2024

Provide feedback