Closed
Description
We have VAD and STT + LLM + TTS setup.
We have kept the vad start_secs to 0.6 but still when VAD doesn't detect interruption for "hmm" , "ok", "fine" etc. the STT does and the bot is interrupted.
Ideally, the interruption should not be detected or audio should not be passed to STT.
Due to this, if bot is currently speaking and if users say something less than start_secs then bot completes its answer and then immediately gives answer to was user said as interruption.
Metadata
Metadata
Assignees
Labels
No labels