Elevenlabs use `previous_text` to improve generation

Pipecat chops up LLM generations send to TTS by punctuation to reduce latency. This creates problems when it feeds TTS a short utterance like `Ok.`, which often causes it to SCREAM randomly, and at best, have a completely different tonality. Short TTS messages give highly variable results, and destroy immersion.

If the TTS was optionally able to access the context, then it could take the most recent message(s) it had generated and populate that in the `previous_text` parameter to enable far smoother TTS generations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Elevenlabs use `previous_text` to improve generation #1399

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Elevenlabs use previous_text to improve generation #1399

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Elevenlabs use `previous_text` to improve generation #1399