Add support for previous text in elevenlabs http processor #1590

danthegoodman1 · 2025-04-14T23:04:54Z

Please describe the changes in your PR. If it is addressing an issue, please reference that as well.

Allows the user to optionally provide the context array for the previous n messages that the assistant has said to pass to the previous_text parameter to enable more natural sounding speech with elevenlabs.

Only for the HTTP processor atm.

Closes #1399

… call

codecov · 2025-04-14T23:06:19Z

Codecov Report

Attention: Patch coverage is 0% with 11 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/pipecat/services/elevenlabs/tts.py	0.00%	11 Missing ⚠️

Files with missing lines	Coverage Δ
src/pipecat/services/elevenlabs/tts.py	`0.00% <0.00%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

markbackman · 2025-04-15T21:58:49Z

@danthegoodman1 in reading the docs, it seems like this feature is intended to provide audio continuity when a single response is split into segments. This does make sense to add and seems like something that should happen automatically.

I'm curious about your proposal. It seems to take previous turns from the context to provide continuity to the output. This seems unexpected as the previous turns may have had different context.

Anyway, I reached out to the 11Labs team to get a better understanding of how this feature should be used.

danthegoodman1 · 2025-04-15T22:04:33Z

How would they have different context? It’s captured at generation time.

I can tell you that this massively improves audio quality. You don’t get any more random screaming for short sentences and such.

markbackman · 2025-04-15T22:25:21Z

I heard back from the 11Labs team. They're confirming that this feature is intended to ensure audio continuity when a single response is split into segments.

For example, Pipecat breaks streaming LLM responses into sentences and send each sentence to TTS individually. previous_text is intended to include earlier sentences from the current response (i.e. turn) to maintain speech continuity.

Given this, I think it makes sense to provide previous sentences from the same turn with subsequent generations. But, it seems like providing previous context messages from previous turns isn't the intended use case for this feature.

markbackman · 2025-04-16T02:43:28Z

I spent a bunch of time on ElevenLabsHttpTTSService tonight. At the tail end, I added: #1600, which I think is how we want previous_text to be implemented. I'm inclined to close this PR @danthegoodman1 but am interested in your POV first.

danthegoodman1 · 2025-04-16T02:52:32Z

This solution just uses the last N messages form the context, I don’t think you want to infinitely accumulate

markbackman · 2025-04-16T02:55:05Z

This solution just uses the last N messages form the context, I don’t think you want to infinitely accumulate

Which solution is that?

danthegoodman1 · 2025-04-16T02:56:16Z

This PR, controlled with context_max_previous_text

markbackman · 2025-04-16T03:01:26Z

Right, but context messages are not what should be added to previous_text. It should be previous sentences from the current generation (i.e. bot's turn).

For example, "Hello. I'm chatbot, your assistant. How can I help you today?" would be:

Input: "Hello.", previous_text: ""
Input: "I'm chatbot, your assistant.", previous_text: "Hello."
Input: "How can I help you today?", previous_text: "Hello. I'm chatbot, your assistant."

The previous_text is cleared after after turn ends.

Providing messages from previous turns would skew the response as that information isn't being spoken contextually with the words produced by the TTS.

danthegoodman1 · 2025-04-16T03:11:20Z

Ah I see what you mean, I think your solution is better if you can trim the tail to some max length

markbackman · 2025-04-16T03:14:07Z

Ah I see what you mean, I think your solution is better if you can trim the tail to some max length

Turns are self limiting, so I'm really not concerned about that. previous_text resets after an interruption (StartInterruptionFrame, TTSStoppedFrame) or end of turn (LLMFullResponseEndFrame). So, this will prevent the text length from getting out of control.

With that, I'll close this out. I'll leave your issue open (#1399). I'm hoping to get some tips from the 11Labs team on the single word case, as that's something we can still improve on.

danthegoodman1 · 2025-04-16T03:20:26Z

In favor of #1600? Want to get this feature merged one way or another

danthegoodman1 added 12 commits March 19, 2025 08:55

Add support for previous text in elevenlabs http processor

36c6d37

put back function

6c9b17c

put back speed

705feb8

speed back

dfc0069

speed back

a267605

formatting

8b8013e

comment

243b43f

remove redundant log

1643193

Merge branch 'main' of github.com:danthegoodman1/pipecat

b3bf1e0

update with new file location

0a81120

remove from websocket processor because it uses params on connect not…

29b00a1

… call

remove more edits

503f081

jamsea requested review from markbackman and aconchillo April 14, 2025 23:35

jamsea approved these changes Apr 14, 2025

View reviewed changes

markbackman requested a review from jamsea April 16, 2025 02:42

markbackman closed this Apr 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for previous text in elevenlabs http processor #1590

Add support for previous text in elevenlabs http processor #1590

Uh oh!

danthegoodman1 commented Apr 14, 2025

Uh oh!

codecov bot commented Apr 14, 2025 •

edited

Loading

Uh oh!

markbackman commented Apr 15, 2025

Uh oh!

danthegoodman1 commented Apr 15, 2025 •

edited

Loading

Uh oh!

markbackman commented Apr 15, 2025

Uh oh!

markbackman commented Apr 16, 2025

Uh oh!

danthegoodman1 commented Apr 16, 2025

Uh oh!

markbackman commented Apr 16, 2025

Uh oh!

danthegoodman1 commented Apr 16, 2025

Uh oh!

markbackman commented Apr 16, 2025

Uh oh!

danthegoodman1 commented Apr 16, 2025

Uh oh!

markbackman commented Apr 16, 2025

Uh oh!

danthegoodman1 commented Apr 16, 2025

Uh oh!

Uh oh!

Add support for previous text in elevenlabs http processor #1590

Add support for previous text in elevenlabs http processor #1590

Uh oh!

Conversation

danthegoodman1 commented Apr 14, 2025

Please describe the changes in your PR. If it is addressing an issue, please reference that as well.

Uh oh!

codecov bot commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

markbackman commented Apr 15, 2025

Uh oh!

danthegoodman1 commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markbackman commented Apr 15, 2025

Uh oh!

markbackman commented Apr 16, 2025

Uh oh!

danthegoodman1 commented Apr 16, 2025

Uh oh!

markbackman commented Apr 16, 2025

Uh oh!

danthegoodman1 commented Apr 16, 2025

Uh oh!

markbackman commented Apr 16, 2025

Uh oh!

danthegoodman1 commented Apr 16, 2025

Uh oh!

markbackman commented Apr 16, 2025

Uh oh!

danthegoodman1 commented Apr 16, 2025

Uh oh!

Uh oh!

codecov bot commented Apr 14, 2025 •

edited

Loading

danthegoodman1 commented Apr 15, 2025 •

edited

Loading