Add RecallObservations for retrieval of prompt extensions #6909

enyst · 2025-02-24T02:31:31Z

This change is worth documenting at https://docs.all-hands.dev/
Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Make microagents available in the event stream as recall observations (accessible from the UI, LLM)

This PR proposes to refactor prompt extensions into recalled observations:

take the retrieval of information out of the PromptManager, into a Memory component
PromptManager remains responsible with loading templates and rendering them (just a 'view manager')
Memory subscribes to the stream
- on user messages, it may retrieve extensions and add them to the stream as RecallObservations
- on the first user message, it may retrieve repo and runtime info
- on recall actions (not yet in use, source=agent), it may retrieve e.g. library docs
some logic is separated from the agent (like Move PromptManager to agent controller level #6526)
since the information is in the stream, session restore / refresh / runtime reconnect / etc will not lose it.

Cc: @xingyaoww
I'd love your opinion about this idea. It's basically an alternative to 6526, I was curious to see roughly how it looks like, so we can maybe see if it makes sense.

Cc: @csmith49

MAIN TASKS

define and create RecallObservations and RecallActions
memory component, on_event, initialization
conversation_memory - move context/prompt manager; handle RecallObs, event->message
- ~~RecallObs should be before the user message in the messages to the LLM?~~
adapt agent controller flow
agent config options (enable extensions, disabled microagents)

Other expected outcomes:

no dependency from Runtime or Memory to agent or prompt manager (ref comment)
- Runtime prompt manager
- ~~Runtime plugins~~
- Memory
no access from agent to the stream, direct or indirect

Link of any specific issues this addresses
Fix #6535
Fix #7265
Probably fix #6191
Also fix:

fix first message multiple-insert in context, on session restore
fix microagents multi-insert on any user message, anytime

…eval

…etrieve-prompt

csmith49 · 2025-02-25T18:04:33Z

This looks clean so far. I'm a fan of moving what we can to the event stream: much better visibility than modifying messages in-place, and I think leaning into the pub-sub approach is a good way to add extra functionality without having to tip-toe around the agent/controller/system control flow.

xingyaoww · 2025-02-25T18:50:38Z

I like this idea better than what I did in #6526! Happy to close that PR in favor of this one

…etrieve-prompt

csmith49

🔥 🔥 🔥

This is a really cool change, I'm excited to get some extra visibility in the event stream.

Well done!

enyst · 2025-03-13T20:46:16Z

@xingyaoww I think I addressed everything, I'd love to know what you think. There are also more tests that I separated out, that I keep an eye on, and they went well!

(~~I'll make another pass renaming the new events as Robert suggested on slack, but that won't change anything of substance.~~ Done.)

I'd love to get this in, and then Calvin and I are looking into maybe doing something similar for the condenser summaries, or at least, it has become clear to us that it is a similar problem: just like repo.md gets "lost" from context sometimes after runtime errors or something, the condenser summaries can get lost, and that's why we see that strange agent behavior doing again what it did before.

xingyaoww

LGTM! Thanks so much for this! (Though I kinda still like the more generalized "RecallAction" better -- because environment info fits a bit awkward in "MicroagentObservation" and can be confusing to display in the UI -- I would consider it is more like "PromptExtensionAction/Observation". But happy to go ahead if you think this is good!

xingyaoww · 2025-03-14T15:20:19Z

openhands/memory/conversation_memory.py

+            isinstance(obs, MicroagentObservation)
+            and self.agent_config.enable_prompt_extensions
+        ):
+            if obs.info_type == MicroagentInfoType.ENVIRONMENT:


I think we might eventually name this to other things -- MicroAgentInfo = ENVIRONMENT seems a bit confusing. But let's leave this as is for now :)

I'll give it more thought 🤔

Regardless whether in this PR or not, we will probably want to make some change soon: because this PR doesn't include the system prompt, but you would like to do something similar with it, right?

So we still need Recall.

I think the system prompt just became more important to show to users, with these ContextWindowExceeded right at the start on OpenAI API: ultimately, users able to see and maybe edit it themselves would be good IMHO.

enyst · 2025-03-14T17:02:30Z

(Though I kinda still like the more generalized "RecallAction" better -- because environment info fits a bit awkward in "MicroagentObservation" and can be confusing to display in the UI -- I would consider it is more like "PromptExtensionAction/Observation"

It is a bit strange! I almost stopped at half in the renaming process to think it over, because the LLM interpreted the MicroagentObservation as "the output of a MicroagentAction", which sounds like the microagent acted... That's the same confusion that existed because of the old microagents when people expected the new ones to "act" like the old ones.

These events, IMHO, are about information retrieval: we found the content of a microagent or more, and published it in the stream. Or runtime info, yes!

Microagents can be interpreted as a particular format in which a RecallObservation can find/bring information.

Or, as you said, we could look at the end result (we do this so that eventually they're added to the context), so PromptExtensions.

On the other hand, I can see maybe arguments for it:

we have a similar pattern with DelegateAction and DelegateObservation: the events belong to the parent. That works. So maybe it's perfectly fine, not confusing in that way
the events already include attributes that say "microagent", so it was RecallObservation with RecallType MICROAGENT_KNOWLEDGE, and a list of little dataclasses MicroagentKnowledge, which is maybe... half this style, half that style? I feel like, it wasn't actually generic yet, and it seems like it would affect a PromptExtension name too...
Robert got positive feedback on the name "microagent", maybe it will be good to double down on it for now 😄

(The 'environment' type is an enum value, maybe the UI can pick it up and do something a bit special about that type...)

Idk, the near future does seem to bring more about microagents (e.g. we can maybe handle TASK in a follow-up, custom global directory, retrieving library docs from the internet could be mapped to microagents - I suppose?), so I think maybe it's okay to test it and see where it leads us.

Cc: @rbren

rbren · 2025-03-14T17:31:09Z

I trust your judgement here :)

* Revert "move out new tests" This reverts commit cb3edc0 * move two tests * adapt to MicroagentKnowledge dataclass * adapt deduplication tests for reversed logic * move tests of the new flow from old prompt manager * adapt tests to the latest changes * rename recall -> microagent * add test for first user message * adapt to serialization change

…etrieve-prompt

…owledge when empty

…utput

…tensions (#6909)" This reverts commit cc45f5d.

enyst added 21 commits February 22, 2025 18:48

track used tokens

d80c376

add response_id

bd9fc55

test accumulation

c59abb5

clean up

dba25f5

fix not initialized

38b5198

retrieve tokens usage for an event

b1a18d5

add tests

5b063cc

fix tests

801b134

add recall action and observation

c21ddaf

refactor prompt extensions

16da353

dont want to fight o1 right now, will revisit

c26185d

fix logic

143293d

fix subscriber

66781fc

create memory

956b3b4

rename memory to long term memory

f109a2a

rename main module to memory

bb5817c

rename to memory

0e54bab

refactor prompt manager to manage the view, memory manages info retri…

b95d540

…eval

fix disabled microagents

21c2253

refactor info in the first user message to a recalled observation

d596fd2

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/r…

2c5018f

…etrieve-prompt

enyst marked this pull request as draft February 24, 2025 02:31

enyst mentioned this pull request Feb 24, 2025

Move PromptManager to agent controller level #6526

Closed

enyst added 2 commits February 24, 2025 17:32

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/r…

bec0594

…etrieve-prompt

add memory.py

c25701f

enyst added 3 commits February 25, 2025 21:15

tweak name

88ae5d2

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/r…

83fc613

…etrieve-prompt

add selected_repo command line arg

6cd9ece

enyst added 2 commits March 13, 2025 02:34

adapt template for the dataclass; test only prompt manager in its file

87b65f1

clean up path attribute

50519e7

enyst force-pushed the enyst/retrieve-prompt branch from a103532 to 50519e7 Compare March 13, 2025 14:48

enyst requested review from xingyaoww and csmith49 March 13, 2025 17:29

enyst mentioned this pull request Mar 13, 2025

The context window seems to "reset" to earlier points, rather than later points in time #7175

Closed

csmith49 approved these changes Mar 13, 2025

View reviewed changes

enyst added 2 commits March 14, 2025 03:38

rename recall -> microagent

db94e6d

fix serialization, comparison, small clean up

5377280

xingyaoww approved these changes Mar 14, 2025

View reviewed changes

enyst mentioned this pull request Mar 14, 2025

[Bug]: Poor repository instruction adherence #7265

Closed

1 task

enyst and others added 7 commits March 15, 2025 00:56

tweak

81c152e

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/r…

2cf7a4e

…etrieve-prompt

Fix MicroagentObservation __str__ method to not include microagent_kn…

8e98782

…owledge when empty

Fix test_prompt_manager_template_rendering to match actual template o…

f5554f2

…utput

Fix test_prompt_manager_template_rendering to match the real template

5b9f10d

rename action to recall

438cf47

fix enum, this one is generic

d9405c0

enyst merged commit cc45f5d into All-Hands-AI:main Mar 15, 2025
13 checks passed

enyst mentioned this pull request Mar 15, 2025

Add WorkspaceContext observation for the initial user message #7275

Closed

2 tasks

enyst pushed a commit that referenced this pull request Mar 16, 2025

Revert "Add RecallActions and observations for retrieval of prompt ex…

51d1a99

…tensions (#6909)" This reverts commit cc45f5d.

This was referenced Mar 16, 2025

Debug: Revert last commit to test visual browsing integration test #7279

Closed

Fix visual browsing #7278

Merged

[Bug]: The selected_repo is not saved, so not reloaded #7286

Open

RecallObservations #7292

Merged

'TASK' microagents #7290

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RecallObservations for retrieval of prompt extensions #6909

Add RecallObservations for retrieval of prompt extensions #6909

enyst commented Feb 24, 2025 •

edited

Loading

csmith49 commented Feb 25, 2025

xingyaoww commented Feb 25, 2025

csmith49 left a comment

enyst commented Mar 13, 2025 •

edited

Loading

xingyaoww left a comment

xingyaoww Mar 14, 2025

enyst Mar 14, 2025

enyst Mar 14, 2025

enyst commented Mar 14, 2025 •

edited

Loading

rbren commented Mar 14, 2025

Add RecallObservations for retrieval of prompt extensions #6909

Add RecallObservations for retrieval of prompt extensions #6909

Conversation

enyst commented Feb 24, 2025 • edited Loading

csmith49 commented Feb 25, 2025

xingyaoww commented Feb 25, 2025

csmith49 left a comment

Choose a reason for hiding this comment

enyst commented Mar 13, 2025 • edited Loading

xingyaoww left a comment

Choose a reason for hiding this comment

xingyaoww Mar 14, 2025

Choose a reason for hiding this comment

enyst Mar 14, 2025

Choose a reason for hiding this comment

enyst Mar 14, 2025

Choose a reason for hiding this comment

enyst commented Mar 14, 2025 • edited Loading

rbren commented Mar 14, 2025

enyst commented Feb 24, 2025 •

edited

Loading

enyst commented Mar 13, 2025 •

edited

Loading

enyst commented Mar 14, 2025 •

edited

Loading