[discarded] Add append_file to CodeActAgent, incl. tests; some markdown fixes #2207

tobitege · 2024-06-02T22:15:58Z

This PR adds a new method append_file(content: str) to the agent skills.

In testing OD with some file operations, it is a noticable issue for repeated errors by the model (used Gemini Pro 1.5) that adding content to a file often causes exceptions due to an invalid "start" line number for the edit_file command.

It seems very hard for the model to identify or keep track the total number of lines in a file or it is just not good enough in counting (ask an LLM to count the words of its answer and you know what I mean).
This issue can cause extra cost when it shouldn't.

Such an error then looks like this:

An example prompt to demonstrate the use of it could be like:
"Write the numbers 5 to 10 into a new file named test.txt"
A followup prompt could then just say:
"Append the numbers 20 to 25 to the same file."
If it were to use the edit_file command, chances would be high that it would use the wrong line number(s) again.

Also: fixed some more markdown lints in readme's, added several test methods for append_file.
Pre-commit was clean.

I need help with this PR! I am not a Python developer and not a pro-Linux user! /shamebell
How to run these tests - I haven't figured that out yet.
It isn't clear to me, whether I'm to produce some of the log outputs for demonstration
or I missed some place to add the new method name.
Would be great if someone could assist me or provide advice, thanks a lot in advance! :)

li-boxuan · 2024-06-02T22:27:01Z

(this is not a review) Thank you! I think this could potentially be helpful! Given that we are running a few benchmarks on the current CodeActAgent, I propose we don't merge this (or other PRs attempting to tweak CodeActAgent or edit tools) in a week.

tobitege · 2024-06-02T22:29:12Z

(this is not a review) Thank you! I think this could potentially be helpful! Given that we are running a few benchmarks on the current CodeActAgent, I propose we don't merge this (or other PRs attempting to tweak CodeActAgent or edit tools) in a week.

Absolutely fine with me. Good luck with the benching! :D

li-boxuan · 2024-06-03T05:13:03Z

A counter-argument: agent skills should only accept skills that are non-trivial to implement. Appending could be easily done via python or bash commands, and we probably shouldn't prompt LLM to use it.

tobitege · 2024-06-04T10:39:52Z

A counter-argument: agent skills should only accept skills that are non-trivial to implement. Appending could be easily done via python or bash commands, and we probably shouldn't prompt LLM to use it.

There are several logistical, technical and economic advantages of an explicit append_file command:

An "append" is easier to understand and does not require the LLM to first determine the last line number of a file.
We cannot require or presume, that the targeted file has a trailing blank line (unlike linted Python source files) that could easily be "replaced". But even that blank line might be intended and needed to be preserved.
The LLM may then intransparently decide to replace that last content line ("empty" or not) or wrongly come up with a line number higher than the current line count (as in above screenshot).
It may itself need to create an extra Python script to determine the correct line number, requiring extra turns, raising time and cost. And that potentially for any subsequent part of a repetitive task (e.g. iterate files in folders for summarizations).

Above is "trivial" for us, but with the current implementation it is not a cost-effective and seemingly "harder" way for the LLM to achieve results.

Major question:
Alternatively, could edit_file's behavior be slightly changed, that IF the LLM's start line number is higher then the actual file's line count do an append automatically?
Is the behavior more determined due to CodeActAgent needing Python file generation being strict?

Potential PR changes
When I started this PR, I intentionally stayed away from changing the core edit_file, just to be safe.
However, I could easily imagine the edit_file and potential append_file methods use a protected common method with "append" just being a boolean parameter, which certainly would reduce the agentskills codebase.

li-boxuan · 2024-06-05T04:47:02Z

There are several logistical, technical and economic advantages of an explicit append_file command:

Your points are valid and I agree it is hard to LLM to use edit_file correctly to realize append, but... it could simply do echo "xxx" >> file, right?

tobitege · 2024-06-05T07:35:08Z

There are several logistical, technical and economic advantages of an explicit append_file command:

Your points are valid and I agree it is hard to LLM to use edit_file correctly to realize append, but... it could simply do echo "xxx" >> file, right?

I'll look into it, but we do have the optional linting around it, so not sure if we can cut it down to a simple echo

neubig

I think this is something that's good to experiment with. @tobitege when you're happy with the code we can try to run a benchmark to see the effect on accuracy/cost. Please ping me then!

(blocked by #2085 as well)

tobitege · 2024-06-08T13:22:40Z

Thanks, I'll try to get the code cleaned up a little more first and let you know when ready. 👍

tobitege · 2024-06-08T15:33:04Z

@neubig got my branch updated, make lint'ed, agentskills tests successfull, good for review.

Note: I did add append_file to prompt.py but not with specific examples.

tobitege · 2024-06-08T21:57:42Z

Need to check why integration tests fail, might be merge issue because of outdated mock log files.

tobitege · 2024-06-09T07:36:04Z

My branch is in a bad state. Will re-issue this PR with fresh branch off of main.

Add append_file to CodeActAgent, incl. tests; some markdown fixes

1f94b8a

tobitege added 2 commits June 5, 2024 06:50

Merge branch 'main' into tobi-append

6b3f5cb

Merge branch 'main' into tobi-append

0907ae3

tobitege added 2 commits June 5, 2024 18:09

fix again append tests

4cdf0fe

Merge branch 'main' into tobi-append

f73cb1f

neubig reviewed Jun 8, 2024

View reviewed changes

neubig assigned tobitege Jun 8, 2024

tobitege added 2 commits June 8, 2024 16:41

Merge branch 'main' into tobi-append

2142cdd

small fix in append; docs fixes

0f6147a

neubig assigned neubig and unassigned tobitege Jun 8, 2024

Merge branch 'main' into tobi-append

acb9c48

tobitege requested a review from neubig June 8, 2024 21:57

tobitege marked this pull request as draft June 8, 2024 22:20

tobitege changed the title ~~feat: Add append_file to CodeActAgent, incl. tests; some markdown fixes~~ feat: Add append_file to CodeActAgent, incl. tests; some markdown fixes ON HOLD Jun 8, 2024

tobitege added 3 commits June 9, 2024 01:14

integration tests regen 1

56e205c

integration tests regen 2

bd916c2

Merge branch 'main' into tobi-append

8e96767

tobitege changed the title ~~feat: Add append_file to CodeActAgent, incl. tests; some markdown fixes ON HOLD~~ [discarded] Add append_file to CodeActAgent, incl. tests; some markdown fixes Jun 9, 2024

tobitege closed this Jun 9, 2024

tobitege mentioned this pull request Jun 9, 2024

feat: append_file incl. all tests [agentskills] #2346

Merged

tobitege deleted the tobi-append branch June 22, 2024 06:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[discarded] Add append_file to CodeActAgent, incl. tests; some markdown fixes #2207

[discarded] Add append_file to CodeActAgent, incl. tests; some markdown fixes #2207

Uh oh!

tobitege commented Jun 2, 2024 •

edited

Loading

Uh oh!

li-boxuan commented Jun 2, 2024

Uh oh!

tobitege commented Jun 2, 2024

Uh oh!

li-boxuan commented Jun 3, 2024

Uh oh!

tobitege commented Jun 4, 2024

Uh oh!

li-boxuan commented Jun 5, 2024

Uh oh!

tobitege commented Jun 5, 2024

Uh oh!

neubig left a comment •

edited

Loading

Uh oh!

tobitege commented Jun 8, 2024

Uh oh!

tobitege commented Jun 8, 2024

Uh oh!

tobitege commented Jun 8, 2024 •

edited

Loading

Uh oh!

tobitege commented Jun 9, 2024

Uh oh!

Uh oh!

[discarded] Add append_file to CodeActAgent, incl. tests; some markdown fixes #2207

[discarded] Add append_file to CodeActAgent, incl. tests; some markdown fixes #2207

Uh oh!

Conversation

tobitege commented Jun 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

li-boxuan commented Jun 2, 2024

Uh oh!

tobitege commented Jun 2, 2024

Uh oh!

li-boxuan commented Jun 3, 2024

Uh oh!

tobitege commented Jun 4, 2024

Uh oh!

li-boxuan commented Jun 5, 2024

Uh oh!

tobitege commented Jun 5, 2024

Uh oh!

neubig left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tobitege commented Jun 8, 2024

Uh oh!

tobitege commented Jun 8, 2024

Uh oh!

tobitege commented Jun 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tobitege commented Jun 9, 2024

Uh oh!

Uh oh!

tobitege commented Jun 2, 2024 •

edited

Loading

neubig left a comment •

edited

Loading

tobitege commented Jun 8, 2024 •

edited

Loading