Sketch code generator #10824

aliabid94 · 2025-03-17T20:14:38Z

Now we can use code generation directly in gradio sketch.

I added a "_value_description" property to components so they can describe what values they can have, which the LLM code generator needs to know. This was necessary, because often it was not enough to just read the docstring of a component - the value expected depends often on other kwargs (especially type=).

gradio-pr-bot · 2025-03-17T20:15:17Z

🪼 branch checks and previews

•	Name	Status	URL
	Spaces	ready!	Spaces preview
	Website	ready!	Website preview
🦄	Changes	detected!	Details

Install Gradio from this PR

pip install https://gradio-pypi-previews.s3.amazonaws.com/8408294762bce28ea5aa66145dfeee681f76e18b/gradio-5.21.0-py3-none-any.whl

Install Gradio Python Client from this PR

pip install "gradio-client @ git+https://github.com/gradio-app/gradio@8408294762bce28ea5aa66145dfeee681f76e18b#subdirectory=client/python"

Install Gradio JS Client from this PR

npm install https://gradio-npm-previews.s3.amazonaws.com/8408294762bce28ea5aa66145dfeee681f76e18b/gradio-client-1.13.1.tgz

Use Lite from this PR

<script type="module" src="https://gradio-lite-previews.s3.amazonaws.com/8408294762bce28ea5aa66145dfeee681f76e18b/dist/lite.js""></script>

gradio-pr-bot · 2025-03-17T20:15:26Z

🦄 change detected

This Pull Request includes changes to the following packages.

Package	Version
`gradio`	`minor`

Maintainers can select this checkbox to manually select packages to update.

With the following changelog entry.

Sketch code generator

Maintainers or the PR author can modify the PR title to modify this entry.

Something isn't right?

Maintainers can change the version label to modify the version bump.
If the bot has failed to detect any changes, or if this pull request needs to update multiple packages to different versions or requires a more comprehensive changelog entry, maintainers can update the changelog file directly.

…gradio into sketch_function_code

abidlabs · 2025-03-17T22:31:55Z

@aliabid94 works great! Just a few nits on the usage:

Could we lower the duration of the success modals? Feels very long after seeing a few of them:

If you are logged out, you have a nice way to have a user input their HF token. Perhaps mention that the token needs to have permissions to call inference providers

Likewise, if the token doesn't have permissions to call inference providers and you get an error, it might be a good idea to catch and explain that error

My main frustration is that sometimes the generated code uses external libraries that are not in the context, which makes the rendered app unusable. Perhaps we should encourage the LLM not to use external libraries that are not gradio dependencies

abidlabs · 2025-03-17T22:34:45Z

gradio/components/annotated_image.py

@@ -102,6 +102,7 @@ def __init__(
        self.width = width
        self.color_map = color_map
        self.show_fullscreen_button = show_fullscreen_button
+        self._value_description = "a tuple of type [image: str, annotations: list[tuple[mask: str, label: str]]] where 'image' is the path to the base image and 'annotations' is a list of tuples where each tuple has a 'mask' image filepath and a corresponding label."


Not a huge fan of adding ._value_description, because of the increased maintenance involved. Particularly as the usage of this parameter is quite decoupled from its place in the code. i.e. we are adding these to component classes but we are using these in a completely different part of the code (Sketch). Is it truly necessary -- can't the LLM understand how to combine the docstring with the type of the component?

Suggested change

self._value_description = "a tuple of type [image: str, annotations: list[tuple[mask: str, label: str]]] where 'image' is the path to the base image and 'annotations' is a list of tuples where each tuple has a 'mask' image filepath and a corresponding label."

self._value_description = "a tuple of type [image: str, annotations: list[tuple[mask: str, label: str]]] where 'image' is the path to the base image and 'annotations' is a list of tuples where each tuple has a 'mask' image filepath and a corresponding label."

abidlabs · 2025-03-17T22:39:38Z

gradio/sketch/utils.py

+                full_prompt += f"""- index {index} should be: {get_value_description(o[0], o[1])}.\n"""
+    full_prompt += f"""The function should perform the following task: {prompt}\n"""
+    full_prompt += "Return only the python code of the function in your response. Do not wrap the code in backticks or include any description before the response. Return ONLY the function code. Start your response with the header provided. Include any imports inside the function.\n"
+    full_prompt += """If using an LLM would help with the task, use the huggingface_hub library. For example:


I think it's worth adding the other supported tasks here: https://huggingface.co/docs/huggingface_hub/en/guides/inference#supported-providers-and-tasks

I'm sure people will try things like image generation, which right now causes the generated functions to use obscure dependencies:

aliabid94 · 2025-03-18T18:10:43Z

Could we lower the duration of the success modals? Feels very long after seeing a few of them:

Done

If you are logged out, you have a nice way to have a user input their HF token. Perhaps mention that the token needs to have permissions to call inference providers

Done.

Likewise, if the token doesn't have permissions to call inference providers and you get an error, it might be a good idea to catch and explain that error

Done.

My main frustration is that sometimes the generated code uses external libraries that are not in the context, which makes the rendered app unusable. Perhaps we should encourage the LLM not to use external libraries that are not gradio dependencies

Done.

Not a huge fan of adding ._value_description, because of the increased maintenance involved. Particularly as the usage of this parameter is quite decoupled from its place in the code. i.e. we are adding these to component classes but we are using these in a completely different part of the code (Sketch). Is it truly necessary -- can't the LLM understand how to combine the docstring with the type of the component?

I don't love it either but the description of value and type together aren't enough. By default, it just uses the type of value if no _value_description is provided (see how gr.Textbox doesn't need a _value_description for example, because it is enough to say that value will just be a string). But for other components, many other kwargs that are needed to know the final type. For example, it is necessary for the LLM to know the choices for a dropdown, so that it knows the values the function can receive (or return if dropdown is an output), and it needs to know if the dropdown is multiselect to know whether to expect/return a list or a single string.

I think it's worth adding the other supported tasks here

Ok I plann on adding most tasks in a follow up PR but I've added image generation for now.

freddyaboulton · 2025-03-18T19:11:30Z

Nice @aliabid94 ! Just tried out some couple of examples (image gen via API, chatbot via inference provider, simple image editing) and the experience was great. I also don't love ._value_description but was impressed at how well the generated function inferred the correct types.

Some feedback on the experience:

When you go back to edit a sketch after "Save and Render", it would be nice if the function tab also opened.
It would be nice if the LLM's generation used the previous generation in its context. In my simple image editing example, I asked the LLM to generate code that either flipped an image, applied a sepia filter, or turned it black and white depending on a drop down value. After saving and rendering, I noticed the sepia logic had a bug. So I went back and pasted the error message and told it to fix the sepia logic. But it rewrote everything and although it fixed the sepia filter it broke the black and white filter lol.
The LLM doesn't know about streaming outputs. It generated the correct code for using the LLM inference providers but then when I asked it to stream the response it generated this:

def chat(multimodaltextbox, chatbot):
    import huggingface_hub
    from io import StringIO
    import sys

    # Initialize the inference client
    client = huggingface_hub.InferenceClient()

    # Prepare the input for the chat model
    messages = chatbot + [{'role': 'user', 'content': multimodaltextbox['text']}]

    # Function to capture streaming output
    class Capturing(list):
        def __enter__(self):
            self._stdout = sys.stdout
            sys.stdout = self._stringio = StringIO()
            return self
        def __exit__(self, *args):
            self.extend(self._stringio.getvalue().splitlines())
            del self._stringio    # free up some memory
            sys.stdout = self._stdout

    # Capture the streaming response
    with Capturing() as output:
        response = client.chat_completion(messages, stream=True)

    # Append each streamed response to the chatbot
    for line in output:
        chatbot.append({'role': 'assistant', 'content': line})

    # Clear the textbox for the next message
    cleared_textbox = {'text': '', 'files': []}

    return chatbot, cleared_textbox

aliabid94 · 2025-03-18T19:50:10Z

When you go back to edit a sketch after "Save and Render", it would be nice if the function tab also opened.

Done.

It would be nice if the LLM's generation used the previous generation in its context. In my simple image editing example, I asked the LLM to generate code that either flipped an image, applied a sepia filter, or turned it black and white depending on a drop down value. After saving and rendering, I noticed the sepia logic had a bug. So I went back and pasted the error message and told it to fix the sepia logic. But it rewrote everything and although it fixed the sepia filter it broke the black and white filter lol.

The LLM doesn't know about streaming outputs. It generated the correct code for using the LLM inference providers but then when I asked it to stream the response it generated this:

I think there's still a bit of work to be done on improving the prompting, I'll do these in a follow up PR

* changes * changes * add changeset * changes * changes * changes * changes * changes * changes * changes --------- Co-authored-by: Ali Abid <[email protected]> Co-authored-by: gradio-pr-bot <[email protected]> Co-authored-by: Abubakar Abid <[email protected]>

* WIP * Fix * roughdraft * Workinig * query params * add changeset * modify * revert * lint * Code * Fix * lint * Add code * Fix * Fix python unit tests * Update `markupsafe` dependency version (#10820) * changes * add changeset * type * add changeset --------- Co-authored-by: gradio-pr-bot <[email protected]> * Adds a watermark parameter to `gr.Chatbot` that is added to copied text (#10814) * changes * add changeset * format' * test * copy * changes * doc * format --------- Co-authored-by: gradio-pr-bot <[email protected]> * Fix gr.load_chat (#10829) * changes * add changeset --------- Co-authored-by: Ali Abid <[email protected]> Co-authored-by: gradio-pr-bot <[email protected]> * Fix typo in docstring of Request class in route_utils.py (#10833) * Fix cell menu not showing in non-editable dataframes (#10819) * remove editable condition * - add test - improve html semantics * add changeset * fix test * fix test * - fix test - fix column widths changing on sort * swap e2e for story --------- Co-authored-by: gradio-pr-bot <[email protected]> * Sketch code generator (#10824) * changes * changes * add changeset * changes * changes * changes * changes * changes * changes * changes --------- Co-authored-by: Ali Abid <[email protected]> Co-authored-by: gradio-pr-bot <[email protected]> Co-authored-by: Abubakar Abid <[email protected]> * chore: update versions (#10811) Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * minor fixes * fix * Add guide * Minor tweaks * Address comments --------- Co-authored-by: gradio-pr-bot <[email protected]> Co-authored-by: Abubakar Abid <[email protected]> Co-authored-by: aliabid94 <[email protected]> Co-authored-by: Ali Abid <[email protected]> Co-authored-by: Abdesselam Benameur <[email protected]> Co-authored-by: Hannah <[email protected]> Co-authored-by: Gradio PR Bot <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Ali Abid added 3 commits March 12, 2025 16:57

changes

b07ee25

Merge remote-tracking branch 'origin' into sketch_function_code

d8bf3e5

changes

a25123f

add changeset

bbc1d6c

Ali Abid and others added 4 commits March 17, 2025 13:28

changes

85faeb8

Merge branch 'sketch_function_code' of https://github.com/gradio-app/…

e0de115

…gradio into sketch_function_code

changes

170e382

Merge branch 'main' into sketch_function_code

1fb18d1

aliabid94 requested review from abidlabs, dawoodkhan82, hannahblair and freddyaboulton March 17, 2025 21:19

Ali Abid added 3 commits March 17, 2025 14:59

changes

3cf13ac

Merge branch 'sketch_function_code' of https://github.com/gradio-app/…

a54fa3c

…gradio into sketch_function_code

Merge remote-tracking branch 'origin' into sketch_function_code

c62d266

abidlabs reviewed Mar 17, 2025

View reviewed changes

changes

1de2988

Ali Abid added 2 commits March 18, 2025 11:19

changes

76a2eb1

changes

70b9bac

changes

05632a2

Merge remote-tracking branch 'origin' into sketch_function_code

8408294

aliabid94 merged commit 4d78710 into main Mar 18, 2025
21 of 22 checks passed

aliabid94 deleted the sketch_function_code branch March 18, 2025 20:16

gradio-pr-bot mentioned this pull request Mar 18, 2025

chore: update versions #10811

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sketch code generator #10824

Sketch code generator #10824

Uh oh!

aliabid94 commented Mar 17, 2025

Uh oh!

gradio-pr-bot commented Mar 17, 2025 •

edited

Loading

Uh oh!

gradio-pr-bot commented Mar 17, 2025 •

edited

Loading

Something isn't right?

Uh oh!

abidlabs commented Mar 17, 2025

Uh oh!

abidlabs Mar 17, 2025

Uh oh!

abidlabs Mar 17, 2025

Uh oh!

aliabid94 commented Mar 18, 2025 •

edited

Loading

Uh oh!

freddyaboulton commented Mar 18, 2025

Uh oh!

aliabid94 commented Mar 18, 2025

Uh oh!

Uh oh!

Uh oh!

	self._value_description = "a tuple of type [image: str, annotations: list[tuple[mask: str, label: str]]] where 'image' is the path to the base image and 'annotations' is a list of tuples where each tuple has a 'mask' image filepath and a corresponding label."
	self._value_description = "a tuple of type [image: str, annotations: list[tuple[mask: str, label: str]]] where 'image' is the path to the base image and 'annotations' is a list of tuples where each tuple has a 'mask' image filepath and a corresponding label."

Sketch code generator #10824

Sketch code generator #10824

Uh oh!

Conversation

aliabid94 commented Mar 17, 2025

Uh oh!

gradio-pr-bot commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🪼 branch checks and previews

Uh oh!

gradio-pr-bot commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦄 change detected

This Pull Request includes changes to the following packages.

With the following changelog entry.

Something isn't right?

Uh oh!

abidlabs commented Mar 17, 2025

Uh oh!

abidlabs Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

abidlabs Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

aliabid94 commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

freddyaboulton commented Mar 18, 2025

Uh oh!

aliabid94 commented Mar 18, 2025

Uh oh!

Uh oh!

Uh oh!

gradio-pr-bot commented Mar 17, 2025 •

edited

Loading

gradio-pr-bot commented Mar 17, 2025 •

edited

Loading

aliabid94 commented Mar 18, 2025 •

edited

Loading