Added support for google specific arguments for video analysis #2110

Sumered · 2025-07-01T15:46:37Z

Added support for Google models specific arguments when processing video, those arguments are:

media_resolution, which is model setting, settings this to LOW instead of default HIGH results in ~3x lower amount of input tokens consumed for video input.
fps, which is video specific settings, set by setting vendor_metadata in FileUrl/BinaryContent, controls frame sampling. Default is 1.0 for Google models, setting this to lower value decrease amount of input video tokens, setting it to higher value increase analysis quality in highly dynamic videos.
start_offset, which is video specific settings, set by setting vendor_metadata in FileUrl/BinaryContent, controls start offset of video. Useful for capping token consumption per video. According to docs it needs to contain s at the end, ex. 300s
end_offset, which is video specific settings, set by setting vendor_metadata in FileUrl/BinaryContent, controls end offset of video. Useful for capping token consumption per video.

Official Google docs for those new arguments:
https://ai.google.dev/gemini-api/docs/video-understanding

pydantic_ai_slim/pydantic_ai/messages.py

pydantic_ai_slim/pydantic_ai/models/google.py

…ode a bit

Sumered · 2025-07-02T16:02:58Z

I'm still not exactly sure why are those tests failing, so I would be thankful if you could provide me with an explanation on what to do with those?

DouweM · 2025-07-02T16:05:45Z

pydantic_ai_slim/pydantic_ai/models/google.py

+                            start_offset=item.vendor_metadata.get('start_offset', None),
+                            end_offset=item.vendor_metadata.get('end_offset', None),
+                        )
+                        inline_data_dict['video_metadata'] = video_metadata  # type: ignore


Would this work?

Suggested change

inline_data_dict['video_metadata'] = video_metadata # type: ignore

inline_data_dict['video_metadata'] = item.vendor_metadata # type: ignore

DouweM · 2025-07-02T16:15:22Z

pydantic_ai_slim/pydantic_ai/messages.py

+    vendor_metadata: dict[str, Any] | None = None
+    """The vendor specific metadata for the file.
+    Currently supports only those keys:
+
+    fps: float,
+    start_offset: str (ex. 1800s),
+    end_offset: str (ex. 1800s)
+
+    And works only for google models for video analysis.
+    """


I'd prefer something like this:

Suggested change

vendor_metadata: dict[str, Any] | None = None

"""The vendor specific metadata for the file.

Currently supports only those keys:

fps: float,

start_offset: str (ex. 1800s),

end_offset: str (ex. 1800s)

And works only for google models for video analysis.

"""

vendor_metadata: dict[str, Any] | None = None

"""Vendor-specific metadata for the file.

Supported by:

- `GoogleModel`: `VideoUrl.vendor_metadata` is used as `video_metadata`: https://ai.google.dev/gemini-api/docs/video-understanding#customize-video-processing

"""

DouweM · 2025-07-02T16:18:47Z

uv.lock

@@ -1191,20 +1191,21 @@ wheels = [

 [[package]]
 name = "google-genai"
-version = "1.15.0"
+version = "1.23.0"


The test failure suggests that with the genai update, the previously recorded cassettes don't match anymore because post != POST... Can you try replacing method: POST with method: post in all the tests/models/cassettes/test_google/*.yaml files?

Sumered and others added 4 commits July 1, 2025 16:39

Added support for google specific arguments for video analysis

cd2093a

Formatting

fa9df0f

Merge branch 'main' into google-video-processing-arguments

c15fbad

Changed snapshot in tests

ec379f3

Sumered mentioned this pull request Jul 2, 2025

Add media processing settings to GoogleModelSettings #2017

Open

DouweM requested changes Jul 2, 2025

View reviewed changes

pydantic_ai_slim/pydantic_ai/messages.py Outdated Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/models/google.py Outdated Show resolved Hide resolved

DouweM self-assigned this Jul 2, 2025

DouweM added the awaiting author revision label Jul 2, 2025

Sumered and others added 2 commits July 2, 2025 17:27

Merge branch 'main' into google-video-processing-arguments

28f1fbc

Removed VendorMetadata class and replaced it with dict + simplified c…

664ec63

…ode a bit

DouweM requested changes Jul 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added support for google specific arguments for video analysis #2110

Added support for google specific arguments for video analysis #2110

Sumered commented Jul 1, 2025

Uh oh!

Uh oh!

Uh oh!

Sumered commented Jul 2, 2025

Uh oh!

DouweM Jul 2, 2025

Uh oh!

DouweM Jul 2, 2025

Uh oh!

DouweM Jul 2, 2025

Uh oh!

Uh oh!

	inline_data_dict['video_metadata'] = video_metadata # type: ignore
	inline_data_dict['video_metadata'] = item.vendor_metadata # type: ignore

Added support for google specific arguments for video analysis #2110

Are you sure you want to change the base?

Added support for google specific arguments for video analysis #2110

Conversation

Sumered commented Jul 1, 2025

Uh oh!

Uh oh!

Uh oh!

Sumered commented Jul 2, 2025

Uh oh!

DouweM Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

DouweM Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

DouweM Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!