feat: add image input support for Gemini and Vertex providers #421

y-sudharshan · 2025-09-09T10:56:29Z

Accept base64, URL, and bytes for image input
Update payload construction and validation
Add documentation and tests for image scenarios
Ensure backward compatibility with text-only input

Description

This pull request adds image input support to the Google Gemini and Vertex providers in any-llm. The changes include:

Extending the input interface to accept images as base64-encoded strings, URLs, or raw bytes in the [messages] parameter.
Updating the payload construction and validation logic to handle image data according to Gemini and Vertex API requirements.
Implementing error handling for invalid image formats, corrupt data, and unsupported types.
Updating documentation (README and API docs) with clear instructions and examples for sending images.
Adding unit and integration tests for valid and invalid image input scenarios.
Ensuring backward compatibility so existing text-only workflows remain unaffected.
These changes enable multimodal (text + image) support for Gemini and Vertex, making any-llm more flexible for developers.

Checklist

I have added unit tests that prove my fix/feature works
New and existing tests pass locally
Documentation was updated where necessary
I have read and followed the contribution guidelines```

- Accept base64, URL, and bytes for image input - Update payload construction and validation - Add documentation and tests for image scenarios - Ensure backward compatibility with text-only input

codecov · 2025-09-10T11:33:18Z

Codecov Report

❌ Patch coverage is 15.00000% with 17 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/any_llm/providers/gemini/utils.py	15.00%	13 Missing and 4 partials ⚠️

Files with missing lines	Coverage Δ
src/any_llm/providers/gemini/utils.py	`55.08% <15.00%> (-33.92%)`	⬇️

... and 27 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2025-09-25T00:21:09Z

This PR is stale because it has been open 7 days with no activity. Remove stale label or comment or this will be closed in 3 days.

github-actions · 2025-10-08T00:21:18Z

This PR is stale because it has been open 7 days with no activity. Remove stale label or comment or this will be closed in 3 days.

github-actions · 2025-10-18T00:20:00Z

This PR is stale because it has been open 7 days with no activity. Remove stale label or comment or this will be closed in 3 days.

y-sudharshan and others added 2 commits September 9, 2025 16:20

feat: add image input support for Gemini and Vertex providers

a3e4d8d

- Accept base64, URL, and bytes for image input - Update payload construction and validation - Add documentation and tests for image scenarios - Ensure backward compatibility with text-only input

Merge branch 'main' into feature-gemini-vertex-image-support

b057df2

github-actions bot added the Stale label Sep 25, 2025

y-sudharshan added 3 commits September 27, 2025 15:07

feat: update source code, documentation, scripts, and tests

59483ea

chore: finalize merge and stage changes after syncing with remote branch

3879603

resolve issues

931ca79

github-actions bot removed the Stale label Sep 30, 2025

github-actions bot added the Stale label Oct 8, 2025

y-sudharshan added 2 commits October 10, 2025 21:28

gemini tool choice

de8e7b6

unit tests

ec9dde6

github-actions bot removed the Stale label Oct 11, 2025

github-actions bot added the Stale label Oct 18, 2025

y-sudharshan closed this Oct 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add image input support for Gemini and Vertex providers #421

feat: add image input support for Gemini and Vertex providers #421

Uh oh!

y-sudharshan commented Sep 9, 2025 •

edited

Loading

Uh oh!

codecov bot commented Sep 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 25, 2025

Uh oh!

github-actions bot commented Oct 8, 2025

Uh oh!

github-actions bot commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: add image input support for Gemini and Vertex providers #421

feat: add image input support for Gemini and Vertex providers #421

Uh oh!

Conversation

y-sudharshan commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

codecov bot commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Sep 25, 2025

Uh oh!

github-actions bot commented Oct 8, 2025

Uh oh!

github-actions bot commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

y-sudharshan commented Sep 9, 2025 •

edited

Loading

codecov bot commented Sep 10, 2025 •

edited

Loading