Skip to content

docs(video-analyzer): add Video Analyzer application example #600

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

ochoaughini
Copy link

Description of the change

Adds a new documentation page — Video Analyzer — under site/en/gemini-api/docs/applications.
The page walks developers through using the Gemini API to:

  • Transcribe video audio to text (subtitles).
  • Generate scene descriptions with the vision model.
  • Produce a summary and build an embeddings index for semantic search.

Motivation

Developers frequently ask for an end-to-end example that combines multimodal (audio + vision) capabilities in a single app.
This doc fills that gap, showcasing best practices and giving them a runnable reference implementation.

Type of change

Documentation

Checklist

  • I have performed a self-review of my changes.
  • I have added detailed comments where applicable.
  • I have verified that my change does not break existing code.
  • My PR is based on the latest changes of the main branch (git pull --rebase upstream main).
  • I am familiar with the Google Style Guide for the language I have coded in.
  • I have read through the Contributing Guide and signed the Contributor License Agreement.

@ochoaughini ochoaughini requested a review from a team as a code owner July 30, 2025 07:46
Copy link

google-cla bot commented Jul 30, 2025

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

@github-actions github-actions bot added status:awaiting review PR awaiting review from a maintainer component:documentation Update docs labels Jul 30, 2025
@ochoaughini ochoaughini force-pushed the docs/video-analyzer-example branch from 852ce42 to 052515e Compare July 30, 2025 07:57
Copy link
Author

@ochoaughini ochoaughini left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for taking a look!
This doc is meant to showcase a full multimodal workflow—audio + vision—in one place.
Happy to tweak wording, add more code, or adjust placement as needed. Appreciate your feedback!

@github-actions github-actions bot removed the status:awaiting review PR awaiting review from a maintainer label Jul 30, 2025
@ochoaughini ochoaughini reopened this Jul 30, 2025
@github-actions github-actions bot added the status:awaiting review PR awaiting review from a maintainer label Jul 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:documentation Update docs status:awaiting review PR awaiting review from a maintainer
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant