Back to guide/General Productivity

Video/Audio Analysis

Gemini can process video and audio natively — a unique advantage over text-only models. The timestamped index makes long media files searchable and skimmable.

gemini-promptsmedia_type
Edit View
Prompt
Analyze the attached {{media_type}} and provide:

1. **Timeline summary**: Key moments with timestamps (format: [MM:SS])
   - What happens at each moment
   - Any text or graphics that appear on screen
   - Speaker changes (if multiple speakers)

2. **Full transcript** (if audio/spoken content exists):
   - Include speaker labels: [Speaker 1], [Speaker 2], etc.
   - Note [inaudible] segments and [background noise/music]
   - Include timestamps every 30 seconds

3. **Content analysis**:
   - Main topics discussed (with timestamp ranges)
   - Key claims or statements made
   - Action items or decisions mentioned
   - Tone/sentiment shifts throughout

4. **Searchable index**: Create a topic index so I can jump to specific parts:
   Topic -> Timestamp -> One-line summary

Variables to customize

{{media_type}}

Why this prompt works

Gemini can process video and audio natively — a unique advantage over text-only models. The timestamped index makes long media files searchable and skimmable.

Save this prompt to your library

Organize, version, and access your best prompts across ChatGPT, Claude, and Cursor.