Video/Audio Analysis
Gemini can process video and audio natively — a unique advantage over text-only models. The timestamped index makes long media files searchable and skimmable.
Analyze the attached {{media_type}} and provide:
1. **Timeline summary**: Key moments with timestamps (format: [MM:SS])
- What happens at each moment
- Any text or graphics that appear on screen
- Speaker changes (if multiple speakers)
2. **Full transcript** (if audio/spoken content exists):
- Include speaker labels: [Speaker 1], [Speaker 2], etc.
- Note [inaudible] segments and [background noise/music]
- Include timestamps every 30 seconds
3. **Content analysis**:
- Main topics discussed (with timestamp ranges)
- Key claims or statements made
- Action items or decisions mentioned
- Tone/sentiment shifts throughout
4. **Searchable index**: Create a topic index so I can jump to specific parts:
Topic -> Timestamp -> One-line summaryVariables to customize
Why this prompt works
Gemini can process video and audio natively — a unique advantage over text-only models. The timestamped index makes long media files searchable and skimmable.
Save this prompt to your library
Organize, version, and access your best prompts across ChatGPT, Claude, and Cursor.
Related prompts
Forcing the agent to plan before acting prevents premature execution and wasted steps. Explicit dependency mapping enables parallel execution and catches logical gaps early.
Tool Selection AgentThe ReAct pattern (Reason + Act) creates an explicit reasoning trace that improves tool selection accuracy. The error-handling rule prevents infinite retry loops.
Prompt CompressorExplicitly requiring all functional requirements to be preserved prevents the model from over-compressing and losing critical instructions.
Memory Management AgentExplicit memory read/write instructions create agents that improve over time. Categorization keeps memories organized, and the deduplication rule prevents context bloat.