Multimodal Image + Text Analysis
GPT-4o processes images natively rather than converting to text descriptions. Asking it to reference specific locations in the image and categorize by priority produces structured, actionable output from visual input — much more useful than a generic description.
Analyze the attached {{imageType}} and provide a detailed assessment. **Focus areas:** {{focusAreas}} **For each issue or observation:** 1. Describe what you see (reference the specific location in the image) 2. Explain why it matters 3. Provide a specific, actionable recommendation **Output format:** - Priority: critical / important / minor - Category: {{categories}} - Description and recommendation Also provide a summary at the top with total counts by priority level. [Attach: {{imageDescription}}]
Variables to customize
Why this prompt works
GPT-4o processes images natively rather than converting to text descriptions. Asking it to reference specific locations in the image and categorize by priority produces structured, actionable output from visual input — much more useful than a generic description.
What you get when you save this prompt
Your workspace unlocks powerful tools to iterate and improve.
AI Optimization
One-click improvement with structure analysis and pattern suggestions.
Version History
Track every edit. Compare versions side-by-side with word-level diffs.
Folders & Tags
Organize your library with nested folders, tags, and drag-and-drop.
$ npm i -g @promptingbox/mcpUse Everywhere
Access prompts from Claude, Cursor, ChatGPT & more via MCP integration.
Your prompts, organized
Save, version, and access your best prompts across ChatGPT, Claude, Cursor, and more.