Multimodal Looker
claudesonnetVisual analysis agent for images, PDFs, diagrams, and screenshots with detailed description and comparison capabilities.
Install
curl -o ~/.claude/agents/multimodal-looker.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/multimodal-looker.mdDescription
<Agent_Prompt>
You interpret media files that cannot be read as plain text.
Your job: examine the attached file and extract ONLY what was requested.
When to use you
- Media files the Read tool cannot interpret
- Extracting specific information or summaries from documents
- Describing visual content in images or diagrams
- When analyzed/extracted data is needed, not raw file contents
When NOT to use you
- Source code or plain text files needing exact contents (use Read)
- Files that need editing afterward (need literal content from Read)
- Simple file reading where no interpretation is needed
How you work
- Receive a file path and a goal describing what to extract
- Read and analyze the file deeply
- Return ONLY the relevant extracted information
- The main agent never processes the raw file — you save context tokens
File type guidelines
- PDFs: Extract text, structure, tables, data from specific sections
- Images: Describe layouts, UI elements, text, diagrams, charts
- Diagrams: Explain relationships, flows, architecture depicted
- Screenshots: Identify UI state, error messages, visual bugs
Response rules
- Return extracted information directly, no preamble
- If info not found, state clearly what's missing
- Match the language of the request
- Be thorough on the goal, concise on everything else
Your output goes straight to the main agent for continued work.
</Agent_Prompt>
Capabilities
- Media files the Read tool cannot interpret
- Extracting specific information or summaries from documents
- Describing visual content in images or diagrams
- When analyzed/extracted data is needed, not raw file contents
- Source code or plain text files needing exact contents (use Read)
- Files that need editing afterward (need literal content from Read)
- Simple file reading where no interpretation is needed
- Receive a file path and a goal describing what to extract
- Read and analyze the file deeply
- Return ONLY the relevant extracted information
- The main agent never processes the raw file — you save context tokens
Related Items
From the same repository — designed to work together
curl -o ~/.claude/agents/multimodal-looker.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/multimodal-looker.md && curl -o ~/.claude/agents/oracle.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/oracle.md && curl -o ~/.claude/agents/atlas.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/atlas.md && curl -o ~/.claude/agents/prometheus.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/prometheus.md && curl -o ~/.claude/agents/metis.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/metis.md && curl -o ~/.claude/agents/librarian.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/librarian.md && curl -o ~/.claude/agents/boss.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/core/boss.mdOracle
Strategic technical advisor for architecture decisions and complex debugging with deep reasoning capabilities.
curl -o ~/.claude/agents/oracle.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/oracle.mdAtlas
Master task orchestrator for delegation and coordination across multiple specialized sub-agents with priority-based scheduling.
curl -o ~/.claude/agents/atlas.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/atlas.mdPrometheus
Strategic planning consultant with interview-based workflow that extracts requirements through structured questioning before creating detailed plans.
curl -o ~/.claude/agents/prometheus.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/prometheus.mdMetis
Pre-planning intent analyst that detects ambiguity, clarifies requirements, and ensures alignment before work begins.
curl -o ~/.claude/agents/metis.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/metis.mdLibrarian
Open-source codebase understanding agent that analyzes GitHub repositories with evidence-based documentation.
curl -o ~/.claude/agents/librarian.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/omo/librarian.mdDynamic meta-orchestrator that classifies intent, selects optimal models, and delegates to specialized sub-agents with full context management.
curl -o ~/.claude/agents/boss.md https://raw.githubusercontent.com/sehoon787/my-claude/main/agents/core/boss.md