The `multimodal-specialist` component within Orchestkit currently lacks an image-generation table, specifically for OpenAI's `gpt-image-1` and the upcoming `gpt-image-2` APIs. The existing OpenAI image functionality is limited to analysis (GPT-5 vision), leaving a gap for generation and editing capabilities with input-fidelity controls and tool-use compatibility.
## Context `multimodal-specialist` currently lists Vision, Audio, Video Generation, and Embedding model tables — but **no image-generation table**. OpenAI's `gpt-image-1` (and the upcoming `gpt-image-2`) is a notable gap: it covers both generation and editing in one API, with input-fidelity controls and tool-use compatibility. Today the agent's only OpenAI image touchpoint is *analysis* (GPT-5 vision). Generation is unrepresented — Kling 3.0 / Sora 2 / Veo 3.1 / Runway are video, not still image. Users asking "generate a hero image for this PR" get no guidance from the agent. ## Proposed scope **Phase 1 — agent + skill update (small):** - Add **Image Generation** table to `src/agents/multimodal-specialist.md` (between Vision and Audio tables) covering GPT Image 2, Imagen 4, Flux 1.1 Pro, Ideogram 3, Recraft V3, with cost/use-case columns. - Update `multimodal-specialist` description frontmatter to mention image generation. - Update `src/skills/multimodal-llm/SKILL.md` with the Open