# Benchmark Agent UI Design Prompts

<!-- model: gemini, aspectRatio: 16:9, count: 1 -->

## doc-qa-agent
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "doc-qa-agent" — document Q&A system. Dark theme. Three-column layout: left sidebar lists uploaded PDFs as file cards with page count badges and hover states. Center column is a PDF viewer with yellow-highlighted text chunks showing which passages were retrieved. Right column is a chat panel — user messages right-aligned in zinc-700 bubbles, AI answers left-aligned in zinc-800 cards. Tool calls shown as collapsible Accordion rows labeled "🔍 Searched 847 chunks · 12ms". Source citations rendered as small yellow Badge chips [p.42] [p.17] inline in the answer text. Top navbar: document title breadcrumb, Upload PDF Button, embedding progress bar. Streaming answer has an animated blinking cursor. Color palette: zinc-950 background, zinc-800 cards, yellow-400 highlights, white text. Shadcn-style components throughout. High fidelity web UI screenshot.

## customer-support-agent
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "customer-support-agent" — AI customer support dashboard. Warm light theme. Three-panel layout: left sidebar lists active conversations with user avatar, name, green/yellow/red status dot, and last message preview truncated. Center panel is a chat window — rounded message bubbles, agent messages in indigo-50 with left avatar, user messages right-aligned in white. Above chat: a segmented confidence bar component (High / Medium / Low) with color fill and label. Orange outlined "Escalate to Human 🙋" Button appears when confidence drops to Low. Right panel: three large stat cards — Resolution Rate 87% with upward arrow, Avg Response 1.2 min with sparkline, Open Tickets 14 with downward trend. Below stats: Tabs component — Active | Resolved | Escalated. Color palette: white background, indigo-600 accent, warm gray-100 sidebar. Polished shadcn SaaS product UI screenshot.

## deploy-monitor
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "deploy-monitor" — AI deployment monitoring and incident responder. Dark high-density ops dashboard. Top row: four stat cards — Uptime 99.97% in green with pulse dot, P95 Latency 142ms in yellow, Active Incidents 1 in red with animated ring, Deployments Today 7 in white. Below: full-width recharts-style line chart showing response time with a red shaded anomaly region and vertical "INCIDENT" marker. Left panel: scrollable endpoints Table — URL, status Badge (Healthy/Degraded/Down), last check timestamp, inline sparkline. Right panel: dark Card labeled "AI Incident Responder" with monospaced font streaming log lines like "Analyzing build #4821..." and a vertical Stepper: Logs ✓ → Builds ✓ → Deploys → Root Cause (spinning). Color palette: slate-950 background, red-500 alerts, green-400 healthy, white text. Dense ops dashboard screenshot.

## multi-model-router
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "multi-model-router" — AI model comparison playground. Dark theme with vibrant columns. Top bar: wide Textarea input "Enter your prompt..." with a bold "Race Models ▶" Button and model selector checkboxes. Main area: four equal-width columns — GPT-4o (blue top border), Claude (orange), Gemini (green), Llama (purple). Each column: colored Badge header with model name and version, a live streaming text content area with blinking cursor, footer row with tokens/sec displayed as a mini bar chart and cost "$0.0042" in dim text. Thin horizontal progress bar at column top races to 100% as response completes. Left sidebar: history list showing past races with winner highlighted in gold. Vote thumbs-up/down buttons at bottom of each column. Top right navbar: "Session cost: $0.18". Neutral-900 background, model accent colors. Screenshot.

## slack-pr-reviewer
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "slack-pr-reviewer" — AI pull request review bot dashboard. Clean light developer theme. Left sidebar: PR list — author avatar, PR title truncated to one line, repo name in dim text, colored status Badge: Reviewed (green) / Pending (yellow) / Flagged (red). Main area: PR detail view. Two Tabs — "AI Review" and "Thread". AI Review tab: diff-style code block with red/green line backgrounds, then below it an AI analysis section with three collapsible Alert components — 🔴 Bug: "Null pointer on line 47" / 🟡 Security: "Exposed API key" / 🟢 Style: "Rename variable". Thread tab: Slack-style chat messages with avatar, username, timestamp, threaded indent replies. Right sidebar: mini bar chart PRs by repo, stat "Avg review time: 4 min". Top navbar: platform toggle buttons Slack | Discord | GitHub. Screenshot.

## content-pipeline
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "content-pipeline" — AI content production pipeline dashboard. Light warm stone theme. Top: horizontal 5-step Stepper — Brief Submitted (green check) → Research (green check) → Draft (blue spinner, active) → Editorial Review (gray) → Published (gray). Active step "Draft" expands below the stepper into a Card showing a rich text article preview with a streaming cursor, word count badge, and Retry/Pause Buttons. Left sidebar: article brief list — title, status chip, deadline badge, author avatar. Right panel: stacked cards — OG Image Preview showing a social card mockup with title and image, then Social Posts with Twitter and LinkedIn card previews. Bottom-right: Toast notification "✅ Research complete — 8 sources found". Warm stone-100 background, amber-500 active step, sage-green completed steps. Screenshot.

## feature-rollout
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "feature-rollout" — feature flag and A/B testing management tool. Dark theme with confident typography. Top Tabs: Flags | Experiments | Analytics. Stats row: 7 active experiments, 2 ready to ship, 1 paused. Main area: dense data Table — Flag Name, Status (shadcn Toggle switch on/off), Rollout % (thin inline Progress bar with percentage label), Variants (A / B chips), Conversion % with delta. A row is selected and a Sheet slides in from the right: large Donut chart showing variant A 68% vs B 32%, Rollout Slider 0-100% set to 45%, Segment checkboxes (Power Users ✓, Free Tier ✓, Enterprise unchecked), then an "AI Recommendation" Card with badge "🚀 Ship Variant A — 94% confidence" and two-line reasoning text. Gray-900 background, violet-500 accents, green winning, red losing. Screenshot.

## event-driven-crm
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "event-driven-crm" — event-driven CRM with AI churn prediction. Light clean enterprise SaaS. Left sidebar: customer list — avatar, full name, company name, and a small colored health dot (green/yellow/red). Main area: customer profile page. Top row: avatar, name "Sarah Chen", company "Acme Corp", plan Badge "Pro", edit Button. Stats row: MRR $2,400 | Open Tickets 3 | Last Login 12 days ago. Below: horizontal Timeline component — nodes on a line with icons and dates: 🟢 Signup, 🟢 Purchase $240, 🟠 Support Ticket #882, 🔴 Churn Signal "Cancelled onboarding". Each node hoverable with a tooltip Card. Right panel: circular gauge chart "Health Score 34/100" in red, below it a red Alert Banner "Churn Risk: HIGH — 73% probability". Then: recommended actions list with Buttons: Send Re-engagement Email, Schedule Call, Apply Discount. White background, sky-blue accents. Screenshot.

## code-sandbox-tutor
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "code-sandbox-tutor" — interactive AI coding tutor for students. Light, playful but professional web app. Left sidebar: lesson curriculum tree — expandable sections with icons 🔵 Loops 🟢 Functions 🟡 Arrays, lesson items with completion checkmarks and lock icons for locked lessons. Currently active lesson highlighted. Center top: Monaco-style code editor with syntax-highlighted JavaScript, line numbers, and the student's code visible. Center bottom: split output panel — left half "Console Output" with printed output lines, right half "Live Preview" iframe showing rendered HTML result. Right sidebar: AI tutor chat — bot avatar labeled "Codey 🤖", chat bubbles with hints, a yellow Alert card "💡 Hint: check your loop bounds". Prominent green "▶ Run Code" Button above editor. Orange "🔧 Auto-fix" Button below error. Top bar: "Lesson 4: Arrays" title, XP bar 340/500, streak "🔥 7 days". Screenshot.

## multi-agent-research
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "multi-agent-research" — multi-agent parallel research orchestrator. Dark visualization-heavy theme. Center: a node-graph canvas (react-flow style) — central oval node "What causes inflation?" connected by animated dashed lines to four agent nodes arranged around it: blue rounded Card "🌐 Web Search — RUNNING", amber Card "📄 Document Analysis — DONE", purple Card "🗄️ Knowledge Base — WAITING", white Card "🧠 Orchestrator — SYNTHESIZING". Each Card shows agent name, colored status chip, and a one-line preview of findings. Below the graph: a "Synthesis Report" streaming output area with markdown formatting — headers, bullet points, and inline citations [1][2][3] rendered as superscript blue badges. Left sidebar: research history with timestamps. Top: question input with "Start Research ▶" Button. Status bar bottom: "4 agents · 23 sources · 2 min 14 sec". Slate-900 background. Screenshot.

## discord-game-master
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "discord-game-master" — AI tabletop RPG game master web companion app. Dark fantasy theme but as a real web app. Left sidebar: "The Party" section — four character cards each with fantasy portrait avatar, character name "Theron the Bold", class icon ⚔️🧙🏹🛡️, HP bar in red (65/100), XP bar in gold (340/500). Center: game feed as a chat-style scrollable log. GM narration in italic amber serif text. Player actions in white monospace. A dice roll result Card: large d20 icon, roll "18" in huge text, "+3 STR modifier", total "21" in bold, green "SUCCESS" Badge. Player input row at bottom with quick-action Buttons: ⚔️ Attack | 🔍 Investigate | 💬 Persuade | ✨ Custom. Right panel: "Current Scene" with a dungeon map image in a Card. Below: Initiative Order as a ranked list — colored dots, name, HP badge. Top navbar: "Campaign: Curse of Strahd", session timer "02:14:33", "🎨 Generate Scene Art" Button. Gray-950 background, amber-400 accents. Screenshot.

## compliance-auditor
<!-- model: gemini, aspectRatio: 16:9, count: 1 -->
Web app UI mockup for "compliance-auditor" — AI compliance auditing system for SaaS platforms. Light professional institutional theme. Sticky top Alert banner: "🚫 2 Critical findings blocking deploy — Review required" in red. Stats row: large "Compliance Score 94/100" in a green Badge, "Last Audit: 2 hours ago", "Open: 3 Critical · 7 Warnings · 42 Passed". Main area: full-width shadcn Table — columns: ID (#C-047), Category Badge (Infrastructure/Code/Secrets), Severity Badge (Critical red / Warning amber / Pass green), Description text, Status chip (Open/Resolved/Blocked). A row is expanded inline showing AI explanation paragraph and "Remediation Steps" as a numbered list with a "Mark Resolved" Button. Left sidebar: audit schedule — cron job list with next-run timestamps and play Button. Right panel: "Approval Workflow" vertical Stepper — Audit ✓ → AI Review ✓ → Human Approval (active spinner) → Deploy Unblocked. "Export PDF Report" Button at bottom. White background, red-600 critical, amber warning, emerald pass. Screenshot.
