Files
hermes-detective/docs/ideas/008-visual-detective.md
shoko ecfd0b1160 feat: Initial commit - Hermes Detective Agency concept
- Hermes Detective Agency: Open-ended mystery investigation game
- Roles: Chief (human), Witness (Kimi), Detective (Hermes)
- 5 difficulty levels, community cases, open-ended solving
- Scoring: Alignment %, Evidence %, Time
- Features: Retry, Journal, Observe mode
- Tech: Kimi Vision + Hermes Agent + Pollinations

Changelog:
- Research phase: Kimi capabilities, Hermes agent, image APIs
- Brainstorming: 14 ideas explored
- Comparison matrix: Detective selected as winner
- Concept finalized with all design decisions
2026-04-20 00:00:30 +00:00

8.5 KiB

Idea 008: Visual Detective

Date: 2026-04-19

Concept

Upload a "crime scene" or mystery image. Kimi analyzes every detail. Hermes pieces together clues and generates a detective story/hypothesis.

Why Strong

  • Heavy visual analysis (Kimi reads the scene)
  • Low reasoning (observation, not complex logic)
  • Storytelling naturally fits step-by-step
  • Mystery genre = engaging

User Flow

  1. Upload image (or get random daily mystery)
  2. Kimi: "I see a broken window, muddy footprints, overturned chair..."
  3. Hermes: "Based on these clues, here's what likely happened..."
  4. Output: Detective story with visual evidence

Tech

  • Kimi Vision: Scene analysis
  • Hermes: Narrative orchestration
  • Pollinations: Generate mystery images

Unique?

  • Nobody's doing "AI detective" with your photos
  • Could be daily mystery + community solving

009: Image Tarot Reader

Date: 2026-04-19

Concept

Upload any image. AI interprets it like a tarot card reading.

Why Strong

  • Fun/flirty, low stakes
  • Heavy visual analysis (Kimi interprets symbolism)
  • Storytelling fits perfectly
  • Shareable results

User Flow

  1. Upload image OR random draw
  2. Kimi: Analyzes composition, colors, objects, mood
  3. Hermes: "This represents [Tarot card]. Your reading: [Narrative]"
  4. Output: Tarot card + 3-card spread interpretation

Step-by-Step

🃏 Drawing your card...
👁️ Analyzing your image...

Visual Elements Detected:
• A winding road (path in life)
• Setting sun (endings/new beginnings)
• Standing figure (you, the observer)

🎴 Your Card: The Fool
Interpretation: A new journey awaits. Trust the path ahead...

Past: Confusion about direction
Present: Standing at the crossroads
Future: Leap of faith required

Tech

  • Kimi Vision: Symbol analysis
  • Hermes: Tarot narrative generation
  • Pollinations: Generate thematic card visuals

010: Color Emotion Translator

Date: 2026-04-19

Concept

Upload image. AI analyzes dominant colors and translates them into emotions/mood.

Why Strong

  • Pure visual analysis
  • Art/design focused
  • Generates color palette + emotion report
  • Useful for designers

User Flow

  1. Upload image
  2. Kimi: Extracts colors, analyzes saturation, harmony
  3. Hermes: Translates to emotions, generates palette
  4. Output: Color palette + emotion breakdown + suggested uses

Step-by-Step

🔍 Scanning colors...
🎨 Extracting dominant palette...

Detected Colors:
• #2D4A3E (Deep Forest Green) - 45%
• #F5E6D3 (Warm Cream) - 30%
• #8B4513 (Saddle Brown) - 15%
• #CD853F (Peru Gold) - 10%

🎭 Emotional Profile:
Primary: Grounded, natural, calm
Secondary: Warm, nostalgic, organic
Accent: Vintage, artisanal, trustworthy

💡 Recommendations:
• Brand Identity for eco-friendly products
• Interior design: cozy cabin aesthetic
• Packaging: artisanal food products

011: Before/After Time Machine

Date: 2026-04-19

Concept

Upload an old/historical photo. AI shows what it would look like today or vice versa.

Why Strong

  • Historical/educational angle
  • Visual transformation is compelling
  • Shows AI's understanding of time/changes

User Flow

  1. Upload old OR new photo
  2. Select transformation direction
  3. Kimi: Analyzes context, era, subject
  4. Hermes: Predicts/adapts to target era
  5. Output: Side-by-side transformation

Step-by-Step

📸 Analyzing source image...
📅 Detected era: 1950s New York Street

Identifying elements:
• Black & white photography style
• Vintage automobiles (1950s models)
• Fashion: fedoras, swing coats
• Architecture: Art Deco buildings

🔮 Projecting to 2024...

Transformation breakdown:
• Colorization: Added natural skin tones + sky colors
• Vehicles: Replaced with modern equivalents
• Architecture: Updated signage, added modern elements
• Fashion: Modernized while preserving style

✨ Your 1950s scene in 2024!

012: Visual Haiku Generator

Date: 2026-04-19

Concept

Upload any image. AI generates a haiku (5-7-5) based on visual elements.

Why Strong

  • Minimal reasoning, pure visual
  • Artistic/creative output
  • Japanese aesthetic + AI = unique
  • Highly shareable

User Flow

  1. Upload image
  2. Kimi: Analyzes scene, mood, elements
  3. Hermes: Crafts haiku (strict 5-7-5)
  4. Output: Image + haiku + syllable breakdown

Step-by-Step

🖼️ Analyzing your image...

Scene Elements:
• Autumn forest path
• Golden leaves falling
• Soft morning light through trees

✍️ Crafting haiku...

Forest whispers
Golden footsteps on leaves—
Silence speaks loud

📝 Syllable breakdown:
"Forest" (2) - whisper (2)
s(1) - il(1) -ence (1) - speaks (1) - loud (1)
"Golden" (2) - foot (1) -steps (1) - on (1) - leaves (1)
(5) - (7) - (5) ✅

013: Image Alchemy

Date: 2026-04-19

Concept

Upload two random images. AI "fuses" them into a new concept based on their shared elements.

Why Strong

  • Surprising/comedic combinations
  • Pure visual + semantic analysis
  • Unique creative output
  • Viral potential

User Flow

  1. Upload image A (or random)
  2. Upload image B (or random)
  3. Kimi: Analyzes both separately
  4. Hermes: Finds connections, creates fusion
  5. Output: New concept + fused image prompt

Step-by-Step

🌀 Analyzing Image A: A Viking ship
• Norse aesthetic
• Ocean voyage
• Historical warrior culture

🌀 Analyzing Image B: A Coffee shop
• Cozy atmosphere
• Barista craft
• Modern social space

🔮 Alchemizing...

Found connections:
• Craft (warrior's craft → barista's craft)
• Ritual (battle ritual → coffee ritual)
• Journey (ocean voyage → daily commute)

⚗️ Alchemy Result:

"THE VIKING BARISTA"

A warrior of the morning,
steering through storms of exhaustion,
claiming the sacred cup.

Your coffee shop serves mead in horn-shaped mugs,
the barista wears a helmet of foam,
and every latte is a conquest.

014: Visual Lie Detector

Date: 2026-04-19

Concept

Upload a photo + claim. AI analyzes if the image supports or contradicts the claim.

Why Strong

  • Useful in era of fake news
  • Pure visual verification
  • Educational about image analysis
  • "Is this real?" tool

User Flow

  1. Paste claim + upload image
  2. Kimi: Analyzes image details
  3. Hermes: Compares claim vs evidence
  4. Output: Verdict + reasoning

Step-by-Step

🔍 Analyzing claim: "This photo was taken in Paris"

🔬 Image Analysis:
• Architecture: Haussmannian buildings ✓
• Street signs: French ✓
• License plates: European format ✓
• Language: French on signs ✓
• Vegetation: Consistent with Paris climate ✓
• Shadows: Consistent with claimed time of day ✓

✅ VERDICT: LIKELY AUTHENTIC

Confidence: 94%
Supporting evidence: 8/8 elements match
Caveats: Metadata not verified

015: Object Archaeology

Date: 2026-04-19

Concept

Upload an object close-up. AI identifies it, tells its history/story.

Why Strong

  • Educational
  • Heavy visual (identification + knowledge)
  • Discovery/antiquities angle
  • Could work with museum APIs

User Flow

  1. Upload object photo
  2. Kimi: Visual identification + details
  3. Hermes: Tells object's "story"
  4. Output: Identity + history narrative

Step-by-Step

🔍 Scanning object...

Visual Analysis:
• Material: Ceramic
• Style: Ming Dynasty blue and white
• Pattern: Dragon with cloud motifs
• Technique: Underglaze blue

🏺 Object Identified:
Ming Dynasty (1368-1644) Blue and White Porcelain
Dragon Pattern Bowl

📜 The Story:
This bowl was crafted during the reign of Emperor Wanli,
at the height of Jingdezhen's porcelain production.
The dragon motif signifies imperial power and protection...

[Full historical narrative]

Quick Comparison Matrix

# Name Visual Reasoning Uniqueness Fun
007 Spot the Difference Heavy Light 9/10 8/10
008 Visual Detective Heavy Light 8/10 9/10
009 Image Tarot Heavy Light 8/10 10/10
010 Color Emotion Medium Light 7/10 7/10
011 Before/After Heavy Medium 8/10 8/10
012 Visual Haiku Heavy Light 9/10 8/10
013 Image Alchemy Heavy Light 10/10 10/10
014 Lie Detector Heavy Medium 9/10 8/10
015 Object Archaeology Heavy Medium 8/10 8/10

My top picks for uniqueness + fun:

  1. 013 Image Alchemy — Most unique, viral potential
  2. 009 Image Tarot — Fun, shareable, low friction
  3. 007 Spot the Difference — Game + AI demonstration
  4. 014 Visual Lie Detector — Useful, educational

What stands out to you?