feat: Initial commit - Hermes Detective Agency concept
- Hermes Detective Agency: Open-ended mystery investigation game - Roles: Chief (human), Witness (Kimi), Detective (Hermes) - 5 difficulty levels, community cases, open-ended solving - Scoring: Alignment %, Evidence %, Time - Features: Retry, Journal, Observe mode - Tech: Kimi Vision + Hermes Agent + Pollinations Changelog: - Research phase: Kimi capabilities, Hermes agent, image APIs - Brainstorming: 14 ideas explored - Comparison matrix: Detective selected as winner - Concept finalized with all design decisions
This commit is contained in:
397
docs/ideas/008-visual-detective.md
Normal file
397
docs/ideas/008-visual-detective.md
Normal file
@@ -0,0 +1,397 @@
|
||||
# Idea 008: Visual Detective
|
||||
|
||||
**Date:** 2026-04-19
|
||||
|
||||
## Concept
|
||||
|
||||
Upload a "crime scene" or mystery image. Kimi analyzes every detail. Hermes pieces together clues and generates a detective story/hypothesis.
|
||||
|
||||
## Why Strong
|
||||
|
||||
- Heavy visual analysis (Kimi reads the scene)
|
||||
- Low reasoning (observation, not complex logic)
|
||||
- Storytelling naturally fits step-by-step
|
||||
- Mystery genre = engaging
|
||||
|
||||
## User Flow
|
||||
|
||||
1. Upload image (or get random daily mystery)
|
||||
2. Kimi: "I see a broken window, muddy footprints, overturned chair..."
|
||||
3. Hermes: "Based on these clues, here's what likely happened..."
|
||||
4. Output: Detective story with visual evidence
|
||||
|
||||
## Tech
|
||||
|
||||
- Kimi Vision: Scene analysis
|
||||
- Hermes: Narrative orchestration
|
||||
- Pollinations: Generate mystery images
|
||||
|
||||
## Unique?
|
||||
|
||||
- Nobody's doing "AI detective" with your photos
|
||||
- Could be daily mystery + community solving
|
||||
|
||||
---
|
||||
|
||||
## 009: Image Tarot Reader
|
||||
|
||||
**Date:** 2026-04-19
|
||||
|
||||
## Concept
|
||||
|
||||
Upload any image. AI interprets it like a tarot card reading.
|
||||
|
||||
## Why Strong
|
||||
|
||||
- Fun/flirty, low stakes
|
||||
- Heavy visual analysis (Kimi interprets symbolism)
|
||||
- Storytelling fits perfectly
|
||||
- Shareable results
|
||||
|
||||
## User Flow
|
||||
|
||||
1. Upload image OR random draw
|
||||
2. Kimi: Analyzes composition, colors, objects, mood
|
||||
3. Hermes: "This represents [Tarot card]. Your reading: [Narrative]"
|
||||
4. Output: Tarot card + 3-card spread interpretation
|
||||
|
||||
## Step-by-Step
|
||||
|
||||
```
|
||||
🃏 Drawing your card...
|
||||
👁️ Analyzing your image...
|
||||
|
||||
Visual Elements Detected:
|
||||
• A winding road (path in life)
|
||||
• Setting sun (endings/new beginnings)
|
||||
• Standing figure (you, the observer)
|
||||
|
||||
🎴 Your Card: The Fool
|
||||
Interpretation: A new journey awaits. Trust the path ahead...
|
||||
|
||||
Past: Confusion about direction
|
||||
Present: Standing at the crossroads
|
||||
Future: Leap of faith required
|
||||
```
|
||||
|
||||
## Tech
|
||||
|
||||
- Kimi Vision: Symbol analysis
|
||||
- Hermes: Tarot narrative generation
|
||||
- Pollinations: Generate thematic card visuals
|
||||
|
||||
---
|
||||
|
||||
## 010: Color Emotion Translator
|
||||
|
||||
**Date:** 2026-04-19
|
||||
|
||||
## Concept
|
||||
|
||||
Upload image. AI analyzes dominant colors and translates them into emotions/mood.
|
||||
|
||||
## Why Strong
|
||||
|
||||
- Pure visual analysis
|
||||
- Art/design focused
|
||||
- Generates color palette + emotion report
|
||||
- Useful for designers
|
||||
|
||||
## User Flow
|
||||
|
||||
1. Upload image
|
||||
2. Kimi: Extracts colors, analyzes saturation, harmony
|
||||
3. Hermes: Translates to emotions, generates palette
|
||||
4. Output: Color palette + emotion breakdown + suggested uses
|
||||
|
||||
## Step-by-Step
|
||||
|
||||
```
|
||||
🔍 Scanning colors...
|
||||
🎨 Extracting dominant palette...
|
||||
|
||||
Detected Colors:
|
||||
• #2D4A3E (Deep Forest Green) - 45%
|
||||
• #F5E6D3 (Warm Cream) - 30%
|
||||
• #8B4513 (Saddle Brown) - 15%
|
||||
• #CD853F (Peru Gold) - 10%
|
||||
|
||||
🎭 Emotional Profile:
|
||||
Primary: Grounded, natural, calm
|
||||
Secondary: Warm, nostalgic, organic
|
||||
Accent: Vintage, artisanal, trustworthy
|
||||
|
||||
💡 Recommendations:
|
||||
• Brand Identity for eco-friendly products
|
||||
• Interior design: cozy cabin aesthetic
|
||||
• Packaging: artisanal food products
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## 011: Before/After Time Machine
|
||||
|
||||
**Date:** 2026-04-19
|
||||
|
||||
## Concept
|
||||
|
||||
Upload an old/historical photo. AI shows what it would look like today or vice versa.
|
||||
|
||||
## Why Strong
|
||||
|
||||
- Historical/educational angle
|
||||
- Visual transformation is compelling
|
||||
- Shows AI's understanding of time/changes
|
||||
|
||||
## User Flow
|
||||
|
||||
1. Upload old OR new photo
|
||||
2. Select transformation direction
|
||||
3. Kimi: Analyzes context, era, subject
|
||||
4. Hermes: Predicts/adapts to target era
|
||||
5. Output: Side-by-side transformation
|
||||
|
||||
## Step-by-Step
|
||||
|
||||
```
|
||||
📸 Analyzing source image...
|
||||
📅 Detected era: 1950s New York Street
|
||||
|
||||
Identifying elements:
|
||||
• Black & white photography style
|
||||
• Vintage automobiles (1950s models)
|
||||
• Fashion: fedoras, swing coats
|
||||
• Architecture: Art Deco buildings
|
||||
|
||||
🔮 Projecting to 2024...
|
||||
|
||||
Transformation breakdown:
|
||||
• Colorization: Added natural skin tones + sky colors
|
||||
• Vehicles: Replaced with modern equivalents
|
||||
• Architecture: Updated signage, added modern elements
|
||||
• Fashion: Modernized while preserving style
|
||||
|
||||
✨ Your 1950s scene in 2024!
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## 012: Visual Haiku Generator
|
||||
|
||||
**Date:** 2026-04-19
|
||||
|
||||
## Concept
|
||||
|
||||
Upload any image. AI generates a haiku (5-7-5) based on visual elements.
|
||||
|
||||
## Why Strong
|
||||
|
||||
- Minimal reasoning, pure visual
|
||||
- Artistic/creative output
|
||||
- Japanese aesthetic + AI = unique
|
||||
- Highly shareable
|
||||
|
||||
## User Flow
|
||||
|
||||
1. Upload image
|
||||
2. Kimi: Analyzes scene, mood, elements
|
||||
3. Hermes: Crafts haiku (strict 5-7-5)
|
||||
4. Output: Image + haiku + syllable breakdown
|
||||
|
||||
## Step-by-Step
|
||||
|
||||
```
|
||||
🖼️ Analyzing your image...
|
||||
|
||||
Scene Elements:
|
||||
• Autumn forest path
|
||||
• Golden leaves falling
|
||||
• Soft morning light through trees
|
||||
|
||||
✍️ Crafting haiku...
|
||||
|
||||
Forest whispers
|
||||
Golden footsteps on leaves—
|
||||
Silence speaks loud
|
||||
|
||||
📝 Syllable breakdown:
|
||||
"Forest" (2) - whisper (2)
|
||||
s(1) - il(1) -ence (1) - speaks (1) - loud (1)
|
||||
"Golden" (2) - foot (1) -steps (1) - on (1) - leaves (1)
|
||||
(5) - (7) - (5) ✅
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## 013: Image Alchemy
|
||||
|
||||
**Date:** 2026-04-19
|
||||
|
||||
## Concept
|
||||
|
||||
Upload two random images. AI "fuses" them into a new concept based on their shared elements.
|
||||
|
||||
## Why Strong
|
||||
|
||||
- Surprising/comedic combinations
|
||||
- Pure visual + semantic analysis
|
||||
- Unique creative output
|
||||
- Viral potential
|
||||
|
||||
## User Flow
|
||||
|
||||
1. Upload image A (or random)
|
||||
2. Upload image B (or random)
|
||||
3. Kimi: Analyzes both separately
|
||||
4. Hermes: Finds connections, creates fusion
|
||||
5. Output: New concept + fused image prompt
|
||||
|
||||
## Step-by-Step
|
||||
|
||||
```
|
||||
🌀 Analyzing Image A: A Viking ship
|
||||
• Norse aesthetic
|
||||
• Ocean voyage
|
||||
• Historical warrior culture
|
||||
|
||||
🌀 Analyzing Image B: A Coffee shop
|
||||
• Cozy atmosphere
|
||||
• Barista craft
|
||||
• Modern social space
|
||||
|
||||
🔮 Alchemizing...
|
||||
|
||||
Found connections:
|
||||
• Craft (warrior's craft → barista's craft)
|
||||
• Ritual (battle ritual → coffee ritual)
|
||||
• Journey (ocean voyage → daily commute)
|
||||
|
||||
⚗️ Alchemy Result:
|
||||
|
||||
"THE VIKING BARISTA"
|
||||
|
||||
A warrior of the morning,
|
||||
steering through storms of exhaustion,
|
||||
claiming the sacred cup.
|
||||
|
||||
Your coffee shop serves mead in horn-shaped mugs,
|
||||
the barista wears a helmet of foam,
|
||||
and every latte is a conquest.
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## 014: Visual Lie Detector
|
||||
|
||||
**Date:** 2026-04-19
|
||||
|
||||
## Concept
|
||||
|
||||
Upload a photo + claim. AI analyzes if the image supports or contradicts the claim.
|
||||
|
||||
## Why Strong
|
||||
|
||||
- Useful in era of fake news
|
||||
- Pure visual verification
|
||||
- Educational about image analysis
|
||||
- "Is this real?" tool
|
||||
|
||||
## User Flow
|
||||
|
||||
1. Paste claim + upload image
|
||||
2. Kimi: Analyzes image details
|
||||
3. Hermes: Compares claim vs evidence
|
||||
4. Output: Verdict + reasoning
|
||||
|
||||
## Step-by-Step
|
||||
|
||||
```
|
||||
🔍 Analyzing claim: "This photo was taken in Paris"
|
||||
|
||||
🔬 Image Analysis:
|
||||
• Architecture: Haussmannian buildings ✓
|
||||
• Street signs: French ✓
|
||||
• License plates: European format ✓
|
||||
• Language: French on signs ✓
|
||||
• Vegetation: Consistent with Paris climate ✓
|
||||
• Shadows: Consistent with claimed time of day ✓
|
||||
|
||||
✅ VERDICT: LIKELY AUTHENTIC
|
||||
|
||||
Confidence: 94%
|
||||
Supporting evidence: 8/8 elements match
|
||||
Caveats: Metadata not verified
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## 015: Object Archaeology
|
||||
|
||||
**Date:** 2026-04-19
|
||||
|
||||
## Concept
|
||||
|
||||
Upload an object close-up. AI identifies it, tells its history/story.
|
||||
|
||||
## Why Strong
|
||||
|
||||
- Educational
|
||||
- Heavy visual (identification + knowledge)
|
||||
- Discovery/antiquities angle
|
||||
- Could work with museum APIs
|
||||
|
||||
## User Flow
|
||||
|
||||
1. Upload object photo
|
||||
2. Kimi: Visual identification + details
|
||||
3. Hermes: Tells object's "story"
|
||||
4. Output: Identity + history narrative
|
||||
|
||||
## Step-by-Step
|
||||
|
||||
```
|
||||
🔍 Scanning object...
|
||||
|
||||
Visual Analysis:
|
||||
• Material: Ceramic
|
||||
• Style: Ming Dynasty blue and white
|
||||
• Pattern: Dragon with cloud motifs
|
||||
• Technique: Underglaze blue
|
||||
|
||||
🏺 Object Identified:
|
||||
Ming Dynasty (1368-1644) Blue and White Porcelain
|
||||
Dragon Pattern Bowl
|
||||
|
||||
📜 The Story:
|
||||
This bowl was crafted during the reign of Emperor Wanli,
|
||||
at the height of Jingdezhen's porcelain production.
|
||||
The dragon motif signifies imperial power and protection...
|
||||
|
||||
[Full historical narrative]
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Quick Comparison Matrix
|
||||
|
||||
| # | Name | Visual | Reasoning | Uniqueness | Fun |
|
||||
|---|------|--------|-----------|------------|-----|
|
||||
| 007 | Spot the Difference | Heavy | Light | 9/10 | 8/10 |
|
||||
| 008 | Visual Detective | Heavy | Light | 8/10 | 9/10 |
|
||||
| 009 | Image Tarot | Heavy | Light | 8/10 | 10/10 |
|
||||
| 010 | Color Emotion | Medium | Light | 7/10 | 7/10 |
|
||||
| 011 | Before/After | Heavy | Medium | 8/10 | 8/10 |
|
||||
| 012 | Visual Haiku | Heavy | Light | 9/10 | 8/10 |
|
||||
| 013 | Image Alchemy | Heavy | Light | 10/10 | 10/10 |
|
||||
| 014 | Lie Detector | Heavy | Medium | 9/10 | 8/10 |
|
||||
| 015 | Object Archaeology | Heavy | Medium | 8/10 | 8/10 |
|
||||
|
||||
---
|
||||
|
||||
**My top picks for uniqueness + fun:**
|
||||
1. **013 Image Alchemy** — Most unique, viral potential
|
||||
2. **009 Image Tarot** — Fun, shareable, low friction
|
||||
3. **007 Spot the Difference** — Game + AI demonstration
|
||||
4. **014 Visual Lie Detector** — Useful, educational
|
||||
|
||||
What stands out to you?
|
||||
Reference in New Issue
Block a user