Files
hermes-detective/docs/ideas/COMPARISON.md
shoko ecfd0b1160 feat: Initial commit - Hermes Detective Agency concept
- Hermes Detective Agency: Open-ended mystery investigation game
- Roles: Chief (human), Witness (Kimi), Detective (Hermes)
- 5 difficulty levels, community cases, open-ended solving
- Scoring: Alignment %, Evidence %, Time
- Features: Retry, Journal, Observe mode
- Tech: Kimi Vision + Hermes Agent + Pollinations

Changelog:
- Research phase: Kimi capabilities, Hermes agent, image APIs
- Brainstorming: 14 ideas explored
- Comparison matrix: Detective selected as winner
- Concept finalized with all design decisions
2026-04-20 00:00:30 +00:00

4.4 KiB

Ideas Comparison Matrix

Date: 2026-04-19
Purpose: Compare all ideas to select final concept


Scoring Criteria

Criteria Weight Description
Visual Analysis 30% Heavy Kimi use (aligned with Kimi's strength)
Multi-Turn 20% Not single-turn, builds over time
Human-AI Interaction 20% Human participates, not passive
Cost Efficiency 15% Low API costs (image gen vs analysis)
Uniqueness 10% Stand out from competitors
Fun/Engagement 5% Enjoyable to play/watch

Scoring: 1-5 (5 = best)


Full Comparison Matrix

# Idea Visual Multi-Turn Human-AI Cost Unique Fun Total
001 Visual Narrative Agent 4 4 3 2 3 4 3.5
002 Visual Memory Journal 3 3 2 3 4 3 3.0
003 Design Critic 3 2 2 3 2 3 2.6
004 Visual Poem 4 2 2 3 4 4 3.2
005 Scene Journey 4 3 2 2 3 4 3.2
007 Spot the Difference 4 2 3 2 4 5 3.4
008 Visual Detective 4 3 2 3 4 4 3.5
009 Image Tarot 4 2 3 3 4 5 3.5
013 Image Alchemy 4 2 3 2 5 5 3.6
014 Lie Detector 4 2 3 3 4 4 3.4
032v2 Art Critic 5 3 3 3 3 4 3.7
033v2 Detective 5 5 5 4 4 5 4.7
035 Guess Artist 5 2 3 3 3 4 3.5
Auction Auction 3 4 5 4 4 4 3.9

Top Contenders

Rank Idea Score Key Strengths
🥇 033v2 Detective 4.7 Best multi-turn, human directs, Kimi does real work
🥈 Auction 3.9 Human describes, human engages, cheap
🥉 032v2 Art Critic 3.7 Kimi visual analysis, multi-turn
4 013 Image Alchemy 3.6 Most unique, viral potential
5 009 Image Tarot 3.5 Fun, shareable

033v2 Detective — Why It Wins

Alignment with User Goals

User Goal How Detective Meets It
Heavy visual analysis Kimi analyzes each piece of evidence
Low reasoning Pattern matching, not complex logic
Multi-turn 5-7 rounds per case
Human-AI collaboration Human (Chief) directs the investigation
Cost efficient Mostly text between Kimi calls
Fun/engagement Mystery + competition

What Makes It Special

  1. Natural two-agent roles: Witness (sees) + Detective (thinks)
  2. Human as boss: Chief directs investigation, not passive observer
  3. Multi-turn structure: Each round builds the case
  4. Kimi's strength shines: Visual evidence analysis is the core mechanic
  5. Scoring system: Track cases solved, rounds taken, accuracy

Comparison to Other Games

Aspect Spot the Difference Tarot Alchemy Detective
Visual Analysis 4 4 4 5
Multi-Turn 2 2 2 5
Human Role Judge Receive Submit Direct
Narrative None Story Surprise Full Mystery
Replayability Medium Low Medium High

Recommendation

Go with 033v2 Detective.

Why Not Others

Idea Why Not
001 Visual Narrative Too similar to others, high cost
007 Spot Difference Fun but shallow (1-turn)
009 Image Tarot Not really interactive
013 Image Alchemy Unique but single interaction
Auction Good but less "AI demonstration"

Detective's Edge

  • Multi-turn = not just a quick demo
  • Human directs = active participation
  • Kimi sees evidence = clear AI capability showcase
  • Cost efficient = mostly text
  • Daily cases = reason to return

Next Steps for 033v2 Detective

  • Define case structure (5-7 evidence images)
  • Design Chief interface (what buttons/actions)
  • Plan Witness + Detective prompts
  • Mock up UI
  • Prototype with one case

Appendix: Ideas That Could Combine with Detective

Detective + Art Critic

Two types of daily content: Mystery case OR Art analysis

Detective + Auction

Hybrid mode: Evidence auction where Chief describes to Detective

Detective + Spot Difference

Mini-game within case: "Find the clue hidden in this photo"