- Hermes Detective Agency: Open-ended mystery investigation game - Roles: Chief (human), Witness (Kimi), Detective (Hermes) - 5 difficulty levels, community cases, open-ended solving - Scoring: Alignment %, Evidence %, Time - Features: Retry, Journal, Observe mode - Tech: Kimi Vision + Hermes Agent + Pollinations Changelog: - Research phase: Kimi capabilities, Hermes agent, image APIs - Brainstorming: 14 ideas explored - Comparison matrix: Detective selected as winner - Concept finalized with all design decisions
133 lines
4.4 KiB
Markdown
133 lines
4.4 KiB
Markdown
# Ideas Comparison Matrix
|
|
|
|
**Date:** 2026-04-19
|
|
**Purpose:** Compare all ideas to select final concept
|
|
|
|
---
|
|
|
|
## Scoring Criteria
|
|
|
|
| Criteria | Weight | Description |
|
|
|----------|--------|-------------|
|
|
| **Visual Analysis** | 30% | Heavy Kimi use (aligned with Kimi's strength) |
|
|
| **Multi-Turn** | 20% | Not single-turn, builds over time |
|
|
| **Human-AI Interaction** | 20% | Human participates, not passive |
|
|
| **Cost Efficiency** | 15% | Low API costs (image gen vs analysis) |
|
|
| **Uniqueness** | 10% | Stand out from competitors |
|
|
| **Fun/Engagement** | 5% | Enjoyable to play/watch |
|
|
|
|
**Scoring:** 1-5 (5 = best)
|
|
|
|
---
|
|
|
|
## Full Comparison Matrix
|
|
|
|
| # | Idea | Visual | Multi-Turn | Human-AI | Cost | Unique | Fun | **Total** |
|
|
|---|------|--------|------------|----------|------|--------|-----|-----------|
|
|
| 001 | Visual Narrative Agent | 4 | 4 | 3 | 2 | 3 | 4 | **3.5** |
|
|
| 002 | Visual Memory Journal | 3 | 3 | 2 | 3 | 4 | 3 | **3.0** |
|
|
| 003 | Design Critic | 3 | 2 | 2 | 3 | 2 | 3 | **2.6** |
|
|
| 004 | Visual Poem | 4 | 2 | 2 | 3 | 4 | 4 | **3.2** |
|
|
| 005 | Scene Journey | 4 | 3 | 2 | 2 | 3 | 4 | **3.2** |
|
|
| 007 | Spot the Difference | 4 | 2 | 3 | 2 | 4 | 5 | **3.4** |
|
|
| 008 | Visual Detective | 4 | 3 | 2 | 3 | 4 | 4 | **3.5** |
|
|
| 009 | Image Tarot | 4 | 2 | 3 | 3 | 4 | 5 | **3.5** |
|
|
| 013 | Image Alchemy | 4 | 2 | 3 | 2 | 5 | 5 | **3.6** |
|
|
| 014 | Lie Detector | 4 | 2 | 3 | 3 | 4 | 4 | **3.4** |
|
|
| 032v2 | Art Critic | 5 | 3 | 3 | 3 | 3 | 4 | **3.7** |
|
|
| **033v2** | **Detective** | **5** | **5** | **5** | **4** | **4** | **5** | **4.7** |
|
|
| 035 | Guess Artist | 5 | 2 | 3 | 3 | 3 | 4 | **3.5** |
|
|
| Auction | Auction | 3 | 4 | 5 | 4 | 4 | 4 | **3.9** |
|
|
|
|
---
|
|
|
|
## Top Contenders
|
|
|
|
| Rank | Idea | Score | Key Strengths |
|
|
|------|------|-------|---------------|
|
|
| 🥇 | **033v2 Detective** | **4.7** | Best multi-turn, human directs, Kimi does real work |
|
|
| 🥈 | Auction | 3.9 | Human describes, human engages, cheap |
|
|
| 🥉 | 032v2 Art Critic | 3.7 | Kimi visual analysis, multi-turn |
|
|
| 4 | 013 Image Alchemy | 3.6 | Most unique, viral potential |
|
|
| 5 | 009 Image Tarot | 3.5 | Fun, shareable |
|
|
|
|
---
|
|
|
|
## 033v2 Detective — Why It Wins
|
|
|
|
### Alignment with User Goals
|
|
|
|
| User Goal | How Detective Meets It |
|
|
|-----------|----------------------|
|
|
| Heavy visual analysis | Kimi analyzes each piece of evidence |
|
|
| Low reasoning | Pattern matching, not complex logic |
|
|
| Multi-turn | 5-7 rounds per case |
|
|
| Human-AI collaboration | Human (Chief) directs the investigation |
|
|
| Cost efficient | Mostly text between Kimi calls |
|
|
| Fun/engagement | Mystery + competition |
|
|
|
|
### What Makes It Special
|
|
|
|
1. **Natural two-agent roles:** Witness (sees) + Detective (thinks)
|
|
2. **Human as boss:** Chief directs investigation, not passive observer
|
|
3. **Multi-turn structure:** Each round builds the case
|
|
4. **Kimi's strength shines:** Visual evidence analysis is the core mechanic
|
|
5. **Scoring system:** Track cases solved, rounds taken, accuracy
|
|
|
|
### Comparison to Other Games
|
|
|
|
| Aspect | Spot the Difference | Tarot | Alchemy | **Detective** |
|
|
|--------|-------------------|-------|---------|---------------|
|
|
| Visual Analysis | 4 | 4 | 4 | **5** |
|
|
| Multi-Turn | 2 | 2 | 2 | **5** |
|
|
| Human Role | Judge | Receive | Submit | **Direct** |
|
|
| Narrative | None | Story | Surprise | **Full Mystery** |
|
|
| Replayability | Medium | Low | Medium | **High** |
|
|
|
|
---
|
|
|
|
## Recommendation
|
|
|
|
**Go with 033v2 Detective.**
|
|
|
|
### Why Not Others
|
|
|
|
| Idea | Why Not |
|
|
|------|---------|
|
|
| 001 Visual Narrative | Too similar to others, high cost |
|
|
| 007 Spot Difference | Fun but shallow (1-turn) |
|
|
| 009 Image Tarot | Not really interactive |
|
|
| 013 Image Alchemy | Unique but single interaction |
|
|
| Auction | Good but less "AI demonstration" |
|
|
|
|
### Detective's Edge
|
|
|
|
- **Multi-turn** = not just a quick demo
|
|
- **Human directs** = active participation
|
|
- **Kimi sees evidence** = clear AI capability showcase
|
|
- **Cost efficient** = mostly text
|
|
- **Daily cases** = reason to return
|
|
|
|
---
|
|
|
|
## Next Steps for 033v2 Detective
|
|
|
|
- [ ] Define case structure (5-7 evidence images)
|
|
- [ ] Design Chief interface (what buttons/actions)
|
|
- [ ] Plan Witness + Detective prompts
|
|
- [ ] Mock up UI
|
|
- [ ] Prototype with one case
|
|
|
|
---
|
|
|
|
## Appendix: Ideas That Could Combine with Detective
|
|
|
|
### Detective + Art Critic
|
|
Two types of daily content: Mystery case OR Art analysis
|
|
|
|
### Detective + Auction
|
|
Hybrid mode: Evidence auction where Chief describes to Detective
|
|
|
|
### Detective + Spot Difference
|
|
Mini-game within case: "Find the clue hidden in this photo"
|