Files
hermes-detective/docs/chosen-detective-game.md
shoko ecfd0b1160 feat: Initial commit - Hermes Detective Agency concept
- Hermes Detective Agency: Open-ended mystery investigation game
- Roles: Chief (human), Witness (Kimi), Detective (Hermes)
- 5 difficulty levels, community cases, open-ended solving
- Scoring: Alignment %, Evidence %, Time
- Features: Retry, Journal, Observe mode
- Tech: Kimi Vision + Hermes Agent + Pollinations

Changelog:
- Research phase: Kimi capabilities, Hermes agent, image APIs
- Brainstorming: 14 ideas explored
- Comparison matrix: Detective selected as winner
- Concept finalized with all design decisions
2026-04-20 00:00:30 +00:00

503 lines
13 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Project: Hermes Detective Agency
**Chosen Concept:** 033v2 Detective
**Date:** 2026-04-19
**Status:** Concept Finalized
**Tags:** hermes-agent, kimi-vision, game, multi-agent, open-ended, community
---
## Concept Summary
A mystery investigation game where a human (Chief) directs two AI agents — a **Witness** (powered by Kimi Vision) and a **Detective** (powered by Hermes) — to investigate visual cases.
**Core philosophy:** Open-ended solving. No single truth. Evidence guides, but multiple theories are valid.
---
## Elevator Pitch
> *"You're the Chief. Your Witness sees everything. Your Detective connects the dots. Build YOUR theory. See how it aligns with others."*
---
## The Story
You run a small detective agency. Your two AI assistants have superhuman abilities:
- **Witness** can look at any image and describe it perfectly — every detail, every inconsistency, every hidden clue.
- **Detective** can take those observations and build theories, spot patterns, and identify suspects.
Your job? **Direct the investigation.** Tell them what to look at. Ask the right questions. Build your theory.
**Key difference:** There's no single "right answer." The creator has an intended story, but your theory is valid if evidence supports it.
---
## Game Roles
### Chief (Human)
The player. You run the investigation.
| Action | Effect |
|--------|--------|
| Examine evidence | Witness + Kimi analyze |
| Question suspects | Detective probes, Witness watches |
| Compare items | Kimi highlights differences |
| Build theory | Cite evidence, form conclusion |
| Request truth | See creator's intended story (optional) |
### Witness (Agent A + Kimi)
The eyes. Analyzes visual evidence. Appears based on triggers.
| Input | Output |
|-------|--------|
| Crime scene photo | "I see glass shards, muddy footprints, a broken frame..." |
| Suspect photo | "This person has paint on their sleeve..." |
| Document | Extracts text, notes inconsistencies |
| Item close-up | Identifies details Chief might miss |
**Dynamic Appearance:** In harder cases, Witness doesn't appear until triggered.
### Detective (Agent B)
The brain. Builds theories, responds to questions.
| Input | Output |
|-------|--------|
| Witness observations | "Based on evidence, the thief entered through..." |
| Suspect profiles | "Suspect A has motive: insurance fraud..." |
| Human questions | "Good question, Chief. Let me look into that..." |
| Theory building | Helps Chief cite evidence for their theory |
---
## Difficulty System
### Difficulty Levels
| Difficulty | Description | Evidence | Suspects | Red Herrings | Plot Twist |
|-----------|-------------|----------|----------|---------------|------------|
| **Easy** | Obvious clues, clear path | 4-5 | 2 | ❌ | ❌ |
| **Medium** | Requires comparison | 6-7 | 3 | ❌ | ❌ |
| **Hard** | Red herrings present | 8-9 | 4 | ✅ | ❌ |
| **Hardcore** | Plot twist mid-case | 10-11 | 4 | ✅ | ✅ |
| **Impossible** | All elements, complex | 12+ | 5 | ✅ | ✅ |
### Daily Structure
```
One case per day, everyone gets the same case
Same difficulty for all players
Different case each day
```
### Starter Pack (5 Cases)
| Week | Difficulty | Theme |
|------|------------|-------|
| 1 | Easy | Simple theft |
| 2 | Medium | Missing person |
| 3 | Hard | Corporate fraud |
| 4 | Hardcore | Art heist (plot twist) |
| 5 | Impossible | Multi-layered conspiracy |
**Approach:** Add cases incrementally during development.
---
## Evidence System
### Evidence Types
| Type | What Kimi Sees | Example Clue |
|------|---------------|--------------|
| **Crime scene** | Scene layout, objects, anomalies | "Window was broken from inside" |
| **Surveillance** | People, actions, timestamps | "Person lingered at door for 3 minutes" |
| **Documents** | Text, handwriting, context | "Letter mentions 'meeting at midnight'" |
| **Photos** | People, items, locations | "Suspect's shoes match the footprint" |
| **Maps** | Routes, access points, exits | "Only one entrance visible to street" |
| **Items** | Condition, marks, connections | "Key is copy — grooves don't match original" |
### Evidence Citation
Evidence helps build theory. Not all evidence is required.
```
Chief's Theory: "I think Suspect B did it."
📎 Cited Evidence:
- Evidence #3: Crime scene photo
- Evidence #5: Security footage
- Evidence #8: Witness testimony
→ 3/10 evidence cited (30%)
💬 Detective: "That's a solid theory. The evidence
supports B, but have you considered Evidence #7?"
```
### Hints Embedded in Evidence
Not a separate button. Hints are part of the evidence design.
| Level | Visibility | Example |
|-------|-----------|---------|
| **Too obvious** | Easy to find | "Letter saying 'I did it'" |
| **Barely obvious** | Check certain places | "Muddy shoes near suspect's home" |
| **Not too obvious** | Requires attention | "Timeline inconsistency in letter" |
### Witness Trigger System
In harder cases, Witness appears based on triggers.
```
Trigger Example:
Turn 1: Chief examines crime scene photo
Turn 2: Chief finds a hair sample on the floor
↓ [Trigger activated]
Turn 3: 👁️ Witness appears
↓ "I recognize this hair... it belongs to Suspect B's dog"
Turn 4: Chief examines suspect's home
Turn 5: 👁️ Witness appears again (new trigger)
↓ "I saw Suspect B leaving the gallery at midnight..."
```
**Indicator:** Each piece of evidence has a note indicating if it triggers Witness appearance.
---
## Open-Ended Solving
### Core Philosophy
> **No single truth. Multiple valid theories.**
| Before | After |
|--------|-------|
| One correct answer | Multiple valid theories |
| Wrong accusation = Fail | Theory valid if evidence supports |
| One winner | Everyone discusses |
| Truth ends game | Truth is guidance, not mandate |
### Theory Building
```
👤 Chief builds theory:
"I think Suspect B did it, with help from Suspect A.
B had access (night guard), A had keys (curator).
They split the insurance money."
📎 Chief cites evidence:
- Evidence #3: Crime scene (window not broken)
- Evidence #5: Security footage (B was inside)
- Evidence #7: A has master keys
- Evidence #9: Financial records (recent debt)
💬 Detective responds:
"That's a coherent theory. Your cited evidence
supports collaboration between A and B."
```
### Truth Reveal
**Available anytime. Does NOT end the game.**
| When | Why |
|------|-----|
| After building theory | "Did I get it right?" |
| When stuck | "Give me guidance" |
| Never | "I want to figure it out myself" |
| After solving | "See how close I was" |
```
📜 THE TRUTH (Creator's Intended)
The case was designed as:
"A and B collaborated. A had keys, B had access.
But C was the real mastermind, funding the whole thing."
👤 Your theory:
"Suspect B acted alone."
💬 Comparison:
- Your theory missed the collaboration element
- You correctly identified B as main actor
- Evidence you cited: 80% relevant
- 🎯 65% alignment with intended truth
💬 But: Your theory is still valid based on evidence!
Discussion continues. Truth is guidance, not mandate.
```
---
## Scoring System
### Per Case Statistics
| Metric | Calculation |
|--------|-------------|
| **Time** | Turns × 10 min (simplified) |
| **Evidence** | Evidence cited / Total evidence |
| **Alignment** | How close to creator's intended story |
| **Coherence** | Theory makes sense based on evidence |
### Statistics Display
```
┌─────────────────────────────────────┐
│ 📊 CASE STATISTICS │
├─────────────────────────────────────┤
│ ⏱️ Time: 6 turns × 10 min = 60 min │
│ 📎 Evidence: 7/10 cited (70%) │
│ 🎯 Alignment: 85% with creator │
│ 💬 Theory coherence: Strong │
├─────────────────────────────────────┤
│ ⭐ Rating: Sharp Detective │
└─────────────────────────────────────┘
```
### Rating Tiers
| Alignment | Rating |
|-----------|--------|
| 90-100% | Master Detective |
| 75-89% | Sharp Detective |
| 50-74% | Promising Detective |
| 25-49% | Apprentice |
| 0-24% | Rookie |
---
## Retry & Journal System
### Multiple Attempts
User can solve same case multiple times.
```
Case #47 — The Hartwell Heist
Your Attempts:
├── Attempt #1: 85% alignment, 6 turns 📖
├── Attempt #2: 92% alignment, 4 turns 📖
├── Attempt #3: In progress...
└── Best: 92% alignment
```
### Journal Documentation
Every attempt is documented (solve or not).
```
Attempt #1: April 19, 2026
├── Status: Solved
├── Evidence cited: 7/10
├── Alignment: 85%
├── Theory: "Suspect B acted alone"
└── Notes: "Missed the A-B collaboration"
```
### Privacy Settings
| Setting | Description |
|---------|-------------|
| **Private** | Only you see your attempts |
| **Publish stats** | Everyone sees your stats (default) |
| **Publish journal** | Anyone can read your solve |
---
## Replay (Observe Mode)
Watch how others solved the case.
```
📺 OBSERVE MODE
@alice's Solve of Case #47
Turn 1: Examined crime scene
Turn 2: Found hair sample → Witness appeared
Turn 3: Questioned Suspect B
Turn 4: Examined financial records
Turn 5: Cited evidence, formed theory
Turn 6: Requested truth reveal
⏱️ 6 turns | 🎯 85% alignment | ⭐ Sharp
```
**Only published journals are observable.**
---
## Case Creation System
### Starter Cases
5 cases (one per difficulty) as templates.
**Source:** Real solved cases adapted for the game.
### Community Cases
Anyone can create and share cases.
#### Creation Flow
```
1. Choose reference case (optional)
"Let's base this on the Isabella Stewart Gardner theft"
2. Gather/create evidence
Upload images (crime scene, suspects, documents)
3. Write case brief
├── Title, difficulty
├── Suspect list (names, photos)
├── Evidence set
├── Hidden truth (creator's intended story)
├── Red herrings (optional)
├── Plot twist (optional)
└── Witness triggers (which evidence triggers Witness)
4. Test it
Play through yourself to verify solvability
5. Publish
├── Private link (friends only)
└── Public (case library)
```
### Case Format
```yaml
case:
title: "The Hartwell Heist"
difficulty: medium
difficulty_description: "Requires comparison of evidence"
evidence:
- id: 1
type: photo
image: crime_scene.jpg
description: "Crime scene photograph"
triggers_witness: true
hint_level: not_too_obvious
- id: 2
type: document
image: letter.jpg
description: "Anonymous letter found"
triggers_witness: false
hint_level: barely_obvious
suspects:
- name: "Suspect A"
photo: suspect_a.jpg
description: "Gallery curator"
truth:
summary: "A and B collaborated..."
alignment_criteria:
- "Correctly identified collaboration"
- "Identified A as key holder"
- "Identified B as main actor"
witness_triggers:
- evidence_id: 1
testimony: "I see glass on the floor inside..."
```
### Case Creator Tools
| Tool | Purpose |
|------|---------|
| **Skill** | Hermes skill for case creation guidance |
| **Validator** | Verify case format is correct |
---
## Community Moderation
### Discovery Philosophy
> **Community cases are the jungle. Direct links are the path.**
| Discovery Method | Quality | Effort |
|-----------------|---------|--------|
| Case library (browse) | Mixed (jungle) | Low |
| Direct link from creator | Same quality | Medium |
| Social media / community | Trusted (curated) | High |
### Quality Signals
| Signal | Description |
|--------|-------------|
| **Visits** | How many times case was played |
| **Reviews** | 👍 or 👎 (no text, requires effort to spam) |
```
Case #47B — "The Missing Heirloom"
├── Visits: 234
├── 👍 45 | 👎 3
└── Quality score: High
```
**Note:** Review manipulation is possible but requires effort. Not perfect, but workable.
### Sharing Flow
```
Creator creates case
Tests locally
Publishes to community
Shares link on social media / Discord
Players try directly from creator
```
---
## Summary of Decisions
| Element | Decision |
|---------|----------|
| Difficulty | 5 levels (Easy → Impossible) |
| Daily structure | One case per day, same for all |
| Timer | ❌ No (first phase) |
| Hints | ✅ Embedded in evidence |
| Retry | ✅ Unlimited attempts |
| Journal | ✅ Every attempt documented |
| Observe | ✅ Watch published solves |
| Privacy | Private by default |
| Publish | Stats always, journal optional |
| Scoring | Alignment %, Evidence %, Time |
| Open-ended | ✅ No single truth |
| Truth reveal | Available anytime |
| Case source | Real cases + community |
| Witness | Dynamic (triggers in hard cases) |
| Red herrings | ✅ Hard+ difficulty |
| Plot twist | ✅ Hardcore+ difficulty |
| Community | Visits + reviews (no auth) |
---
## What's Next
Once we finalize the concept:
- Technical architecture
- UI/UX design
- Prompt engineering
- Case creation template
- Prototype development
---
## Related Documents
- `docs/ideas/COMPARISON.md` — Full comparison matrix
- `docs/ideas/008-visual-detective.md` — Initial brainstorm