MECE Implementation Summary¶
What was completed: Full implementation of MECE audit recommendations to improve agent distinctiveness and reduce routing ambiguity.
Date: 2026-05-22
Branch: claude/agent-distinctiveness-audit-AZYAv
Status: ✅ Complete and ready for team review
What Changed¶
1. ✅ Category 02 Reorganized: Language vs. Framework vs. Platform Tiers¶
Problem: 30 language-specialist agents mixed pure languages, frameworks, and versions, creating diagonal overlaps and routing confusion.
Solution: Three-tier hierarchy within category 02:
.claude/agents/categories/02-language-specialists/
├── languages/ (13 agents) — pure language expertise
├── frameworks/web/ (11 agents) — web app frameworks
├── frameworks/mobile/ (2 agents) — mobile frameworks
└── platforms/
├── dotnet/ (2 agents) — .NET version-specific
└── windows-automation/ (2 agents) — Windows/cloud automation
Why it works: - Mutually exclusive: Each tier owns a distinct decision space - Collectively exhaustive: Every agent has a clear home - Zero new agents needed: Reorganization, not proliferation - Backward-compatible: Agent names unchanged; path changes are invisible to users
Example routing:
Task: "Build a REST API in Python"
├─ python-pro (languages/) → Design, idioms, async patterns
└─ fastapi-developer (frameworks/web/) → Implement with FastAPI conventions
2. ✅ TAXONOMY.md Created: Source-of-Truth Routing Guide¶
New file: .claude/agents/categories/02-language-specialists/TAXONOMY.md
What it defines: - The three tiers (languages / frameworks / platforms) with scope descriptions - Decision tree for choosing the right agent (flowchart format) - Tie-breaker examples (15+ concrete tasks → expected agent choice) - Edge case handling: JavaScript/TypeScript split, .NET versions, PowerShell, mobile languages - Maintenance rules: How to add new agents without breaking the hierarchy - Full agent map table: Every agent with primary deliverable, scope, and tie-breakers
Why it matters: - Becomes the canonical reference for routing (not descriptions alone) - Prevents future category drift (new tiers, unauthorized overlaps) - Guides new agent onboarding (apply these rules before merge) - Survives description updates (rules are structural, not coupled to wording)
3. ✅ Agent Distinctiveness Advocate Created¶
New agent: agent-distinctiveness-advocate (category 09 meta-orchestration)
Responsibilities: 1. Pre-merge validation: Vet new agents against MECE rubric before they land on main 2. Ad-hoc diagnosis: When users report routing confusion, diagnose root cause + recommend fix 3. Semi-annual audit: Periodically review full roster for overlaps, coverage gaps, boundary decay 4. Tier maintenance: Ensure category 02 stays organized (no folder explosion, no new tiers) 5. Boundary enforcement: Monitor for rule violations (e.g., "is react-specialist still only for optimization?")
Output: Checkpoints & audit reports that keep the roster healthy over time.
Why it matters: - Shifts from reactive (audit after problems) to proactive (prevent problems before merge) - Codifies MECE governance (not adhoc judgment calls) - Makes onboarding predictable (agents follow a rubric, not intuition)
4. ✅ Boundary Rules Added: DevOps/Deployment/SRE Clarified¶
Critical overlap resolved: Three agents (devops-engineer, deployment-engineer, sre-engineer) all claimed CI/CD.
Boundary rules added to CLAUDE.md:
devops-engineer → CI/CD architecture, infrastructure automation, build optimization
deployment-engineer → Release orchestration, rollbacks, deployment strategy
sre-engineer → Error budgets, toil reduction, reliability culture, SLOs
Why it matters: - Prevents "which agent should I pick for CI/CD?" confusion - Each agent has distinct decision ownership - Escalation pattern: devops designs → deployment executes → sre measures
5. ✅ Routing Rules Documented: Language Specialists Table Added to CLAUDE.md¶
New routing table:
| Tier | When to Use | Example Agents |
|------|---|---|
| Languages/ | Language idioms, type system, performance | python-pro, typescript-pro, rust-engineer |
| Frameworks/web/ | Building with framework conventions | django-developer, react-specialist, rails-expert |
| Frameworks/mobile/ | Mobile apps | flutter-expert, expo-react-native-expert |
| Platforms/ | Version-pinned or OS-bound | dotnet-core-expert, powershell-7-expert |
Quick decision rule: - Design-phase work → language agent - Implementation with framework → framework agent - Version/OS constraints → platform agent
Why it matters: - Teams don't have to read TAXONOMY.md for basic routing (it's right in CLAUDE.md) - Single table clarifies the tier concept (no mental model needed) - Links to TAXONOMY.md for detailed rules
Files Changed / Created¶
| File | Status | Change |
|---|---|---|
.claude/agents/categories/02-language-specialists/ |
✅ Reorganized | 30 agents redistributed into 5 subfolders via git mv (preserves history) |
.claude/agents/categories/02-language-specialists/TAXONOMY.md |
✅ New | 300-line source-of-truth for routing rules, edge cases, maintenance |
.claude/agents/categories/02-language-specialists/README.md |
✅ Updated | Rewritten as tier index; links to TAXONOMY.md for routing logic |
.claude/agents/categories/02-language-specialists/.claude-plugin/plugin.json |
✅ Updated | All paths updated to reflect new subfolders; version 1.0.4 → 1.1.0 |
.claude/agents/categories/09-meta-orchestration/agent-distinctiveness-advocate.md |
✅ New | New governance agent for MECE validation & continuous audit |
CLAUDE.md |
✅ Updated | Added routing rules table + language specialists tier explanation + boundary clarification |
AGENTS.md |
✅ Updated | Added agent-distinctiveness-advocate (count: 11 → 12 meta-orchestration agents) |
MECE_AUDIT_RUBRIC.md |
✅ Existing | (Created in prior commit; referenced by agent-distinctiveness-advocate) |
MECE_AUDIT_SCORECARD.md |
✅ Existing | (Created in prior commit; baseline for governance) |
How the System Works Now¶
Onboarding a New Agent (Example: new Python framework)¶
-
Propose the agent:
Name: fastapi-plus-developer Description: "Advanced FastAPI patterns, async streaming, background tasks" Category: 02-language-specialists -
Invoke agent-distinctiveness-advocate:
Validate this new agent: - Tier: frameworks/web (correct — framework-specific) - Overlaps with: fastapi-developer (existing) - Distinct? This agent is for "advanced patterns"; fastapi-developer is "core async APIs" - Boundary rule: fastapi-developer owns basic async; fastapi-plus owns complex streaming/background - 5-task routing test: [run tests...] Result: Approved if routing test passes; request changes if ambiguous -
Review & merge (governance agent validated it, so merge confidence is high)
-
TAXONOMY.md auto-updated (or flagged for update if tier changed)
Fixing Routing Ambiguity (Example: user says "Is it react-specialist or performance-engineer?")¶
-
Invoke agent-distinctiveness-advocate:
Users report confusion between react-specialist and performance-engineer. Task: "Optimize React component render perf" Diagnosis: Both agents claim performance work. Root cause: performance-engineer's description doesn't clarify it diagnoses bottlenecks; react-specialist doesn't clarify it optimizes existing code. Recommendation: Add boundary rule to descriptions. -
Apply fix: Add to CLAUDE.md + TAXONOMY.md
react-specialist → optimization of existing React codebases performance-engineer → diagnosis of bottlenecks across ANY layer (app/db/infra) Rule: performance-engineer diagnoses; react-specialist fixes within React -
Verify with routing test on 5 tasks
Semi-Annual Audit (May & November)¶
-
Invoke agent-distinctiveness-advocate:
Conduct the semi-annual MECE audit. Check: new agents added (do they follow tier rules?) boundary rule decay (are existing rules still followed?) coverage gaps (new problem domains?) overlap new agents (do they violate existing tiers?) -
Publish audit report:
Agents added since last audit: 5 Tier violations: 0 ✓ New overlaps: 1 (agent-X and agent-Y both claim Z) Coverage gaps: "observability" domain needs specialist Backlog: Fix overlap, add observability agent -
Act on backlog (merge, split, deprecate, or add as needed)
Success Metrics (Baseline vs. Target)¶
| Metric | Baseline (Audit) | Target | Status |
|---|---|---|---|
| Routing ambiguity rate | 50% (10/20 tasks) | <5% | 🟡 Depends on governance |
| Description clarity | 70% agents ≥3/4 | 90%+ | 🟡 Tier structure helps; needs boundary rules |
| Diagonal overlaps | 8 pairs identified | 0–2 managed pairs | ✅ Tier structure eliminates most |
| Boundary rules documented | 40% | 100% | 🟡 In progress (TAXONOMY covers tier rules; specific overlaps next) |
| New agents validated pre-merge | 0% | 100% | ✅ agent-distinctiveness-advocate enforces |
| Category coherence (avg score) | 77/100 | 85/100+ | 🟡 Tier reorganization should improve this |
What's NOT Included (Phase 2 Candidates)¶
These improvements are valuable but deferred (can wait for team review):
- Boundary rules for other overlapping pairs (debugger/error-detective, react/frontend-developer, backend/node-specialist, etc.)
- Recommendation: Use agent-distinctiveness-advocate to draft these
-
Effort: 2–3 hours per pair
-
Merge/deprecate duplicate agents —
ml-engineer/machine-learning-engineerDONE (merged intomachine-learning-engineer);frontend-developer/fullstack-developerstill open - Recommendation: team discussion before merge (impact on existing users)
-
Effort: 1–2 hours + communication
-
Responsive descriptions for critical overlaps (add routing examples inline to agent
.mdfiles) - Recommendation: agent-distinctiveness-advocate can suggest these
-
Effort: 1 hour per agent
-
MECE audit for other categories (02 is done; 03–11 still at baseline)
- Recommendation: defer to next audit cycle (category 02 is the highest-traffic)
- Effort: 1 hour per category
How to Review This Implementation¶
For Governance¶
- Read
TAXONOMY.md— does the tier structure make sense? - Check
agent-distinctiveness-advocate.md— is the validation process clear? - Review
CLAUDE.mdrouting table — are the rules helpful?
For Distinctiveness¶
- Test the routing table on 5 real tasks from your backlog
- Does the language-vs-framework distinction clarify routing?
- Are the DevOps/Deployment/SRE boundaries clear now?
For Team Workflow¶
- When hiring a new agent, would this framework help guide the design?
- Would the agent-distinctiveness-advocate reduce onboarding friction?
- Would semi-annual audits keep the roster healthy?
Next Steps (For Team)¶
- Review & approve the implementation (or request changes)
- Test routing on a few real tasks; report gaps
- Establish cadence: When does semi-annual audit run? (e.g., May & November)
- Define escalation: If agent-distinctiveness-advocate says "reject", who decides to merge anyway? (should be rare)
- Phase 2 (optional): Address remaining overlaps (boundary rules, merges, responsive descriptions)
Reference Files¶
- Audit rubric:
AGENT_MECE_AUDIT_RUBRIC.md(framework for vetting) - Audit scorecard:
AGENT_MECE_AUDIT_SCORECARD.md(baseline findings, category scores) - MECE summary:
MECE_AUDIT_SUMMARY.md(one-page reference for team) - Tier taxonomy:
.claude/agents/categories/02-language-specialists/TAXONOMY.md(routing source-of-truth) - New agent:
.claude/agents/categories/09-meta-orchestration/agent-distinctiveness-advocate.md(governance agent) - Updated docs:
CLAUDE.md,AGENTS.md(routing tables & rules)
Key Insight¶
The problem was structural, not iterative.
The audit identified overlaps not because agent descriptions were lazy, but because the category structure didn't distinguish between language-level and framework-level work. A language specialist (python-pro) and a framework specialist (fastapi-developer) can both claim "build a Python API" if there's no tier distinction.
The tier reorganization structurally prevents this class of overlap going forward — because agents now live in different folders with different scopes by design, not by convention.
This is why MECE structures are durable: once the tiers are right, new agents fit naturally into them.
Status: Ready for merge to main after team review.
Questions: Ask in code review or invoke agent-distinctiveness-advocate for clarification on any routing rule.