Skip to content

Agent Army MECE Audit โ€” Quick Reference

Overall Score: 72/100 (Fair โ€” actionable improvements exist)
Date: 2026-05-22
Agents: See AGENT_COUNTS.md โ€” organized across 11 categories
Routing ambiguity: 50% (target: <5%)


๐Ÿ”ด Critical Overlaps (Fix Immediately)

Agents Issue Impact Fix Effort
devops-engineer + deployment-engineer Both own CI/CD; no boundary rule High โ€” users don't know which to pick 2 hrs
debugger + error-detective Identical scope (root cause diagnosis) Medium โ€” duplicate capability Merge or split by local vs. distributed
ml-engineer + machine-learning-engineer RESOLVED โ€” merged into machine-learning-engineer; ml-engineer removed โ€” Done
devops-engineer + deployment-engineer RESOLVED โ€” boundary rule added (devops = build/operate CI/CD + infra; deployment = release/rollout strategy) โ€” Done
react-specialist + frontend-developer Both own React; no rule for greenfield vs. optimization High โ€” 15% of React tasks ambiguous Add rule: "frontend-developer=greenfield, react-specialist=optimization"
backend-developer + node-specialist + fastapi-developer Architecture vs. language vs. framework confusion High โ€” many server-side tasks ambiguous Add rule: "backend-developer=cross-language architecture, specialists=language/framework idioms"

โš ๏ธ Medium Overlaps (Add Decision Rules)

Can be resolved with explicit boundary rules (no merging needed):

  1. api-designer vs backend-developer โ€” "designer owns specs; backend owns API implementation"
  2. data-engineer vs dlt-engineer โ€” "engineer=generic tool-agnostic; dlt-engineer=dlt-specific optimization"
  3. documentation-engineer vs technical-writer โ€” "engineer designs systems; writer creates content"
  4. performance-engineer vs layer-specialists โ€” "performance-engineer diagnoses bottleneck; specialist fixes in their layer"
  5. security-auditor vs penetration-tester โ€” "auditor=comprehensive assessment; tester=exploitation + validation"
  6. platform-engineer vs kubernetes-specialist โ€” "platform=IDP end-to-end; k8s-specialist=k8s ops"
  7. legacy-modernizer vs refactoring-specialist โ€” "modernizer=strategy + sequencing; specialist=tactical code cleanup"
  8. Plus 6 more in full scorecard

๐Ÿ“Š Category Scores

Category Score Status Key Issue
11 ยท Enterprise Architecture 91/100 โœ… Exemplary None โ€” TOGAF-aligned structure is MECE-perfect
07 ยท Specialized Domains 85/100 โœ… Good None โ€” scoped by domain, clear
10 ยท Research & Analysis 88/100 โœ… Good Minor: add pipeline clarification
09 ยท Meta & Orchestration 87/100 โœ… Good Minor: orchestration vs. coordination boundary
08 ยท Business & Product 82/100 โœ… Good Minor: add rules for PM vs BA, PM vs Scrum
03 ยท Infrastructure 81/100 โš ๏ธ Fair ๐Ÿ”ด CRITICAL: devops/deployment overlap
04 ยท Quality & Security 79/100 โš ๏ธ Fair ๐Ÿ”ด debugger/error-detective merger
02 ยท Language Specialists 78/100 โš ๏ธ Fair Consider consolidating 4 JS agents; 5 PowerShell OK
06 ยท Developer Experience 76/100 โš ๏ธ Fair Minor: doc engineer vs. writer
05 ยท Data & AI 74/100 โš ๏ธ Fair ๐Ÿ”ด ml-engineer/machine-learning-engineer merger
01 ยท Core Development 65/100 ๐Ÿ”ด Fair ๐Ÿ”ด frontend-developer scope creep; fullstack redundancy

๐ŸŽฏ What Blocks Semantic Distinctiveness

Problem 1: Missing Decision Rules

Most overlaps exist because descriptions lack explicit boundary conditions.

Example (current):

react-specialist: "optimize existing React applications"
frontend-developer: "build complete frontend applications across React, Vue, Angular"
โ†’ Unclear: can frontend-developer optimize React? Does react-specialist build new React apps?

Fixed:

react-specialist: "optimize existing React codebases for performance, state management, hooks"
  - Use when: you have working React code that needs performance/architecture improvements

frontend-developer: "build new full-stack frontends across React/Vue/Angular; multi-framework architecture"
  - Use when: greenfield frontend work or selecting framework strategy

Rule: frontend-developer chooses framework; react-specialist optimizes React only

Problem 2: Granularity Mismatch (Diagonal Overlap)

One agent scoped to language level; another to framework level; they converge.

Example: - backend-developer (language-agnostic, architecture-focused) - node-specialist (language-specific) - fastapi-developer (framework-specific)

All three could own "build a Node.js FastAPI equivalent" task.

Fix: Add hierarchy rule:

backend-developer: Cross-language architecture (microservices, API design, scalability)
node-specialist: Node.js idioms, async patterns, npm ecosystem
fastapi-developer: FastAPI-specific async patterns, Pydantic validation

Route "build Node.js API" to: backend-developer (design) โ†’ node-specialist (implement)

Problem 3: Same Artifact, Different Focus

Both agents produce the same deliverable but from different angles.

Examples: - debugger and error-detective both diagnose root causes - data-analyst and data-scientist both analyze data - ml-engineer and machine-learning-engineer both deploy ML systems

Fix: Either merge (if truly identical) or split explicitly by scope.


๐Ÿ“‹ Routing Test Results: 20 Real Tasks

Ambiguity rate: 50% (10/20 tasks)

Tasks with clear single agent (40%): - Build a GraphQL API - Set up Kubernetes - Audit compliance - Develop smart contracts - Build a game - Implement payment processing - etc.

Tasks with 2+ ambiguous agents (60%): - "Optimize React app" โ†’ react-specialist OR performance-engineer - "Build Node.js API" โ†’ node-specialist OR backend-developer - "Set up CI/CD" โ†’ devops-engineer OR deployment-engineer - "Debug a bug" โ†’ debugger OR error-detective - "Build ELT pipeline" โ†’ data-engineer OR dlt-engineer - etc.


โœ… How to Fix (Phase 1: 1 Week, 5โ€“10 Hours)

Step 1: Merge or Deprecate (2 hrs)

  • Merge ml-engineer + machine-learning-engineer โ†’ keep one name
  • Merge debugger + error-detective โ†’ split by distributed (error-detective) vs. local (debugger), OR merge into debugger with sub-focus areas
  • Merge frontend-developer + fullstack-developer โ†’ if fullstack is rare, deprecate it

Step 2: Add Boundary Rules to AGENTS.md (4 hrs)

Create a "Routing Rules" section for each overlapping pair.

Template:

### react-specialist vs frontend-developer
- **react-specialist**: optimize existing React codebases; advanced React 18+ patterns; hooks/context/state mgmt
- **frontend-developer**: greenfield multi-framework work; framework selection; full-stack integration
- **When unsure**: frontend-developer if choosing tech, react-specialist if refining tech

Step 3: Test Routing on 5 Real Tasks (1 hr)

Pick 5 ambiguous tasks from your backlog; confirm each routes unambiguously with new rules.

Step 4: Publish + Communicate (1 hr)

Update AGENTS.md with routing rules; notify team of merged agents (deprecation notices).


๐Ÿš€ Before Adding New Agents to Your Backlog

Use the Agent MECE Checklist (from AGENT_MECE_AUDIT_RUBRIC.md, Part F):

---
name: [agent-name]
description: "..."
tools: [...]
model: [...]
category: [...]
---

## MECE Self-Check
โœ“ Primary deliverable: [artifact โ€” distinct from 5+ existing agents?]
โœ“ Overlaps with: [list any agents with same/adjacent deliverable]
โœ“ Boundary rule: [explicit condition separating from overlapping agents]
โœ“ Routing test: [5 tasks โ€” does each route unambiguously?]

Red flags for backlog candidates: - โŒ "Like X, but for Y" (e.g., "like data-engineer, but for Apache Spark") โ€” probably a sub-specialization, not a new agent - โŒ No clear boundary rule vs. existing agents - โŒ Same artifact as 2+ agents (unless you're splitting an oversized role) - โŒ Sits in a gap but collides with 3+ existing agents (fix overlaps first)

Green flags: - โœ… Solves a clearly unmet problem domain (e.g., edge computing, visual design systems) - โœ… Scopes cleanly (no diagonal overlap) - โœ… Boundary rules vs. adjacent agents are explicit - โœ… Routing test passes (new agent always gets picked unambiguously)


๐Ÿ“– Full Documentation

  • AGENT_MECE_AUDIT_RUBRIC.md: Complete framework (7 parts, includes templates)
  • AGENT_MECE_AUDIT_SCORECARD.md: Full audit with findings per category + improvement roadmap
  • MECE_AUDIT_SUMMARY.md โ† you are here

  1. Review critical overlaps above; draft mergers/deprecations
  2. Add 5 boundary rules to AGENTS.md (highest-impact overlaps first)
  3. Run 10-task routing test to validate fixes
  4. Apply checklist to each agent in your backlog before adding
  5. Re-audit semi-annually (or every 5 new agents)

Questions? The rubric is designed so non-agents can evaluate distinctiveness. Use it to onboard others.