Table of Contents >> Show >> Hide
- What “Everything’s In Play” Actually Means (Beyond the Buzzword Confetti)
- Our 16-Agent Roster: A Practical View of the “Core”
- 1) The Front Door Agent (Triage & Routing)
- 2) The Policy & Permissions Agent (The Adult in the Room)
- 3) The Knowledge Retrieval Agent (Grounding & RAG)
- 4) The Data Analyst Agent (SQL, Metrics, Dashboards)
- 5) The Workflow Orchestrator Agent (Long-Running Tasks)
- 6) The Tools & Integrations Agent (APIs, SaaS Connectors, “Computer Use”)
- 7) The Customer Support Agent
- 8) The Sales Ops Agent
- 9) The Finance Ops Agent
- 10) The HR & People Agent
- 11) The Engineering Productivity Agent
- 12) The Security Triage Agent
- 13) The QA & Evaluation Agent (Truth Serum)
- 14) The Observability Agent (Tracing, Cost, Latency)
- 15) The Governance Agent (Risk, Compliance, Controls)
- 16) The Legacy Vendor Agent (Yes, We Kept One)
- Why 15 of 16 Came From Modern Vendors: The Real Reasons (No “Digital Transformation” Posters Required)
- What Legacy Vendors Still Do Better (And Why They’re Not Going Away)
- A Buyer’s Scorecard: How to Choose Your Agent Stack Without Regrets
- Reality Check: Why Many Agent Projects Fail (And How to Not Be a Statistic)
- So Why Only 1 Legacy Vendor Agent? The Strategic Answer
- Field Notes: of Real Experience Building a 16-Agent Core
- SEO Tags
Not long ago, buying enterprise software felt like adopting a very expensive whale. You didn’t “choose tools” so much as “pick a vendor,” sign a contract the size of a novella, and spend the next three quarters learning what the vendor meant by the word implementation.
Then AI showed upspecifically, agentic AIand the whale got replaced by a school of very fast fish. Suddenly the question wasn’t “Which suite do we marry?” but “Which agents do we trust with the keys, the data, and the power to click buttons in production without turning our finance team into an improv troupe?”
That’s how we ended up with a surprising scoreboard: 16 core AI agents in production, and only 1 comes from a legacy vendor. The rest? A modern mix of cloud platforms, API-first builders, and best-of-breed components that ship weekly, integrate cleanly, and treat “observability” as a first-class citizen instead of a footnote.
This isn’t a victory lap against incumbents. It’s a map of a market where everything’s in playand where the “default vendor” is no longer default just because they’ve been cashing your maintenance check since flip phones roamed the earth.
What “Everything’s In Play” Actually Means (Beyond the Buzzword Confetti)
In the age of AI agents, software isn’t just a UI with workflows. It’s a system that reasons, retrieves, plans, calls tools, andif you’re braveexecutes multi-step actions. That changes procurement math in three ways:
- Value moves up the stack. The differentiator is less “feature list” and more “How reliably can this agent do the job?”
- Interoperability becomes oxygen. Multi-agent systems have to hand off tasks, share context, and respect permissions.
- Trust becomes measurable. If you can’t trace decisions, evaluate outputs, and enforce guardrails, you don’t have an agentyou have a vibe.
Modern platforms are designed around these realities. Many legacy suites are still retrofitting them, which is a bit like bolting a jet engine to a shopping cart: technically exciting, operationally terrifying.
Our 16-Agent Roster: A Practical View of the “Core”
When we say “core AI agents,” we don’t mean every experimental bot someone made after two cold brews and a conference keynote. We mean the agents that show up to work every day and touch real workflows.
1) The Front Door Agent (Triage & Routing)
This agent decides what the user actually wants, what data is needed, and which specialist agent should handle the request. Think of it as the world’s fastest dispatcherminus the clipboard, plus a healthy fear of ambiguity.
2) The Policy & Permissions Agent (The Adult in the Room)
Agents are only useful if they can access tools and data. They’re only safe if access is scoped correctly. This agent enforces “who can see what,” redacts sensitive fields, and blocks actions that violate policy.
3) The Knowledge Retrieval Agent (Grounding & RAG)
It finds the right internal documents, policies, tickets, and historical decisionsthen feeds the minimum necessary context into the reasoning loop. Great retrieval turns “LLM guesswork” into “enterprise-grade answers.”
4) The Data Analyst Agent (SQL, Metrics, Dashboards)
This agent translates business questions into queries, runs analysis, and returns explanations a human can trust. The secret sauce isn’t fancy mathit’s tight governance and reproducible results.
5) The Workflow Orchestrator Agent (Long-Running Tasks)
The difference between a demo and production is what happens after step three. This agent manages multi-step workflows, retries, approvals, escalations, and schedulingwithout “forgetting” what it was doing mid-flight.
6) The Tools & Integrations Agent (APIs, SaaS Connectors, “Computer Use”)
When an API exists, use it. When it doesn’t, automation still has to happen. Tool-calling, RPA, and “computer use” features expand what agents can do, but also increase the blast radius if you don’t govern them.
7) The Customer Support Agent
Summarizes tickets, drafts responses, suggests next-best actions, and can resolve simple issues. The best versions are grounded in your knowledge base and can cite internal policynot just sound confident.
8) The Sales Ops Agent
Updates fields, drafts follow-ups, generates call summaries, and keeps pipeline hygiene from becoming a group punishment. The trick is respecting CRM rules and not inventing “deal stages” like it’s writing fan fiction.
9) The Finance Ops Agent
Handles invoice exceptions, vendor questions, and policy-based expense categorization. It lives and dies by auditability: every decision needs a traceable reason.
10) The HR & People Agent
Answers policy questions, guides onboarding, and routes sensitive issues to humans. It’s a prime example of why privacy, redaction, and role-based access must be designed innot taped on later.
11) The Engineering Productivity Agent
Helps with code review summaries, incident writeups, runbook guidance, and release notes. It’s fantasticuntil it’s notso evaluation and safe defaults matter.
12) The Security Triage Agent
Summarizes alerts, correlates signals, and drafts investigation steps. It’s powerful, but only if it’s trained to be humble: “Here’s what I know, here’s what I don’t, and here’s what to check next.”
13) The QA & Evaluation Agent (Truth Serum)
Runs test suites, checks outputs against rubrics, and flags regressions in quality, cost, latency, and safety. If you don’t measure it, you don’t control it.
14) The Observability Agent (Tracing, Cost, Latency)
Watches agents in production: where they fail, what tools they call, how much they spend, and how often they hallucinate. This is the difference between “AI initiative” and “AI system.”
15) The Governance Agent (Risk, Compliance, Controls)
Maps controls to real risks: data leakage, prompt injection, insecure output handling, and supply chain vulnerabilities. It aligns behavior with internal policy and external frameworks.
16) The Legacy Vendor Agent (Yes, We Kept One)
Our one legacy-vendor agent is embedded where the vendor already owns the workflow end-to-endthink ticketing/ITSM or a deeply entrenched system-of-action. It’s there because adoption is frictionless and the workflow is native.
But here’s the punchline: everything around itrouting, evaluation, tracing, policy enforcement, data grounding, and cross-system orchestrationleans modern. That’s not bias. That’s gravity.
Why 15 of 16 Came From Modern Vendors: The Real Reasons (No “Digital Transformation” Posters Required)
1) Agent stacks are composable by nature
Agents aren’t a single product. They’re a system: models, tools, retrieval, memory, orchestration, governance, evaluation, and observability. Modern vendors tend to assume composability from day one: APIs, SDKs, plug-ins, and clean integration points.
2) Multi-model is the new multi-cloud
Enterprises increasingly want the option to route work to different models for cost, latency, privacy, or capability. When the platform assumes one “blessed” model forever, it ages like milk in August.
3) Tracing is not optional anymore
In production, you need to answer: “Why did the agent do that?” Modern agent tooling treats traces as a core artifact: every generation, tool call, handoff, and guardrail decision. Legacy suites often add observability laterif you ask nicely.
4) Evaluation is now a product requirement, not a research hobby
Teams need regression tests for agents the way they need tests for code. That means offline evals, online monitoring, rubrics, human feedback loops, and cost/latency budgets. Modern vendors build for this because they had to.
5) Security threats are agent-shaped
Prompt injection isn’t theoretical when your agent can call tools. Insecure output handling isn’t abstract when your agent can write to a database. Modern platforms increasingly ship guardrails, scanning, and policy controls because enterprise buyers demand themloudly.
6) Shipping speed matters more than brand nostalgia
Agent capabilities evolve fast: better tool calling, better grounding, better orchestration, better governance. Modern vendors iterate weekly. Legacy vendors often iterate on quarterly or annual release trains. In agent-land, that’s the difference between “leading” and “reading about it later.”
7) The suite tax is real
Suites are great when your main problem is buying fewer things. Agents are great when your main problem is doing more work. When the suite makes you compromise on tracing, evaluation, or model choice, “single vendor simplicity” turns into “single vendor limitation.”
What Legacy Vendors Still Do Better (And Why They’re Not Going Away)
Let’s be fair: legacy vendors have strengths that startups would pay real money to inherit.
- Distribution: They’re already deployed, budgeted, and blessed by procurement.
- Workflow ownership: They often own the UI and the underlying system-of-record or system-of-action.
- Enterprise controls: Compliance programs, certifications, and admin tooling are mature.
- Change management: Training, enablement, and support models existeven if they sometimes feel like paperwork cosplay.
That’s why our “one legacy agent” exists: where the vendor owns the workflow so completely that the adoption curve is basically a flat line. But outside those walled gardens, the market is brutally competitiveand the best components win.
A Buyer’s Scorecard: How to Choose Your Agent Stack Without Regrets
Start with outcomes, not demos
A demo is an agent on its best behavior. Production is an agent at 4:57 p.m. on a Friday, holding a live credential and facing a weird edge case the product team never imagined. Define success in operational terms:
- Accuracy and groundedness (with measurable evals)
- Latency targets by workflow type
- Cost budgets and routing strategies
- Permissioning and data access controls
- Traceability and incident response
Prefer platforms that assume heterogeneity
Your environment already contains multiple clouds, SaaS tools, data stores, and identity systems. If a vendor’s “integration strategy” is “please migrate everything into our universe,” that’s not a strategyit’s a hostage note.
Demand evidence of governance maturity
Look for alignment with recognized risk-management approaches, clear admin controls, audit logs, and an honest story about prompt injection and tool safety. If the vendor waves away risk, they’re either new or lying (sometimes both).
Reality Check: Why Many Agent Projects Fail (And How to Not Be a Statistic)
The market is loud, and not all of it is signal. Some “agents” are just chatbots wearing a trench coat. Analysts have warned about hype-driven projects getting canceled when costs rise and outcomes stay fuzzy. That doesn’t mean agents don’t workit means undisciplined implementations don’t work.
Common failure modes
- Unbounded scope: “Make an agent that does everything” is how you build nothing.
- No evaluation loop: If quality isn’t measured, quality will drift.
- No tracing: Debugging becomes séance-based engineering.
- Tool chaos: Too many tools, unclear permissions, and no safe defaults.
- Data mess: Bad sources + weak retrieval = confident nonsense.
A practical success pattern: thin slice → instrument → expand
Start with one workflow you can measure end-to-end. Instrument it with traces and evals. Add guardrails and human approvals where needed. Then expand to adjacent workflows. Agents scale like products, not like PowerPoints.
So Why Only 1 Legacy Vendor Agent? The Strategic Answer
Because in the agent era, control points changed.
Legacy vendors historically won by owning the application surface area. Agents win by owning execution quality: planning, grounding, tool use, safety, and measurability. Modern vendors are optimized for those control points. Many legacy vendors are catching upbut they’re catching up in public, while teams still have work to automate today.
The result is a best-of-breed reality: the “agent stack” looks more like a modular architecture than a suite. And the vendor that earns your trust is the one that proves reliability, transparency, and governanceregardless of how many golf tournaments they sponsor.
Field Notes: of Real Experience Building a 16-Agent Core
Here’s what surprised us most once we stopped demoing agents and started living with them.
First: prompting was the easy part. The hard part was everything the prompt touchespermissions, tool calls, data access, retries, and the awkward moment when an agent “helpfully” tries to do the right thing with the wrong account. We learned quickly that the most important feature isn’t eloquence; it’s control. Who can the agent act as? Which systems can it write to? What’s the approval path when confidence is low? If those answers aren’t crisp, your agent is basically a well-spoken intern with admin credentials. That’s not innovation. That’s a security incident waiting for an calendar invite.
Second: observability turns panic into progress. The first time an agent produced a weird answer, the instinct was to blame the model, then blame the retrieval, then blame the moon’s gravitational pull. Tracing changed everything. When we could see each stepwhat it retrieved, what it decided, which tool it called, and what came backwe stopped arguing and started fixing. “It hallucinated” became “it retrieved an old policy doc from 2019 and treated it as gospel.” That’s actionable. Also, it’s humbling. The model wasn’t being mischievous; it was being literal in the only way a machine can be.
Third: evals are the difference between improvement and superstition. Without a test harness, you can’t tell whether a new prompt, a new tool, or a new model made things better. You just have vibes and the loudest person in the room. We built evaluation sets the same way we built product QA: edge cases, representative workflows, and clear rubrics. Then we treated regressions like regressionsno excuses, no “but it felt smarter.” Agents don’t get performance reviews based on charisma.
Fourth: the “legacy agent” stayed because adoption is a real constraint. In one workflow, the legacy platform already owned the ticket queue, the approvals, the UI, and the reporting. Dropping in the vendor’s native agent let us deliver value in weeks, not quarters. But we still wrapped it with our modern control layer: routing logic, governance checks, and monitoring. In other words, we used the legacy agent like a specialized employee not as the manager of the whole workforce.
Finally: every successful agent has a job description. When we wrote crisp scopeswhat the agent does, what it never does, what it escalates, and how it proves successeverything got easier. Less drama, fewer surprises, more trust. And yes, fewer late-night messages that begin with, “Hey… weird question… did we give the expense agent permission to approve espresso machines?”