AI in banking

What 120+ bank deployments reveal about agentic AI strategy

27 May 2026
9
mins read

Most banks have agentic AI pilots. Very few have an agentic AI strategy. The difference between the two shows up in the P&L - not the demo room. McKinsey estimates that AI-first banks are scaling at double the rate of average institutions, and pioneers are pulling ahead by a 4% return-on-tangible-equity advantage. The gap isn't closing. It's widening every quarter that a bank spends in pilot purgatory.

Why most agentic AI programs stall before they matter

The failure mode is familiar. A bank launches a promising agentic AI pilot in disputes or onboarding. Results look good in isolation. Then the program hits the same five walls it always hits: fragmented data, unclear governance, no shared orchestration layer, competing priorities across business units, and a board that wants ROI before it approves the next phase. The pilot never graduates. Six months later, a new pilot starts somewhere else.

McKinsey's research on AI governance maturity in 2026 found that only about one-third of organizations reach a maturity level of three or higher in agentic AI controls - even as technical capabilities advance rapidly. The bottleneck is organizational and architectural, not technological.

An enterprise agentic AI banking strategy has to solve that structural problem. It can't be a portfolio of individual use cases. It has to be a coordinated operating model where agents, employees, and digital channels work from shared context and shared authority. Outcomes must align instead of diverging at every handoff. A portfolio of disconnected use cases doesn't compound - each one pays the same infrastructure tax from scratch.

The four decisions every C-suite must make first

Before deployment architecture or vendor selection, there are four strategic decisions that determine whether agentic AI scales or stalls. Getting these right up front saves 18 months of rework later.

1. Decide your posture

McKinsey describes three strategic postures for banks facing agentic AI: wait and see, adapt by becoming a product supplier behind agent interfaces, or compete to own the direct customer relationship. Most banks default to the first posture while telling themselves they're pursuing the third. The strategic conversation has to be explicit. A bank that decides to compete for the customer relationship needs an entirely different investment profile than one that decides to become an infrastructure provider for third-party agents. Both are defensible. Neither is compatible with the other. The board needs to choose.

2. Agree on the unit of transformation

Agentic AI programs that fund individual use cases keep re-paying the same integration, data, and governance costs from scratch. BCG's recent analysis of agentic AI in retail banking reinforces that neither the customer journey nor the underlying workflows can remain unchanged - transformation has to be structural. The practical implication: the unit of transformation should be a domain, not a use case. A bank that funds "agentic dispute resolution" as a domain - covering the full front-to-back workflow, the semantic model, agent configuration, governance rules, and the workspace employees use - ships faster on every subsequent case within that domain. A bank that funds individual dispute chatbots keeps starting over.

3. Set the autonomy policy before agents go live

Every agent operates at some level of autonomy - assistive, delegated, or autonomous. The strategic mistake is leaving this decision to the implementation team. Autonomy policy is a board-level risk decision. What actions can agents take without human approval? Which domains require delegated authority where a human approves each case? Where is full autonomy permissible within defined guardrails? Documenting this before deployment makes governance operational rather than retrospective. It also makes the regulatory conversation far simpler - you can show the examiner exactly what authority each agent holds and what evidence trail it leaves.

4. Identify the executive who owns the outcome

Agentic AI programs that report to technology organizations consistently underdeliver on business outcomes. The COO who owns cost-to-serve targets should sign off on the program. They should not just receive a quarterly status report from the engineering team. The Head of Lending owns the conversion metrics that agentic origination moves. The CDO owns the digital adoption numbers. Whoever holds the P&L accountability for the domain should sponsor the agentic program within it. Programs that conflate the technology and business ownership roles produce impressive demos and flat P&Ls.

Organizational readiness: what it requires

Banks tend to assess AI readiness at the model and data layer - do we have clean data, do we have the right models, do we have ML engineers? Those questions matter, but they're not the hard part. The hard part is operational readiness.

Operational readiness means the bank has defined the workflows that agents will participate in well enough that an agent can execute them. Vague SOPs produce vague agent behavior, so precision matters. It also means the bank has a shared semantic layer that gives agents a consistent view of customer state, account state, and case state across systems. An agent that reads the customer from five different systems and gets five different answers cannot be trusted to act. And it means the bank has a governance framework that pre-authorizes what agents can do, under what conditions, with what evidence trail. Governance designed after the fact is always too slow and too conservative to let agentic programs scale.

From what 120+ bank deployments reveal about agentic banking, the banks that move fastest to production share one characteristic: they didn't treat readiness as a pre-condition that had to be perfect before deployment started. They built readiness and deployed in parallel, domain by domain, starting with the workflows where they had the cleanest data and the most defined processes.

Agent orchestration architecture: the structural non-negotiable

Every bank eventually discovers that single-agent deployments have a ceiling. A dispute agent that doesn't share context with the KYC agent, the fraud agent, and the customer communications layer will resolve half the cases correctly. It will escalate the other half to a human who has to start from scratch. The value leaks at every handoff.

The architectural answer is an orchestration layer that coordinates deterministic workflows - the known, repeatable execution paths - with agentic workflows, where agents handle adaptive tasks within bounded authority. Both modes need to run on shared operational context. The customer's state at the moment a dispute is filed has to be the same state the resolution agent, the fraud review agent, and the CSR workspace all see. Without that shared foundation, multi-agent orchestration produces faster chaos, not faster resolution.

Jouk Pleiter, Backbase CEO, describes the architectural imperative this way: agents need "unified context, governed authority, and a shared source of truth that fragmented systems cannot provide." That's not a technology problem - it's an architecture decision. Banks that deploy agents on fragmented data foundations are building AI-enabled systems, not AI-native ones. The distinction matters enormously to long-term ROI.

The practical architecture requirement: a semantic layer for unified customer state, an orchestration layer for workflow coordination, and an authority layer that issues decision tokens for every agent action. None of these work without the others. No agent acts outside its defined scope. Every decision is traceable. This is what makes agentic AI safe enough to scale in a regulated institution - not the individual models, but the control plane they operate within. Nexus provides exactly this kind of shared orchestration foundation for banks building at enterprise scale.

For a deeper look at how agentic AI use cases map across banking departments, the pattern is consistent: departments that standardize on shared orchestration infrastructure ship new use cases in weeks rather than months.

Human-in-the-loop governance at enterprise scale

Human-in-the-loop is not a binary setting. It's a spectrum, and different domains operate at different points on it. Deloitte's research on agentic AI adoption across the banking value chain emphasizes that deploying agentic AI needs fresh thinking and a fundamental redesign of existing processes - not just a new model layered onto old workflows.

The governance framework has to match the risk profile of each domain. Credit underwriting at a large commercial bank probably stays at delegated autonomy - agents prepare the case, humans approve the decision, full evidence bundle attached. High-volume retail disputes with clear policy criteria can move to autonomous resolution within defined guardrails. Humans review exceptions and monitor accuracy metrics. The governance policy defines the boundary. The technology enforces it. The audit trail proves it to the regulator.

What breaks enterprise governance is when the policy lives in a document and the enforcement lives in hope. Every agent action needs to carry a traceable record of the policy applied, the authority granted, the model version used, and the decision outcome. That's not overhead - it's the asset that lets the bank expand agent autonomy over time. It demonstrates to the board and the regulator that the system behaves as specified. Autonomy is earned through evidence, not promised through marketing. Sentinel is designed to provide exactly that kind of continuous monitoring and control layer across agent deployments.

The AI data strategy implications are significant here too. Banks that store decision evidence as first-class operational data - not as logs buried in a data lake - can run the learning loops that improve agent accuracy over time. The system gets better. The governance case gets stronger. The scope of autonomous operation expands. That virtuous cycle is what separates banks that scale agentic AI from banks that manage it.

Measuring outcomes that matter to the board

The metrics that dominate most agentic AI programs - containment rate, deflection rate, task completion rate - are internal technology metrics. They matter for the engineering team. They don't move a board conversation.

The board metrics for an enterprise agentic AI banking strategy are four: cost-to-serve per domain, conversion rate on key origination journeys, staff productivity ratio (cases or applications processed per FTE), and time-to-resolution across serviced domains. Banks running AI-driven call center automation alongside broader agentic servicing programs report 30-40% reductions in cost-to-serve and 50-90% faster execution on routine case types. Those are board-level numbers.

Agentic AI's strategic value is elastic operations - the ability to scale throughput without scaling headcount linearly. A bank that grows 20% in lending volume while holding servicing headcount flat has demonstrated elastic operations. That outcome narrative is what secures the next phase of investment. Containment rate doesn't do that.

The banks pulling ahead on agentic AI in 2026 are not the ones with the most sophisticated models. They're the ones that made clear architectural choices early, aligned their governance framework to their risk posture, and measured the outcomes that matter to the business. For banks asking what it means to be truly AI-native, the answer starts with architecture. The strategy conversation has to start there.

Frequently asked questions

What is an agentic AI banking strategy?

An agentic AI banking strategy is an enterprise-level plan for deploying AI agents across banking operations - covering which domains to automate, how to govern agent authority, what orchestration architecture to build, and how to measure business outcomes. A sound agentic AI banking strategy treats the operating model, not the use case, as the unit of transformation.

How is an agentic AI banking strategy different from a general AI strategy?

A general AI strategy often covers model selection, data infrastructure, and individual use cases. An agentic AI banking strategy goes further - it addresses how agents act autonomously across multi-step workflows and how authority is delegated and governed across domains. It also defines how orchestration infrastructure ensures agents share context across the entire banking frontline rather than operating in isolation.

What organizational readiness does a bank need before deploying agentic AI at scale?

Banks need well-defined workflows agents can execute, a shared semantic layer that gives agents a consistent customer and case view across systems, and a pre-authorized governance framework that specifies what agents can do under what conditions. Readiness built domain by domain - starting with the cleanest data - outperforms waiting for enterprise-wide perfection.

How do banks govern human-in-the-loop oversight in an agentic AI program?

Governance works by matching autonomy level to domain risk: assistive for high-stakes decisions like commercial credit, delegated for complex servicing cases, and autonomous within defined guardrails for high-volume routine tasks. Every agent action should carry a traceable decision record - the policy applied, the authority granted, and the outcome - so the bank can demonstrate compliance and expand autonomy over time based on evidence.

What metrics should the board track for an enterprise agentic AI banking strategy?

Boards should track cost-to-serve per domain, conversion rates on key origination journeys, staff productivity (cases per FTE), and time-to-resolution across serviced domains. These reflect the core value of agentic AI - elastic operations, meaning the ability to scale throughput without scaling headcount linearly. Technology metrics like containment rate matter internally but don't capture the strategic business case.

About the author
Backbase
Backbase pioneered the Unified Frontline category for banks.

Backbase built the AI-native Banking OS - the operating system that turns fragmented banking operations into a Unified Frontline. Customers, employees, and AI agents work as one across digital channels, front-office, and operations.

Backbase was founded in 2003 by Jouk Pleiter and is headquartered in Amsterdam, with teams across North America, Europe, the Middle East, Asia-Pacific, Africa and Latin America. 120+ leading banks run on Backbase across Retail, SMB & Commercial, Private Banking, and Wealth Management.

Table of contents
Vietnam's AI moment is here
From digital access to the AI "factory"
The missing nervous system: data that can keep up with AI
CLV as the north star metric
Augmented, not automated: keeping humans in the loop