AI in banking

How Banks Must Prepare for the Mythos Threat

10 June 2026
9
mins read
For decades, banks have had one quiet advantage in cybersecurity: time. Mythos-class AI systems are closing that window permanently β€” and the security operating models built around patch management simply weren't designed for a world where that window no longer exists.

The time advantage is gone

For decades, banks have had one quiet advantage in cybersecurity: time. Attackers had to discover a vulnerability, then separately develop an exploit, then weaponize it. That chain of steps gave defenders a window to patch. Mythos-class AI systems are closing that window by collapsing all those steps into a single, continuous loop.

This is a structural shift in the nature of the threat itself, not just a speed improvement for attackers. And most bank security teams are still operating on a model designed for the world that existed before it.

Burc Yildirim, Security Director at Backbase said: "AI is starting to compress the vulnerability lifecycle. In the past, discovery, exploit development and weaponization were separate steps. They required time, specialized expertise, and coordination across different people and teams. With systems like Mythos, those steps start to collapse into a single loop."

When that loop closes, the entire logic of traditional security operations - discover, prioritize, patch - starts to break down. The logic of patch-first operations assumes a window between discovery and exploitation. That window is now effectively gone. Banks that understand this will redesign their operating model accordingly. Banks that don't will keep buying better scanners for a threat that doesn't move at scanner speed anymore.

We explored the technical mechanics of this shift in detail in what 120+ bank deployments reveal about the real Mythos threat. This piece is about what banks do next.

Stage one - Accepting that patch management is necessary but no longer sufficient

The first stage of preparation is the hardest, because it requires letting go of a mental model that has worked reasonably well for thirty years.

Patch management - tracking CVEs by severity and deploying fixes within agreed SLAs - is not going away. It remains a baseline hygiene requirement. But it was designed around a specific assumption: that there is a discoverable, published vulnerability with a known identifier before exploitation becomes a serious risk. Mythos-class systems break that assumption cleanly.

An AI system capable of reasoning across large codebases, API contracts, and configuration files doesn't need a CVE to find a path. It can identify logical flaws, authorization gaps, and chained attack paths that no human researcher has documented and no scanner has a signature for. The vulnerability lifecycle no longer starts with disclosure. It starts with the AI.

Banks built their SLAs around patch windows that no longer exist. "Every bank CISO I speak with nods when they hear that - and then I ask them what they've changed in their operating model as a result, and the room gets quiet." said Yildirim.

The implication is that banks need a parallel discipline alongside patch management, one that doesn't depend on known identifiers. That discipline is exposure management, and it operates on a fundamentally different logic.

Stage two - Shifting from vulnerability management to exposure management

Vulnerability management asks: "Do we have known weaknesses, and have we patched them?" Exposure management asks: "What can an attacker reach, exploit, and use to move laterally or cause systemic damage - right now, regardless of whether it has a name?"

Banking systems carry a specific risk: interconnection. A weakness in one component - an internal API, a cloud IAM role, a third-party data feed - can cascade across payment workflows, customer data stores, and regulatory reporting systems. A CVSS score simply doesn't capture that.

Exposure management prioritizes differently: not by severity score, but by whether a flaw can be reached, what breaks if it is, and whether it opens a path to something worse downstream. A medium-severity configuration error in an API gateway that sits upstream of your real-time payments rail is more dangerous than a high-CVSS vulnerability in an air-gapped legacy reporting system. Exposure management sees that. CVSS scoring doesn't.

This connects directly to why fragmentation compounds the AI threat in banking. Fragmented architectures create exactly the kind of complex, poorly-mapped interconnections that AI-assisted attackers are best at navigating. The banks most exposed to Mythos-class threats are often the ones whose own teams can't fully map their own attack surface.

Exposure management requires one concrete capability many banks lack: mapping security findings directly to business processes, not just to assets. When a novel vulnerability emerges - with or without a CVE - the security team can immediately answer: "Does this touch something that matters, and how fast can we contain it?"

Stage three - Knowing exactly where the AI-assisted threat is highest

Not every attack surface carries equal risk under AI-assisted exploitation. The areas where Mythos-class systems are most capable are predictable, because they share a common characteristic: logical complexity.

AI reasoning systems excel at finding flaws in logic - authorization rules that don't hold under edge cases, session management that behaves differently under specific sequences, API contracts that expose unintended data under certain parameter combinations. These are precisely the flaws that human security testers miss most often, because manual testing is slow and sampling-based. AI testing is exhaustive and fast.

The highest-risk surfaces for banks are APIs (especially internal and partner-facing APIs with complex authorization logic), authentication and authorization layers, session management, payment workflows, cloud IAM configurations, and third-party dependencies. These aren't chosen at random - they're the surfaces where the distance between "what the system is supposed to do" and "what it does under adversarial conditions" is widest. That is where AI reasoning has the most to find.

The practical output of this stage is a prioritized attack surface map - not a generic asset inventory, but a specific view of which surfaces are most likely to yield exploitable paths under AI-assisted reconnaissance, mapped against the business processes they support.

Here's what this looks like at an actual bank. Consider a mid-sized bank with a modern API gateway fronting its retail lending workflows. Before: security testing runs quarterly, covers the documented endpoints, and produces a report tied to known CVE references. The team patches what's flagged, closes the audit finding, moves on. After: continuous AI-assisted testing runs against the same gateway, but also against the undocumented internal endpoints, the token refresh logic, and the rate-limiting behavior under parallel request sequences. It also examines the interaction between the API layer and the downstream credit decisioning service. It finds a chained authorization flaw - no CVE, not in any scanner database - that allows a valid session token to query account data outside its permitted scope. The exposure management model flags it immediately as high-impact because it touches a regulated data flow. Remediation begins within hours, not the next quarterly cycle. That's the difference in operating model terms.

Stage four - Deploying the same AI capabilities on defense

The most important strategic response to AI-assisted attacks is deploying the same class of AI capabilities defensively - continuously, as a permanent part of the security operating cycle.

This means continuous penetration testing powered by AI, running against production-equivalent environments on an ongoing basis rather than quarterly engagements. It means AI-assisted code review integrated into the development pipeline, catching logical flaws before they reach production. It means AI-assisted configuration review across cloud infrastructure, API gateways, and IAM policies - looking for the same kind of chained logic errors that an attacker would look for. And it means attack path modeling that continuously updates as the architecture changes, rather than snapshots taken at audit time.

"The key message is simple - Mythos does not only mean attackers can find more vulnerabilities. It means defenders may lose the time advantage they used to rely on."

Losing that time advantage doesn't mean accepting defeat. It means accepting that the security operating model has to run at a different speed. The patch window has shrunk to the point where, in some threat scenarios, it's effectively gone. Detection and containment speed matter as much as prevention. Banks need to be investing in both.

The cryptographical framework for AI governance in banks is part of this same shift - understanding not just what AI agents can do for your operations, but what AI-class systems can do to them. Banks need to build governance structures that can respond at machine speed.

For banks running AI at scale on the frontline - using agents for customer interactions, credit decisions, and operational workflows - the stakes are even higher. AI banking software fails without a coordination layer, and that coordination layer is also an attack surface. The same AI reasoning capabilities that make banking agents powerful make them targets worth analyzing carefully.

The operating model that works

According to Burc, "Banks should not prepare by buying another scanner. They should prepare by redesigning their security operating model around speed, resilience, containment, and continuous validation."

That redesign has one non-negotiable: containment speed built into architecture, not added after an incident. Everything else - continuous testing, exposure mapping, CVE-independent triage - is scaffolding around that core. When a novel vulnerability is identified, isolation is measured in minutes, not days.

A banking leader's guide to agentic workflows is useful context here - because the same principles that make agentic systems powerful (autonomy, speed, continuous operation) need to be applied to the security function itself. Security at Mythos speed requires security operations that also run autonomously and continuously.

This is also why human-in-the-loop remains essential even as AI handles more of the operational cycle. AI-assisted detection and containment raises the floor. Human judgment on escalation, impact assessment, and recovery decisions remains the ceiling. The goal is not to remove humans from the loop. It's to ensure the loop is moving fast enough that human decisions happen at the right moments, not as bottlenecks.

Five questions every bank security leader should answer honestly

If a critical vulnerability in one of your core APIs were found today by an AI system with no CVE attached, how quickly could your security team detect abnormal behavior and isolate the affected component? The honest answer for many banks is "not fast enough" - because detection tooling is still calibrated to known signatures, and isolation requires manual escalation through multiple teams. Mapping that response timeline is the first concrete step.

Have you mapped your highest-risk attack surfaces - APIs, cloud IAM, payment workflows, third-party dependencies - specifically against AI-assisted exploitation patterns, not just known CVEs? Many attack surface maps are asset inventories with severity scores attached. AI-assisted exploitation doesn't follow severity scores - it follows logical complexity and interconnection. The mapping exercise has to be rebuilt with that logic in mind.

Is your current vulnerability prioritization model based on exploitability and systemic risk, or is it still driven primarily by CVSS scores and patch SLAs? CVSS was designed for a different threat environment. It measures severity, not reachability or business impact. A prioritization model built entirely on CVSS will consistently misallocate attention in ways that AI-assisted attackers can anticipate.

Do you have continuous offensive testing - AI-assisted or otherwise - running as a permanent part of your security lifecycle, or is it still a quarterly or annual exercise? Quarterly testing produces a quarterly snapshot of a surface that changes daily. Continuous testing is not just more frequent - it changes the entire feedback loop between development, operations, and security. The architecture decision and the testing decision have to be made together.

If you had to describe your current security operating model in one sentence, would that sentence include the words speed, containment, and continuous validation - or would it describe a queue? Many honest answers describe a queue: tickets raised, scored, assigned, resolved on SLA. Queues are not compatible with a threat that operates at AI speed. The model has to be rebuilt around a loop, not a pipeline.

The banks that answer these questions and act on the answers will be the ones still in control of their own infrastructure when Mythos-class capabilities become widely used. That window is shorter than most security roadmaps assume.

Security is no longer a posture you maintain - it's a speed you have to match.

About the author
Backbase
Backbase pioneered the Unified Frontline category for banks.

Backbase built the AI-native Banking OS - the operating system that turns fragmented banking operations into a Unified Frontline. Customers, employees, and AI agents work as one across digital channels, front-office, and operations.

Backbase was founded in 2003 by Jouk Pleiter and is headquartered in Amsterdam, with teams across North America, Europe, the Middle East, Asia-Pacific, Africa and Latin America. 120+ leading banks run on Backbase across Retail, SMB & Commercial, Private Banking, and Wealth Management.

Table of contents
Vietnam's AI moment is here
From digital access to the AI "factory"
The missing nervous system: data that can keep up with AI
CLV as the north star metric
Augmented, not automated: keeping humans in the loop