If you’re running a business today, AI agents are probably already somewhere on your roadmap.

Most of the conversation so far around AI agents has focused on capability. How much can they automate and how quickly can they be integrated into real workflows.

But another side of the conversation is now starting to surface in board meetings and risk discussions.

What happens when autonomous systems start making strategic decisions outside of direct human oversight?

Over the past year the conversation around AI agents in general has started to change.

As AI systems move from passive assistance toward autonomous action, organisations are beginning to ask more practical questions about accountability and operational risk. That shift is driving growing interest in emerging governance frameworks like AIUC-1.

At its core, AIUC-1 defines what responsible AI agent behaviour should look like at enterprise scale. It’s not an insurance product and it’s not designed to slow down adoption. The goal is to create a shared reference point for how autonomous systems are evaluated and governed once they begin interacting with real business processes. In simple terms, AIUC-1 is part of the developing rulebook for how autonomous systems behave inside organisations.

What’s driving this shift is pretty clear. The role AI plays inside organisations is evolving rapidly.

For the past few years most enterprise AI has lived in an assistive posture. Models generated insights and drafted recommendations. Very useful and impressive, but still largely advisory. The moment agents begin taking actions inside real workflows, the nature of the conversation changes. Suggestions carry limited liability, but autonomous decisions carry real consequences.

Examples of this shift are already popping up. In a recent case, Air Canada was held responsible for false information provided by its customer service chatbot. A customer asked online about bereavement fares and the bot gave incorrect policy information. The airline later refused the discount and the customer sued and won the case. The tribunal ruled that the airline was ultimately responsible for what its AI system told the customer.

None of this reflects a failure of the technology itself. It reflects the reality of autonomous systems interacting with complex business environments that were originally designed for human oversight.

This is a familiar pattern in technology. Automated trading systems once triggered flash crashes, but markets didn’t abandon algorithmic trading. They introduced circuit breakers and tighter controls. Cloud computing created entirely new security challenges, but organisations didn’t walk away from the cloud. They built security frameworks and compliance standards that allowed it to grow more safely.

And this is the point many organisations are now approaching with AI. Across industries there is growing momentum behind agentic systems that can execute tasks and interact directly with production environments.

This is where efforts like AIUC-1 and similar governance frameworks start to make more sense. They’re early attempts to define what responsible agent behaviour should look like at scale.

Before risk can be priced, it has to be legible.
Before it becomes insurable, it has to become repeatable.
You can’t underwrite what you can’t standardise.

Industries that live close to operational risk tend to recognise this pattern early. Real estate, mining, and hospitality have all spent decades building financial and compliance structures around operational exposure. When new forms of leverage appear in those sectors, the conversation quickly moves past what the technology can do and toward how the risk will be contained or priced.

AI agents are beginning to trigger that same instinct. And historically, this is the point where markets start building the structure that makes new technology more trustworthy.

None of this suggests the technology is fragile or that adoption will slow in any significant way. If anything, the opposite is more likely. Capability will continue to advance and autonomous systems will become more deeply embedded in core workflows. What tends to separate early experimentation from AI that holds up in production isn’t performance alone. It is the surrounding trust architecture that allows organisations to deploy with confidence.

The interesting shift now is that the next bottleneck for many AI initiatives may not be technical capability at all. Increasingly, it’s becoming insurability and governance maturity. The onus will be on each business to demonstrate controlled autonomy in these environments.

The organisations that move fastest over the next few years won’t just be the ones deploying more AI. They’ll be the ones that make their systems more transparent and ultimately insurable in ways the market can understand.

Eventually every powerful technology moves from a capability discussion to a risk discussion.

Who is Accountable When An AI Agent Makes a Mistake?