The Human-AI Playbook

11 Agents Are Not a Team

What football/soccer teaches us about AI-native software teams

By Ana Vásquez, PhD — Director of Operations, Celerik
and Edwin Duque — Director of Data & AI, Celerik

As the World Cup approaches, football — or soccer, depending on where you live — gives us a useful metaphor for understanding the future of AI-native software teams.

A football team is not simply eleven players on the field.

It is a system.

A formation.
A set of roles.
A coach.
A playbook.
Training routines.
Tactical discipline.
A way to recover shape under pressure.
And a scoreboard that tells the team whether the strategy is working.

The same is true for AI-native software teams.

11 agents are not a team.

A company can have copilots, automations and AI agents across the software development lifecycle and still fail to produce better business outcomes.

Why?

Because performance does not come from tools alone. Performance comes from the way work is designed around those tools.

From More Output to Sustainable Value - The difference a system of play makes

The superstar signing problem

Many organizations are still treating AI like a superstar signing.

They buy licenses.
They give people access.
They add copilots.
They experiment with agents.
They expect productivity to rise.

This is understandable. The tools are impressive. The demos are powerful. The promise of faster delivery is real.

But football gives us a warning: a club does not become a great team just because it signs a brilliant player.

A team still needs tactics, roles, training, trust and coordination.

The same happens with AI.

An organization does not become AI-native because it buys AI tools. It becomes AI-native when it redesigns how work happens.

AI is an amplifier

AI can make strong teams stronger.

It can also make weak systems more fragile.

In software development, this matters because AI can increase both the volume and velocity of work. It can help developers write code, generate tests, summarize documentation, explore alternatives, draft specifications and accelerate analysis.

But more output is not automatically more value.

More generated code can mean more things to review.
More automation can mean more hidden assumptions.
More speed can expose bottlenecks in testing, security, architecture and deployment.
More agents can create more coordination problems if no one owns the system.

This is why AI adoption should not be measured only by activity.

The question is not:

Are people using AI?

The better question is:

Is AI improving the performance of the whole system?

DORA’s work on AI-assisted software development is useful here because it frames AI as an amplifier: it can magnify the strengths of high-performing organizations and the dysfunctions of struggling ones. It also warns that the greatest returns do not come from tools alone, but from the underlying organizational system: internal platforms, workflow clarity and team alignment.

That is exactly the point.

AI does not eliminate the need for engineering discipline.

It makes engineering discipline more important.

AI and Human Collaboration - Building the future of Football / Soccer Teams

From tools to a system of play

In football, the system of play determines how the team moves together.

Who presses?
Who covers?
Who creates space?
Who protects the back line?
Who makes the final pass?
Who changes the rhythm when the match becomes difficult?

AI-native software teams need the same kind of clarity.

Who frames the problem?
Who writes the specification?
Who delegates to AI?
Who verifies the output?
Who owns the decision?
Who updates the workflow when the system fails?
Who measures whether AI is creating value or just generating activity?

This is where many enterprise AI programs are still too shallow.

They focus on access.
They focus on prompting.
They focus on tool training.

Those things matter, but they are not enough.

AI-native work requires more than tool usage. It requires work design.

The missing discipline: Human-AI Work Design

At Celerik, we are consolidating this approach as Human-AI Work Design.

Human-AI Work Design is the discipline of redesigning how people learn, delegate, verify, collaborate and are evaluated when AI becomes part of everyday work.

It is not just prompt engineering.
It is not just AI literacy.
It is not just tool training.

It is the design of the human-AI operating system inside the organization.

In software teams, this includes how a team defines requirements, creates specifications, delegates tasks to AI tools, verifies outputs, handles uncertainty, protects quality, documents decisions and measures business impact.

In other enterprise functions, the same discipline appears through rubrics, decision rules, escalation paths, acceptable-use policies, case libraries, review checkpoints and role-based capability development.

The principle is the same:

AI creates value when the work is redesigned around it.

Human-AI Work Design - 5 pillars for coordinated human-AI collaboration

Why “human in the loop” is not enough

Many companies say they have a human in the loop.

But that phrase is often too vague.

Which human?
At what moment?
With what authority?
Using what criteria?
Checking what type of risk?
Documenting what decision?
Escalating to whom?

A human in the loop without clear work design can become a rubber stamp.

Or worse: a person who is accountable for an AI-assisted decision without having enough context, time, skill or authority to verify it properly.

In football terms, saying “we have a human in the loop” is like saying:

We have a goalkeeper.

That is not enough.

The goalkeeper needs a defensive line, communication, positioning, coaching, match data and a team that knows what to do when pressure arrives.

In AI-native work, verification is a system, not a heroic individual act.

The VAR of AI

Football has VAR because speed and human perception are not enough in critical moments.

AI-native teams need their own version of VAR.

Not for every single action. That would slow the game too much.

But for the moments that matter:

security-sensitive outputs,
architectural decisions,
customer-facing logic,
high-risk automation,
financial or legal implications,
data privacy exposure,
and decisions that affect people.

This is not about distrusting AI.

It is about calibrated trust.

A good team does not trust blindly. A good team knows when to play fast and when to review.

AI Output Verification by Risk Level - How to route AI-generated work through the right checkpoints

The J-Curve of AI adoption

There is another reason why companies need a system of play.

AI adoption often creates an initial dip.

Teams need time to learn new tools.
They need to adapt workflows.
They need to manage the verification tax.
They need to update platforms, testing, documentation and governance.

That early turbulence does not necessarily mean failure.

It can be the tuition cost of transformation.

DORA’s ROI report describes this J-Curve as a temporary productivity dip and period of instability caused by the learning curve, verification tax and pipeline adaptation before long-term value is realized.

This is a strategic management issue.

Organizations that only buy tools may get stuck in the dip.

Organizations that redesign work can move through it.

J-Curve of AI Value Realization - Why AI adoption dips before it compounds

What an AI-native software team actually needs

An AI-native software team does not simply need more AI access.

It needs a system of play.

That system includes at least seven elements.

1. Role clarity

Teams need to know what humans own, what AI can assist, what AI can draft, and what must remain under human judgment.

Without role clarity, AI creates confusion instead of leverage.

2. Context architecture

AI is only as useful as the context it can access.

Internal standards, documentation, APIs, design decisions, user knowledge and business rules need to be available, current and usable.

Without context, AI produces generic output.

3. Specification discipline

The specification becomes the artifact that makes human intent legible to AI.

Vague intent produces vague output.

Clear specs help teams move from prompting to purposeful delegation.

4. Verification routines

AI-generated work must be checked through tests, reviews, rubrics, policies, risk rules and escalation paths.

Verification should not depend on individual heroics. It should be designed into the workflow.

5. Trust calibration

Teams need to learn when to rely on AI, when to challenge it, when to abstain and when to escalate.

Trust is not blind confidence.

Trust is disciplined judgment.

6. Workflow redesign

AI should not be inserted into broken processes as a thin layer.

The workflow itself must be redesigned.

Otherwise, AI will accelerate the bottlenecks, ambiguity and rework already present in the system.

7. Value metrics

The team should not measure only usage or speed.

It should measure throughput, quality, instability, rework, user value and business impact.

The scoreboard matters because it tells the team whether the system of play is actually working.

AI-Native Software Team - 7 dimensions to design how your team works with AI

The real question

The AI conversation in many companies still begins with tools.

Which model should we use?
Which coding assistant should we buy?
Which agent platform should we test?
Which license should we negotiate?

Those questions matter.

But they are not the first questions.

The first question is:

What kind of work system are we designing?

Because 11 agents are not a team.

A team needs a system of play.

AI-native software teams need the same.

They need roles.
They need context.
They need verification.
They need coaching.
They need metrics.
They need a way to learn together.
They need a way to recover when things go wrong.

The future of AI in software engineering will not be won by the organizations that simply buy the most tools.

It will be won by the organizations that learn how to redesign work around human judgment and machine capability.

That is the foundation of Human-AI Work Design,

And it is the first principle of The Human-AI Playbook:

11 agents are not a team.

From Agents to AI-Native Team - Why 11 agents without a system of play is not a team

They become a team only when we design the system that allows them to play with humans.

Work with us

If your organization is investing in AI for software development, do not begin by asking only how many tools, copilots, or agents you have.

Begin with a better question:

Do we have an AI system of play?

At Celerik, we help organizations move from AI tool adoption to Human-AI Work Design: redesigning workflows, roles, verification checkpoints, context architectures, and capability-building systems so AI can create sustainable business value.

If your teams are adopting AI but still struggling with trust, coordination, quality, governance, or measurable ROI, we can help you design the system before expecting the team to win.

Follow the series

This article opens The Human-AI Playbook, a series about how organizations can move from AI tool adoption to Human-AI Work Design.

In the coming articles, we will explore how AI-native teams can design better systems of play:

how to define human and AI roles,
how to build context architectures,
how to manage the verification tax,
how to calibrate trust,
how to design AI governance without slowing the team down,
and how to connect AI productivity with real business value.

Because the future of AI in software engineering will not be won by simply adding more agents.

It will be won by the organizations that learn how to make humans and AI play as a team.

Follow the series The Human-AI Playbook.

Next up: The First Bad Match Is Not Failure: What the J-Curve Teaches Us About AI Adoption

This perspective comes from years of work at the intersection of education, digital mediation, HCI/NUI, community participation, data, software development, and AI adoption. Human-AI Work Design is not just a technology framework; it is a continuation of a deeper question: how do people build meaning, judgment, and capability when new technologies change the way they live and work?

Explore the full Human-AI Playbook series:
View the master index and upcoming publications →

11 Agents Are Not a Team: Why AI-Native Software Teams Need a System of Play

11 Agents Are Not a Team

The superstar signing problem

AI is an amplifier

From tools to a system of play

The missing discipline: Human-AI Work Design

Why “human in the loop” is not enough

The VAR of AI

The J-Curve of AI adoption

What an AI-native software team actually needs

1. Role clarity

2. Context architecture

3. Specification discipline

4. Verification routines

5. Trust calibration

6. Workflow redesign

7. Value metrics

The real question

Work with us

Follow the series

You may also like these

The AI Scoreboard. Why AI Adoption Should Not Be Measured by Usage Alone

The First Bad Match Is Not Failure: What the J-Curve Teaches Us About AI Adoption

The Human-AI Playbook

Let’s Create Your Own Success Story

11 Agents Are Not a Team: Why AI-Native Software Teams Need a System of Play

The superstar signing problem

AI is an amplifier

From tools to a system of play

The missing discipline: Human-AI Work Design

Why “human in the loop” is not enough

The VAR of AI

The J-Curve of AI adoption

What an AI-native software team actually needs

1. Role clarity

2. Context architecture

3. Specification discipline

4. Verification routines

5. Trust calibration

6. Workflow redesign

7. Value metrics

The real question

Work with us

Follow the series

You may also like these

The AI Scoreboard. Why AI Adoption Should Not Be Measured by Usage Alone

The First Bad Match Is Not Failure: What the J-Curve Teaches Us About AI Adoption

The Human-AI Playbook

Let’s Create Your Own Success Story

Cookies