The Safety Theater
Walk into any AI product demo in 2026 and you'll hear the same phrase repeated like a mantra: "Don't worry, there's a human in the loop."
It's meant to be reassuring. It signals responsibility. Oversight. Control.
But here's the uncomfortable truth: in most cases, it's theater.
The human isn't catching bad outputs. They're rubber-stamping decisions they don't have time to evaluate, creating the illusion of control while the system runs at a speed no person can match.
The Speed Mismatch
Let's get concrete. An LLM can:
- Review 1,000 support tickets in 30 seconds
- Generate 50 code suggestions per minute
- Analyze security logs from 10,000 endpoints in real-time
- Draft legal contract variations faster than you can read the original
Meanwhile, a human can:
- Read maybe 3-5 tickets thoroughly in the same 30 seconds
- Evaluate one code suggestion properly
- Spot patterns in maybe 100 log entries if they're really good
- Review one contract clause at a time
The mismatch isn't just quantitative—it's structural. When you put a human "in the loop" at AI speeds, you create one of two outcomes:
- The bottleneck: The system slows to human pace, negating the entire point of automation
- The facade: The human approves without genuinely reviewing, becoming a liability shield rather than a quality gate
Neither one is what we actually want.
The Real Choice
Here's what nobody wants to admit: the "human in the loop" is often just delayed autonomy.
You start with approval-required. Then the approvals become automatic because the human can't keep up. Then you remove the human entirely because they weren't adding value anyway.
We saw this exact pattern at Link11 with threat detection:
- 2018: Every flagged IP required analyst review
- 2020: Analysts approved 98% automatically, reviewing only edge cases
- 2023: Automated approval with human audit instead of approval
- 2026: Full autonomy with circuit breakers and retrospective review
The transition was inevitable. The question wasn't whether to go autonomous—it was how to do it safely.
The Guardrail Philosophy
So if "human in the loop" doesn't work at AI speeds, what does?
Guardrails.
Not humans approving every decision. Humans defining the boundaries within which the AI operates autonomously.
Think of it like this:
- Bad: "Check with me before you block any IP address"
- Good: "Never block IPs from this allowlist. Always block IPs that hit these three criteria. For everything else, use your judgment and log it for my review."
The difference is profound:
- The first makes you the bottleneck
- The second makes you the architect
What Good Guardrails Look Like
After deploying AI agents across Link11's infrastructure, here's what I've learned about building trustworthy autonomy:
1. Hard Constraints (Non-Negotiable Rules)
These are the things the AI cannot do, period:
- Never modify production databases directly
- Never expose customer data outside approved systems
- Never disable security monitoring
- Never commit code without tests
These aren't suggestions. They're compiled into the system. Violating them should be technically impossible, not just discouraged.
2. Soft Boundaries (Requires Justification)
These are actions the AI can do, but must log reasoning for:
- Blocking a previously-allowed IP
- Scaling infrastructure beyond normal thresholds
- Making schema changes
- Sending customer communications
The AI operates, but creates a clear audit trail. Humans review after, not before.
3. Escalation Triggers (When to Wake a Human)
Define specific conditions that require human judgment:
- Confidence score below X%
- Financial impact above €Y
- Customer-facing change during high-traffic period
- Conflicting signals from multiple data sources
The key: these are objective thresholds, not vague "use your judgment" clauses.
4. Circuit Breakers (Emergency Stops)
Automatic kill switches when things go wrong:
- If error rate spikes, pause all deployments
- If cost exceeds budget by 50%, freeze scaling
- If customer complaints jump 3x, halt automated responses
- If system detects its own behavior changing unexpectedly, stop and alert
These protect you from runaway automation. The AI doesn't need permission to stop—it needs permission to continue when anomalies appear.
The Uncomfortable Reality
Here's what keeps me awake: we're not really choosing between "human in the loop" and "full autonomy."
We're choosing between:
- Designed autonomy with intentional guardrails, or
- Accidental autonomy where humans pretend to supervise but can't keep up
The second option is more dangerous because it creates false confidence. You think you have oversight. You don't.
Why I'm Choosing Guardrails
At Link11, I made a decision that made some people uncomfortable:
Our AI agents have real authority.
They can:
- Block IP addresses without asking
- Scale infrastructure in response to attacks
- Modify routing rules
- Generate customer communications
But they operate within a rigorously defined envelope:
- Hard-coded constraints prevent catastrophic errors
- Every action is logged with reasoning
- Escalation protocols are automatic and objective
- Circuit breakers stop runaway behavior
- Humans audit decisions after, not approve them before
The result?
- Faster: Response time to attacks dropped from minutes to milliseconds
- Safer: Fewer mistakes than when humans were the bottleneck
- Scalable: We handle 10x the traffic with the same ops team
- Transparent: Better audit trails than when decisions were in Slack DMs
The Future Is Already Here
Whether you like it or not, AI is moving faster than human review can follow.
High-frequency trading already operates at microsecond speeds—no human in that loop.
Content moderation at scale already uses AI—humans review samples, not every decision.
Fraud detection, network security, supply chain optimization—these are all domains where "human in the loop" has quietly become "human auditing the loop."
The question isn't whether to embrace autonomy.
It's whether you're going to design it intentionally or let it emerge by accident.
The Bottom Line
"Human in the loop" sounds responsible. But at LLM speeds, it's often just a way to avoid the hard work of:
- Defining clear boundaries
- Building real constraints
- Creating objective escalation rules
- Accepting that some decisions will be made without you
I'm not advocating for reckless automation.
I'm advocating for honest automation—where we stop pretending humans can review every decision and start building systems that actually deserve trust.
The choice isn't between control and chaos.
It's between designed autonomy with guardrails and accidental autonomy with prayer.
I know which one I'm betting on.
Follow the journey
Subscribe to Lynk for daily insights on AI strategy, cybersecurity, and building in the age of AI.
Subscribe →