What we actually believe.

Six stances we'll defend in public. Each one comes from owning deployments, not attending keynotes. The claims are not going to soften.

The conference circuit is not expertise.

Two kinds of people talk about AI in public. The first kind have run it in production — they've owned the deployments, debugged the failures at midnight, watched a model drift over six months and figured out why. The second kind have strong opinions about what other people should build.

The second kind dominate the conference circuit. They speak well, cite the right papers, name-drop the right labs. The slide decks are polished. The practical advice is thin. Nobody on a stage is accountable for whether what they recommend actually works once you're back at the office.

Running self-hosted models in production teaches you things no keynote does: what orchestration failure looks like at scale, how a model that worked in testing behaves under real data, what it actually costs to own the outcome rather than just describe it. That knowledge doesn't come from the conference; it comes from the incident report.

Headcount is a failure mode, not a plan.

The first sign that an AI roadmap is wrong: it requires twenty new hires to work. That's not an AI strategy; it's a staffing plan wearing AI's clothes. The point of well-built AI is to change the economics of doing work, to make more doable with the same team. Adding a team to operate the AI is the opposite of the point.

If every unit of AI output requires proportional human input to produce it, the leverage isn't there. You've built a more complicated process, not a better one. The goal is a system that scales its output without scaling its headcount, one where the incremental cost of doing more keeps falling as the system runs.

We've watched clients spend two years building AI systems that required a full-time team to prompt, monitor, and babysit. That's a proof-of-concept that never escaped the demo room. The work isn't done until the humans step back and the system keeps running.

Pilots are where ambition goes to die.

Almost every organization we talk to has a pilot. A demo that impressed someone. A prototype that worked in a controlled environment. Very few have that pilot running across the business, and fewer still have it running well a year after anyone last looked at it.

The gap between proof of concept and production deployment is where most AI investment quietly disappears. It isn't a technology gap. The models work. It's an execution gap: the orchestration that handles edge cases, the feedback loops that catch failures, the monitoring that tells you when something has drifted from what it was supposed to do.

Crossing that gap requires discipline most technology projects don't have, because the failure mode is subtle. A pilot fails loudly. A production deployment that isn't working correctly fails quietly, for months, while the organization assumes it's fine. Execution discipline is the only thing that separates a demo from a system.

Owning your models is leverage, not overhead.

When you run your AI on someone else's API, every piece of institutional knowledge you feed into it leaves the building. Your customer patterns, your internal frameworks, your proprietary processes — they enrich someone else's training corpus. The legal exposure alone is worth taking seriously. The competitive exposure is worse.

Self-hosting is not paranoia. It's data strategy. The organizations that win the next decade will be the ones whose AI systems have a compounding advantage: models that know things a competitor's model doesn't, because they were trained on internal knowledge that never went anywhere. That advantage can't be bought once someone else has it.

The cost of running your own models has fallen dramatically. The benefit hasn't. A model trained on your data, running on your infrastructure, improving from your feedback, is a different kind of asset than an API call. One compounds. The other bills you by the token and keeps nothing.

Most "AI" is a prompt wrapped around a borrowed API.

A prompt is not a product. Calling an OpenAI API is not a system. The reason most AI deployments stay in the demo room is that nobody builds what makes them durable: memory that persists across sessions, routing that handles the failure cases, feedback loops that catch drift, the mechanisms that make the system improve over time rather than degrade.

The real work is infrastructure. It's not glamorous and it doesn't fit on a slide. It's the difference between a system that works in a demo and one that works in production at midnight, six months after anyone last thought about it. Most firms stop at the demo. They call that a deployment.

Orchestration, memory, and feedback loops are where we live — not in the prompt, but in everything around it. A model is a component. What matters is what you build on top of it: the architecture that makes the whole thing reliable, self-correcting, and worth the investment.

The best AI deployment fits what you already have.

The instinct when pursuing AI is to start clean: new infrastructure, new data pipelines, a fresh architecture. That instinct is expensive and usually wrong. Most enterprises have years of valuable data already in motion — in their CRMs, their operational systems, their internal tools. The highest-ROI deployments we've seen don't replace those systems. They connect to them.

AI that runs alongside your existing workflows, consuming the data those workflows already produce, gets to production faster and gets used more. It doesn't require your team to change how they work before it can help them. It shows up inside the tools they already open every morning.

Starting over is the obvious move when the real problem is integration. We've learned to be skeptical of greenfield proposals when the data already exists. The question isn't whether to start fresh; it's whether the existing foundation is worth building on. Usually it is. The work is connecting to it correctly, not replacing it.

The complexity matrix is a liability, not a plan.

Every organization that's taken the tool-first approach to operations ends up with the same artifact: a matrix. An exception arises and a tool gets added. A new workflow appears and a new integration follows. The architecture that was supposed to handle everything accumulates coverage until it's expensive to maintain, slow to change, and brittle in ways that don't reveal themselves until something breaks under pressure.

The instinct behind it was right. You need to handle the full range of conditions. But the approach was wrong. Adding coverage adds fragility. The system gets heavier with every addition, and at some point changing any part of it requires touching everything else. The teams that manage it spend most of their time keeping it alive, not improving it. That's not a technology problem. It's a structural one.

AI's promise is the inverse of the matrix. A well-built AI system handles edge cases by being adaptive, not comprehensive. It absorbs the conditions the matrix was built to cover without inheriting the maintenance surface area. It gets more capable the longer it runs, not more complicated. The organizations that understand this aren't asking how to add AI to the matrix. They're asking how to replace the matrix with something that doesn't need to grow every time conditions change. That's the right question. It's also the harder one.

Disagree with one of these? Good.

A direct conversation, no deck, no discovery theater. You leave knowing what's worth building and what it takes.