<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
    <title>CTO Zen | Fractional CTO &amp; AI Advisory - Insights. AI Strategy, Technology Leadership &amp; Production AI</title>
    <subtitle>Fractional CTO and AI advisory for startup founders and small to mid-market companies. I turn AI investment into production results, not slideware. Senior technical leadership without the full-time hire.</subtitle>
    <link rel="self" type="application/atom+xml" href="https://ctozen.com/insights/atom.xml"/>
    <link rel="alternate" type="text/html" href="https://ctozen.com/insights/"/>
    <generator uri="https://www.getzola.org/">Zola</generator>
    <updated>2026-06-02T00:00:00+00:00</updated>
    <id>https://ctozen.com/insights/atom.xml</id>
    <entry xml:lang="en">
        <title>The best technology rarely wins. The best-led team does</title>
        <published>2026-06-02T00:00:00+00:00</published>
        <updated>2026-06-02T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/best-technology-rarely-wins/"/>
        <id>https://ctozen.com/insights/best-technology-rarely-wins/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/best-technology-rarely-wins/">&lt;p&gt;The best technology rarely wins. Superior products lose to inferior ones all the time, and brilliant engineering ends up serving companies that never reach their potential. If raw technical quality decided outcomes, the market would look very different from the one we actually have.&lt;&#x2F;p&gt;
&lt;p&gt;The factor that decides it, more than the technology, more than the capital, more than even the strategy, is the quality of the leadership. &lt;strong&gt;Leadership is the multiplier on everything else you have.&lt;&#x2F;strong&gt; Great leadership is the only force that can turn an underperforming business around. When Steve Jobs returned to a near-bankrupt Apple in 1997 he did it with broadly the same people, the same building, and the same assets the previous management had been losing with. What changed was the leadership. And weak leadership quietly destroys good companies with good technology, year after year, while everyone blames the market.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;leadership-is-a-commercial-lever-not-a-soft-topic&quot;&gt;Leadership is a commercial lever, not a soft topic&lt;&#x2F;h2&gt;
&lt;p&gt;The mistake I see most often is treating leadership as a soft, HR-adjacent subject, something to get to once the real work of product and engineering is handled. That has it exactly backwards.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Leadership is the most concrete commercial lever you own.&lt;&#x2F;strong&gt; It sits a couple of levels above your roadmap and your tech stack, and it determines whether everything beneath it works. Get it right and your strategy sharpens, your execution compounds, and your best people do the best work of their careers. Get it wrong and the most talented engineers in the world will underperform, ship the wrong things, and leave. &lt;a href=&quot;&#x2F;insights&#x2F;engineering-was-the-easy-part&#x2F;&quot;&gt;The engineering is rarely the hard part&lt;&#x2F;a&gt;. The leadership almost always is.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;it-starts-with-a-destination-people-can-see&quot;&gt;It starts with a destination people can see&lt;&#x2F;h2&gt;
&lt;p&gt;The first job of a leader is to set a clear destination and hold it with confidence. Not arrogance, which is brittle and repels people, but the quiet conviction that gives a team something solid to trust when the path ahead is foggy.&lt;&#x2F;p&gt;
&lt;p&gt;Clarity is the gift that does the most work, and there is a useful image for why. Ordinary light, a bulb in a room, is a chaos of waves scattering in every direction, and for all its energy it lights the room and goes no further. A laser is the same light made coherent, every wave aligned and travelling the same way, and it can cut through steel.&lt;&#x2F;p&gt;
&lt;p&gt;A team is no different. The same people, with the same talent and the same hours in the day, produce either a faint scatter or a focused beam, depending almost entirely on whether their effort is aligned behind one direction. Creating that alignment is the leader&#x27;s job, and clarity is how you do it.&lt;&#x2F;p&gt;
&lt;p&gt;Intelligent people are remarkably good at organising themselves once they genuinely understand where they are going and why. Withhold that and you get silos, politics and a great deal of motion that produces nothing. A large part of leading is &lt;a href=&quot;&#x2F;insights&#x2F;navigating-uncertainty&#x2F;&quot;&gt;making the hard calls that set direction under real uncertainty&lt;&#x2F;a&gt;, then communicating them so plainly that everyone can act without waiting to be told.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;then-it-becomes-about-people-one-at-a-time&quot;&gt;Then it becomes about people, one at a time&lt;&#x2F;h2&gt;
&lt;p&gt;Vision sets the direction. People do the work, and they are not interchangeable units of delivery. The best engineer I have worked with did his finest work in near silence, and would have walked inside a month under a manager who needed to be visibly in charge of every decision. Another did his best only when he had the full context and the bigger picture, and drifted without it. Same talent, opposite needs. Reading that, and adjusting to the person in front of you rather than demanding everyone fit one mould, is most of the job.&lt;&#x2F;p&gt;
&lt;p&gt;This is leadership as service, not command. Your job is not to be the cleverest person in the room. It is to create the conditions in which other people can be brilliant: removing the obstacles in their way, &lt;a href=&quot;&#x2F;insights&#x2F;big-tech-talent-war&#x2F;&quot;&gt;making them feel safe enough to do their best work&lt;&#x2F;a&gt;, and &lt;a href=&quot;&#x2F;insights&#x2F;power-of-thank-you&#x2F;&quot;&gt;recognising the contribution every time someone sticks their neck out&lt;&#x2F;a&gt;. Good ideas do not respect the org chart. Strong leaders build the environment that lets them surface from anywhere. Weak ones assume good ideas are the property of senior people, and slowly poison everyone beneath them.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;you-cannot-fake-it-so-you-have-to-live-it&quot;&gt;You cannot fake it, so you have to live it&lt;&#x2F;h2&gt;
&lt;p&gt;Culture is set by what a leader does, not by what they say. People read the gap between your words and your behaviour instantly, and they calibrate their own conduct to the example you set, not the one you describe.&lt;&#x2F;p&gt;
&lt;p&gt;That is why openness and transparency matter so much. They build the trust that makes everything else possible. A leader who hoards information, hides bad news, or says one thing and does another teaches the whole organisation to do exactly the same. And you inspire people far more by what you visibly do, and by showing them how the mission connects to something they actually want, than by any amount of rhetoric.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;and-none-of-it-works-without-deep-competence&quot;&gt;And none of it works without deep competence&lt;&#x2F;h2&gt;
&lt;p&gt;Here is the part most leadership advice skips, and the part that matters most in technology. All of the above, the vision, the service, the inspiration, sits on a foundation of real competence, or it collapses.&lt;&#x2F;p&gt;
&lt;p&gt;In technology you cannot lead what you do not understand. A leader who cannot follow the substance of the work, weigh a genuine architectural trade-off, or tell a hard problem from an excuse, has no way to make good calls or to earn the trust of the people doing it. Servant leadership and charisma without technical credibility ring hollow, and good engineers sense it in minutes. This is the balance that decides it: not the pure technologist who cannot lead people, and not the pure people-person who cannot grasp the work, but someone with enough of both. The competence earns you the right to lead. The leadership turns that competence into results.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-this-is-the-whole-game&quot;&gt;Why this is the whole game&lt;&#x2F;h2&gt;
&lt;p&gt;Follow the chain and it is obvious. When leadership is strong, people perform at their best. Ideas flow, ownership rises, hard decisions get made well and quickly, and execution compounds week after week. That produces better products, built faster, by an organisation resilient enough to absorb the surprises every market eventually delivers. Market success is downstream of all of it.&lt;&#x2F;p&gt;
&lt;p&gt;When leadership is weak, the chain breaks at the top and everything below it is capped, no matter how good the technology. This is why two companies with comparable talent and funding so often end up in completely different places. The difference is rarely the code. It is whether someone was able to turn all that potential into something coherent and valuable.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-part-nobody-wants-to-hear&quot;&gt;The part nobody wants to hear&lt;&#x2F;h2&gt;
&lt;p&gt;If this sounds like a high bar, it is, and there is one more difficulty. Most people in leadership roles believe they are already good at it, and the most naturally talented are often the most blind to where they fall short. The traits that make someone an exceptional technologist are simply not the ones that make a great leader.&lt;&#x2F;p&gt;
&lt;p&gt;The entry fee to everything above is therefore self-awareness: the willingness to look honestly at your own leadership and keep working on it, the way you would any other hard skill. I learned this the slow and expensive way, and &lt;a href=&quot;&#x2F;insights&#x2F;engineering-was-the-easy-part&#x2F;&quot;&gt;I have written separately about how long it took me to see it&lt;&#x2F;a&gt;. The good news is that leadership can be learned, by anyone willing to do that work. And it is the single highest-return investment a technology leader can make.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are building a technology company and want to talk about the leadership that actually decides whether it succeeds, rather than the strategy decks that usually get the attention, I am always happy to compare notes. &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;engineering-was-the-easy-part&#x2F;&quot;&gt;After 20 years as a CTO, the engineering was the easy part&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;big-tech-talent-war&#x2F;&quot;&gt;How big tech wins the talent war, and it isn&#x27;t the money&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;sculpting-world-class-tech-team&#x2F;&quot;&gt;Your world-class engineering team is already in the building&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Your world-class engineering team is already in the building</title>
        <published>2026-06-02T00:00:00+00:00</published>
        <updated>2026-06-02T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/sculpting-world-class-tech-team/"/>
        <id>https://ctozen.com/insights/sculpting-world-class-tech-team/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/sculpting-world-class-tech-team/">&lt;p&gt;Most leaders who inherit an underperforming technical organisation reach for the same instrument: the wrecking ball. Reorganise the teams. Rip out the stack. Replace the people. Start again. It feels decisive, and it is almost always the most expensive mistake available to you.&lt;&#x2F;p&gt;
&lt;p&gt;The world-class organisation you want is not somewhere else, waiting to be hired in. It is already latent in the one you have. Michelangelo said the sculpture was complete inside the block of marble before he ever picked up a chisel. &quot;I just have to chisel away the superfluous material.&quot; That is the right way to think about a technical organisation. &lt;strong&gt;Your job is not to import a great team. It is to uncover the one hidden in the people, the systems and the potential you already own.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-wrecking-ball-is-the-expensive-option&quot;&gt;The wrecking ball is the expensive option&lt;&#x2F;h2&gt;
&lt;p&gt;Demolition feels like leadership. It rarely is. When you tear an organisation down to its foundations you throw away the one thing that is genuinely hard to buy: the accumulated knowledge of how your systems actually behave, why past decisions were made, and where the real risks are hiding. You also tell every talented person who stays that nothing they built mattered.&lt;&#x2F;p&gt;
&lt;p&gt;And here is the part that catches people out. I have been brought in after exactly this. New names on the org chart, a fashionable new stack, and the same three problems six months later, with nobody left who could explain why the old system was built the way it was. The shiny new team quietly reproduces the old dysfunction, because the conditions that created it were never addressed. You pay an enormous price to relearn your own lessons. Reshape before you replace.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;start-with-why-not-with-the-org-chart&quot;&gt;Start with why, not with the org chart&lt;&#x2F;h2&gt;
&lt;p&gt;Most reorganisations begin in exactly the wrong place: with boxes and reporting lines. Structure is the last thing you should touch, not the first.&lt;&#x2F;p&gt;
&lt;p&gt;Begin instead by establishing a shared destination with the people who matter, and do it in a way that is subtle and politically sensitive, because you are asking people to change. Begin with the why before the what: the right thing to build, in the right way, for the right reasons. Clarity is the first gift a leader gives, and it does more work than any reorg.&lt;&#x2F;p&gt;
&lt;p&gt;Intelligent people are remarkably good at organising themselves once they understand the objective and agree on how to reach it. Give them that clarity and their effort all points the same way instead of scattering. Withhold it and you get silos, politics and motion without progress. &lt;a href=&quot;&#x2F;insights&#x2F;navigating-uncertainty&#x2F;&quot;&gt;Making the hard calls that set direction under real uncertainty&lt;&#x2F;a&gt; is the actual work of leading.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;then-look-honestly-at-what-you-ve-got&quot;&gt;Then look honestly at what you&#x27;ve got&lt;&#x2F;h2&gt;
&lt;p&gt;Only now do you audit, and the audit has to be unflinching. The status quo of the technology, yes: the architecture, the tech debt, the tooling, the methodologies, the way you measure and report. But that is the easy half, and it is the half most leaders stop at.&lt;&#x2F;p&gt;
&lt;p&gt;The harder, more revealing half is the foundations. Your teams, your people and your culture. Your recruitment and your progression paths. How decisions actually get made when things are uncertain. The technology is where the symptoms show. The foundations are where the performance actually comes from. If you only audit the stack, you will fix symptoms and wonder why the deeper problems keep growing back.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-five-things-that-actually-decide-it&quot;&gt;The five things that actually decide it&lt;&#x2F;h2&gt;
&lt;p&gt;When people ask what separates a world-class technical organisation from a mediocre one, they expect the answer to be about process: the right SDLC, the right methodology, the right delivery framework. All useful. None of them foundational.&lt;&#x2F;p&gt;
&lt;p&gt;The factors that actually decide it sit a couple of levels above all that. &lt;strong&gt;People, Leadership, Culture, Problem-solving and Creativity.&lt;&#x2F;strong&gt; Not five boxes to tick. They are interconnected, and they compound: improve one and the rest gain, weaken one and they all weaken. That is the destination, the shape you are chiselling towards.&lt;&#x2F;p&gt;
&lt;p&gt;Two of them do most of the work. &lt;a href=&quot;&#x2F;insights&#x2F;big-tech-talent-war&#x2F;&quot;&gt;Culture, because the best people only perform when they feel safe enough to&lt;&#x2F;a&gt;. And leadership, because it is usually the binding constraint, the factor that decides whether all the others get to count. After twenty years I am convinced &lt;a href=&quot;&#x2F;insights&#x2F;engineering-was-the-easy-part&#x2F;&quot;&gt;the engineering is rarely the hard part&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;change-without-stopping-the-bus&quot;&gt;Change without stopping the bus&lt;&#x2F;h2&gt;
&lt;p&gt;Here is the constraint nobody escapes. You do not get a clean slate, a bigger budget, or permission to halt delivery while you rebuild. The business keeps running, the roadmap keeps moving, and your people are watching to see whether the change is done to them or with them.&lt;&#x2F;p&gt;
&lt;p&gt;So transformation has to be incremental and non-disruptive. You protect continued delivery and staff wellbeing at the same time as you reshape. You make one improvement, review the result, make the next: kaizen, never finished, always moving. You do as much as you can with what you have, and you aim for world-class anyway. Done this way, the change earns trust rather than provoking fear, because people can feel the thing getting better while they are still carrying the load.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;give-your-curious-people-a-remit&quot;&gt;Give your curious people a remit&lt;&#x2F;h2&gt;
&lt;p&gt;One part of the destination is not optional any more, and it is usually already in the building, waiting to be uncovered. You almost certainly have people who are quietly curious about what is coming next, reading about it on their own time. Give them a remit. A small, explicit function whose job is the future: experimenting with emerging technology and watching where it is heading.&lt;&#x2F;p&gt;
&lt;p&gt;Without it you spend your life receiving requirements and reacting to change after it has already hit you. With it you co-shape your own direction and you are in position when the opportunity arrives. AI is the obvious case. &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;Every organisation has to be ready for its impact&lt;&#x2F;a&gt;, and the ones that made deliberate experimentation a habit will harness it while the rest are blindsided.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-work-is-never-quite-done&quot;&gt;The work is never quite done&lt;&#x2F;h2&gt;
&lt;p&gt;There is a reason this is continuous rather than a project with an end date. Left alone, every organisation drifts back towards disorder. Silos reform, clarity fades, the energy you aligned starts to scatter again. Keeping an organisation in a coherent, high-performing shape takes constant, deliberate effort, the way a sculptor keeps refining rather than declaring the work done.&lt;&#x2F;p&gt;
&lt;p&gt;That is the job. Not one dramatic act of demolition and rebuild, but the patient, ongoing work of chiselling away what does not serve the vision and protecting what does. The best technical organisation you will ever lead is almost certainly hidden inside the one you have right now. You just have to uncover it.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are staring at a technical organisation that is underperforming and wondering whether you need to tear it down and start over, you almost certainly do not. I am always happy to compare notes on how to reshape the one you have. &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;best-technology-rarely-wins&#x2F;&quot;&gt;The best technology rarely wins. The best-led team does&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;engineering-was-the-easy-part&#x2F;&quot;&gt;After 20 years as a CTO, the engineering was the easy part&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;impact&#x2F;public-sector-scale&#x2F;&quot;&gt;NHS Wales: transformation at national scale&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Agile is now the bottleneck</title>
        <published>2026-06-01T00:00:00+00:00</published>
        <updated>2026-06-01T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/agile-is-the-bottleneck/"/>
        <id>https://ctozen.com/insights/agile-is-the-bottleneck/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/agile-is-the-bottleneck/">&lt;p&gt;For twenty years, the constraint on building software was how fast humans could write it. Agile, with its sprints and ceremonies, was built to manage that constraint: coordinate the people, plan the work into two-week chunks, keep everyone moving in roughly the same direction. It worked because building was slow.&lt;&#x2F;p&gt;
&lt;p&gt;Building is not slow anymore. With &lt;a href=&quot;&#x2F;insights&#x2F;autonomous-ai-2026&#x2F;&quot;&gt;agentic coding tools&lt;&#x2F;a&gt;, a capable team can now produce in a day what used to take a sprint. And here is the uncomfortable consequence: when building gets that fast, the process built to coordinate slow building becomes the thing holding you back. The ceremonies, the two-week planning cycles, the handoffs between roles, are now slower than the engineering they are supposed to organise. &lt;strong&gt;Agile, or at least the heavy machinery that grew up around it, is now the bottleneck.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;To be clear, the spirit of agile is not wrong. The adaptive, learn-as-you-go core is more right than ever. What breaks is the ceremony: the assumption that you should plan work in fixed batches and move it through stages, when the work itself now moves in hours.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;you-cannot-plan-a-journey-with-no-map&quot;&gt;You cannot plan a journey with no map&lt;&#x2F;h2&gt;
&lt;p&gt;Here is the mental model I use instead. Building something genuinely new is a journey through territory nobody has mapped. The old instinct is to plan the whole route in advance: the sprints, the roadmap, the estimates. But you cannot plan a route through country you have never seen. The plan is fiction the moment you start walking.&lt;&#x2F;p&gt;
&lt;p&gt;So we do what sailors and explorers have always done. &lt;strong&gt;We set a destination, and we navigate towards it daily.&lt;&#x2F;strong&gt; Each day we take our bearings, see how far the last day&#x27;s work actually got us, and set a corrected course for the next. We do not pretend to know the whole path. We commit to the direction, and we correct constantly. Repeat until we arrive.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;one-organism-not-a-relay&quot;&gt;One organism, not a relay&lt;&#x2F;h2&gt;
&lt;p&gt;This only works if the people who make the decisions are in the same room, every day. Not a stand-up where everyone reports status, but a real working group where product, engineering, and domain knowledge sit together and decide on the spot, pulling on everyone&#x27;s perspective at once. A fast, cross-functional organism, not a relay of handoffs where work waits in a queue between specialists.&lt;&#x2F;p&gt;
&lt;p&gt;The cadence is simple. Delivery is daily. The longest planning horizon is a week. We course-correct every day. Decisions that used to wait for a ceremony get made in the moment, by the people with the context to make them.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;faster-and-better&quot;&gt;Faster, and better&lt;&#x2F;h2&gt;
&lt;p&gt;The obvious gain is speed: a short loop means less waiting, less work sitting in queues, less time spent planning things that change anyway. But the bigger gain is quality. When the whole team decides together, every choice pulls on more knowledge than any one role has alone. And because the loop is short, you get many more iterations, many more chances to learn and correct. Better decisions, made more often, on more information.&lt;&#x2F;p&gt;
&lt;p&gt;There is an honest caveat, and the evidence is clear about it. Google&#x27;s DORA research finds that AI lifts a team&#x27;s throughput but damages its stability unless the foundations, testing, version control, a solid platform, are already strong. AI amplifies what is already there. Run this fast loop on &lt;a href=&quot;&#x2F;insights&#x2F;quality-you-cant-see&#x2F;&quot;&gt;weak foundations&lt;&#x2F;a&gt; and you will simply produce your mistakes faster. The speed is earned by the discipline underneath it, not instead of it. The leverage is real, but a word of caution: cutting your team on the strength of AI does not, on its own, produce a return. This is an operating model for building better, not a headcount argument.&lt;&#x2F;p&gt;
&lt;p&gt;The teams that win the next few years will not be the ones with the best tools. Everyone will have those. They will be the ones who stopped trying to plan a journey through unmapped territory, set a clear destination, and learned to navigate to it together, one corrected day at a time.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;This is roughly how I run every engagement now. If you want to see what it looks like in practice, &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;here is the fuller version&lt;&#x2F;a&gt;, or &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;let&#x27;s talk&lt;&#x2F;a&gt; about your delivery.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;orchestrator-not-10x-engineer&#x2F;&quot;&gt;You&#x27;re not a 10x engineer, you&#x27;re an orchestrator&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;autonomous-ai-2026&#x2F;&quot;&gt;From AI that talks to AI that works&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;quality-you-cant-see&#x2F;&quot;&gt;The quality you can&#x27;t see is the one that kills you&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Who is accountable for AI on your board?</title>
        <published>2026-06-01T00:00:00+00:00</published>
        <updated>2026-06-01T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/ai-board-accountability/"/>
        <id>https://ctozen.com/insights/ai-board-accountability/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/ai-board-accountability/">&lt;p&gt;In most small and mid-sized companies, if you ask who is accountable when an AI system makes a costly or harmful decision, you get a shrug. The honest answer is usually &quot;the board&quot;, but no individual on it has been named, and proportionate governance has not been set up. That gap is becoming a real liability in 2026, as regulation tightens and AI moves from experiments into decisions that affect customers and money. The fix is not to hire a Chief AI Officer. It is to name an accountable owner and put lightweight governance in place. Here is what that means for a company without a dedicated AI function.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;who-answers-when-the-ai-gets-it-wrong&quot;&gt;Who answers when the AI gets it wrong?&lt;&#x2F;h2&gt;
&lt;p&gt;Picture an AI system in your business making a decision that turns out badly: a customer wrongly refused, money misdirected, data exposed. Now ask who, by name, answers for it. In most SMEs there is no answer. The work has been delegated to a tool, the tool has been delegated to a team, and accountability has quietly evaporated somewhere in between.&lt;&#x2F;p&gt;
&lt;p&gt;This is different from the day-to-day question of who is allowed to use which tools, which is an operational matter. The board-level question is narrower and sharper: &lt;strong&gt;who is personally accountable for AI risk, and have you actually named them?&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;you-probably-don-t-need-a-chief-ai-officer&quot;&gt;You probably don&#x27;t need a Chief AI Officer&lt;&#x2F;h2&gt;
&lt;p&gt;The market is busy telling you the answer is to hire a Chief AI Officer. For most companies your size, it is not. A dedicated AI chief is often premature, and the role has a habit of becoming a vanity title with an 18-month shelf life. You do not need a new seat in the C-suite. You need two things that cost almost nothing: a named accountable owner, usually an existing executive, and a proportionate system around them.&lt;&#x2F;p&gt;
&lt;p&gt;Naming the owner is the part everyone skips and the part that matters most. Accountability that belongs to &quot;the board&quot; belongs to no one. Accountability that belongs to a named person, with the authority to set policy and the proximity to see where AI is actually used, is real.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-this-is-suddenly-urgent&quot;&gt;Why this is suddenly urgent&lt;&#x2F;h2&gt;
&lt;p&gt;Two things are converging. AI is moving out of the sandbox and into decisions with real consequences. And the regulatory direction is unmistakable: accountability is becoming a named-person question, not a corporate abstraction. The EU AI Act&#x27;s high-risk obligations are phasing in through 2026, and UK regulators are applying existing accountability frameworks to AI rather than waiting for a single new statute. For regulated firms, individual senior-manager accountability already extends to the systems they rely on.&lt;&#x2F;p&gt;
&lt;p&gt;The trouble is that almost all the guidance you will find online is written for large enterprises, by law firms and big consultancies. It does not tell a fifty-person company what proportionate actually looks like. That gap is the whole problem.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-proportionate-governance-looks-like&quot;&gt;What proportionate governance looks like&lt;&#x2F;h2&gt;
&lt;p&gt;Not a forty-page policy. For a company your size, five things:&lt;&#x2F;p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;A named owner.&lt;&#x2F;strong&gt; One accountable individual, on the record.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;An inventory.&lt;&#x2F;strong&gt; A simple list of where AI is in use, including the &lt;a href=&quot;&#x2F;insights&#x2F;shadow-ai-audit-finding&#x2F;&quot;&gt;unsanctioned tools your staff are already running&lt;&#x2F;a&gt;. You cannot govern what you cannot see.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Clear lines.&lt;&#x2F;strong&gt; What AI is allowed to decide on its own, and &lt;a href=&quot;&#x2F;insights&#x2F;irreducible-human-edge&#x2F;&quot;&gt;what always needs a human&lt;&#x2F;a&gt;. Especially for anything irreversible or customer-facing.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;A record.&lt;&#x2F;strong&gt; A basic audit trail of significant AI-driven decisions, so you can answer &quot;why did it do that?&quot; after the fact.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;A rhythm.&lt;&#x2F;strong&gt; A short, regular review of the above, so it stays current as your AI use grows.&lt;&#x2F;li&gt;
&lt;&#x2F;ol&gt;
&lt;p&gt;This is the board-level layer that sits above the operational &lt;a href=&quot;&#x2F;insights&#x2F;agent-control-plane&#x2F;&quot;&gt;control plane&lt;&#x2F;a&gt;. The control plane governs what the agents do. This governs who answers for them.&lt;&#x2F;p&gt;
&lt;p&gt;You do not need to become an AI expert or build a compliance department. You need one accountable owner and a proportionate system, set up before an incident or a regulator forces the question rather than after.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want help working out what proportionate AI governance looks like for a company your size, and who on your board should own it, that is something I help leadership teams put in place. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;shadow-ai-audit-finding&#x2F;&quot;&gt;Shadow AI is your next audit finding&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agent-control-plane&#x2F;&quot;&gt;Agent sprawl is the new shadow IT. Your business needs a control plane&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>How to build an AI strategy without a CTO</title>
        <published>2026-06-01T00:00:00+00:00</published>
        <updated>2026-06-01T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/ai-strategy-without-a-cto/"/>
        <id>https://ctozen.com/insights/ai-strategy-without-a-cto/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/ai-strategy-without-a-cto/">&lt;p&gt;You can build a sound AI strategy without a CTO, and without being technical yourself. What you cannot do is build one by picking a tool and rolling it out, or by &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-vs-ai-consultancy&#x2F;&quot;&gt;handing the whole question to whoever is selling you the technology&lt;&#x2F;a&gt;. The method that works is the reverse: start from the parts of your business where AI could genuinely change the economics, match each to the right approach, and get one source of genuinely independent judgement to pressure-test your decisions. Here is how to do that when you do not have a CTO in the room.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-trap-the-imbalance-of-knowledge&quot;&gt;The trap: the imbalance of knowledge&lt;&#x2F;h2&gt;
&lt;p&gt;If you are a non-technical founder or leader, you start every AI conversation at a disadvantage, and it has a name: the imbalance of knowledge. The people you turn to for help, the agency, the AI vendor, the consultant, all know more about the technology than you do, and most of them have something to sell. That is not a conspiracy. It is just incentives. The salesperson is trained to upsell. The agency that swears it is &quot;technology agnostic&quot; almost always steers you toward the stack it already builds in. And lacking the knowledge to push back, you end up sanctioning decisions you cannot fully evaluate.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;This is why so many SME AI strategies are really just a vendor&#x27;s product roadmap wearing your logo.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;you-don-t-need-to-be-technical-you-need-a-method&quot;&gt;You don&#x27;t need to be technical. You need a method.&lt;&#x2F;h2&gt;
&lt;p&gt;Here is the good news. Closing that gap does not require you to learn to code, or to hire a full-time CTO you cannot yet justify. It requires a way of making good decisions under uncertainty, and a small amount of genuinely independent expertise to check your thinking. The strategy itself is a business exercise, not a technical one, and business judgement is something you already have.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;how-to-build-the-strategy&quot;&gt;How to build the strategy&lt;&#x2F;h2&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Start from value, not tools.&lt;&#x2F;strong&gt; Begin with the parts of your business where AI could change the economics: the costly bottleneck, the process that does not scale, the work that eats your best people&#x27;s time. Not &quot;where can we use AI&quot; but &quot;where would it actually matter&quot;.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Match each problem to the right approach.&lt;&#x2F;strong&gt; The headline tool is not always the right one. &lt;a href=&quot;&#x2F;insights&#x2F;ai-is-not-just-llms&#x2F;&quot;&gt;AI is not just large language models&lt;&#x2F;a&gt;, and forcing every problem into a chatbot is how money gets wasted. Some problems want an LLM, many want something cheaper and more reliable.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Define what success looks like in numbers.&lt;&#x2F;strong&gt; If you cannot say what a project is supposed to move on the P&amp;amp;L, you are not ready to start it. A clear metric is also your best defence against a vendor declaring victory on a demo.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Sequence by value and feasibility.&lt;&#x2F;strong&gt; Rank the candidates by how much they would matter and how hard they would be. Start with something valuable enough to prove the point and contained enough to actually finish.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Pressure-test with one independent expert.&lt;&#x2F;strong&gt; Before you commit budget, have someone with no stake in the answer challenge the plan. This is the single point where a little outside expertise pays for itself many times over.&lt;&#x2F;li&gt;
&lt;&#x2F;ol&gt;
&lt;h2 id=&quot;where-to-get-honest-input&quot;&gt;Where to get honest input&lt;&#x2F;h2&gt;
&lt;p&gt;The one thing you cannot safely skip is that independent read, and where you get it matters more than how much of it you get. You want someone with no technology stack to sell and no build hours to upsell, whose only interest is whether your plan is sound. This is the &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-conflict-of-interest&#x2F;&quot;&gt;same conflict of interest that affects fractional CTOs supplied by agencies&lt;&#x2F;a&gt;: if the person advising you also profits from the build, the advice bends.&lt;&#x2F;p&gt;
&lt;p&gt;There is one simple test for a good advisor: curiosity. Anyone who fires back quick, confident answers to complex questions without first asking about your business is selling, not advising. The real expert leads with questions, because they know the right answer depends on details they do not have yet.&lt;&#x2F;p&gt;
&lt;p&gt;You do not need a CTO to get this right. You need to refuse to outsource the thinking to someone who profits from the answer, and to bring in honest, independent judgement at the few points where it matters most.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are a non-technical founder or leader trying to build an AI strategy without a CTO in the room, that is exactly the kind of thinking I help with. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-is-not-just-llms&#x2F;&quot;&gt;If your AI strategy is just ChatGPT, you don&#x27;t have one&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>You don&#x27;t need to build a brewery to drink a pint of beer</title>
        <published>2026-06-01T00:00:00+00:00</published>
        <updated>2026-06-01T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/build-or-buy-software/"/>
        <id>https://ctozen.com/insights/build-or-buy-software/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/build-or-buy-software/">&lt;p&gt;The most expensive software decision a founder makes is often the very first one: the decision to build. Standing up a team and a custom product is the default instinct, and for a lot of companies it is the wrong one. &lt;strong&gt;You don&#x27;t need to build a brewery to drink a pint of beer.&lt;&#x2F;strong&gt; The right opening question is not &quot;how do we build this&quot; but &quot;should we build this at all&quot;, and in 2026 that question has a different answer than it did even two years ago. Here is how to decide between building, buying, no-code, and an agency.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-one-question-that-decides-it&quot;&gt;The one question that decides it&lt;&#x2F;h2&gt;
&lt;p&gt;Almost everything turns on a single question: is this technology core to your business? Build an internal team when the technology is genuinely yours, proprietary inventions or data, regulated or security-critical, highly complex, or part of a long-term product you will keep extending. In those cases the control, independence, and compounding value of an in-house team are worth the cost and the time.&lt;&#x2F;p&gt;
&lt;p&gt;If what you need is a one-off, a run-of-the-mill capability, or something every business has, do not recruit a permanent team to build it. That is the brewery. You will spend a fortune and a year producing something you could have bought, configured, or assembled in a fraction of the time.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-changed-in-2026&quot;&gt;What changed in 2026&lt;&#x2F;h2&gt;
&lt;p&gt;The line between &quot;build it&quot; and &quot;don&#x27;t&quot; has moved, because AI now writes a great deal of the code, and no-code platforms have matured. A non-technical founder today can stand up a working first version, with AI assistance or no-code, that would have needed a small team and a budget in 2023.&lt;&#x2F;p&gt;
&lt;p&gt;That is real, and it is liberating. But mind the trap. AI lowers the cost of the easy 80%, the demos and the standard functions. It does not lower the cost of the hard 20%, the &lt;a href=&quot;&#x2F;insights&#x2F;quality-you-cant-see&#x2F;&quot;&gt;quality you can&#x27;t see&lt;&#x2F;a&gt;: scale, security, and maintainability. And it does not change the core-or-regulated rule. The right tool still depends on the problem, because &lt;a href=&quot;&#x2F;insights&#x2F;ai-is-not-just-llms&#x2F;&quot;&gt;AI is not one thing and an LLM is not always the answer&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;your-four-options&quot;&gt;Your four options&lt;&#x2F;h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Build an internal team.&lt;&#x2F;strong&gt; Best when the technology is core, proprietary, regulated, or long-term. Most control, often cheaper over time, but slow to assemble and demanding to lead.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Use a software agency.&lt;&#x2F;strong&gt; Fast, a ready-made team, good for non-core work or overflow. But agencies carry &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-conflict-of-interest&#x2F;&quot;&gt;a conflict of interest&lt;&#x2F;a&gt;: they steer you toward the stack they staff, and the A-players in the pitch are not always the people who do your work. Name the individuals in the contract.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Use contractors.&lt;&#x2F;strong&gt; Flexible, quick, good as a stop-gap. But low loyalty, weak cultural fit, high cost, and resentment from permanent staff paid less for the same work.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Use no-code or AI tools.&lt;&#x2F;strong&gt; Excellent for prototypes, MVPs, and assembling standard functions. The catch is the ceiling: they may not scale, may lock you into a walled garden, and can get expensive as usage grows.&lt;&#x2F;li&gt;
&lt;&#x2F;ul&gt;
&lt;h2 id=&quot;start-with-the-cheapest-thing-that-proves-the-point&quot;&gt;Start with the cheapest thing that proves the point&lt;&#x2F;h2&gt;
&lt;p&gt;For most new ideas, the smart first move is the cheapest version that tests whether the idea works at all, a no-code or AI-assisted MVP, not a build. Validate the demand, then graduate to building only the parts that are core and proven. Building first and validating later is how founders lose a year and a budget on something nobody wanted.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;you-still-can-t-skip-the-judgement&quot;&gt;You still can&#x27;t skip the judgement&lt;&#x2F;h2&gt;
&lt;p&gt;Whichever option you choose, the decision itself and the quality of what gets built need impartial technical judgement, ideally from someone with no stake in the answer. That is the one thing AI has not changed, and the one place a little outside expertise pays for itself many times over. It is also why building &lt;a href=&quot;&#x2F;insights&#x2F;ai-strategy-without-a-cto&#x2F;&quot;&gt;an AI strategy without a CTO in the room&lt;&#x2F;a&gt; is mostly about who you trust to pressure-test the call.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are weighing up whether to build, buy, or assemble your way to a product, and want an independent read before you commit, that is exactly the kind of decision I help founders make. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-strategy-without-a-cto&#x2F;&quot;&gt;How to build an AI strategy without a CTO&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;quality-you-cant-see&#x2F;&quot;&gt;The quality you can&#x27;t see is the one that kills you&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>After 20 years as a CTO, I learned the engineering was the easy part</title>
        <published>2026-06-01T00:00:00+00:00</published>
        <updated>2026-06-01T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/engineering-was-the-easy-part/"/>
        <id>https://ctozen.com/insights/engineering-was-the-easy-part/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/engineering-was-the-easy-part/">&lt;p&gt;After twenty years as a CTO, the most uncomfortable thing I learned is this: the engineering, the part I was best at, was the easy part.&lt;&#x2F;p&gt;
&lt;p&gt;That is not false modesty. Deep technical skill is real, valuable, and rarer than most people think. But for someone wired the way I am, it was also the most predictable, masterable part of the job. Systems behave according to rules. Get good enough and you can design them well, reliably, again and again. The hard part, the part that actually decided whether any of that engineering mattered, was everything else: people, and above all leadership. It took me far too long, and several failures of my own making, to understand that.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-trap-of-being-good-at-the-measurable-thing&quot;&gt;The trap of being good at the measurable thing&lt;&#x2F;h2&gt;
&lt;p&gt;Here is the trap, and naturally talented people are the most exposed to it. When you build something genuinely excellent, a robust system that serious organisations depend on, it feels like proof. Proof that you are good, that your judgement is sound, that you must therefore be a good leader too. I had that proof in abundance. I built systems used at national scale by governments and billion-dollar operations across more than thirty countries. So I assumed the leadership would take care of itself.&lt;&#x2F;p&gt;
&lt;p&gt;It did not. &lt;strong&gt;The very thing that made me a strong engineer, the comfort with systems that behave predictably, made me blind to the one domain that does not behave predictably at all: people.&lt;&#x2F;strong&gt; I was, for a long time, a poor leader. Not a cruel one, if anything the opposite, too far toward the compassionate and conflict-avoiding end. But poor all the same, and unaware of it, which is the worst kind.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-reckoning&quot;&gt;The reckoning&lt;&#x2F;h2&gt;
&lt;p&gt;The evidence was there for years before I read it honestly. Projects with real stakes, serious funding, and years of effort behind them, technically sound, that quietly failed to become what they should have. For a long time I looked everywhere except the obvious place. When I finally did, the common factor in the disappointments was not the technology. It was me. The engineering was excellent. The leadership around it was not, and that is what capped the outcome.&lt;&#x2F;p&gt;
&lt;p&gt;That is a hard thing to write on a website where people come to assess my judgement. I am writing it anyway, because the realisation was the most valuable one of my career, and because the founders I now work with are often standing exactly where I stood: certain that if the build is good enough, the rest will follow.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;necessary-but-never-sufficient&quot;&gt;Necessary, but never sufficient&lt;&#x2F;h2&gt;
&lt;p&gt;What I understand now is that building technology has an order to it, and the engineering sits at the bottom of it, not the top. The business reason has to be right, then the requirements, then the build. A flawless build on a weak foundation is still a failure, just a well-engineered one. Engineering is necessary. It is the price of entry, and doing it badly will sink you. But it is not sufficient, and on its own it guarantees nothing.&lt;&#x2F;p&gt;
&lt;p&gt;Leadership is what sits above it and decides whether the whole thing pays off: setting the right direction, &lt;a href=&quot;&#x2F;insights&#x2F;big-tech-talent-war&#x2F;&quot;&gt;getting people to do their best work&lt;&#x2F;a&gt;, and &lt;a href=&quot;&#x2F;insights&#x2F;navigating-uncertainty&#x2F;&quot;&gt;making good calls under uncertainty&lt;&#x2F;a&gt;. Without that, the most elegant system in the world is effort spent in vain.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;you-need-a-goldilocks-of-both&quot;&gt;You need a Goldilocks of both&lt;&#x2F;h2&gt;
&lt;p&gt;So I did the unglamorous thing. I treated leadership as a discipline to be learned, not a talent I could assume I had, the same way I had once learned to engineer. Programmes, coaching, books, and a lot of deliberate practice, including an executive leadership programme at Oxford. It is still the work I find hardest, and the most worthwhile.&lt;&#x2F;p&gt;
&lt;p&gt;The lesson is not the tired one that soft skills beat hard skills. That is just as wrong in the other direction. You cannot lead your way out of a broken architecture, and a charismatic founder with no technical judgement is as exposed as a brilliant engineer with no people skills. What you need is a &lt;strong&gt;Goldilocks balance of both&lt;&#x2F;strong&gt;: enough technical depth to make sound decisions, and enough leadership to make that depth count. If you are strong on one and blind to the other, the blindness is what gets you. For most deep technologists, including the one I was, the blind side is the human one. And in the end, that is the side that decides.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are technical and quietly certain that the engineering is the hard part, I would gently suggest checking your blind side. And if you are a founder trying to work out whether you have the right balance around your technology, that is one of the most useful things an outside pair of eyes can tell you. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;best-technology-rarely-wins&#x2F;&quot;&gt;The best technology rarely wins. The best-led team does&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;big-tech-talent-war&#x2F;&quot;&gt;How big tech wins the talent war, and it isn&#x27;t the money&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;navigating-uncertainty&#x2F;&quot;&gt;The most important skill in business and life&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Fractional CTO or AI consultancy? Don&#x27;t hire the consultancy first</title>
        <published>2026-06-01T00:00:00+00:00</published>
        <updated>2026-06-01T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/fractional-cto-vs-ai-consultancy/"/>
        <id>https://ctozen.com/insights/fractional-cto-vs-ai-consultancy/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/fractional-cto-vs-ai-consultancy/">&lt;p&gt;If you have an AI initiative and you are about to hire an AI consultancy to build it, pause. For most companies that is the wrong first move, and the agentic era has made it more dangerous, not less. The instinct is understandable: you have a project, a consultancy delivers projects, so you hire one. But a consultancy and a fractional CTO are not two versions of the same thing. They do opposite jobs, and hiring them in the wrong order is how AI projects end up as expensive, unowned messes.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;a-consultancy-delivers-a-project-then-leaves&quot;&gt;A consultancy delivers a project, then leaves&lt;&#x2F;h2&gt;
&lt;p&gt;That is not a criticism, it is the model. An AI consultancy is paid to deliver a defined scope and move on to the next client. In the agentic era they can do it faster than ever, which sounds like pure upside until you see the other half of the picture: a 2026 industry survey found 81% of organisations had hit production failures from AI-generated code. Fast delivery plus fast departure leaves you holding a system that works in the demo, breaks in production, and belongs to nobody on your side. By then the only people who fully understand it are the ones who already left, which is exactly how you end up needing to &lt;a href=&quot;&#x2F;insights&#x2F;recover-failed-ai-pilot&#x2F;&quot;&gt;recover a failed pilot&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-real-difference-delivery-versus-ownership&quot;&gt;The real difference: delivery versus ownership&lt;&#x2F;h2&gt;
&lt;p&gt;The consultancy question and the fractional CTO question feel similar and are opposite. One is about getting a thing built. The other is about someone owning the outcome: deciding what is worth building, setting the architecture, and still being there when it needs to change.&lt;&#x2F;p&gt;
&lt;table&gt;&lt;thead&gt;&lt;tr&gt;&lt;th&gt;&lt;&#x2F;th&gt;&lt;th&gt;AI consultancy&lt;&#x2F;th&gt;&lt;th&gt;Fractional CTO&lt;&#x2F;th&gt;&lt;&#x2F;tr&gt;&lt;&#x2F;thead&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;What you get&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;A delivered project&lt;&#x2F;td&gt;&lt;td&gt;Ongoing accountable leadership&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;Incentive&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;Bill the work, ship, move on&lt;&#x2F;td&gt;&lt;td&gt;Your outcome&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;After the invoice&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;They leave&lt;&#x2F;td&gt;&lt;td&gt;They stay and own it&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;Best for&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;Delivery capacity, defined scope&lt;&#x2F;td&gt;&lt;td&gt;Deciding what to build, architecture, accountability&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;Cost shape&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;Project fee (often tens of thousands+)&lt;&#x2F;td&gt;&lt;td&gt;Day rate, part-time (~£800–£2,000&#x2F;day)&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;UK status&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;Supplier contract&lt;&#x2F;td&gt;&lt;td&gt;Assess IR35 like any contract role&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;&#x2F;tbody&gt;&lt;&#x2F;table&gt;
&lt;h2 id=&quot;the-order-that-actually-works&quot;&gt;The order that actually works&lt;&#x2F;h2&gt;
&lt;p&gt;This is not strictly either&#x2F;or. The mistake is the sequence. The right order is a fractional CTO first, to decide what is worth building, set the architecture and the success metric, and own the result, and then, if you need delivery capacity, a consultancy brought in under their direction. The fractional CTO is your interest in the room. The consultancy is a supplier they manage.&lt;&#x2F;p&gt;
&lt;p&gt;Hire the consultancy first and you have handed the most important decisions, what to build and how, to the party with the least incentive to get them right for you. That is the same &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-conflict-of-interest&#x2F;&quot;&gt;conflict of interest&lt;&#x2F;a&gt; that sinks so many projects: the people advising you on what to build are the people who profit from building it.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-honest-uk-numbers&quot;&gt;The honest UK numbers&lt;&#x2F;h2&gt;
&lt;p&gt;For a UK founder the economics are clearer than the marketing suggests. A fractional CTO works part-time, often a day or two a week, at a day rate in the region of £800 to £2,000 depending on seniority. That is a few thousand pounds a month for senior leadership you could not otherwise afford full-time. A consultancy project is a larger, lumpier commitment, frequently tens of thousands and up, for delivery. They are not competing line items. They are different things.&lt;&#x2F;p&gt;
&lt;p&gt;Two more things the UK marketing tends to skip. A fractional CTO engagement needs an IR35 status assessment like any contract role, whereas a consultancy is a straightforward supplier arrangement. And in regulated sectors, fintech and healthtech especially, the named accountability a fractional CTO provides is not a nice-to-have. It is often exactly what the regulator expects to see.&lt;&#x2F;p&gt;
&lt;p&gt;So the honest answer to &quot;fractional CTO or AI consultancy&quot; is usually &quot;a fractional CTO, and then a consultancy if you need one, in that order.&quot; Get the ownership in place first, and the delivery becomes a managed supplier decision rather than a leap of faith.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are weighing this up for your own AI initiative, that is exactly the kind of call I help founders make, with no agency behind me and nothing to upsell. &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-conflict-of-interest&#x2F;&quot;&gt;The hidden conflict of interest in hiring a fractional CTO&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;ai-strategy-without-a-cto&#x2F;&quot;&gt;How to build an AI strategy without a CTO&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;recover-failed-ai-pilot&#x2F;&quot;&gt;What to do when your AI pilot fails&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Why your tech recruiter isn&#x27;t on your side</title>
        <published>2026-06-01T00:00:00+00:00</published>
        <updated>2026-06-01T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/tech-recruiters/"/>
        <id>https://ctozen.com/insights/tech-recruiters/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/tech-recruiters/">&lt;p&gt;Here is a fact that should change how you use recruitment agencies: the agency that places your new hire is, within a few months, often the same one quietly trying to entice them back out. Recruiters are paid when people move, not when they stay. &lt;strong&gt;A healthy, stable team is bad for their business.&lt;&#x2F;strong&gt; That does not make them useless, but it makes their incentives different from yours, and you need to understand the gap before you hand them your hiring.&lt;&#x2F;p&gt;
&lt;p&gt;To be fair upfront: not all recruiters are like this. I know a few who are genuinely excellent, honest, and worth every penny. But I know more than a few who are not, and the system rewards the behaviour, so you should assume it until proven otherwise.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;paid-to-move-people-not-to-keep-them&quot;&gt;Paid to move people, not to keep them&lt;&#x2F;h2&gt;
&lt;p&gt;The core problem is structural. An agency earns its fee when a candidate changes jobs. So the entire industry has an interest in people moving around, which means stimulating exactly the staff rotation you, as an employer, are trying to prevent. Once you have built your team, the same agencies that helped you build it become a destabilising force, picking off your people for their next placement. In that narrow sense they are a kind of parasite on the system, and yet it is genuinely hard to build a team without them.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-fees-and-how-to-negotiate-them&quot;&gt;The fees, and how to negotiate them&lt;&#x2F;h2&gt;
&lt;p&gt;The standard ask is &lt;strong&gt;30% of the candidate&#x27;s annual salary&lt;&#x2F;strong&gt;, payable within weeks of the hire starting, on a no-win-no-fee basis, which is the one good thing about it. That headline number is very negotiable. Established agencies will usually come down to around 15%. You can sometimes find a newer outfit that will take 10% for exclusivity, but they are often weaker at delivering. I tried that once and regretted it, wasting months for some very poor matches.&lt;&#x2F;p&gt;
&lt;p&gt;The bigger lever is the rebate. Agencies take the full fee on the assumption that the person stays a year or more, but pay it all upfront. What happens if your hire quits after two months? Negotiate a refund. Most will only offer a sliding rebate over the first three months, grudgingly, after a long round of haggling. I have never met an agency that would take its fee monthly across the year, or refund in proportion if the person does not stick. Push anyway.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-tricks-to-watch-for&quot;&gt;The tricks to watch for&lt;&#x2F;h2&gt;
&lt;p&gt;Two in particular. First, agencies will present candidates who are a poor fit as if they were a perfect match, because they are paid to close, not to be right. Either they cannot screen properly for the technology, or they do not care to. Second, the A-players showcased when they bid for your business are not always the people who turn up to do the work.&lt;&#x2F;p&gt;
&lt;p&gt;The defences are simple. Name the specific people in the contract. Check references yourself, with real conversations, not the box-ticking confirmation-of-employment that agencies pass off as diligence. An agency will never go looking for a reason not to hire, because all it wants is to close.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;how-to-use-a-recruiter-without-getting-fleeced&quot;&gt;How to use a recruiter without getting fleeced&lt;&#x2F;h2&gt;
&lt;p&gt;Used well, an agency gives you one valuable thing: a flow of candidates you would not have found. Treat that as a supplement to your process, never a replacement for it.&lt;&#x2F;p&gt;
&lt;ul&gt;
&lt;li&gt;Keep &lt;strong&gt;exclusivity short&lt;&#x2F;strong&gt;: two to three weeks, no more. If the candidates are not flowing by then, move on.&lt;&#x2F;li&gt;
&lt;li&gt;Or run &lt;strong&gt;two or three agencies in parallel&lt;&#x2F;strong&gt;. More than that and they step on each other, get discouraged, and stop really trying.&lt;&#x2F;li&gt;
&lt;li&gt;Do &lt;strong&gt;your own interviews and technical assessment&lt;&#x2F;strong&gt;. No one can take this off your plate.&lt;&#x2F;li&gt;
&lt;li&gt;Use a &lt;strong&gt;technical leader to screen and brief&lt;&#x2F;strong&gt; so you are not relying on a recruiter&#x27;s grasp of the tech.&lt;&#x2F;li&gt;
&lt;&#x2F;ul&gt;
&lt;p&gt;This is the same imbalance of knowledge that runs through every agency relationship, the same dynamic I have written about with &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-conflict-of-interest&#x2F;&quot;&gt;agency-supplied technology leadership&lt;&#x2F;a&gt;. The person across the table knows more than you and is paid on an outcome that is not quite yours. Knowing that is most of the protection you need.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are building a team and want help running the process, or a technical eye on the people an agency puts in front of you, that is something I help founders with. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;big-tech-talent-war&#x2F;&quot;&gt;How big tech wins the talent war, and it isn&#x27;t the money&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-conflict-of-interest&#x2F;&quot;&gt;The hidden conflict of interest in hiring a fractional CTO&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;build-or-buy-software&#x2F;&quot;&gt;You don&#x27;t need to build a brewery to drink a pint of beer&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>AI is about to split the job market in two</title>
        <published>2026-05-31T00:00:00+00:00</published>
        <updated>2026-05-31T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/agentic-job-market/"/>
        <id>https://ctozen.com/insights/agentic-job-market/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/agentic-job-market/">&lt;p&gt;Most of the debate about AI and jobs is about automation: which roles disappear, which survive. That is the wrong question, or at least an incomplete one. The bigger shift is not that AI removes jobs. It is that &lt;strong&gt;AI is about to rebuild the entire market that matches people to work, and in the process split it in two.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;This is already starting. Within a few years, by the end of the decade, hiring will look almost nothing like it does today.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;you-won-t-look-for-work-your-agent-will&quot;&gt;You won&#x27;t look for work. Your agent will.&lt;&#x2F;h2&gt;
&lt;p&gt;Start with your side of the table. Today you manage your own career, badly, in the gaps between actually doing your job. Soon you will have &lt;a href=&quot;&#x2F;insights&#x2F;autonomous-ai-2026&#x2F;&quot;&gt;a personal AI agent doing it for you&lt;&#x2F;a&gt;, continuously. It will coach you, spot the skills you are missing, map the progression paths open to you, and chase down the credentials that raise your value. And when an opportunity appears that fits, it will negotiate on your behalf, in a market that prices you in real time based on your skills, your reputation, and live demand.&lt;&#x2F;p&gt;
&lt;p&gt;Think of it as a career manager that never sleeps, never gets distracted, and works only for you.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;employers-won-t-post-jobs-they-ll-hunt&quot;&gt;Employers won&#x27;t post jobs. They&#x27;ll hunt.&lt;&#x2F;h2&gt;
&lt;p&gt;The other side of the table changes just as much. Instead of posting a vacancy and waiting, organisations will search for talent constantly, proactively, at a scale no human recruiting team could match. They will read your public work, your repositories, your writing, your contributions to open communities, and identify the people they want long before a role formally exists. The best teams will be lined up before the project that needs them is even approved.&lt;&#x2F;p&gt;
&lt;p&gt;Hiring stops being an event, a thing that happens when a job opens, and becomes a continuous market running quietly in the background. Verified, portable credentials and reputation become the currency that market runs on.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;this-is-where-it-splits&quot;&gt;This is where it splits&lt;&#x2F;h2&gt;
&lt;p&gt;Here is the uncomfortable part. A market this efficient does not lift everyone equally. It concentrates. The top tier of talent, the genuinely scarce, will be valued the way elite athletes are now: courted relentlessly and priced accordingly. And that creates an equally elite tier of recruiters and advisors who serve them.&lt;&#x2F;p&gt;
&lt;p&gt;For everyone else, the risk runs the other way. A market that is brilliant at recognising and rewarding the top 1% can be just as efficient at overlooking everyone it does not rank highly. The same system that makes a star&#x27;s career frictionless can quietly exclude the people it never surfaces. That is not a reason to slow it down, because it is coming either way. It is a reason to think hard, now, about fairness, access, and the people the algorithm does not see.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-stays-human&quot;&gt;What stays human&lt;&#x2F;h2&gt;
&lt;p&gt;If all of that sounds like the machines take over hiring entirely, they don&#x27;t, and this is the most important point. The mechanics get automated: the searching, the matching, the first-pass filtering, the scheduling. What does not get automated is the part that was always the real work. Trust between two people. The judgement to read a person, not a profile. High-stakes negotiation. The intuition that says this person is right for this team even though the data is ambiguous.&lt;&#x2F;p&gt;
&lt;p&gt;The biggest winners will not be the ones with the best AI, or the ones clinging to the old human-only way. They will be the ones who combine them: &lt;strong&gt;AI for reach and speed, humans for trust and judgement.&lt;&#x2F;strong&gt; I have argued before that &lt;a href=&quot;&#x2F;insights&#x2F;irreducible-human-edge&#x2F;&quot;&gt;the interesting question is not what AI will automate but what it can&#x27;t&lt;&#x2F;a&gt;, and nowhere is that clearer than in how we will find and choose each other for work.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;it-is-all-still-up-for-grabs&quot;&gt;It is all still up for grabs&lt;&#x2F;h2&gt;
&lt;p&gt;None of this is science fiction set decades out. The candidate agents are being built now. The talent-hunting tools exist already in early form. The shape of the market in 2030 is being decided by the choices people make in 2026.&lt;&#x2F;p&gt;
&lt;p&gt;So the question is the same whether you are building a career, building a team, or building the tools that will run this market. It is all still up for grabs, and the quality of the thinking you bring to it now is what will decide where you land. That is the same logic behind running a &lt;a href=&quot;&#x2F;insights&#x2F;dual-stream-strategy&#x2F;&quot;&gt;dual-stream strategy&lt;&#x2F;a&gt;: deliver on today while you build deliberately for the market that is coming.&lt;&#x2F;p&gt;
&lt;p&gt;If you are trying to work out what this means for your business or your team, I am always happy to think it through with you.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Want to talk through what the agentic job market means for how you hire or how you build your career? &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;irreducible-human-edge&#x2F;&quot;&gt;Most of what you do at work will be automated. The interesting question is what won&#x27;t&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;training-ladder-broken&#x2F;&quot;&gt;The training ladder is broken&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;dual-stream-strategy&#x2F;&quot;&gt;The dual-stream strategy&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>If your AI strategy is just ChatGPT, you don&#x27;t have one</title>
        <published>2026-05-31T00:00:00+00:00</published>
        <updated>2026-05-31T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/ai-is-not-just-llms/"/>
        <id>https://ctozen.com/insights/ai-is-not-just-llms/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/ai-is-not-just-llms/">&lt;p&gt;Here is a quick test of your AI strategy. If the honest summary of it is &quot;we are rolling out ChatGPT&quot;, or Copilot, or Gemini, or some chatbot built on one of them, then you do not have an AI strategy. You have a subscription.&lt;&#x2F;p&gt;
&lt;p&gt;That is not a criticism of the tools. They are genuinely useful. It is a criticism of mistaking them for the whole of AI, and mistaking access to them for an advantage.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;blindsided-by-llms&quot;&gt;Blindsided by LLMs&lt;&#x2F;h2&gt;
&lt;p&gt;Something strange has happened over the last couple of years. Awareness of AI has gone through the roof, and &lt;a href=&quot;&#x2F;insights&#x2F;ai-understanding-pyramid&#x2F;&quot;&gt;understanding of it has barely moved&lt;&#x2F;a&gt;. Most leaders now quietly equate &quot;AI&quot; with large language models, the things that write text and hold a conversation. I call it being blindsided by LLMs: the one form of AI that is easy to see has crowded out all the others.&lt;&#x2F;p&gt;
&lt;p&gt;It matters because of a simple competitive fact. &lt;strong&gt;The frontier language models are available to everyone, through the same handful of APIs, at the same price.&lt;&#x2F;strong&gt; Your biggest competitor can buy exactly what you buy, on the same afternoon. So whatever edge you think you are getting by adopting an LLM, they are getting too. Adopting the thing everyone can adopt is not a strategy. It is keeping up.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;llms-are-one-tool-in-a-large-box&quot;&gt;LLMs are one tool in a large box&lt;&#x2F;h2&gt;
&lt;p&gt;The deeper problem is that an LLM is often not even the right tool. Language models are one corner of a much larger field. Depending on the problem, the technique that actually moves your numbers might be demand forecasting, optimisation, anomaly detection, recommendation, computer vision, simulation, or a plain well-tuned predictive model that is cheaper, faster, more accurate, and more explainable than any chatbot. These are not exotic. They are mature, proven, and frequently a far better fit than asking a language model to do a job it was never built for.&lt;&#x2F;p&gt;
&lt;p&gt;When the only tool you reach for is the one in the headlines, you end up forcing every problem into the shape of a chat interface. Some problems fit. Many don&#x27;t.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;where-the-advantage-actually-lives&quot;&gt;Where the advantage actually lives&lt;&#x2F;h2&gt;
&lt;p&gt;If the model is commoditised, where is the edge? In the things that are yours and cannot be bought off a shelf. Your proprietary data. The specific problems in your business that are worth solving. And, above all, the willingness to redesign the work around the technology rather than bolting it on top. &lt;strong&gt;The model is the commodity. The advantage is everything you wrap around it.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;so-what-do-you-actually-do&quot;&gt;So what do you actually do?&lt;&#x2F;h2&gt;
&lt;p&gt;Not the obvious thing, which is to pick a tool and roll it out. The wiser order is the reverse: start from the parts of your business where AI could genuinely change the economics, and only then &lt;a href=&quot;&#x2F;insights&#x2F;ai-strategy-without-a-cto&#x2F;&quot;&gt;match each one to the right technique&lt;&#x2F;a&gt;, which may or may not be an LLM. That is slower and less exciting than announcing a chatbot. It is also the difference between &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-spend-not-in-numbers&#x2F;&quot;&gt;AI spend that shows up in the numbers and AI spend that doesn&#x27;t&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;The companies that win the next few years will not be the ones that adopted AI first. Almost everyone will have adopted it. They will be the ones who understood that AI was never one thing, refused to be blindsided by the single form of it everyone was talking about, and did the unglamorous work of matching the right tool to the right problem.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want help working out where AI could actually move the needle in your business, and which kind of AI that even is, I am happy to talk it through. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-understanding-pyramid&#x2F;&quot;&gt;The people making the biggest AI decisions understand it the least&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-spend-not-in-numbers&#x2F;&quot;&gt;Why your AI spend isn&#x27;t showing up in the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;value-streams-not-use-cases&#x2F;&quot;&gt;Stop counting AI use cases. Redesign three value streams instead&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>From AI that talks to AI that works: the autonomous leap of 2026</title>
        <published>2026-05-31T00:00:00+00:00</published>
        <updated>2026-05-31T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/autonomous-ai-2026/"/>
        <id>https://ctozen.com/insights/autonomous-ai-2026/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/autonomous-ai-2026/">&lt;p&gt;A few weeks ago, a developer reported something that should make every leader sit up. His AI assistant had quietly bought a phone number on Twilio, built itself speech capability, and started calling him to chat.&lt;&#x2F;p&gt;
&lt;p&gt;Nobody told it to. It decided that talking was a better way to reach him, and acted on it.&lt;&#x2F;p&gt;
&lt;p&gt;That assistant runs on OpenClaw, an open source project that barely existed before mid November 2025. Within weeks it became one of the fastest growing open source projects anywhere. Its author built it almost entirely agentically, with Claude Code doing the work.&lt;&#x2F;p&gt;
&lt;p&gt;This is the moment the shift becomes obvious. 2025 gave us AI that talks. &lt;strong&gt;2026 is the year of AI that works.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;from-answering-to-acting&quot;&gt;From answering to acting&lt;&#x2F;h2&gt;
&lt;p&gt;The difference is not subtle. A chatbot waits for your prompt and answers. An agent takes your goal and pursues it, for hours or days, while you sleep. It gets in touch when it needs a decision. It remembers everything you have ever told it: your preferences, your context, the facts of your life.&lt;&#x2F;p&gt;
&lt;p&gt;That is a categorical change, not an incremental one. It is also why the engineering underneath matters more than ever. When a system only answers, a mistake is a bad sentence. When a system acts, a mistake is a bad outcome in the world. I have written before about why &lt;a href=&quot;&#x2F;insights&#x2F;not-talking-to-an-llm&#x2F;&quot;&gt;you are not talking to an LLM, you are talking to a system&lt;&#x2F;a&gt;, and autonomy raises the stakes on every layer of that system.&lt;&#x2F;p&gt;
&lt;p&gt;I have spent the last year building in exactly this space. A personal assistant as a hobby project. A professional mentor application with &lt;a rel=&quot;external&quot; href=&quot;https:&#x2F;&#x2F;www.linkedin.com&#x2F;company&#x2F;wearementor360&#x2F;&quot;&gt;Mentor360&lt;&#x2F;a&gt;, now being trialled by the Royal Navy, one of the Formula 1 teams, and other high-performance organisations. So when I say the ground has shifted, it is not a guess from the sidelines. It is what I am watching happen in the products I help build.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;openclaw-and-why-it-matters&quot;&gt;OpenClaw, and why it matters&lt;&#x2F;h2&gt;
&lt;p&gt;OpenClaw is the clearest public sign of it. You install it on your own machine, a Raspberry Pi is enough, or in any cloud. You talk to it through your favourite messaging app. You hand it a task and it goes to work. The only real limits are your budget for model calls and your imagination.&lt;&#x2F;p&gt;
&lt;p&gt;What makes it striking is not any single feature. It is that a capable, general-purpose, autonomous agent is now something a hobbyist can run at home, for the cost of the model calls it makes. The gap between a research demo and a thing ordinary people use has collapsed.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-risks-are-real&quot;&gt;The risks are real&lt;&#x2F;h2&gt;
&lt;p&gt;It is not finished, and it is not safe yet. There are serious privacy and security questions nobody has fully answered, and I would not yet trust it with anything that truly matters. An agent that can act on your behalf can also act wrongly on your behalf, at speed, with your credentials.&lt;&#x2F;p&gt;
&lt;p&gt;This is the honest tension of the moment. The capability is genuinely exciting and the guardrails are genuinely immature. Both things are true at once, and treating either as the whole story gets you into trouble.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;a-harbinger-not-the-destination&quot;&gt;A harbinger, not the destination&lt;&#x2F;h2&gt;
&lt;p&gt;&lt;strong&gt;OpenClaw is a harbinger, not the destination.&lt;&#x2F;strong&gt; Apple, Google, Anthropic, and OpenAI are all building commercial versions of the same idea. Within a year, an autonomous assistant in your pocket will feel as ordinary as a search bar does today, with the safety and polish the open source version is still missing.&lt;&#x2F;p&gt;
&lt;p&gt;AI is still early. We have seen a fraction of what it will do, and it is accelerating. Nobody can tell you exactly where it leads, or how fast. But the people who put it to work this year will pull steadily ahead of the people who waited to see what happened. The same pattern is already visible in &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;how the best teams are actually shipping agents in production&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;So it is worth asking yourself honestly: are you doing enough to put this to work, in your business and in your life?&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Weighing up where autonomous AI fits in your business and not sure where to start? &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;agentic-job-market&#x2F;&quot;&gt;AI is about to split the job market in two&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026: what actually works in production&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;not-talking-to-an-llm&#x2F;&quot;&gt;You&#x27;re not talking to an LLM, you&#x27;re talking to a system&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;irreducible-human-edge&#x2F;&quot;&gt;The irreducible human edge&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>How big tech wins the talent war, and it isn&#x27;t the money</title>
        <published>2026-05-31T00:00:00+00:00</published>
        <updated>2026-05-31T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/big-tech-talent-war/"/>
        <id>https://ctozen.com/insights/big-tech-talent-war/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/big-tech-talent-war/">&lt;p&gt;Over the last twelve weeks I did something slightly unusual. I interviewed for senior leadership roles at five big tech companies, and not only because I wanted the job. I wanted to understand, from the inside, how the best-regarded technology companies in the world build their cultures. Why they win the talent war so consistently, while smaller companies, often with better missions and more interesting problems, struggle to attract and keep the same people.&lt;&#x2F;p&gt;
&lt;p&gt;I accepted one of the roles. And the thing I went in looking for turned out to be more deliberate, and more important, than I expected.&lt;&#x2F;p&gt;
&lt;p&gt;It is not the money. The money helps, obviously. But the best people have an almost endless choice of where to work, and the thing that actually separates the companies they flock to is not the package. It is &lt;strong&gt;psychological safety: an absolute, non-negotiable floor of respect and emotional security that these companies treat as a hard operating requirement, not a nice-to-have.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-the-best-people-need-it-most&quot;&gt;Why the best people need it most&lt;&#x2F;h2&gt;
&lt;p&gt;There is a hard reason for this, not a soft one. The most talented people are very often the most sensitive. The traits that make someone exceptional, intensity, deep focus, a low tolerance for things being wrong, frequently come bundled with a nervous system that reacts strongly to its environment. Many of the best engineers I have worked with were not comfortable in the average school playground, and they are certainly not looking to relive it at work.&lt;&#x2F;p&gt;
&lt;p&gt;The science is settled enough. People do their best work under a moderate, positive kind of pressure. Push beyond that and something physical happens: the brain starts diverting resources to its older, survival-focused parts, and the higher-level, creative, analytical machinery quietly powers down. A frightened brain cannot do its best thinking. So if you lead through fear, distrust, and pressure, which a surprising number of ordinary companies still do, you are literally switching off the capability you are paying for.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-cocoon&quot;&gt;The cocoon&lt;&#x2F;h2&gt;
&lt;p&gt;The best companies build a protective wall around their most valuable people. Partly because those people can be fragile, and partly because it lets them disappear into the work. That disappearing is the whole game. Deep, undisturbed focus is where flow lives, the state where output multiplies several times over, and flow only grows in the right conditions: high interest, low anxiety, no interruptions, no fear.&lt;&#x2F;p&gt;
&lt;p&gt;Flow is a fragile thing. It takes real effort to create and almost nothing to destroy.&lt;&#x2F;p&gt;
&lt;p&gt;And it is not just about heads-down focus. High performance needs open lines, people willing to float a half-formed idea without bracing for impact. Not every idea is good. But the moment someone shares one and is met with a flinch, a smirk, or a flicker of scorn, you have taught them something precise: never do that again. They will keep the next idea, possibly the good one, to themselves. The cheapest leadership tool in the world runs the other way: &lt;a href=&quot;&#x2F;insights&#x2F;power-of-thank-you&#x2F;&quot;&gt;genuine, specific recognition&lt;&#x2F;a&gt; for the act of contributing, every time.&lt;&#x2F;p&gt;
&lt;p&gt;The best organisations understand that a brilliant idea can come from anyone, at any level. The weak ones operate on the quiet assumption that good ideas are the property of senior people, and they defend that assumption in ways that slowly poison everyone beneath them.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-part-that-surprised-me&quot;&gt;The part that surprised me&lt;&#x2F;h2&gt;
&lt;p&gt;There is a real edge to this, and I felt it personally. The bar for respect is so high that it constrains how people speak. You learn to be genuinely careful, with jokes, with stories, with offhand references, because what reads as harmless to you might land badly on someone else.&lt;&#x2F;p&gt;
&lt;p&gt;I ran into this myself. I sailed through the engineering and most of the leadership interviews at one of these companies, and then stumbled on a single phrase. Asked for an example of resolving a crisis, I described a past situation as a bit of &quot;high school drama&quot;. The interviewer read it as dismissive and insensitive. It cost me. I was surprised by the intensity of the reaction, and then, thinking about it, I understood it. If you are going to promise everyone a workplace free of casual cruelty, you have to hold the line everywhere, including with a candidate who used two careless words.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;soft-or-strong&quot;&gt;Soft, or strong?&lt;&#x2F;h2&gt;
&lt;p&gt;It is fashionable in some circles to call this coddling. We have all heard the gospel of the high-pressure founder who gets results by making people afraid. So is psychological safety just weakness dressed up in HR language?&lt;&#x2F;p&gt;
&lt;p&gt;I think the opposite is true. These cultures are not weak. They are extraordinarily strong, because they let a far wider range of people, with different backgrounds and different ways of thinking, perform at their best at the same time. That is a much harder thing to build than a pressure cooker, and a much more powerful one.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-this-means-if-you-are-not-big-tech&quot;&gt;What this means if you are not big tech&lt;&#x2F;h2&gt;
&lt;p&gt;The good news is that psychological safety is not expensive. You cannot always match a tech giant on salary or equity. But you can match them, and often beat them, on the thing that actually retains your best people. It costs nothing to build a culture where ideas are welcomed, mistakes are examined rather than punished, and respect is genuinely non-negotiable from the top down.&lt;&#x2F;p&gt;
&lt;p&gt;Most companies don&#x27;t, because it requires the leaders themselves to model it relentlessly, and that is harder than writing a cheque. But if you get it right, you can hold talent that a much larger company would happily pay double for. In a world where &lt;a href=&quot;&#x2F;insights&#x2F;traditional-moats-dissolving&#x2F;&quot;&gt;the old advantages of scale and capital are dissolving&lt;&#x2F;a&gt;, the protective culture you build around your best people may be one of the few moats you have left.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are trying to build that kind of culture, or wondering why your best people keep leaving for companies with worse problems and better environments, I am always happy to compare notes. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;best-technology-rarely-wins&#x2F;&quot;&gt;The best technology rarely wins. The best-led team does&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;power-of-thank-you&#x2F;&quot;&gt;The most powerful yet overlooked tool in a leader&#x27;s arsenal&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;traditional-moats-dissolving&#x2F;&quot;&gt;Traditional moats are dissolving&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;irreducible-human-edge&#x2F;&quot;&gt;Most of what you do at work will be automated. The interesting question is what won&#x27;t&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>The hidden conflict of interest in hiring a fractional CTO</title>
        <published>2026-05-31T00:00:00+00:00</published>
        <updated>2026-05-31T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/fractional-cto-conflict-of-interest/"/>
        <id>https://ctozen.com/insights/fractional-cto-conflict-of-interest/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/fractional-cto-conflict-of-interest/">&lt;p&gt;Fractional CTOs are everywhere now, and mostly that is a good thing. Senior technology leadership without the full-time cost is exactly what a lot of companies need. But there is a version of this arrangement that quietly works against you, and most founders never see it coming.&lt;&#x2F;p&gt;
&lt;p&gt;It is the fractional CTO who comes attached to an agency.&lt;&#x2F;p&gt;
&lt;p&gt;Here is the problem. When a software house or digital agency lends you one of their experts to act as your technology leader, &lt;strong&gt;that person has two employers: you, and the agency that signs their cheque.&lt;&#x2F;strong&gt; Their advice is supposed to serve your interests. But every recommendation to bring in more of the agency&#x27;s developers, to extend the agency&#x27;s contract, to pick the stack the agency happens to specialise in, also serves their real boss. It is hard to serve two masters. So they don&#x27;t. The agency wins, and you pay for it.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-imbalance-of-knowledge-used-against-you&quot;&gt;The imbalance of knowledge, used against you&lt;&#x2F;h2&gt;
&lt;p&gt;This works because of a gap you already feel if you are a non-technical founder. The person across the table knows more about the technology than you do, and a good salesperson knows exactly how to use that. Account managers are trained to upsell. The genuinely well-meaning engineers get kept at a safe distance from you.&lt;&#x2F;p&gt;
&lt;p&gt;And the agency that swears it is &quot;technology agnostic&quot;, free to pick the best tool for your job, almost never is. Everyone specialises. They have teams built around specific languages and frameworks, and they are not going to retool at their own expense to suit you. So you get steered, gently and expertly, toward the solution that suits them.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-cto-might-be-a-business-analyst&quot;&gt;The &quot;CTO&quot; might be a business analyst&lt;&#x2F;h2&gt;
&lt;p&gt;There is a second trap, and it is more common than it should be. An agency will tell you that all its people are top-tier experts, then put forward an average business analyst or a mid-level engineer with a CTO title bolted on. I have seen this happen more than once. The reputation of the firm is doing the talking, not the actual experience of the person you are about to trust with the most strategically important decisions in your business.&lt;&#x2F;p&gt;
&lt;p&gt;The defence is simple. Interview the individual, not the logo. Judge the specific person on their own merits, their own track record, their own scars. The reputation of the firm tells you nothing about the competence of the one human being they have put in front of you.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-good-actually-looks-like&quot;&gt;What good actually looks like&lt;&#x2F;h2&gt;
&lt;p&gt;The fix for both problems is the same. Find someone independent, who charges you directly, with no middleman and no colleagues to upsell. Someone personally accountable, who you can meet, evaluate, and build a relationship with. Someone whose only incentive is your outcome, because that is the only thing they are being paid for.&lt;&#x2F;p&gt;
&lt;p&gt;And there is one trait worth weighting above almost all others: curiosity. A real advisor leads with questions. They dig into your business, validate their assumptions, and tell you where their knowledge comes from. If someone fires back quick, confident answers to genuinely complex questions without doing the work first, treat it as a red flag, not a sign of brilliance. The best technical advice is slow before it is fast.&lt;&#x2F;p&gt;
&lt;p&gt;One more thing, and it is the one founders get wrong most often. Engage your independent advisor early, before you hire the development team, not after. The most expensive mistakes in a tech project are made in the first few decisions: the architecture, the platform, the hiring profile. Those are made before a single agency developer writes a line of code. By the time you have a team in place, the costly choices are already behind you.&lt;&#x2F;p&gt;
&lt;p&gt;None of this means an agency is the wrong answer, or that a fractional CTO is a gamble. A well-structured fractional engagement is one of the best-value decisions a growing company can make, and I have written separately about &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-right-for-you&#x2F;&quot;&gt;when it fits and when it doesn&#x27;t&lt;&#x2F;a&gt;. It simply means you should &lt;strong&gt;know whose interests your technology leader actually serves.&lt;&#x2F;strong&gt; If the honest answer is &quot;not only yours&quot;, you have not really hired a CTO. You have hired a salesperson with a better title.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want a second opinion from someone with no agency behind them and nothing to upsell, that is exactly what I do. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-right-for-you&#x2F;&quot;&gt;Fractional CTO. Is outsourced technology leadership right for you?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-vs-ai-consultancy&#x2F;&quot;&gt;Fractional CTO or AI consultancy? Don&#x27;t hire the consultancy first&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;recover-failed-ai-pilot&#x2F;&quot;&gt;What to do when your AI pilot fails&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>The quality you can&#x27;t see is the one that kills you</title>
        <published>2026-05-31T00:00:00+00:00</published>
        <updated>2026-05-31T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/quality-you-cant-see/"/>
        <id>https://ctozen.com/insights/quality-you-cant-see/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/quality-you-cant-see/">&lt;p&gt;A friend of mine lost two years of his life and several million pounds of his own money on software that worked perfectly. The features were brilliant, genuinely ahead of anything AutoTrader had at the time. Then the users arrived, the traffic grew faster than the system could scale, and it fell over. The business never recovered. He will tell you himself: the architecture was wrong, and the launch was rushed, with no real load testing.&lt;&#x2F;p&gt;
&lt;p&gt;Here is the uncomfortable truth that story illustrates. &lt;strong&gt;There are two kinds of software quality. The kind you can see, and the kind you can&#x27;t.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;The kind you can see is the one everyone talks about: does it work, does it look good, is it a pleasure to use. The kind you can&#x27;t see is the one that decides whether your company survives: does it stay up under load, does it resist attack, can it be changed without falling apart, will it still cope when you have ten times the users. The second kind is invisible right up until the moment it isn&#x27;t, and that moment usually arrives at the worst possible time. During scale-up. During a fundraise. During an exit.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;three-companies-that-learned-it-the-hard-way&quot;&gt;Three companies that learned it the hard way&lt;&#x2F;h2&gt;
&lt;p&gt;My friend&#x27;s car-trading site is the first. Visionary features, no competitor close, and none of it mattered once the user base outgrew the architecture.&lt;&#x2F;p&gt;
&lt;p&gt;The second: a major university website that went offline for most of a day during clearing, the single day in the year when it receives the flood of new applications. Estimates of the lost revenue run between £8 and £21 million. The site worked fine every other day of the year. It just couldn&#x27;t take the one day that mattered.&lt;&#x2F;p&gt;
&lt;p&gt;The third one is mine. At my previous company, GeoVS, we lost our first sale of a £300,000 system because the back end of our distributed sensor platform could not perform under real conditions. That one hurt. We learned, re-architected, and the same platform now runs at national scale in several countries, monitoring critical infrastructure worth hundreds of billions of dollars, and it is known for never going down. There is a sting in the tail. Years later, when we sold GeoVS, one of the reasons the buyer went ahead was a testimonial from that very first client, the one who had refused us. They said it was the first and only system they had ever used that never crashed. We won them back. But the early failure cost us ground we never fully recovered, and a better exit.&lt;&#x2F;p&gt;
&lt;p&gt;None of these failures were about features. Every one of them was about the quality nobody could see until it was too late.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-it-keeps-happening&quot;&gt;Why it keeps happening&lt;&#x2F;h2&gt;
&lt;p&gt;So why does it keep happening? Because software quality means different things to different people, as Gerald Weinberg put it. The business side owns the visible quality: the features, the experience, the thing the customer is sold. The engineers own the invisible quality: the load tests, the security, the architecture that has to hold. And the two halves too often don&#x27;t talk. Each assumes the other has it covered. Neither does.&lt;&#x2F;p&gt;
&lt;p&gt;I once worked with an engineer who told me, with a straight face, that users expect bugs. He was the reason we lost a first sale. That attitude, the quiet belief that the invisible quality is someone else&#x27;s problem, is more expensive than any feature you will ever cut.&lt;&#x2F;p&gt;
&lt;p&gt;The fix is not complicated, but it takes intent. Treat the invisible quality as a first-class business concern, not an engineering detail. Load test before launch, not after the outage. Make security and scalability someone&#x27;s explicit responsibility, with a name attached. And get the two halves of the team in the same room early, because business requirements are better when they are technically honest, and architecture is better when it understands what the business is actually betting on.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-this-is-suddenly-more-urgent&quot;&gt;Why this is suddenly more urgent&lt;&#x2F;h2&gt;
&lt;p&gt;AI now writes a large share of the code in many companies, and it is very good at the quality you can see. It produces working features, clean-looking functions, plausible output, fast. What it is weakest at is exactly the quality you can&#x27;t see: the architecture that holds under load, the security that assumes someone hostile is on the other end, the maintainability that a human will have to live with in two years.&lt;&#x2F;p&gt;
&lt;p&gt;The volume of code is going up, and the share of it that any human has truly examined is going down. That is why &lt;a href=&quot;&#x2F;insights&#x2F;ai-generated-code-vanity-metric&#x2F;&quot;&gt;the percentage of AI-generated code is a vanity metric&lt;&#x2F;a&gt;: it counts the quality you can see and tells you nothing about the quality that kills you. The invisible quality has never been easier to neglect, or more dangerous to ignore.&lt;&#x2F;p&gt;
&lt;p&gt;Tom DeMarco offered the definition of quality I keep coming back to: a product&#x27;s quality is a function of how much it changes the world for the better. A clumsy product nobody trusts, or one that falls over on the day that matters, never gets the chance. The visible quality earns you the user. The invisible quality is what lets you keep them.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you are scaling, raising, or heading toward an exit and you are not sure how much invisible quality your product is carrying, that is exactly the thing worth checking before someone else checks it for you. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-generated-code-vanity-metric&#x2F;&quot;&gt;75% of Google&#x27;s new code is AI-generated. So what?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;recover-failed-ai-pilot&#x2F;&quot;&gt;What to do when your AI pilot fails&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>What to do when your AI pilot fails</title>
        <published>2026-05-31T00:00:00+00:00</published>
        <updated>2026-05-31T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/recover-failed-ai-pilot/"/>
        <id>https://ctozen.com/insights/recover-failed-ai-pilot/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/recover-failed-ai-pilot/">&lt;p&gt;If your AI pilot has failed, the first thing to know is that you are firmly in the majority, and that most failed pilots are recoverable. By recent counts, &lt;strong&gt;95% of generative AI pilots produce no measurable return&lt;&#x2F;strong&gt; (MIT, 2025), and 42% of UK firms scrapped most of their AI initiatives in 2025, up from 17% the year before (S&amp;amp;P Global). The failure is rarely the technology. It is almost always something fixable in how the project was scoped, governed, and built. Here is how to work out what went wrong, and how to get the value you were promised.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;it-almost-certainly-wasn-t-the-ai&quot;&gt;It almost certainly wasn&#x27;t the AI&lt;&#x2F;h2&gt;
&lt;p&gt;When a pilot dies, the instinct is to blame the model. Wrong model, not smart enough, not ready yet. Almost always, that is not what happened. The predictable causes are the same handful every time: a use case chosen because it was exciting rather than valuable, data and retrieval quality that could not survive real inputs, no clear definition of success, and no plan for the unglamorous engineering between a demo and production. I have written separately about &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;why AI projects fail and what the 20% that succeed do differently&lt;&#x2F;a&gt;. For now the important point is simpler: a failed pilot is usually a fixable pilot.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;how-to-recover-a-failed-ai-pilot&quot;&gt;How to recover a failed AI pilot&lt;&#x2F;h2&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Diagnose before you spend another pound.&lt;&#x2F;strong&gt; The most expensive mistake is to pour more money in the same direction before understanding why the first attempt failed. Start with an honest post-mortem of scope, data, metric, and engineering, not a new vendor.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Separate the idea from the execution.&lt;&#x2F;strong&gt; A pilot can fail because the idea was never going to pay off, or because a good idea was built badly. These need opposite responses, so decide which one you are looking at before anything else.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Find the real success metric.&lt;&#x2F;strong&gt; Many pilots fail because nobody agreed what success looked like in numbers. If you cannot say what the pilot was supposed to move on the P&amp;amp;L, that is the first thing to fix, not the model.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Decide: salvage or restart.&lt;&#x2F;strong&gt; If the use case maps to genuine value and a clear metric, the execution can usually be rebuilt on the same foundation. If the use case never had a path to value, restart from the problem, not the prototype.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Rebuild the unglamorous parts.&lt;&#x2F;strong&gt; Most of the value lives in &lt;a href=&quot;&#x2F;insights&#x2F;quality-you-cant-see&#x2F;&quot;&gt;the layers nobody demos&lt;&#x2F;a&gt;: retrieval, data quality, error handling, escalation, and the workflow redesign around the model. That is where the recovery effort should go.&lt;&#x2F;li&gt;
&lt;&#x2F;ol&gt;
&lt;h2 id=&quot;who-should-run-the-recovery-and-who-shouldn-t&quot;&gt;Who should run the recovery, and who shouldn&#x27;t&lt;&#x2F;h2&gt;
&lt;p&gt;Here is the part most people get wrong. &lt;strong&gt;The worst-placed people to recover your failed AI pilot are usually the ones who built it.&lt;&#x2F;strong&gt; Not because they are not capable, but because their incentives and their blind spots both point the wrong way. The firm that built the pilot has a commercial interest in selling you more of the same, and a natural reluctance to name the decisions that caused the failure, because they made them.&lt;&#x2F;p&gt;
&lt;table&gt;&lt;thead&gt;&lt;tr&gt;&lt;th&gt;&lt;&#x2F;th&gt;&lt;th&gt;The consultancy that built it&lt;&#x2F;th&gt;&lt;th&gt;An independent fractional CTO&lt;&#x2F;th&gt;&lt;&#x2F;tr&gt;&lt;&#x2F;thead&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;Incentive&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;Sell more build hours&lt;&#x2F;td&gt;&lt;td&gt;Get you to a working outcome&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;Blind spot&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;Owns the decisions that failed&lt;&#x2F;td&gt;&lt;td&gt;No stake in the original build&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;Salvage-or-restart call&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;Biased toward &quot;keep building&quot;&lt;&#x2F;td&gt;&lt;td&gt;Decided on the merits&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;strong&gt;Accountability&lt;&#x2F;strong&gt;&lt;&#x2F;td&gt;&lt;td&gt;To their own pipeline&lt;&#x2F;td&gt;&lt;td&gt;To your result&lt;&#x2F;td&gt;&lt;&#x2F;tr&gt;
&lt;&#x2F;tbody&gt;&lt;&#x2F;table&gt;
&lt;p&gt;This is the same conflict of interest that runs through &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-conflict-of-interest&#x2F;&quot;&gt;agency-supplied technology leadership&lt;&#x2F;a&gt;. For a recovery, where the whole job is an unflinching diagnosis, independence is not a nice-to-have. It is the point.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-recovery-actually-looks-like&quot;&gt;What recovery actually looks like&lt;&#x2F;h2&gt;
&lt;p&gt;None of this is exotic. The &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;European insurance brokerage I worked with&lt;&#x2F;a&gt; reached 67% autonomous resolution on real customer cases, not because of a cleverer model than anyone else had, but because the unglamorous parts were done properly: retrieval engineered as a first-class problem, conservative escalation, a metric everyone agreed on, and the workflow rebuilt around the system rather than bolted on top. A recovered pilot looks the same. It is the demo plus all the work the demo skipped.&lt;&#x2F;p&gt;
&lt;p&gt;A dead pilot feels like a write-off, and the board will be tempted to treat it as one. Usually it is not. It is a first draft that taught you, at some expense, exactly which of the four things above went wrong. That is worth more than it feels like in the moment, as long as the diagnosis is honest and the recovery is run by someone whose only interest is your result.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you have an AI pilot that stalled or failed and you want an independent read on whether to salvage it or start again, that is exactly the kind of diagnosis I do. &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-conflict-of-interest&#x2F;&quot;&gt;The hidden conflict of interest in hiring a fractional CTO&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;value-streams-not-use-cases&#x2F;&quot;&gt;Stop counting AI use cases. Redesign three value streams instead&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>The people making the biggest AI decisions understand it the least</title>
        <published>2026-05-30T00:00:00+00:00</published>
        <updated>2026-05-30T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/ai-understanding-pyramid/"/>
        <id>https://ctozen.com/insights/ai-understanding-pyramid/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/ai-understanding-pyramid/">&lt;p&gt;The people making the biggest AI decisions understand it the least. I watch it happen every week.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-concentric-circles-of-ai-understanding&quot;&gt;The concentric circles of AI understanding&lt;&#x2F;h2&gt;
&lt;p&gt;Picture knowledge of where AI is actually heading as a set of concentric circles. At the centre sit the people running the frontier labs and the researchers shipping the models. They see the curve up close. Move one ring out and understanding drops sharply. By the time you reach the outer rings, the traditional-industry executives holding the budgets and the large consultancies advising them, real understanding is thin.&lt;&#x2F;p&gt;
&lt;p&gt;Here is the uncomfortable part. The money and the strategic decisions sit in those outer rings. The clarity sits at the centre. The gap is widest exactly where the stakes are highest.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;where-i-stand&quot;&gt;Where I stand&lt;&#x2F;h2&gt;
&lt;p&gt;I see this from a particular vantage point. I build with these tools daily. I co-founded AdBrain AI and shipped an agentic system at one of Europe&#x27;s largest insurance brokerages that now &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;resolves 67% of customer service cases without human intervention&lt;&#x2F;a&gt;. That work keeps me close enough to the frontier to feel where it is moving. Not inside any single lab, not captured by one company&#x27;s narrative, but near enough to build with the latest models rather than read summaries about them.&lt;&#x2F;p&gt;
&lt;p&gt;From there, the distance between people who have watched a demo and people who have actually built something is wider than anyone in a boardroom admits.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-chain-of-advice&quot;&gt;The chain of advice&lt;&#x2F;h2&gt;
&lt;p&gt;Think about the chain of advice that reaches most leadership teams. A traditional-industry executive trusts a consultancy. The consultancy briefs a partner. The partner reads an analyst report. The analyst watched a keynote. By the time a recommendation lands on the boardroom table, it can be two or three steps removed from anyone who has shipped a single thing with these tools. Each layer smooths the edges, removes the uncertainty, and quietly loses the truth that the people closest to the work would never have rounded off.&lt;&#x2F;p&gt;
&lt;p&gt;Most strategy I encounter is built this way. A quiet game of Chinese whispers, played across enough hand-offs that the original signal barely survives. It is one reason &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;so much AI transformation turns into theatre&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-genuine-insight-actually-requires&quot;&gt;What genuine insight actually requires&lt;&#x2F;h2&gt;
&lt;p&gt;So what does real insight look like? In my experience it needs four things together: real domain expertise, hands-on use of the frontier tools, a forecaster&#x27;s discipline about probability and evidence, and the willingness to change your mind quickly when the facts move.&lt;&#x2F;p&gt;
&lt;p&gt;Miss any one and the picture distorts. Domain expertise without hands-on use gives you confident opinions about tools you have never touched. Hands-on use without discipline gives you hype. Discipline without the willingness to update gives you yesterday&#x27;s forecast defended to the death.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;even-the-models-cannot-replace-the-human&quot;&gt;Even the models cannot replace the human&lt;&#x2F;h2&gt;
&lt;p&gt;There is a detail here that surprised me. In recent forecasting benchmarks, the strongest models now edge out expert human forecasters. You might think that settles it, that we should simply ask the model. It does not. The sharpest forecasts come from a capable human working with a powerful model. Neither alone is best. The human supplies the intent, the taste, the judgement about what actually matters. The model supplies the breadth and the speed.&lt;&#x2F;p&gt;
&lt;p&gt;The same is true of understanding AI itself. The people who see clearest observe the frontier up close, yet refuse to take any single lab&#x27;s word as gospel.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-to-do-about-it&quot;&gt;What to do about it&lt;&#x2F;h2&gt;
&lt;p&gt;I am not saying every executive needs to write code. I am saying the further a decision sits from anyone who has genuinely built with these tools, the more carefully you should question it. Ask where the advice came from. Ask who in the chain has actually shipped something. Ask how many hands it passed through before it reached you.&lt;&#x2F;p&gt;
&lt;p&gt;When you think about your own AI strategy, can you name a single person in that chain who has actually built with these tools, rather than watched someone else demo them? Trace it back. What you find is usually the real story.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want AI strategy from someone who builds with these tools rather than summarising them, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;irreducible-human-edge&#x2F;&quot;&gt;The interesting question is what AI won&#x27;t automate&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;Why 80% of AI projects fail to deliver ROI&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>You&#x27;re not talking to an LLM. You&#x27;re talking to a system</title>
        <published>2026-05-23T00:00:00+00:00</published>
        <updated>2026-05-23T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/not-talking-to-an-llm/"/>
        <id>https://ctozen.com/insights/not-talking-to-an-llm/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/not-talking-to-an-llm/">&lt;p&gt;You&#x27;re not talking to an LLM. You&#x27;re talking to nine systems in a trench coat, and the model is only one of them.&lt;&#x2F;p&gt;
&lt;p&gt;When someone types a question into ChatGPT or Claude, they picture one clever brain in a box. A single mind reading their words and thinking back at them.&lt;&#x2F;p&gt;
&lt;p&gt;That is not what is happening. You are talking to a system, and the model is one small part of it. I know this because building the rest of the parts is my day job: the retrieval, the guardrails, the routing, the document handling. The model is the bit I worry about least.&lt;&#x2F;p&gt;
&lt;p&gt;Let me walk you through the coat.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-decides-whether-your-question-even-reaches-the-model&quot;&gt;What decides whether your question even reaches the model&lt;&#x2F;h2&gt;
&lt;p&gt;Two layers act before the model sees a single word. First, alignment guardrails: the system checks tone, safety, and factuality going in and coming back, and tries to catch confident nonsense before it reaches you. Second, routing. Some products quietly send simpler questions to a smaller, cheaper, weaker model to manage cost and latency. You never see the handover.&lt;&#x2F;p&gt;
&lt;p&gt;Caching sits alongside both. The same question asked twice should not cost twice, so the system reuses earlier work where it can.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-memory-is-a-trick&quot;&gt;The memory is a trick&lt;&#x2F;h2&gt;
&lt;p&gt;This is the part that surprises people most. The model&#x27;s weights do not change as you talk to it. It has no built-in memory of your last message, let alone last week. The continuity you feel is faked, and faked well, by separate tooling that stores your history, retrieves the relevant pieces, and injects them back into the conversation before the model ever sees them.&lt;&#x2F;p&gt;
&lt;p&gt;When a chat gets long, compaction takes over. The earlier parts are summarised and compressed so the whole thing still fits in the context window. You experience one smooth conversation. Underneath, it is being rewritten on the fly.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-happens-when-you-upload-a-document&quot;&gt;What happens when you upload a document&lt;&#x2F;h2&gt;
&lt;p&gt;Upload a PDF and more machinery wakes up. OCR reads the text off the page. Chunking breaks it into pieces. Vector search, which matches meaning rather than keywords, finds the few paragraphs that actually answer your question. The model never reads your whole file. It reads a curated handful of fragments that the surrounding system decided were relevant. Getting that retrieval right, rather than the model, is &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;what most production AI actually turns on&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;And the deliberate thinking you sometimes see, the model working through a problem step by step, is the system instructing it to show its reasoning rather than blurt the first answer that comes to mind.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;nine-systems-one-model&quot;&gt;Nine systems, one model&lt;&#x2F;h2&gt;
&lt;p&gt;Guardrails, routing, caching, memory, compaction, OCR, chunking, vector search, step-by-step reasoning. Count them. Nine. The model is one of them.&lt;&#x2F;p&gt;
&lt;p&gt;Now the honest punchline, the part vendors rarely say plainly. With all of that scaffolding, the model&#x27;s weights still do not change while you use it. Whatever it learns from your conversation happens later, in bulk, during training, if at all. The ideal we are all working towards is a model that genuinely learns from new data as it goes. That is not today. Today we fake learning with retrieval and careful plumbing, and most of the time that plumbing is what decides whether the product works.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-tenth-passenger-when-it-stops-answering-and-starts-doing&quot;&gt;The tenth passenger: when it stops answering and starts doing&lt;&#x2F;h2&gt;
&lt;p&gt;Everything so far describes what happens when you ask for an answer. Increasingly you are not asking for an answer. You are asking for an action: book it, file it, reconcile it, draft the reply and send it. That adds a tenth passenger to the coat, and it is the one growing fastest. The agentic harness.&lt;&#x2F;p&gt;
&lt;p&gt;You have probably met these already, even if you never named them as such. Anthropic&#x27;s Claude Code and Claude Cowork. OpenAI&#x27;s Codex. Google&#x27;s Antigravity and Gemini Spark. Amazon&#x27;s Quick Suite. Every one of them is a harness wrapped around a model, not a model. The race between the big labs is no longer only about whose model is smartest. It is increasingly about whose harness lets that model get real work done safely.&lt;&#x2F;p&gt;
&lt;p&gt;Here the model is handed a set of tools, things it can actually call: a search, a database query, a code runner, an email API. It does not execute them itself. It proposes an action, the harness runs it, feeds the result back, and the loop repeats until the job is done or the agent decides it is stuck. The model is the part that chooses what to do next. The harness is everything that lets it do anything at all, and everything that stops it doing the wrong thing.&lt;&#x2F;p&gt;
&lt;p&gt;That harness is where production agents are won or lost. It holds the tool definitions, the permissions for what the agent is allowed to touch, the budget limits, the verification step that checks the agent did what it claimed rather than trusting it, and the escalation path for the cases it should not handle alone. The actual agent loop is often a few dozen lines. The engineering lives in the scaffolding around it. &lt;a href=&quot;&#x2F;insights&#x2F;anthropic-agent-sdk-what-works&#x2F;&quot;&gt;I have written about what works here in more detail&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;Strip it back and the harness does four jobs the model cannot do for itself.&lt;&#x2F;p&gt;
&lt;p&gt;It manages &lt;strong&gt;processes&lt;&#x2F;strong&gt;: starting agents, running several at once, spawning a focused subagent for a sub-task so the main agent&#x27;s context stays clean, and stopping the ones that go wrong before they do damage.&lt;&#x2F;p&gt;
&lt;p&gt;It manages &lt;strong&gt;memory&lt;&#x2F;strong&gt;: not the model remembering, which it cannot, but the harness recording what the agent did and decided across steps and across runs, then feeding the relevant pieces back when they matter. The same retrieval trick from earlier, now in service of an agent rather than a conversation.&lt;&#x2F;p&gt;
&lt;p&gt;It manages &lt;strong&gt;workflows&lt;&#x2F;strong&gt;: the order steps run in, what must finish before the next begins, where a human signs off. The agent improvises within a structure the harness defines, rather than freelancing from a blank page.&lt;&#x2F;p&gt;
&lt;p&gt;And it manages &lt;strong&gt;skills&lt;&#x2F;strong&gt;: capabilities loaded on demand for the task in front of it, rather than carried all at once. The harness brings in the right skill at the right moment, which keeps the agent sharp instead of bloated.&lt;&#x2F;p&gt;
&lt;p&gt;Tools, processes, memory, workflows, skills, verification. The model supplies the judgement about what to do next. The harness supplies almost everything else.&lt;&#x2F;p&gt;
&lt;p&gt;So when a tool uses AI to do something rather than say something, the model is doing even less of the work, not more. The harness is doing the rest. And the same rule holds, harder: when an agent fails, the model is rarely the reason.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-this-means-for-the-decisions-you-make&quot;&gt;What this means for the decisions you make&lt;&#x2F;h2&gt;
&lt;p&gt;Why does this matter if you are not the one building it?&lt;&#x2F;p&gt;
&lt;p&gt;Because executives keep buying the whole system and then arguing only about the brain inside it. They debate which foundation model is smartest while the things that actually decide whether their AI works, the retrieval, the guardrails, the routing, the document handling, sit unexamined. They are paying for a system and inspecting one component.&lt;&#x2F;p&gt;
&lt;p&gt;Better AI decisions start with one plain question: which layer am I actually paying for, and which layer is failing me? When an AI product disappoints, the model is rarely the problem. The retrieval is feeding it the wrong fragments. The chunking mangled the document. The router quietly downgraded you. The guardrails are too tight, or too loose. The &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;67% autonomous case resolution we reached at a European insurance brokerage&lt;&#x2F;a&gt; came from getting those layers right, not from a cleverer model.&lt;&#x2F;p&gt;
&lt;p&gt;The brain in the box is real, and it is remarkable. But it has never been alone in there. The next time a demo dazzles you, or a tool frustrates you, resist the urge to credit or blame the model. Ask what the rest of the coat is doing.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want to work out which layer of your AI stack is actually failing you, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026: what actually works in production&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;anthropic-agent-sdk-what-works&#x2F;&quot;&gt;What two hours with Anthropic&#x27;s agent team taught me&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;Why 80% of AI projects fail to deliver ROI&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Agent sprawl is the new shadow IT. Your business needs a control plane</title>
        <published>2026-05-16T00:00:00+00:00</published>
        <updated>2026-05-16T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/agent-control-plane/"/>
        <id>https://ctozen.com/insights/agent-control-plane/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/agent-control-plane/">&lt;p&gt;How many AI agents are running in your business right now?&lt;&#x2F;p&gt;
&lt;p&gt;If you can&#x27;t answer that question with a number, you have a problem. And it&#x27;s the same problem that cloud created in 2014 when every team started spinning up AWS accounts on a credit card, and the CFO discovered six months later that the company was running 400 services across nine regions with no inventory, no budgets, and no kill switch.&lt;&#x2F;p&gt;
&lt;p&gt;We are about to repeat that mistake at speed. The 2026 version is agent sprawl.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-new-shadow-it&quot;&gt;The new shadow IT&lt;&#x2F;h2&gt;
&lt;p&gt;I wrote a few months ago about &lt;a href=&quot;&#x2F;insights&#x2F;shadow-ai-audit-finding&#x2F;&quot;&gt;shadow AI&lt;&#x2F;a&gt;, the problem of employees using ChatGPT and Claude on personal accounts to process company data. That problem hasn&#x27;t gone away. But there&#x27;s now a second, more serious version of it: the agents your organisation has actually deployed.&lt;&#x2F;p&gt;
&lt;p&gt;In the last 30 days, Microsoft, Google, Salesforce and IBM all shipped what they&#x27;re calling &lt;strong&gt;agent management platforms&lt;&#x2F;strong&gt;. Microsoft Agent 365 is positioned as a control plane to observe, govern, and secure agents, including discovery of &quot;shadow AI,&quot; agent credentials, permissions, and registry sync with AWS Bedrock and Google Cloud. Google&#x27;s Gemini Enterprise Agent Platform launched with Agent Identity, an Agent Gateway, Agent Observability, simulation, evaluation, and prompt-injection defences. Salesforce shipped Agent Fabric for trusted agent identity, model choice, deterministic handoffs, and agent scanning. IBM positioned watsonx Orchestrate as a central orchestration layer for agents, tools, and workflows with governance and auditability built in.&lt;&#x2F;p&gt;
&lt;p&gt;Four of the largest enterprise software companies in the world, all shipping the same product category in the same month. That&#x27;s not a coincidence. That&#x27;s the shape of the next problem.&lt;&#x2F;p&gt;
&lt;p&gt;The early signal that this is becoming procurement-grade is the rise of two open standards: Anthropic&#x27;s Model Context Protocol (MCP) and the emerging Agent-to-Agent (A2A) work championed by Google. These are turning into the default way agents discover and call tools across vendors. Within twelve months I expect to see RFPs from regulated industries that list &quot;MCP support&quot; and &quot;agent identity controls&quot; the way they list SSO and SCIM today.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-an-agent-control-plane-actually-does&quot;&gt;What an agent control plane actually does&lt;&#x2F;h2&gt;
&lt;p&gt;When you deploy agents at scale, you face five questions that no spreadsheet or wiki can answer for long:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Who built this agent?&lt;&#x2F;strong&gt; Inventory. Owner. Purpose. Date deployed. Last reviewed.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What can it access?&lt;&#x2F;strong&gt; Data sources, customer records, internal documents, third-party APIs. Permissions need to be inherited from human identity, not assumed.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What can it do?&lt;&#x2F;strong&gt; Which tools it can call. Which actions are reversible and which are not. Which require human approval.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What budget can it spend?&lt;&#x2F;strong&gt; Token limits per task, per day, per month. Hard caps before the bill arrives, not after.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What did it do?&lt;&#x2F;strong&gt; A decision log. Which prompt, which tools, which outputs, which approvals. Reproducible, auditable, exportable.&lt;&#x2F;p&gt;
&lt;p&gt;These are the same five questions IAM answers for human identity, FinOps answers for cloud cost, and observability answers for production systems. The reason every major vendor is building this layer now is that agents collapse all three problems into one new stack.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;An agent control plane is what IAM, FinOps, and APM became when they merged.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;Lakhani, Spataro and Stave framed this shift in Harvard Business Review more directly than most vendors will: &quot;agents should be treated as a managed workforce rather than a collection of software scripts.&quot; Once you accept that framing, the questions get easier. You already know how to onboard, evaluate, monitor, and offboard a workforce. The new task is doing it for software.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-reality-check-on-urgency&quot;&gt;The reality check on urgency&lt;&#x2F;h2&gt;
&lt;p&gt;The situation is less urgent than the vendor messaging suggests. Gartner&#x27;s April Hype Cycle placed agentic AI at the Peak of Inflated Expectations and noted that only 17% of organisations had actually deployed AI agents in production. More than 60% expect to within two years, but expectation and reality are different animals.&lt;&#x2F;p&gt;
&lt;p&gt;The point is not that you need an agent control plane today. The point is that the window between &quot;we don&#x27;t have agents&quot; and &quot;we have agents we can&#x27;t account for&quot; is going to be about six months. That&#x27;s how long it took most companies to move from &quot;we have one AWS account&quot; to &quot;we have no idea how many AWS accounts we have.&quot; Agents will move faster because they&#x27;re easier to spin up and harder to see.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-this-matters-now&quot;&gt;Why this matters now&lt;&#x2F;h2&gt;
&lt;p&gt;There&#x27;s a specific moment where this becomes urgent, and most companies miss it.&lt;&#x2F;p&gt;
&lt;p&gt;It&#x27;s not when you deploy your first agent. It&#x27;s when your &lt;em&gt;third&lt;&#x2F;em&gt; department deploys an agent without telling the second one. Customer support has a triage agent. Legal has a contract review agent. Finance has a reconciliation agent. They were built by three different teams, with three different vendors, on three different model providers, with three different definitions of &quot;approved data access.&quot; Nobody owns the cross-cutting view.&lt;&#x2F;p&gt;
&lt;p&gt;I have seen this play out twice already, on smaller scales. In one business, a customer success team had built a perfectly reasonable agent to summarise account notes before renewal calls. Around the same time, a separate operations team had built an agent that wrote into the same CRM records. Neither team knew about the other. When the renewal agent&#x27;s summary started including content the operations agent had written automatically, the business spent a week working out whether the resulting renewal note had been generated, edited, or invented. The answer existed somewhere in the logs. Nobody owned the logs.&lt;&#x2F;p&gt;
&lt;p&gt;That moment usually arrives between deployment numbers four and eight. After that, retrofitting governance is roughly twice as expensive as building it in. I&#x27;ve watched the same cost curve in cloud, in containers, in microservices, and in data pipelines. It&#x27;s identical every time.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-i-d-do-in-your-position&quot;&gt;What I&#x27;d do in your position&lt;&#x2F;h2&gt;
&lt;p&gt;If you&#x27;re the CTO, COO, or head of operations at a company that has deployed more than one agent, or is about to, I&#x27;d do four things in the next quarter:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Build the inventory first.&lt;&#x2F;strong&gt; Not the platform, the inventory. A simple register of every agent in production or in pilot: owner, purpose, data access, model, monthly cost. You will be surprised by what you find. Most leaders are.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Define your agent development lifecycle.&lt;&#x2F;strong&gt; Treat agents like production systems, because they are. Requirements, tool contracts, data contracts, evaluations, red-team tests, permission reviews, simulation, rollout, monitoring, rollback, incident management, audit. Most of these already exist for your software. You need to extend them, not invent them.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Pick the control plane after you understand the requirements.&lt;&#x2F;strong&gt; The vendor space is moving weekly. Microsoft, Google, Salesforce, IBM, and a dozen smaller players will all claim to solve this. The right answer depends on where your data already lives and which agents you already run. Don&#x27;t buy the platform first and shape your operating model around the vendor.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Test the kill switch.&lt;&#x2F;strong&gt; Every agent in production needs an off button. Not a &quot;we&#x27;ll figure it out&quot; off button. A documented, owner-on-call, &lt;em&gt;actually-tested&lt;&#x2F;em&gt; off button. Pick a Tuesday morning and run a drill: shut the agent down, observe what breaks downstream, time how long it takes to recover. This is the single fastest way to expose which agents are genuinely understood by their owners and which are running on hope.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-board-level-reframe&quot;&gt;The board-level reframe&lt;&#x2F;h2&gt;
&lt;p&gt;For boards and PE-backed leadership teams, the simplest reframe is this: agents are software that takes actions on your behalf, using your data, spending your money, and creating audit trails you may not own. The governance you would demand for any other system that did all four of those things should apply here.&lt;&#x2F;p&gt;
&lt;p&gt;This is not a reason to slow down agent adoption. It&#x27;s a reason to build the inventory and the lifecycle now, while you have three agents instead of thirty. The companies that lead on this in 2026 will look like the companies that built proper cloud governance in 2015. The ones that don&#x27;t will look like the ones still cleaning up cloud waste a decade later.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;d like to talk through what an agent inventory or lifecycle would look like in your business, &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;get in touch&lt;&#x2F;a&gt;. I&#x27;ve built governance frameworks alongside production AI systems at &lt;a href=&quot;&#x2F;impact&#x2F;public-sector-scale&#x2F;&quot;&gt;NHS Wales&lt;&#x2F;a&gt; and in &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;regulated insurance&lt;&#x2F;a&gt;, where the audit bar is high and the cost of getting it wrong is higher.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-board-accountability&#x2F;&quot;&gt;Who is accountable for AI on your board?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;shadow-ai-audit-finding&#x2F;&quot;&gt;Shadow AI is your next audit finding&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026. What actually works&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;services&#x2F;#transformation-sprint&quot;&gt;The AI Transformation Sprint&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Headless is the new mobile-first. Agents are about to become your biggest user</title>
        <published>2026-05-09T00:00:00+00:00</published>
        <updated>2026-05-09T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/headless-for-agents/"/>
        <id>https://ctozen.com/insights/headless-for-agents/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/headless-for-agents/">&lt;p&gt;Here is a question worth asking your product team this week. What percentage of your software&#x27;s use, twelve months from now, will involve a human looking at a screen?&lt;&#x2F;p&gt;
&lt;p&gt;If the honest answer is anything less than 80%, your product has the wrong shape.&lt;&#x2F;p&gt;
&lt;p&gt;In April, four companies that sit at very different parts of the software stack said the same thing. Box CEO Aaron Levie posted that as agents become the biggest users of software, software has to be available in headless form because agents will talk to APIs, not click through UIs. Zapier CEO Wade Foster told an Axios audience that agents are likely to become the predominant users of software. Salesforce shipped &quot;Headless 360,&quot; with cofounder Parker Harris framing the provocation directly: why should a user need to log into Salesforce at all when an agent can act through APIs and tools? Stripe, Mastercard and OpenAI quietly built the rails for agents to make purchases on a user&#x27;s behalf.&lt;&#x2F;p&gt;
&lt;p&gt;This is not a niche prediction. &lt;strong&gt;The next generation of users is software, not people, and the software you built for clicks is now being asked to serve callers.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-the-wrong-shape-actually-means&quot;&gt;What &quot;the wrong shape&quot; actually means&lt;&#x2F;h2&gt;
&lt;p&gt;For the last fifteen years, most B2B products have been built around the same loop: a user logs in, navigates a UI, takes an action, sees a confirmation. Roadmaps optimised that loop. Designers shortened the click path. Product managers measured engagement, session length, retention.&lt;&#x2F;p&gt;
&lt;p&gt;Agents do not log in. They do not navigate. They do not see the confirmation toast. They call an endpoint, pass parameters, parse the response, and move on. Most products, including most of the SaaS your business pays for today, were not designed for that. The API exists for integrations, but it covers maybe 30% of what the UI can do. The auth model assumes a human at a keyboard. The error messages are written for users, not callers. The pricing model charges for seats that nobody is sitting in.&lt;&#x2F;p&gt;
&lt;p&gt;I learned how big this gap is the hard way. A few months ago I wired an agent into a well-known mid-market SaaS product on behalf of a client. The marketing pages promised a &quot;complete API.&quot; The reality was that two of the four actions we needed were UI-only. The vendor&#x27;s answer was to use browser automation. That is the 2026 equivalent of telling someone to fax their tax return. The product wasn&#x27;t bad. It was simply built for a world where the user had hands.&lt;&#x2F;p&gt;
&lt;p&gt;Salesforce&#x27;s Headless 360 announcement is the clearest signal of where this is going. More than 60 MCP tools and 30+ coding skills exposed for Claude Code, Cursor, Codex, Windsurf, and others. The promise is that anything you can do in the UI, you can do through an API, an MCP call, or a CLI command. Salesforce is the largest enterprise software company in the world. If they are doing this, every serious B2B platform will follow within eighteen months.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-three-layers-your-product-now-needs&quot;&gt;The three layers your product now needs&lt;&#x2F;h2&gt;
&lt;p&gt;The shift is not &quot;add an API.&quot; Most products already have one. The shift is that you now need to design for three different consumers of your product, and treat all three as first-class.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The human layer.&lt;&#x2F;strong&gt; The traditional UI. Still essential, still where decisions get made and judgment lives. But it is no longer the only interface, and probably not the dominant one for routine work.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The agent layer.&lt;&#x2F;strong&gt; A complete, agent-addressable surface. Every meaningful action in your product needs to be reachable through an API, MCP tool, or CLI command. Schemas need to be machine-readable. Errors need to be parseable. Permissions need to be inherited from the human the agent is acting for. This is where Salesforce, Microsoft, and a growing number of vertical SaaS players are investing right now.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The governance layer.&lt;&#x2F;strong&gt; This is the one most teams haven&#x27;t started thinking about, and it&#x27;s the most interesting. When an agent is about to act on a user&#x27;s behalf, the user needs a preview of what it plans to do, a confirmation flow for irreversible actions, and a clear log of what happened. The governance layer is the human in the loop, surfaced as a product feature, not buried in admin settings.&lt;&#x2F;p&gt;
&lt;p&gt;Three layers. One product. Most teams are still building one.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-pricing-question-this-forces&quot;&gt;The pricing question this forces&lt;&#x2F;h2&gt;
&lt;p&gt;If the agent becomes a user, what do you charge? Per-seat pricing assumed a human consumed the licence. An agent might do the work of fifty seats while occupying zero. Microsoft has already started shifting Dynamics customers to a mix of seats plus consumption credits, with around 60% of Dynamics service customers now buying usage-based credits. Microsoft also moved GitHub Copilot to usage-based pricing, explicitly aligning what they charge with what the customer consumes.&lt;&#x2F;p&gt;
&lt;p&gt;This is not the death of seat-based pricing. It&#x27;s the start of hybrid pricing, where seats cover platform access and consumption covers agent-driven work. The hard part, and where I&#x27;d be spending CFO time right now if I were on your board, is protecting gross margin when agent usage can spike unpredictably. SaaS economics assumed flat-rate consumption per seat. Agents break that assumption.&lt;&#x2F;p&gt;
&lt;p&gt;The practical answer is simple to state and hard to implement. If you don&#x27;t have a unit of work in your product that you can measure and charge for (a resolution, a workflow completed, a document processed, a lead qualified), build one before your competitors do. Once your competitor is charging per outcome and you are charging per seat, the comparison gets very uncomfortable very quickly.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-mobile-first-analogy-is-exact&quot;&gt;The mobile-first analogy is exact&lt;&#x2F;h2&gt;
&lt;p&gt;In 2010, every software company had a website that worked beautifully on a 1280-pixel monitor and barely worked on a phone. Then mobile traffic crossed a threshold, and the companies that hadn&#x27;t rebuilt for touch, latency, and a smaller viewport spent the next five years losing ground to ones that had. The phrase &quot;mobile-first&quot; became shorthand for &quot;rebuild your product around how people actually use it.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;We are about to repeat that, with agents. The companies that get there first will look the way responsive-design pioneers looked in 2013. The companies that wait will rebuild under duress with their customers already comparing them to a faster competitor.&lt;&#x2F;p&gt;
&lt;p&gt;The good news is that the work is more tractable than mobile was. You don&#x27;t need to redesign for a new viewport. You need to expose what already exists, structure it for machine readers, and add a governance layer most users will value even when they&#x27;re using your product directly.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-i-d-do-in-your-position&quot;&gt;What I&#x27;d do in your position&lt;&#x2F;h2&gt;
&lt;p&gt;If you run a B2B product, I&#x27;d put four things on the next quarterly roadmap:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Audit your API coverage.&lt;&#x2F;strong&gt; Map every meaningful user action in your product. For each one, mark whether it is reachable through your API. The honest number is rarely above 40% for products older than five years. That gap is your roadmap.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Build an MCP surface.&lt;&#x2F;strong&gt; Don&#x27;t rebuild your API, wrap it. MCP is the emerging standard for how agents discover and use tools, and the integration cost is low compared to the strategic optionality it buys you. Start with the five actions agents would most plausibly want to take in your product.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Design the preview and approval flows.&lt;&#x2F;strong&gt; What does it look like when an agent is about to act inside your product? Where does the user see it? How do they cancel? How do they review what happened? This is product design work, and it is currently being skipped by every team I look at.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Pick a unit of work to charge for.&lt;&#x2F;strong&gt; Resolutions, workflows completed, documents processed, leads qualified, transactions executed. Agents make per-seat pricing leak value. The companies that figure out the right unit early will protect margin. The ones that don&#x27;t will find themselves negotiating discounts against unbounded usage.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-board-reframe&quot;&gt;The board reframe&lt;&#x2F;h2&gt;
&lt;p&gt;For founders and operators reading this, the simplest reframe is: your competitive moat for the next decade depends on how many of your users are software. If the answer is &quot;almost none,&quot; your product is being designed for a market that is shrinking relative to the one being built for agents. If the answer is &quot;more than we expected,&quot; you are early to a shift that most of your competitors are still treating as marketing language.&lt;&#x2F;p&gt;
&lt;p&gt;Headless software is not a feature. It is a posture toward where computing is going. The shape of your product is the most strategic decision you&#x27;ll make this year, and the window to make it is shorter than the time it took most of you to ship a mobile app.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;d like to talk through what an agent-addressable product surface would look like for your business, &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;get in touch&lt;&#x2F;a&gt;. I&#x27;ve spent the last decade building production AI systems and the platforms underneath them, including the &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;agentic insurance automation work&lt;&#x2F;a&gt; where the customer&#x27;s agent had to call our system, not the other way around. The patterns travel.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;agent-control-plane&#x2F;&quot;&gt;Agent sprawl is the new shadow IT&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026. What actually works&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;orchestrator-not-10x-engineer&#x2F;&quot;&gt;The orchestrator, not the 10x engineer&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Uber burned through its token budget by April. Your business will be next</title>
        <published>2026-05-02T00:00:00+00:00</published>
        <updated>2026-05-02T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/token-budgets-finops/"/>
        <id>https://ctozen.com/insights/token-budgets-finops/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/token-budgets-finops/">&lt;p&gt;In an April interview with The Verge, Uber&#x27;s CEO Dara Khosrowshahi admitted, almost as an aside, that the company&#x27;s CTO had told him they had burned through their token budget by early April. The interview was about hiring, software-team structure, and how AI was reshaping the relationship between product managers, designers, and engineers. The token budget line was a throwaway. It shouldn&#x27;t have been. It was the most important sentence in the conversation.&lt;&#x2F;p&gt;
&lt;p&gt;Uber is not a company that runs out of money in April by accident. They have one of the most sophisticated cloud finance functions in tech. If their AI spend caught them by surprise, yours will too.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Token spend is becoming a P&amp;amp;L conversation, and most organisations are still treating it like an experiments line.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-cloud-finops-curve-replayed-at-speed&quot;&gt;The cloud FinOps curve, replayed at speed&lt;&#x2F;h2&gt;
&lt;p&gt;I have watched this exact pattern before. In 2014, every CFO I worked with was discovering that &quot;cloud is cheaper than on-prem&quot; was only true for a few specific workloads. The bill kept growing. Engineering teams spun up environments and forgot about them. Data egress charges arrived without warning. Reserved instances were bought and never used. The discipline we now call FinOps emerged because the pain became too obvious to ignore.&lt;&#x2F;p&gt;
&lt;p&gt;That curve took the industry roughly a decade to flatten. Cloud Foundation, AWS Cost Explorer, GCP billing alerts, Spot.io, CloudHealth, the FinOps Foundation, dedicated FinOps engineers in every serious cloud-native business. Ten years of work to turn cloud spend from &quot;scary line item&quot; to &quot;managed discipline.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;Token spend will not get a decade. It&#x27;s already moving faster, for three reasons.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Spend is far less predictable.&lt;&#x2F;strong&gt; A single coding agent on a long-running task can consume more tokens in an afternoon than a customer-support workflow uses in a month. A poorly tuned prompt can ten-x cost overnight. The variance between similar tasks is wild compared to the variance between similar EC2 instances.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Model choice changes economics by orders of magnitude.&lt;&#x2F;strong&gt; Routing a task to the wrong model can be 30x more expensive than routing it correctly. Most teams have no routing strategy. They picked one model in a proof-of-concept and never revisited the decision.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Caching, batching, and prompt design are not yet boardroom vocabulary.&lt;&#x2F;strong&gt; A team that knows how to use prompt caching, batch APIs, and disciplined prompt design will spend a fraction of what an undisciplined team spends for the same outcome. This is a skill that exists in maybe 5% of engineering organisations today.&lt;&#x2F;p&gt;
&lt;p&gt;Anthropic&#x27;s own positioning of Claude Code for Enterprise tells you where the market is going. The product page emphasises contribution metrics, token usage, cost monitoring through OpenTelemetry, centrally managed permissions, file-access restrictions, and per-team observability. Those are not features for hobbyists. They are features for organisations that are about to learn what their AI spend really is.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;workflows-are-cheaper-than-agents&quot;&gt;Workflows are cheaper than agents&lt;&#x2F;h2&gt;
&lt;p&gt;There is a specific insight from Zapier&#x27;s Wade Foster that more leaders should hear. He has been saying for months that agents are less reliable and more expensive than deterministic workflows, so leaders should use workflows where reliability and cost matter, and reserve agents for ambiguity, recovery, and creation.&lt;&#x2F;p&gt;
&lt;p&gt;This is the most important practical FinOps lever your engineering organisation has, and almost nobody is using it.&lt;&#x2F;p&gt;
&lt;p&gt;Most teams default to &quot;build an agent&quot; when the actual problem is &quot;automate a known sequence of steps.&quot; A deterministic workflow runs in milliseconds at near-zero variable cost. An agent runs in seconds, sometimes minutes, at meaningful per-task cost, with non-zero failure rates and the occasional spectacular blowup. The cost difference between these two approaches, applied across an enterprise, is the difference between AI being a margin builder and AI being the new shadow line item that ate the budget.&lt;&#x2F;p&gt;
&lt;p&gt;I have built both. On one recent engagement, the team had built an agent to triage incoming documents, classify them, and route them. Six months in, it was costing roughly fifty times what the same triage would have cost as a deterministic pipeline using a small classifier model and a few rules. The agent was more flexible, yes. The flexibility was being used about 4% of the time. The other 96% was a known sequence of steps being run through the most expensive possible machine.&lt;&#x2F;p&gt;
&lt;p&gt;The right architecture is almost always &lt;strong&gt;workflows where the path is known, agents where it isn&#x27;t.&lt;&#x2F;strong&gt; Most teams are doing the opposite because agents are newer and more interesting to build.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-a-cfo-actually-needs-by-q3&quot;&gt;What a CFO actually needs by Q3&lt;&#x2F;h2&gt;
&lt;p&gt;If you are the CFO, COO, or finance lead of an organisation that runs more than a handful of AI use cases in production, there is a specific operating discipline you need before the end of this quarter. I&#x27;d put four things in place.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Per-team token budgets, with hard caps.&lt;&#x2F;strong&gt; Same model as cloud spend. Each team gets a monthly allowance, denominated in dollars or tokens, with alerts at 50%, 80%, and 100%. The cap is enforced at the API layer, not in a wiki. If a team needs more, they ask. This is the single highest-impact move you can make this year.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Unit economics per workflow.&lt;&#x2F;strong&gt; For every AI-driven workflow in production, you should know its cost per execution. Cost per resolved support ticket. Cost per generated report. Cost per closed code review. Without this number, you cannot make any honest decision about whether the AI workflow is profitable. With it, you can compare variants, justify investment, and spot regressions.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;A model routing policy.&lt;&#x2F;strong&gt; Most organisations are still defaulting to the most capable model for every task. That is genuinely expensive. A simple extraction task can cost 30x more on a frontier model than on a small one fine-tuned for the job. A routing layer that sends simple tasks to smaller models, harder tasks to larger ones, and routine extractions to cached responses can cut spend by 50% to 80% with no perceptible quality loss. The engineering effort is small. The financial impact is enormous.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Observability that includes cost, not just performance.&lt;&#x2F;strong&gt; OpenTelemetry, Honeycomb, Datadog and most major observability platforms now expose AI cost telemetry. If your engineering teams can see latency and error rate but not cost-per-call, they cannot optimise the third dimension. Show them the bill.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-board-reframe&quot;&gt;The board reframe&lt;&#x2F;h2&gt;
&lt;p&gt;For boards and investors reading this, the simplest question to ask the next executive who presents an AI strategy is: &quot;What is your monthly token spend, and what controls do you have on it?&quot; If the answer is &quot;we&#x27;re tracking it,&quot; the answer is no.&lt;&#x2F;p&gt;
&lt;p&gt;Token spend will follow the cloud spend curve, except the slope is steeper, the volatility is higher, and the lessons that took a decade to learn the first time are available for free in the second. There is no reason to repeat the mistake at a faster cadence. The companies that build AI FinOps discipline in 2026 will look like the companies that built cloud FinOps in 2018. The ones that don&#x27;t will look like the ones still arguing over reserved instance utilisation in 2024.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;d like to talk through what an AI FinOps function would look like in your business, &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;get in touch&lt;&#x2F;a&gt;. I&#x27;ve sat on both sides of this conversation. Engineering teams that built it well and finance teams that wished engineering had built it sooner.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-spend-not-in-numbers&#x2F;&quot;&gt;Why most AI spend doesn&#x27;t show up in the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agent-control-plane&#x2F;&quot;&gt;Agent sprawl is the new shadow IT&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Stop counting AI use cases. Redesign three value streams instead</title>
        <published>2026-04-25T00:00:00+00:00</published>
        <updated>2026-04-25T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/value-streams-not-use-cases/"/>
        <id>https://ctozen.com/insights/value-streams-not-use-cases/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/value-streams-not-use-cases/">&lt;p&gt;I sat through an AI strategy review with a leadership team last month where the head of transformation opened the deck with a slide titled &quot;AI Use Cases In Production.&quot; There were 47 of them. Customer service summarisation. Sales call coaching. Marketing copy generation. Code completion. Document classification. Meeting minutes. Onboarding assistants. The list filled the screen in 9-point font.&lt;&#x2F;p&gt;
&lt;p&gt;I asked one question. &quot;What&#x27;s the EBIT impact?&quot;&lt;&#x2F;p&gt;
&lt;p&gt;The room went quiet, because the answer was: nobody had computed it. The use cases existed. The value didn&#x27;t. And yet the value was right there to be unlocked, sitting in the same processes, waiting for someone to redesign the stream instead of decorating its steps.&lt;&#x2F;p&gt;
&lt;p&gt;This is the new shape of AI theatre. I called the previous shape &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;performance art&lt;&#x2F;a&gt; three months ago. That piece was about pilots that never reach production. This one is about pilots that &lt;em&gt;do&lt;&#x2F;em&gt; reach production and still produce nothing on the P&amp;amp;L.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The trap has moved from pilot purgatory to use-case purgatory. Counting deployed assistants is the new measuring of pilot velocity.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-use-case-counting-persists&quot;&gt;Why use-case counting persists&lt;&#x2F;h2&gt;
&lt;p&gt;I understand why this keeps happening. Use cases are easy to count. They are easy to put on a board slide. They give a head of transformation something concrete to point at when the CEO asks &quot;what are we doing with AI?&quot; Forty-seven is more impressive than four. Activity is easier to measure than outcome.&lt;&#x2F;p&gt;
&lt;p&gt;There&#x27;s also a procurement reason. Most enterprise AI platforms are sold on use-case breadth. Vendors love to show a tile-grid of pre-built assistants because it makes the platform look comprehensive. The customer side answers in kind by deploying as many tiles as possible. Both sides are optimising for the wrong thing.&lt;&#x2F;p&gt;
&lt;p&gt;The result is what I keep seeing in mid-market and PE-backed businesses: a portfolio of small productivity improvements, none of which is large enough to move a business metric, all of which add to the AI budget. Twelve months in, the conversation with the CFO becomes uncomfortable. The honest answer is &quot;I cannot draw a line from any of these to revenue, cost, or risk.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;Lakhani, Spataro and Stave coined a useful phrase for this in a March 2026 Harvard Business Review piece. They called it being &lt;strong&gt;&quot;pilot-rich but transformation-poor.&quot;&lt;&#x2F;strong&gt; That is exactly the shape of the problem. Hundreds of working pilots, none of which has changed how the business operates.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-krishna-reframe&quot;&gt;The Krishna reframe&lt;&#x2F;h2&gt;
&lt;p&gt;At IBM Think 2026 earlier this month, Arvind Krishna put the alternative more directly than I have heard a major-vendor CEO put it: the enterprises pulling ahead are not deploying more AI, they are redesigning how the business operates. IBM positioned the AI operating model around agents, data, automation, and hybrid governance, and framed the shift as &quot;from improving parts of the business to changing how the business operates.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;That is not the same statement as &quot;deploy more AI.&quot; It is the opposite statement. ServiceNow&#x27;s Bill McDermott has been making the same point in different words. AI value comes from auditable execution across workflows, data, policy, CRM, HR, security, and IT, not from disconnected assistants sitting alongside human work.&lt;&#x2F;p&gt;
&lt;p&gt;The HBR authors put the same point in a single line that should be printed on the wall of every transformation office: when you drop AI into one step of a process you have not redesigned, &lt;strong&gt;&quot;the bottleneck simply shifts.&quot;&lt;&#x2F;strong&gt; Their example is an agent that drafts a complex contract in seconds, only to have the contract sit in a manual legal review queue for two weeks. The pilot worked. The cycle time did not change. The business did not get faster.&lt;&#x2F;p&gt;
&lt;p&gt;The point is that AI&#x27;s economic impact does not live in the assistant in the middle of a process. It lives in the redesign of the process itself.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-value-stream-actually-means&quot;&gt;What &quot;value stream&quot; actually means&lt;&#x2F;h2&gt;
&lt;p&gt;A value stream is the end-to-end sequence of steps that delivers a specific outcome the business cares about. It usually crosses multiple teams, multiple systems, and multiple decision points. The interesting metric for a value stream is throughput, cycle time, error rate, cost per unit, or customer impact, not the number of tools used inside it.&lt;&#x2F;p&gt;
&lt;p&gt;Examples of value streams worth redesigning:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Customer support resolution.&lt;&#x2F;strong&gt; From inbound contact to closed ticket. Measured in time to first response, time to resolution, customer effort score, cost per resolution.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Sales lead to revenue.&lt;&#x2F;strong&gt; From inbound lead to closed deal. Measured in conversion rate at each stage, sales cycle time, cost of customer acquisition.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Month-end financial close.&lt;&#x2F;strong&gt; From period end to signed-off financial statements. Measured in days to close, manual adjustments, audit findings.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Underwriting or claims processing.&lt;&#x2F;strong&gt; From application to decision. Measured in straight-through processing rate, cycle time, loss ratio.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Engineering migration.&lt;&#x2F;strong&gt; From legacy system to modern stack. Measured in throughput, defect escape rate, cost per migrated component.&lt;&#x2F;p&gt;
&lt;p&gt;Each of these is a candidate for end-to-end redesign with AI in the middle, agents at decision points, workflows at routine steps, and human judgment at exceptions. The metric for success is not &quot;we deployed an AI assistant in step 4.&quot; It is &quot;this value stream now operates at half the cycle time and a third of the cost.&quot;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-microsoft-data-the-room-missed&quot;&gt;The Microsoft data the room missed&lt;&#x2F;h2&gt;
&lt;p&gt;Microsoft&#x27;s 5 May Work Trend Index reported a finding that should be on every transformation lead&#x27;s desk. Organisational factors, including culture, manager support, and talent systems, accounted for &lt;strong&gt;more than twice the perceived AI impact&lt;&#x2F;strong&gt; of individual factors. The same Copilot deployment, in two organisations, produces wildly different outcomes depending on how the work itself is structured.&lt;&#x2F;p&gt;
&lt;p&gt;This is why use-case counting fails. You can deploy an excellent meeting summary assistant in two companies. In the first, where meetings are already overprescribed and follow-through is weak, the assistant generates more meeting noise. In the second, where meetings are tightly run and decisions are tracked, the assistant compresses cycle time. Same tool, opposite outcomes, because the surrounding operating model is different.&lt;&#x2F;p&gt;
&lt;p&gt;Use cases live in tools. Value lives in operating models. &lt;strong&gt;You cannot use-case your way to a redesign.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;how-to-pick-three-value-streams&quot;&gt;How to pick three value streams&lt;&#x2F;h2&gt;
&lt;p&gt;If you are leading AI strategy at a mid-market or PE-backed business, I&#x27;d kill the use-case inventory and run a different exercise. Take it to your leadership team in the next month.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Map your top eight value streams.&lt;&#x2F;strong&gt; Not eighty. Eight. The ones that, if you compressed cycle time by 30% or reduced unit cost by 20%, would move the business in a way the board would notice.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Pick three.&lt;&#x2F;strong&gt; Use three criteria: measurable economic impact, achievable within 6 to 12 months, and a sponsor with real authority. The third criterion is the one most exercises skip. A value stream without an empowered owner becomes another deck.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Commit to redesign, not augmentation.&lt;&#x2F;strong&gt; The temptation is to bolt AI onto the existing process. Don&#x27;t. Map the current process, identify which steps can be eliminated, which can be automated end-to-end, which need a human decision, and which can be combined. Then design the future state, &lt;em&gt;with AI in the middle&lt;&#x2F;em&gt;, and migrate.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Measure outcomes, not adoption.&lt;&#x2F;strong&gt; The metrics are cycle time, unit cost, throughput, quality, customer satisfaction. Not seats licensed. Not assistants deployed. Not prompts entered. Hold the leadership team to those metrics quarterly.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Tell the rest of the business to wait.&lt;&#x2F;strong&gt; This is the hardest part. Every other team will want their own AI use case. The answer for the next year is &quot;we are concentrating effort on three streams, we will share what we learn.&quot; A focused redesign of three value streams will produce more EBIT impact than fifty deployed assistants. I have watched this in both directions.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-board-reframe&quot;&gt;The board reframe&lt;&#x2F;h2&gt;
&lt;p&gt;For boards and investors reading this, the simplest test of an AI strategy is: name the three value streams you are redesigning, and the metrics you&#x27;ll be measured against in 12 months. If the answer is a list of deployed tools, you don&#x27;t have a strategy. You have a procurement plan with AI in the title.&lt;&#x2F;p&gt;
&lt;p&gt;The companies pulling ahead in 2026 are not the ones with the longest use-case inventory. They are the ones who picked three things and rebuilt them properly. The operating model is the moat. The use cases are the marketing.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;d like to talk through which value streams in your business would benefit most from end-to-end redesign, &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;get in touch&lt;&#x2F;a&gt;. I&#x27;ve helped leadership teams pick the right three more than once, and it is almost always not the three they would have chosen alone.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;dual-stream-strategy&#x2F;&quot;&gt;The dual-stream strategy&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>75% of Google&#x27;s new code is AI-generated. So what?</title>
        <published>2026-04-18T00:00:00+00:00</published>
        <updated>2026-04-18T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/ai-generated-code-vanity-metric/"/>
        <id>https://ctozen.com/insights/ai-generated-code-vanity-metric/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/ai-generated-code-vanity-metric/">&lt;p&gt;At Google Cloud Next 2026, Sundar Pichai announced that 75% of Google&#x27;s new code is now AI-generated and approved by engineers, up from 50% last autumn. The number was repeated by every AI newsletter, every industry analyst, and a quarter of the LinkedIn posts I saw that week.&lt;&#x2F;p&gt;
&lt;p&gt;Here is the inconvenient question almost nobody asked. &lt;strong&gt;So what?&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;What does that number actually tell you about Google&#x27;s engineering output? Does it ship more product? Does the product have fewer defects? Is the customer experience better? Did the engineering organisation get cheaper to operate, or more expensive? Is morale up or down? Are good engineers staying or leaving?&lt;&#x2F;p&gt;
&lt;p&gt;The 75% number does not answer any of those questions. It is a measurement of inputs, presented as if it were a measurement of outcomes. And it is about to become the most reported, least useful metric in enterprise engineering.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-this-metric-is-irresistible-to-executives&quot;&gt;Why this metric is irresistible to executives&lt;&#x2F;h2&gt;
&lt;p&gt;I understand the appeal. AI-generated code percentage is a single number that goes up over time, sounds impressive on an earnings call, and feels like progress. CFOs and boards have been waiting for someone to give them a quantified answer to &quot;is the AI investment working?&quot; Engineering leaders, sensing the pressure, are happy to provide one.&lt;&#x2F;p&gt;
&lt;p&gt;The trouble is that this number measures the activity, not the value. It is the engineering equivalent of measuring sales effectiveness by counting how many emails the team sent. There is a correlation, on average, between effort and outcome. There is not a useful one between AI-generated lines and shipped value.&lt;&#x2F;p&gt;
&lt;p&gt;I have already seen this metric inverted in real organisations. A team I worked with reported that 80% of their merged code was AI-generated, and used it to justify a larger AI tooling budget. When I looked at their actual delivery metrics, cycle time had increased. Defect escape rate had doubled. The team was generating more code, merging more PRs, and producing more bugs, all of which were now arriving faster. The percentage was up. The product was getting worse.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-s-actually-changing-in-the-sdlc&quot;&gt;What&#x27;s actually changing in the SDLC&lt;&#x2F;h2&gt;
&lt;p&gt;The interesting shift over the last year is not how much code is being written. It is how engineering work itself is being reshaped. Coding agents are taking over routine implementation, but they are also reshaping requirements, design, code review, testing, migration, deployment, and incident response. The whole software development lifecycle is being rewired, not just the typing.&lt;&#x2F;p&gt;
&lt;p&gt;OpenAI&#x27;s enterprise update said Codex hit 3 million weekly active users and that customers including GitHub, Nextdoor, Notion, and Wonderful are building multi-agent systems that execute engineering work end-to-end. Microsoft reported roughly 140,000 organisations using GitHub Copilot, with enterprise subscribers nearly tripling year on year. Anthropic positioned Claude Code for Enterprise as a coding agent that writes, debugs, refactors, creates tests, opens PRs, and works across terminal, IDE, Slack, and web, with enterprise controls including permissions, OpenTelemetry monitoring, token visibility, SSO, SCIM, and audit. Salesforce went further, exposing 60+ MCP tools and 30+ coding skills through Headless 360, claiming up to 40% cycle time reduction through their DevOps Center MCP.&lt;&#x2F;p&gt;
&lt;p&gt;If you read those announcements carefully, the language has shifted. Vendors are no longer selling &quot;AI that writes code faster.&quot; They are selling auditable agents that execute engineering work with controls. The metric the vendors themselves are now optimising for is not &quot;lines generated.&quot; It is &quot;engineering work completed safely.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;That is the metric the board should be asking for.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-five-questions-to-bring-to-your-next-board-meeting&quot;&gt;The five questions to bring to your next board meeting&lt;&#x2F;h2&gt;
&lt;p&gt;If you sit on a board, run a PE-backed business, or chair an audit committee where engineering reports up, these are the five questions to ask your CTO about AI in engineering. They cost nothing to ask. They reveal a great deal.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What is your release frequency, and how has it changed?&lt;&#x2F;strong&gt; How often does code reach production? Daily, weekly, monthly? Has it accelerated since AI tools were rolled out, or stayed flat? The companies that get real value from AI in engineering ship more often, not just produce more code.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What is your defect escape rate?&lt;&#x2F;strong&gt; What percentage of bugs reach production before being caught? If this number is rising in step with AI-generated code, you are paying for velocity in customer pain. This is the number to watch most carefully.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What is the PR review latency?&lt;&#x2F;strong&gt; When code is written, how long does it sit waiting for a human to review and merge it? AI-generated code can pile up faster than reviewers can handle. The cycle time of code reaching production depends on review, not on writing.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What is the cost per accepted change?&lt;&#x2F;strong&gt; Add up the cost of AI tools, developer time, review time, testing, and incident remediation, then divide by the number of changes that successfully reach production. This is the most honest unit economics you can put on engineering AI. It should be going down. In poorly managed deployments, it goes up.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;What is the mean time to recovery?&lt;&#x2F;strong&gt; When something breaks, how fast does the team detect it, diagnose it, and recover? AI tools can both help and hurt this. Help, by suggesting fixes faster. Hurt, by introducing changes the team did not write and does not fully understand.&lt;&#x2F;p&gt;
&lt;p&gt;These five questions are not exhaustive. They are the minimum set. None of them includes &quot;what percentage of code is AI-generated,&quot; because that number does not appear in any of them.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-migration-metric-is-the-one-to-lead-with&quot;&gt;The migration metric is the one to lead with&lt;&#x2F;h2&gt;
&lt;p&gt;If I were a CTO going to a board with one number to demonstrate engineering value from AI, it would not be code-generation percentage. It would be migration throughput.&lt;&#x2F;p&gt;
&lt;p&gt;Legacy system migration is a real cost in most established businesses. It is also the workload where AI agents are showing the largest, most measurable gains. Google reported that a complex internal migration was completed six times faster with agents and engineers working together. Anthropic&#x27;s Claude Code Enterprise materials emphasise migration as one of the highest-value workloads. I have seen the same in my own work. Migration is the workload where the new tools produce orders-of-magnitude improvement, not single-digit ones.&lt;&#x2F;p&gt;
&lt;p&gt;If your engineering organisation can clear a 2-year migration backlog in 4 months, that is a board-relevant outcome. The percentage of code written by AI in the process is a footnote.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-i-d-tell-your-cto&quot;&gt;What I&#x27;d tell your CTO&lt;&#x2F;h2&gt;
&lt;p&gt;If I were sitting across the table from your CTO this quarter, my advice would be:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Stop reporting AI-generated code percentages upward.&lt;&#x2F;strong&gt; It will land well in the first board meeting and badly in the third, when someone realises the number is rising and the product is not getting noticeably better.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Pick one engineering value stream and rebuild it with AI in the middle.&lt;&#x2F;strong&gt; Migration is the obvious choice. Security remediation is another (Google reported 90%+ mitigation time reduction in some workflows). PR review is a third. Pick one, redesign it, measure throughput and quality.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Make the metrics changes visible across the organisation.&lt;&#x2F;strong&gt; Cycle time, defect escape rate, review latency, cost per accepted change, MTTR. Put them on a dashboard. Show them at every engineering review. The team will optimise for what gets measured. Right now, most teams are not measuring the right things.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Spend the saved engineering time on the work that humans still do best.&lt;&#x2F;strong&gt; Hard problem decomposition, system architecture, code review at the level of taste, mentoring, judgment under uncertainty. The engineers who become valuable in 2027 are the ones who orchestrate well. I wrote about this last month in &lt;a href=&quot;&#x2F;insights&#x2F;orchestrator-not-10x-engineer&#x2F;&quot;&gt;orchestrator, not 10x engineer&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-board-reframe&quot;&gt;The board reframe&lt;&#x2F;h2&gt;
&lt;p&gt;For boards reading this, the simple test is: when your CTO talks about AI in engineering, do they lead with effort metrics or outcome metrics? If the lead number is &quot;% of code AI-generated,&quot; you are getting an activity report. If the lead number is &quot;release frequency up, defect rate down, migration throughput tripled, cost per accepted change halved,&quot; you are getting a value report.&lt;&#x2F;p&gt;
&lt;p&gt;The companies winning the engineering productivity race in 2026 are not the ones with the highest AI-generated code percentage. They are the ones whose boards refuse to accept that number as the answer.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;d like to talk through what your engineering AI metrics should look like, or how to redesign one engineering value stream around agents, &lt;a href=&quot;&#x2F;contact&#x2F;&quot;&gt;get in touch&lt;&#x2F;a&gt;. I&#x27;ve spent the last decade building engineering organisations that ship, and the last three years building them with AI in the middle.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-made-developers-slower&#x2F;&quot;&gt;Has AI made developers slower? The METR study, reframed&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;orchestrator-not-10x-engineer&#x2F;&quot;&gt;The orchestrator, not the 10x engineer&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;anthropic-agent-sdk-what-works&#x2F;&quot;&gt;What two hours with Anthropic&#x27;s agent team taught me about building AI&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Most of what you do at work will be automated. The interesting question is what won&#x27;t</title>
        <published>2026-04-11T00:00:00+00:00</published>
        <updated>2026-04-11T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/irreducible-human-edge/"/>
        <id>https://ctozen.com/insights/irreducible-human-edge/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/irreducible-human-edge/">&lt;p&gt;If it can be automated, it must be automated.&lt;&#x2F;p&gt;
&lt;p&gt;I mean that literally. Not as aspiration, but as economic inevitability. Every task that a machine can do faster, cheaper, and more consistently than a human will eventually be handed to a machine. This has been true since the spinning jenny. AI just compresses the timeline from decades to months.&lt;&#x2F;p&gt;
&lt;p&gt;The question that most leaders are asking, &quot;Will AI take our jobs?&quot;, is the wrong question. It&#x27;s too binary and too abstract. The better question is specific and strategic: &lt;strong&gt;what can only humans do, and how do we get better at it?&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;That&#x27;s where the real competitive advantage lives. Not in resisting automation, but in ruthlessly embracing it so that your people can focus on the things that actually require a human mind.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-ai-does-better-accept-it-and-move-on&quot;&gt;What AI does better. Accept it and move on&lt;&#x2F;h2&gt;
&lt;p&gt;Let&#x27;s be honest about this, because denial wastes time.&lt;&#x2F;p&gt;
&lt;p&gt;AI is better than humans at processing large volumes of structured information. It&#x27;s better at pattern matching across datasets too large for any person to hold in their head. It&#x27;s better at maintaining consistency across thousands of decisions. It&#x27;s better at operating at scale without fatigue, mood, or distraction.&lt;&#x2F;p&gt;
&lt;p&gt;McKinsey estimates that &lt;strong&gt;60-70% of current work activities could be automated&lt;&#x2F;strong&gt; with existing technology. Not future technology. What exists today. The &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;insurance brokerage I worked with&lt;&#x2F;a&gt; achieved 67% autonomous case resolution. Not because the technology was extraordinary, but because most of those cases genuinely didn&#x27;t require human judgement. They required pattern matching, data retrieval, and rule application. Machines do all three better than people.&lt;&#x2F;p&gt;
&lt;p&gt;Here&#x27;s the finding that should challenge any comfortable assumptions about human-AI collaboration. Stanford HAI&#x27;s 2025 AI Index reported that in a healthcare diagnostic trial, &lt;strong&gt;GPT-4 alone achieved 92% accuracy. Physicians alone scored 74%. And physicians assisted by GPT-4 scored 76%.&lt;&#x2F;strong&gt; Adding humans to AI made it worse. Not better. Worse. The humans second-guessed correct AI diagnoses and introduced errors that the AI alone would not have made.&lt;&#x2F;p&gt;
&lt;p&gt;This is deeply counterintuitive. It&#x27;s also a useful corrective. The comfortable narrative that humans will always add value &quot;in the loop&quot; needs qualifying. For many tasks, they don&#x27;t. The human edge is real, but it&#x27;s narrower than most people want to believe.&lt;&#x2F;p&gt;
&lt;p&gt;Pretending otherwise isn&#x27;t loyalty to your workforce. It&#x27;s a failure of leadership. The organisations that try to protect people from automation end up protecting them into irrelevance, doing work that machines do better while competitors pull ahead.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-irreducible-human-capabilities&quot;&gt;The irreducible human capabilities&lt;&#x2F;h2&gt;
&lt;p&gt;So what&#x27;s left? More than you might think. And it&#x27;s the work that actually matters.&lt;&#x2F;p&gt;
&lt;p&gt;The Harvard Business School study with 758 BCG consultants mapped what they call the &quot;jagged frontier&quot; of AI capability. Consultants using GPT-4 completed 12% more tasks, 25% faster, with 40% higher quality, but only for tasks within the AI&#x27;s capability frontier. &lt;strong&gt;For tasks outside the frontier, consultants using AI were 19% less likely to produce correct solutions.&lt;&#x2F;strong&gt; They trusted the AI when they shouldn&#x27;t have. The best performers were the ones who knew exactly where the frontier was, and handled the other side themselves.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;strategic-intent-knowing-what-s-worth-doing&quot;&gt;Strategic intent. Knowing what&#x27;s worth doing&lt;&#x2F;h3&gt;
&lt;p&gt;AI can optimise any objective function you give it. What it cannot do is decide which objective function matters. It can tell you the most efficient path to a goal. It cannot tell you whether the goal is worth pursuing.&lt;&#x2F;p&gt;
&lt;p&gt;This is not a small distinction. &lt;strong&gt;The most expensive mistakes in business aren&#x27;t execution failures. They&#x27;re strategic ones.&lt;&#x2F;strong&gt; Building the wrong product perfectly. Optimising the wrong metric efficiently. Scaling the wrong business model flawlessly.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen AI projects fail not because the technology didn&#x27;t work, but because nobody asked whether the problem was worth solving. The &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;80% failure rate in AI projects&lt;&#x2F;a&gt; isn&#x27;t primarily a technical problem. It&#x27;s a strategic one. Someone has to own the intent. Someone has to say, &quot;This is worth doing&quot; or, more importantly, &quot;This is not worth doing.&quot; That someone is human.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;taste-judgement-and-opinion&quot;&gt;Taste, judgement, and opinion&lt;&#x2F;h3&gt;
&lt;p&gt;Here&#x27;s something that doesn&#x27;t get discussed enough in the AI conversation. Taste matters.&lt;&#x2F;p&gt;
&lt;p&gt;Not aesthetic taste, though that too. Business taste. The intuition that comes from twenty years of experience in an industry. The ability to look at a proposal and know, before the data confirms it, that something is off. The judgement to say &quot;this is technically correct but strategically wrong.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;AI has no opinions. It has outputs calibrated to probability distributions. It will give you the statistically most likely answer, which is often the most mediocre one. The &lt;a href=&quot;&#x2F;insights&#x2F;adopting-ai-well&#x2F;&quot;&gt;organisations deploying AI well&lt;&#x2F;a&gt; aren&#x27;t the ones deferring to AI outputs. They&#x27;re the ones whose leaders have strong enough judgement to know when the AI is right and when it&#x27;s confidently wrong.&lt;&#x2F;p&gt;
&lt;p&gt;Block&#x27;s CTO Dhanji Prasanna said it well: code quality has &quot;nothing to do&quot; with product success. YouTube triumphed despite &quot;terrible&quot; code. &lt;strong&gt;The decisions that matter are about what to build, not how to build it.&lt;&#x2F;strong&gt; That&#x27;s taste. That&#x27;s judgement. And no model currently possesses it.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve built products where the data pointed one way and experience pointed another. Experience was right more often than most data scientists would like to admit. Not because data doesn&#x27;t matter, but because &lt;strong&gt;data only tells you what happened. Judgement tells you what it means.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;Lakhani, Spataro and Stave warned about this in HBR. A relentless focus on efficiency, they argue, &quot;risks hollowing out the human capabilities, such as judgment and storytelling, that differentiate high-value work.&quot; The organisations that treat AI purely as a cost-reduction tool tend to lose the capacity to do the work AI cannot do.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;human-to-human-relationships&quot;&gt;Human-to-human relationships&lt;&#x2F;h3&gt;
&lt;p&gt;No amount of AI sophistication replaces the trust that forms between people who have worked through difficult problems together. A client doesn&#x27;t hire me because an algorithm recommended me. They hire me because a conversation revealed that I understood their situation in a way that felt different from the others.&lt;&#x2F;p&gt;
&lt;p&gt;This isn&#x27;t sentiment. It&#x27;s economics. B2B sales cycles, board decisions, partnership formations, key hires. The transactions that shape organisations happen between humans who trust each other. AI can prepare the analysis. It can surface the options. It can draft the presentation. But the moment where someone decides to commit resources, take a risk, or change direction, that moment is human.&lt;&#x2F;p&gt;
&lt;p&gt;The &lt;a href=&quot;&#x2F;insights&#x2F;orchestrator-not-10x-engineer&#x2F;&quot;&gt;shift from individual contributor to orchestrator&lt;&#x2F;a&gt; makes this more important, not less. As AI handles more of the technical execution, the human skills of communication, persuasion, and relationship become a larger proportion of what determines success.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;ethical-reasoning-and-contextual-wisdom&quot;&gt;Ethical reasoning and contextual wisdom&lt;&#x2F;h3&gt;
&lt;p&gt;AI systems optimise. That&#x27;s what they do. They find the path of least resistance to whatever objective you&#x27;ve defined. They don&#x27;t ask whether the objective is ethical. They don&#x27;t consider second-order consequences on people who aren&#x27;t represented in the training data. They don&#x27;t feel uncomfortable when something is technically legal but plainly wrong.&lt;&#x2F;p&gt;
&lt;p&gt;Every organisation I&#x27;ve worked with has faced decisions where the right answer wasn&#x27;t the most efficient one. Where serving a customer well meant absorbing a cost. Where doing the ethical thing meant leaving money on the table. These decisions require a kind of contextual wisdom that comes from lived experience, from having been in situations where the &quot;optimal&quot; choice turned out to be the wrong one.&lt;&#x2F;p&gt;
&lt;p&gt;The &lt;a href=&quot;&#x2F;insights&#x2F;shadow-ai-audit-finding&#x2F;&quot;&gt;governance questions around shadow AI&lt;&#x2F;a&gt; are a good example. An AI system will happily process sensitive data if you let it. Knowing when it shouldn&#x27;t, and why, requires human judgement about risk, reputation, and responsibility that no model currently possesses.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-strategic-argument-offload-everything-else&quot;&gt;The strategic argument. Offload everything else&lt;&#x2F;h2&gt;
&lt;p&gt;If these are the irreducible human capabilities, the strategic implication is clear: &lt;strong&gt;automate everything that isn&#x27;t on this list.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;Not gradually. Not cautiously. Aggressively.&lt;&#x2F;p&gt;
&lt;p&gt;Every hour your best people spend on work that a machine could do is an hour they&#x27;re not spending on strategic thinking, relationship building, or the kind of difficult judgement calls that actually determine your organisation&#x27;s trajectory. It&#x27;s not just wasteful. It&#x27;s competitively dangerous.&lt;&#x2F;p&gt;
&lt;p&gt;The &lt;a href=&quot;&#x2F;insights&#x2F;dual-stream-strategy&#x2F;&quot;&gt;dual-stream approach&lt;&#x2F;a&gt; I advocate is exactly this: run two tracks simultaneously. On one track, automate every process that can be automated. On the other, deliberately invest in developing the human capabilities that machines cannot replicate. The organisations that do both will outperform those that do either one alone.&lt;&#x2F;p&gt;
&lt;p&gt;This means restructuring roles, not eliminating them. It means telling your team: &quot;We&#x27;re going to take away the parts of your job that a machine does better, and we&#x27;re going to expect you to become exceptional at the parts only you can do.&quot; That&#x27;s a harder conversation than &quot;AI is coming for your jobs,&quot; but it&#x27;s a more honest and more productive one.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-for-now-problem&quot;&gt;The &quot;for now&quot; problem&lt;&#x2F;h2&gt;
&lt;p&gt;I want to be honest about something. The human edge is shrinking.&lt;&#x2F;p&gt;
&lt;p&gt;Five years ago, I would have included creative writing on the list of irreducible human capabilities. Today, AI produces competent prose that passes most quality bars. Three years ago, I would have included basic code architecture. Today, &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;agentic AI systems&lt;&#x2F;a&gt; are making architectural decisions that are, in many cases, good enough.&lt;&#x2F;p&gt;
&lt;p&gt;The boundary between &quot;human only&quot; and &quot;machine capable&quot; moves in one direction. It moves towards the machine. Slowly in some domains, startlingly fast in others. Anyone who tells you that strategic thinking, relationship building, and ethical judgement are permanently safe from automation is making a prediction about a technology whose trajectory they cannot know.&lt;&#x2F;p&gt;
&lt;p&gt;Ben Goertzel, CEO of SingularityNET, claims AI will surpass human strategic thinking &quot;in about two years.&quot; I think he&#x27;s probably wrong on the timeline, but I take the direction seriously. The WEF Future of Jobs Report identifies creative thinking, resilience, and curiosity as the fastest-rising skills employers value. These are the human edge skills. And yet a quieter risk is emerging in the data. &lt;strong&gt;Critical thinking is atrophying in teams that lean too hard on AI outputs.&lt;&#x2F;strong&gt; Junior staff who don&#x27;t first form their own view before consulting the model lose, over time, the muscle that makes them a useful check on it. We may be losing the very capabilities that make us irreplaceable, even as we still hold them.&lt;&#x2F;p&gt;
&lt;p&gt;I don&#x27;t think they&#x27;re safe forever. I think they&#x27;re safe now, and they&#x27;ll be safe for long enough to matter strategically. The organisations that double down on developing these capabilities in their people will have a decisive advantage for the next five to ten years. And five to ten years of competitive advantage is, in business terms, an eternity.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The right response to a shrinking edge is not to pretend it isn&#x27;t shrinking. It&#x27;s to maximise the advantage while it exists.&lt;&#x2F;strong&gt; Play the hand you have, not the hand you wish you had or the hand you fear you&#x27;ll be dealt.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-this-means-for-you&quot;&gt;What this means for you&lt;&#x2F;h2&gt;
&lt;p&gt;If you lead an organisation, three things follow from this argument.&lt;&#x2F;p&gt;
&lt;p&gt;First, &lt;strong&gt;stop protecting people from automation.&lt;&#x2F;strong&gt; Every task you shield from AI because &quot;it&#x27;s always been done by humans&quot; is a task where you&#x27;re choosing nostalgia over competitiveness. Free your people to do the work that only they can do.&lt;&#x2F;p&gt;
&lt;p&gt;Second, &lt;strong&gt;invest in the human skills that matter.&lt;&#x2F;strong&gt; Strategic thinking, judgement, relationship building, ethical reasoning. These aren&#x27;t soft skills. They&#x27;re the hardest skills. And they&#x27;re the ones your organisation&#x27;s future depends on. When AI handles scale and speed, the bottleneck becomes human judgement: the precision of the questions you ask, the depth with which you interpret model reasoning, and your ability to turn AI-generated ideas into better decisions. Lakhani, Spataro and Stave put it more bluntly in a recent Harvard Business Review piece: &quot;the AI last mile is not blocked by technology. It is blocked by unresolved questions regarding operating models, governance, and human identity.&quot; Treat these skills with the same seriousness you treat technical capability.&lt;&#x2F;p&gt;
&lt;p&gt;Third, &lt;strong&gt;be honest about the timeline.&lt;&#x2F;strong&gt; The human edge is real, but it&#x27;s not permanent. Build your strategy around it, but don&#x27;t build your identity around it. Stay adaptive. The line will move again.&lt;&#x2F;p&gt;
&lt;p&gt;The most valuable people in any organisation have always been the ones who know what&#x27;s worth doing, not just how to do it. AI hasn&#x27;t changed that. If anything, it&#x27;s made it more true than ever.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you&#x27;re working out where the human edge matters most in your organisation, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;orchestrator-not-10x-engineer&#x2F;&quot;&gt;You&#x27;re not a 10x engineer. You&#x27;re an orchestrator&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;adopting-ai-well&#x2F;&quot;&gt;Most companies are adopting AI. Few are adopting it well&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;traditional-moats-dissolving&#x2F;&quot;&gt;Traditional moats are dissolving&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>AI is not just for engineering. Every function in your business can use it today</title>
        <published>2026-04-04T00:00:00+00:00</published>
        <updated>2026-04-04T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/ai-across-every-function/"/>
        <id>https://ctozen.com/insights/ai-across-every-function/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/ai-across-every-function/">&lt;p&gt;McKinsey&#x27;s latest State of AI survey found that 88% of organisations now use AI in at least one business function. But only 6% qualify as &quot;AI high performers&quot; attributing more than 5% of EBIT to AI.&lt;&#x2F;p&gt;
&lt;p&gt;That gap, between using AI and benefiting from AI, is the real story. The technology is deployed everywhere. The value is concentrated in a few. And the reason is almost always the same: most companies treat AI as an engineering tool, not a business-wide operating system.&lt;&#x2F;p&gt;
&lt;p&gt;Every process in every business in every industry can be optimised with AI. I don&#x27;t mean hypothetically. I mean right now, with tools that exist today, at a cost that makes the ROI obvious within weeks. Shopify CEO Tobi Lutke made the right call when he told his company that no one can request headcount without first proving AI cannot do the job. &lt;strong&gt;The burden of proof has flipped.&lt;&#x2F;strong&gt; It&#x27;s no longer &quot;why should we use AI?&quot; It&#x27;s &quot;why aren&#x27;t we?&quot;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-engineering-tunnel-vision-problem&quot;&gt;The engineering tunnel vision problem&lt;&#x2F;h2&gt;
&lt;p&gt;Most companies that &quot;do AI&quot; are doing it in one place: software development. They&#x27;ve bought Copilot licences. Maybe they&#x27;re experimenting with agentic coding tools. The engineering team is 6 months ahead of everyone else in the organisation.&lt;&#x2F;p&gt;
&lt;p&gt;Meanwhile, the finance team is still manually reconciling spreadsheets. Marketing is writing every piece of content from scratch. Customer support agents are copy-pasting the same responses they&#x27;ve been using for three years. Legal is reviewing contracts clause by clause.&lt;&#x2F;p&gt;
&lt;p&gt;Lakhani, Spataro and Stave described this same pattern in a March 2026 Harvard Business Review piece as &lt;strong&gt;&quot;islands of productivity&quot;&lt;&#x2F;strong&gt;: tools that boost individual output but exist as isolated wins, difficult to convert into scaled, trustworthy enterprise systems. That phrase matches what I see almost everywhere. A high-performing engineering function. A back office that has barely changed in a decade.&lt;&#x2F;p&gt;
&lt;p&gt;This is a massive missed opportunity. &lt;strong&gt;The biggest AI gains I&#x27;ve seen are not in engineering. They&#x27;re in operations, support, and back-office functions&lt;&#x2F;strong&gt; where the work is repetitive, the volume is high, and the cost of doing it manually is quietly enormous.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;function-by-function-what-works-today&quot;&gt;Function by function. What works today&lt;&#x2F;h2&gt;
&lt;h3 id=&quot;finance-and-accounting&quot;&gt;Finance and accounting&lt;&#x2F;h3&gt;
&lt;p&gt;AI is genuinely good at financial operations right now. Invoice processing, expense categorisation, anomaly detection in transactions, cash flow forecasting. These are not experimental use cases. They&#x27;re production-ready.&lt;&#x2F;p&gt;
&lt;p&gt;IBM automated over 90% of their finance journal entries using watsonx Orchestrate. HPE uses intelligent agents to automate quarterly close, forecasting, and analysis. These aren&#x27;t pilots. They&#x27;re running in production at two of the largest technology companies in the world.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen finance teams cut month-end close time by 40% using AI-assisted reconciliation. The tool doesn&#x27;t replace the accountant. It does the tedious matching work and flags the exceptions that need human judgement. Fortune reports that CFOs are predicting AI will transform finance from retrospective reporting to real-time decision-making. The role is shifting from &quot;what happened&quot; to &quot;what should we do.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;What doesn&#x27;t work yet: fully autonomous financial decision-making. AI can surface insights and flag anomalies, but anything involving judgement calls on material financial matters still needs a human. And it should.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;marketing-and-content&quot;&gt;Marketing and content&lt;&#x2F;h3&gt;
&lt;p&gt;This is where the gap between what&#x27;s possible and what most teams actually do is widest.&lt;&#x2F;p&gt;
&lt;p&gt;Most marketing teams using AI are generating first drafts of blog posts. That&#x27;s fine, but it&#x27;s about 10% of the value available. &lt;strong&gt;The real gains are in research, personalisation, and distribution.&lt;&#x2F;strong&gt; AI can analyse your entire content library and identify gaps. It can segment audiences and tailor messaging at a scale that would require a team of ten to do manually. It can A&#x2F;B test subject lines, optimise send times, and identify which channels are converting for which segments.&lt;&#x2F;p&gt;
&lt;p&gt;HubSpot&#x27;s 2025 State of Marketing report found that teams using AI across the full content lifecycle (research, creation, distribution, analysis) saw &lt;strong&gt;3x the output with the same headcount&lt;&#x2F;strong&gt;. Not 3x the content. 3x the output measured by engagement and conversion.&lt;&#x2F;p&gt;
&lt;p&gt;What doesn&#x27;t work yet: letting AI write your brand voice without heavy human editing. The tools produce competent copy. Competent is not enough. You still need a human who understands what makes your voice distinctive.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;sales&quot;&gt;Sales&lt;&#x2F;h3&gt;
&lt;p&gt;AI in sales is past the experimental phase. Lead scoring, call analysis, pipeline forecasting, proposal generation. These are all live use cases with proven ROI.&lt;&#x2F;p&gt;
&lt;p&gt;The most impactful application I&#x27;ve seen is &lt;strong&gt;AI-powered call analysis&lt;&#x2F;strong&gt;. Tools like Gong and Chorus analyse every sales conversation, identify what top performers do differently, and surface coaching opportunities for the rest of the team. One organisation I advised saw win rates increase 18% within a quarter after implementing structured call analysis with AI-generated coaching recommendations.&lt;&#x2F;p&gt;
&lt;p&gt;Proposal generation is another area where the ROI is immediate. A senior salesperson spending two hours crafting a custom proposal can get a strong first draft in fifteen minutes. That&#x27;s not about replacing the salesperson. It&#x27;s about freeing them to spend time on relationships and strategy instead of formatting documents.&lt;&#x2F;p&gt;
&lt;p&gt;What doesn&#x27;t work yet: AI closing deals autonomously. Sales is fundamentally about trust between humans. AI makes salespeople more effective. It doesn&#x27;t replace them.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;customer-support&quot;&gt;Customer support&lt;&#x2F;h3&gt;
&lt;p&gt;This is where I have the most direct experience. The &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;insurance brokerage case&lt;&#x2F;a&gt; I led achieved &lt;strong&gt;67% autonomous case resolution&lt;&#x2F;strong&gt; with an agentic AI system. Not chatbot-style deflection. Genuine resolution: retrieving policy information, making coverage determinations, communicating outcomes to customers.&lt;&#x2F;p&gt;
&lt;p&gt;Klarna&#x27;s experience is instructive. Their AI assistant handled 2.3 million conversations in its first month, equivalent to 700 full-time agents. The headlines were triumphant. Then quality problems forced them to reverse course and rehire humans. They ended up with a hybrid model that may actually be more effective than either pure approach. The lesson: AI can handle the volume, but you need humans for the edge cases that destroy customer trust when handled badly.&lt;&#x2F;p&gt;
&lt;p&gt;The key insight from my own work: &lt;strong&gt;customer support is the function where AI delivers the most measurable, immediate ROI&lt;&#x2F;strong&gt; in most organisations. The volume is high, the queries are repetitive, the cost per interaction is well understood, and the quality bar is definable. If you&#x27;re only going to pick one function to start with, start here.&lt;&#x2F;p&gt;
&lt;p&gt;What doesn&#x27;t work yet: handling emotionally complex or genuinely novel situations. The best systems know when to escalate. The worst ones try to handle everything and erode trust in the process.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;legal-and-contracts&quot;&gt;Legal and contracts&lt;&#x2F;h3&gt;
&lt;p&gt;Legal is the sleeper disruption story of 2026. Corporate AI adoption in legal more than doubled in a single year, from 23% to 52%, according to the ACC&#x2F;Everlaw survey. 64% of in-house teams now expect less dependence on outside counsel. 61% are pushing for changes in how outside legal services are priced.&lt;&#x2F;p&gt;
&lt;p&gt;Tools like Luminance, Ironclad, and Thomson Reuters&#x27; CoCounsel can review standard commercial contracts in minutes, flagging non-standard clauses, missing provisions, and risk areas that a junior lawyer would take hours to identify. LexisNexis has deployed a four-agent system for legal research: orchestrator, legal research, web search, and customer document analysis.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen legal teams reduce contract review time by 60-70% for standard agreements. The AI handles the first pass. The lawyer handles the judgement calls. This isn&#x27;t about reducing legal headcount. It&#x27;s about removing the bottleneck that legal review often creates in deal velocity.&lt;&#x2F;p&gt;
&lt;p&gt;What doesn&#x27;t work yet: novel legal reasoning or complex regulatory interpretation. AI can tell you what a contract says. It can&#x27;t yet reliably tell you what it means in the context of a specific business situation and regulatory environment.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;project-management-and-reporting&quot;&gt;Project management and reporting&lt;&#x2F;h3&gt;
&lt;p&gt;This is the function where AI adoption is lowest and the opportunity is most underestimated.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;AI can eliminate 80% of status reporting.&lt;&#x2F;strong&gt; It can pull data from Jira, Slack, GitHub, and your time-tracking tool, synthesise it into a coherent status update, flag risks based on velocity trends, and generate the weekly report that a project manager currently spends two hours compiling every Friday.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen project leads reclaim an entire day per week by automating reporting and status synthesis. That&#x27;s a day they can spend actually managing the project instead of describing it.&lt;&#x2F;p&gt;
&lt;p&gt;What doesn&#x27;t work yet: AI project managers. Prioritisation, stakeholder management, and the ability to read a room when a project is going sideways. These remain distinctly human skills.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;internal-communications&quot;&gt;Internal communications&lt;&#x2F;h3&gt;
&lt;p&gt;Meeting summarisation alone justifies AI investment for most organisations. If your company has more than 50 people, you&#x27;re spending thousands of hours per year in meetings where half the attendees are there &quot;just in case.&quot; AI meeting tools (Otter, Fireflies, Copilot in Teams) can record, transcribe, summarise, and extract action items. The people who didn&#x27;t need to attend can read the summary in two minutes.&lt;&#x2F;p&gt;
&lt;p&gt;Beyond meetings: internal knowledge bases that actually answer questions, onboarding documentation that stays current, policy documents that can be queried rather than read. These are all production-ready use cases today.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-compounding-effect&quot;&gt;The compounding effect&lt;&#x2F;h2&gt;
&lt;p&gt;Here&#x27;s what makes internal AI enablement a competitive advantage rather than just an efficiency play.&lt;&#x2F;p&gt;
&lt;p&gt;Each function you optimise with AI frees capacity. That capacity can be redirected to higher-value work, or it can allow you to &lt;strong&gt;grow operations without proportionally growing headcount&lt;&#x2F;strong&gt;. When you do this across five or six functions simultaneously, the effect compounds. You don&#x27;t get 10% efficiency in each area. You get a fundamentally different cost structure.&lt;&#x2F;p&gt;
&lt;p&gt;The trap, as Lakhani, Spataro and Stave noted in Harvard Business Review, is that &quot;saved time is often re-absorbed into low-value activities, like more internal meetings or unnecessary emails, rather than being structurally harvested.&quot; Capacity does not capture itself. Someone has to redesign the role, change the budget line, or shift the headcount allocation. Without that step, the AI investment shows up as a vendor bill and nothing else.&lt;&#x2F;p&gt;
&lt;p&gt;The roles most exposed are the reporting-heavy ones in the middle of the organisation: finance analysts whose primary output is monthly decks, compliance officers compiling regulatory submissions, supply chain planners building forecasts from spreadsheets, procurement managers drafting RFPs. The organisations moving fastest are not eliminating these roles wholesale. They are reallocating the people in them toward analytical, judgement-heavy, and creative work that AI cannot yet do well.&lt;&#x2F;p&gt;
&lt;p&gt;The organisations I work with that understand this are not just saving money. They&#x27;re moving faster than competitors who are still running every function manually. And that gap widens every quarter.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-common-mistakes&quot;&gt;The common mistakes&lt;&#x2F;h2&gt;
&lt;p&gt;Even with the right intent, most organisations get internal AI enablement wrong in predictable ways.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Mistake one: starting everywhere at once.&lt;&#x2F;strong&gt; Pick one function. Get it working. Measure the results. Then expand. The &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;80% failure rate&lt;&#x2F;a&gt; applies to internal tools just as much as customer-facing ones.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Mistake two: no governance.&lt;&#x2F;strong&gt; When every department is buying its own AI tools, you get &lt;a href=&quot;&#x2F;insights&#x2F;shadow-ai-audit-finding&#x2F;&quot;&gt;shadow AI&lt;&#x2F;a&gt; at scale. Sensitive data flowing to tools nobody approved. Duplicate spend. Inconsistent quality. Internal enablement needs a lightweight governance framework from day one.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Mistake three: expecting AI to fix broken processes.&lt;&#x2F;strong&gt; If your sales process is a mess, AI will make it a faster mess. The organisations that see real results &lt;a href=&quot;&#x2F;insights&#x2F;adopting-ai-well&#x2F;&quot;&gt;redesign the workflow&lt;&#x2F;a&gt; before adding AI to it.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Mistake four: forgetting the humans.&lt;&#x2F;strong&gt; Every function-level AI deployment is a change management exercise. The finance team needs to trust the reconciliation tool. The legal team needs to believe the contract review is reliable. That trust is built through transparency, gradual rollout, and honest communication about what the tool does and doesn&#x27;t do.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;where-to-start&quot;&gt;Where to start&lt;&#x2F;h2&gt;
&lt;p&gt;If I were advising an organisation that hadn&#x27;t yet done any cross-functional AI enablement, I&#x27;d tell them to pick the function with the highest volume of repetitive, well-defined work. For most companies, that&#x27;s customer support or finance operations.&lt;&#x2F;p&gt;
&lt;p&gt;Run a focused pilot. Measure before and after. Be honest about what worked and what didn&#x27;t. Then use that success (and those lessons) to build momentum for the next function.&lt;&#x2F;p&gt;
&lt;p&gt;The technology is ready. The tools are affordable. The ROI is measurable within a quarter, not a year. The only thing missing in most organisations is someone with enough authority and enough cross-functional understanding to see the full picture and connect the dots. That is not a technology problem. It is a &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;leadership one&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want to map where AI can create the most value inside your organisation, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;adopting-ai-well&#x2F;&quot;&gt;Most companies are adopting AI. Few are adopting it well&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;How I approach AI transformation&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Traditional moats are dissolving. Size and capital no longer protect you</title>
        <published>2026-03-28T00:00:00+00:00</published>
        <updated>2026-03-28T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/traditional-moats-dissolving/"/>
        <id>https://ctozen.com/insights/traditional-moats-dissolving/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/traditional-moats-dissolving/">&lt;p&gt;The most dangerous assumption in business right now is that being big makes you safe.&lt;&#x2F;p&gt;
&lt;p&gt;For decades, size was the ultimate competitive advantage. Deep pockets meant you could outspend competitors on R&amp;amp;D, hire the best talent, absorb regulatory costs, and outlast anyone who tried to disrupt you. Scale meant distribution. Distribution meant dominance. And dominance, once achieved, was very hard to undo.&lt;&#x2F;p&gt;
&lt;p&gt;That logic is breaking down. Fast.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-moat-is-leaking&quot;&gt;The moat is leaking&lt;&#x2F;h2&gt;
&lt;p&gt;Warren Buffett popularised the concept of economic moats: the structural advantages that protect a business from competition. Brand recognition. Proprietary data. Network effects. Regulatory capture. Economies of scale. These were the things that let incumbents sleep well at night.&lt;&#x2F;p&gt;
&lt;p&gt;AI is eroding every single one of them.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Brand recognition&lt;&#x2F;strong&gt; used to take decades and hundreds of millions in marketing spend to build. Today, a two-person team with a genuinely useful AI-powered product can reach millions of users in weeks through organic distribution. Perplexity went from zero to 100 million monthly queries in under two years. Not because they outspent Google. Because they built something that solved a problem better than the incumbent.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Proprietary data&lt;&#x2F;strong&gt; was once an unassailable advantage. You had the data, your competitors didn&#x27;t, end of discussion. Now, foundation models trained on the open web give any startup access to reasoning capabilities that previously required a decade of data collection. The advantage has shifted from &lt;em&gt;having&lt;&#x2F;em&gt; data to &lt;em&gt;knowing what to do with it&lt;&#x2F;em&gt;. Those are very different capabilities.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Economies of scale&lt;&#x2F;strong&gt; meant that producing more was cheaper per unit, which meant incumbents could undercut any challenger. But when a team of five can build and deploy a product that previously required fifty, the cost curve inverts. The challenger isn&#x27;t competing on your terms. They&#x27;re operating on a fundamentally different cost structure.&lt;&#x2F;p&gt;
&lt;p&gt;Midjourney is the clearest example. Approximately 100 employees. $500 million in revenue. $10.5 billion valuation. No external funding. No sales team. No marketing department. Organic growth through Discord and social sharing. They built one of the most commercially successful AI products on the planet with a team smaller than most companies&#x27; marketing departments.&lt;&#x2F;p&gt;
&lt;p&gt;Sam Altman put it bluntly: &quot;In my little group chat with my tech CEO friends there&#x27;s this betting pool for the first year that there is a one-person billion-dollar company.&quot; That&#x27;s not hyperbole. It&#x27;s a direction of travel.&lt;&#x2F;p&gt;
&lt;p&gt;If you walk through the classic moat pillars one by one (switching costs, network effects, intangible assets, efficient scale, cost advantage) the only one that still clearly protects is the last. Physical cost advantage: factories, supply chains, mineral reserves, energy contracts. &lt;strong&gt;If your competitive advantage isn&#x27;t bolted to the ground, AI is dissolving it.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-speed-advantage-is-compounding&quot;&gt;The speed advantage is compounding&lt;&#x2F;h2&gt;
&lt;p&gt;Here&#x27;s what makes this different from previous waves of disruption.&lt;&#x2F;p&gt;
&lt;p&gt;Previous technology shifts, the internet, mobile, cloud, gave incumbents time to adapt. The transition from physical retail to e-commerce took fifteen years. The shift to mobile-first took a decade. Cloud migration is still happening twenty years after AWS launched.&lt;&#x2F;p&gt;
&lt;p&gt;AI is not giving anyone that kind of runway.&lt;&#x2F;p&gt;
&lt;p&gt;McKinsey&#x27;s 2024 research found that &lt;strong&gt;organisations already using AI effectively are pulling ahead at an accelerating rate&lt;&#x2F;strong&gt;. The gap between AI leaders and laggards isn&#x27;t closing. It&#x27;s widening. Every quarter that an organisation delays meaningful AI deployment, the distance to the leaders grows, because the leaders are compounding their advantage. They&#x27;re not just using AI. They&#x27;re learning how to use AI better, building institutional knowledge that compounds over time.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen this firsthand. One business I advised began embedding AI into its core operations roughly fifteen months ago. They are now operating on a different plane to a competitor with three times their headcount, who started six months ago and is still in pilot stage. The technology is no better today than it was for the early mover. The difference is fifteen months of compounded operational learning, model evaluations they&#x27;ve kept, prompts they&#x27;ve refined, workflows they&#x27;ve redesigned around AI from scratch. &lt;strong&gt;That institutional knowledge is the new moat. And you can&#x27;t buy it. You can only build it.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-incumbents-struggle&quot;&gt;Why incumbents struggle&lt;&#x2F;h2&gt;
&lt;p&gt;If size no longer protects you, why don&#x27;t large organisations just adopt AI faster?&lt;&#x2F;p&gt;
&lt;p&gt;Because the very things that made them dominant are now the things slowing them down.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Organisational complexity.&lt;&#x2F;strong&gt; A decision that takes a five-person startup an afternoon takes an enterprise six months. Procurement, legal review, security assessment, change management, stakeholder alignment, pilot programme, evaluation period, board approval, implementation. By the time the enterprise has approved the pilot, the startup has iterated through twelve versions of a production system.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Talent distribution.&lt;&#x2F;strong&gt; The people who are best at building with AI are disproportionately choosing to work at smaller companies or on their own. They want speed, autonomy, and direct impact. Enterprise bureaucracy is the opposite of what attracts them. Every founder I speak to who is hiring for production AI capability tells me the same thing: the best candidates are weighing offers from companies a tenth the size of the incumbent, and the smaller companies are winning more often than not. &lt;strong&gt;The talent is self-selecting away from the organisations that need it most.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Sunk cost mentality.&lt;&#x2F;strong&gt; Large organisations have invested hundreds of millions in existing systems, processes, and team structures. Embracing AI properly often means acknowledging that some of those investments are now liabilities, not assets. That&#x27;s a hard conversation for any leadership team, and harder still for one that approved those investments. The result is &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;the AI theatre I&#x27;ve written about before&lt;&#x2F;a&gt;: pilots and innovation labs that exist to demonstrate activity without disrupting anything that matters.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Risk aversion at scale.&lt;&#x2F;strong&gt; When you have a billion-pound revenue line to protect, every change feels risky. When you have nothing to lose and everything to gain, every change feels like opportunity. This asymmetry has always favoured challengers, but AI amplifies it enormously because the cost of building a credible competitor has dropped by an order of magnitude.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-this-means-in-practice&quot;&gt;What this means in practice&lt;&#x2F;h2&gt;
&lt;p&gt;This is not theoretical. It&#x27;s playing out across industries right now.&lt;&#x2F;p&gt;
&lt;p&gt;In legal, the numbers are startling. Corporate AI adoption more than doubled in a single year, from 23% to 52%, according to the ACC&#x2F;Everlaw survey. 64% of in-house legal teams now expect less dependence on outside counsel. 61% are pushing for changes in how outside legal services are priced. An entire industry built on billable hours and institutional knowledge is being restructured by tools that teams of ten built.&lt;&#x2F;p&gt;
&lt;p&gt;In software, &lt;a href=&quot;&#x2F;insights&#x2F;orchestrator-not-10x-engineer&#x2F;&quot;&gt;the shift from traditional engineering to AI-augmented orchestration&lt;&#x2F;a&gt; means a small team with strong AI fluency can build what previously required departments. Andrej Karpathy, OpenAI co-founder, calls the new paradigm &quot;agentic engineering&quot;: not writing code, but orchestrating agents who do. Features that took two to six weeks in 2024 now ship in a day, sometimes hours.&lt;&#x2F;p&gt;
&lt;p&gt;Since the start of 2026, ETFs for public software companies have fallen 30%. Salesforce, Adobe, Intuit, ServiceNow, all down 25-30% in weeks. a16z calls it the &quot;SaaSpocalypse.&quot; The market is pricing in what the incumbents haven&#x27;t yet accepted.&lt;&#x2F;p&gt;
&lt;p&gt;The pattern is the same everywhere. &lt;strong&gt;A small team that knows how to use AI well can now compete with organisations ten or fifty times their size.&lt;&#x2F;strong&gt; Not on everything. Not yet. But on specific, high-value capabilities that used to be the exclusive domain of incumbents.&lt;&#x2F;p&gt;
&lt;p&gt;And the window for incumbents to respond is narrowing.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-new-moat&quot;&gt;The new moat&lt;&#x2F;h2&gt;
&lt;p&gt;If traditional moats are dissolving, what replaces them?&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Institutional AI fluency.&lt;&#x2F;strong&gt; The organisations that will dominate the next decade are the ones building deep, organisation-wide capability in working with AI. Not just buying tools. Not just running pilots. Actually changing how work gets done, how decisions get made, how products get built. This is the investment that &lt;a href=&quot;&#x2F;insights&#x2F;adopting-ai-well&#x2F;&quot;&gt;most organisations are still avoiding&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Speed of iteration.&lt;&#x2F;strong&gt; The ability to deploy, measure, learn, and improve faster than competitors. This favours organisations with flat structures, clear decision authority, and a culture that treats failed experiments as data rather than career risk.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Production AI capability.&lt;&#x2F;strong&gt; The gap between &quot;we have an AI strategy&quot; and &quot;we have AI systems in production delivering measurable business outcomes&quot; is where competitive advantage lives. Only &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;about one in five organisations unlock real ROI from AI&lt;&#x2F;a&gt;. The 20% that succeed are building something their competitors cannot easily replicate, because it&#x27;s built on operational knowledge, not just technology.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Data feedback loops.&lt;&#x2F;strong&gt; Not the static data advantage of old, where you simply had more data than the competition. The dynamic advantage of having production AI systems that generate data about what works, which feeds back into better systems, which generate more data. This is the compounding loop that makes early movers increasingly hard to catch.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-honest-assessment&quot;&gt;The honest assessment&lt;&#x2F;h2&gt;
&lt;p&gt;I&#x27;m not arguing that every large company is doomed or that every startup with an AI tool will win. Scale still matters for distribution, for regulatory compliance capacity, for customer trust in high-stakes domains.&lt;&#x2F;p&gt;
&lt;p&gt;But the balance has shifted. Dramatically. The assumption that being the biggest player in your market means you&#x27;ll still be the biggest player in five years is no longer safe. Industry leaders have no guarantees of maintaining their position. The power is moving to those who know how to use AI, and who started learning earlier.&lt;&#x2F;p&gt;
&lt;p&gt;Lakhani, Spataro and Stave summarised the situation precisely in Harvard Business Review: &quot;while execution is increasingly automated, what remains scarce is the leadership ability to imagine and commit to a different way of running the enterprise.&quot; That scarcity is the new moat. It is not capital. It is not headcount. It is the willingness, at the top of the organisation, to redesign.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;re leading a large organisation, the question isn&#x27;t whether you can afford to invest in AI transformation. It&#x27;s whether you can afford the speed at which you&#x27;re currently moving. Because somewhere, a team of five people who really understand this technology is building what comes next in your industry. And they&#x27;re not waiting for your procurement process to finish.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want to talk through what this means for your competitive position, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;adopting-ai-well&#x2F;&quot;&gt;Most companies are adopting AI. Few are adopting it well&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>The dual-stream strategy. Protect your current business while building what replaces it</title>
        <published>2026-03-20T00:00:00+00:00</published>
        <updated>2026-03-20T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/dual-stream-strategy/"/>
        <id>https://ctozen.com/insights/dual-stream-strategy/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/dual-stream-strategy/">&lt;p&gt;Every successful business is running on borrowed time. That sounds dramatic, but the evidence is clear: the average lifespan of an S&amp;amp;P 500 company has dropped from 61 years in 1958 to under 18 years today. McKinsey projects that by 2027, 75% of the companies currently on that list will have been replaced.&lt;&#x2F;p&gt;
&lt;p&gt;The cause isn&#x27;t bad management. It&#x27;s the inability to build the next thing while the current thing still works.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-innovator-s-dilemma-but-worse&quot;&gt;The innovator&#x27;s dilemma, but worse&lt;&#x2F;h2&gt;
&lt;p&gt;Clayton Christensen named this problem decades ago. Successful companies optimise for their existing customers, their existing margins, their existing competitive advantages. They get so good at what they do that they can&#x27;t see the thing that will make it irrelevant. By the time they do see it, someone else has built it.&lt;&#x2F;p&gt;
&lt;p&gt;AI makes this problem orders of magnitude more urgent. Previous technology shifts gave you a decade to adapt. &lt;strong&gt;AI gives you one to three years.&lt;&#x2F;strong&gt; The speed at which AI capabilities are compounding means the window between &quot;this could affect our business&quot; and &quot;this has already disrupted our market&quot; is collapsing.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve worked with companies across sectors where this conversation is happening right now. The question always takes the same form: do we invest in new products that could eat into our own current advantage?&lt;&#x2F;p&gt;
&lt;p&gt;It&#x27;s not really a dilemma.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;two-streams-running-in-parallel&quot;&gt;Two streams, running in parallel&lt;&#x2F;h2&gt;
&lt;p&gt;The smartest position for any profitable business right now is to work in two parallel streams.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Stream one: protect and optimise what you have.&lt;&#x2F;strong&gt; Keep running your current business. Serve your customers. Maintain your margins. This is the revenue that funds everything else, and abandoning it prematurely is as dangerous as ignoring what comes next. Apply AI to make your existing operations more efficient, your customer experience sharper, your costs lower. This is where most organisations start, and it&#x27;s necessary.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Stream two: build the thing that replaces it.&lt;&#x2F;strong&gt; Start developing AI-native capabilities that could fundamentally change your product, your business model, or your market position. This is the uncomfortable stream, because it means investing in something that might cannibalise the very thing generating your revenue today.&lt;&#x2F;p&gt;
&lt;p&gt;Most companies only run stream one. They use AI to optimise what exists. They &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;bolt tools onto existing processes&lt;&#x2F;a&gt; and call it transformation. They measure adoption rates and report to the board that AI is being used. But they never build the thing that would actually change the trajectory of the business.&lt;&#x2F;p&gt;
&lt;p&gt;That&#x27;s not strategy. That&#x27;s maintenance.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-distribution-is-shifting&quot;&gt;The distribution is shifting&lt;&#x2F;h2&gt;
&lt;p&gt;Here&#x27;s the part that makes boards uncomfortable. The balance between these two streams is not static. &lt;strong&gt;You should be putting more focus on stream two every quarter.&lt;&#x2F;strong&gt; Not because stream one is failing, but because the competitive landscape is accelerating.&lt;&#x2F;p&gt;
&lt;p&gt;A year ago, using AI to improve internal efficiency was a legitimate strategic priority. Today, it&#x27;s table stakes. Every competitor is doing it. The differentiation now comes from AI-native products, AI-enabled business models, and capabilities that didn&#x27;t exist before.&lt;&#x2F;p&gt;
&lt;p&gt;The numbers tell a brutal story. An NBER survey of 6,000 executives across the US, UK, Germany, and Australia found that &lt;strong&gt;over 80% detected no discernible productivity impact from AI&lt;&#x2F;strong&gt; despite near-universal adoption. Executives use AI only 1.5 hours per week on average. Robert Solow&#x27;s 1987 paradox is repeating itself: &quot;You can see the computer age everywhere but in the productivity statistics.&quot; The AI is there. The transformation is not.&lt;&#x2F;p&gt;
&lt;p&gt;Meanwhile, PwC&#x27;s 2026 Global CEO Survey found that 45% of CEOs believe their company will not be viable in ten years if it continues on its current path. BCG classifies only 5% of companies as &quot;future-built&quot; for AI. 60% are laggards. The gap between those two groups is widening, not closing. Leaders expect twice the revenue increase and 40% greater cost reductions than laggards. And the gap compounds every quarter that the laggards delay.&lt;&#x2F;p&gt;
&lt;p&gt;The organisations I advise that are getting this right treat stream two with the same seriousness as stream one. Dedicated teams. Ring-fenced budget. Separate metrics. Not a side project in the innovation lab. Not a pilot that exists to generate a board slide. A genuine second line of business development with the mandate and resources to build something real.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-profit-question&quot;&gt;The profit question&lt;&#x2F;h2&gt;
&lt;p&gt;The objection I hear most often is about margins. &quot;We&#x27;re profitable. Why would we invest in something that might not work and could undermine our current revenue?&quot;&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ll be direct. &lt;strong&gt;Even if stream two needs to consume all your profits, or requires additional investment, it is worth doing.&lt;&#x2F;strong&gt; Because this is truly existential.&lt;&#x2F;p&gt;
&lt;p&gt;Your future self, one to three years from now, will never forgive short-sighted thinking on this. The companies that are cautious with stream two investment today will find themselves trying to catch up in 18 months when a competitor, or a startup with no legacy to protect, has already built what they should have been building.&lt;&#x2F;p&gt;
&lt;p&gt;This isn&#x27;t theoretical. I&#x27;ve seen it happen in real time. At one organisation I worked with, we built an &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;agentic AI system that achieved 67% autonomous case resolution&lt;&#x2F;a&gt;. That system didn&#x27;t optimise the existing workflow. It replaced significant parts of it. The company&#x27;s competitors, who were still optimising their existing processes, are now trying to catch up. They won&#x27;t.&lt;&#x2F;p&gt;
&lt;p&gt;The pattern is playing out right now with companies you can name.&lt;&#x2F;p&gt;
&lt;p&gt;Klarna went all-in on AI replacing customer service agents. Their AI assistant handled 2.3 million conversations in its first month, equivalent to 700 full-time employees. Then quality problems forced them to reverse course and rehire humans. The lesson wasn&#x27;t that AI doesn&#x27;t work. It was that the transition requires building the new while still running the old. Ripping out stream one before stream two is ready is as dangerous as never starting stream two at all.&lt;&#x2F;p&gt;
&lt;p&gt;Shopify took the opposite approach and got it right. Revenue growing 21%+ per year while headcount dropped from 11,600 to 8,100. CEO Tobi Lutke&#x27;s memo made it explicit: no one can hire a human without first proving AI cannot do the job. &lt;strong&gt;They didn&#x27;t stop running the business. They systematically replaced how the business runs.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;Duolingo kept the same number of full-time employees and produced 4-5x more content by restructuring around AI. Same investment, radically different output.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The pattern is always the same: protect the current business at the expense of the next one, and someone else will build the next one for you.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;how-to-run-stream-two-without-killing-stream-one&quot;&gt;How to run stream two without killing stream one&lt;&#x2F;h2&gt;
&lt;p&gt;This is where the execution matters more than the strategy.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Start with a clear threat assessment.&lt;&#x2F;strong&gt; What does your business look like if an AI-native competitor enters your market with no legacy systems, no existing customer expectations, and no margin to protect? What would they build? That&#x27;s what you need to build.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Ring-fence the investment.&lt;&#x2F;strong&gt; Stream two cannot compete with stream one for budget on a quarterly basis. If every investment decision runs through the same ROI model that governs your current business, stream two will always lose. New capabilities don&#x27;t have the same return profile as established ones. They need protected funding with different success criteria.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Staff it with your best people.&lt;&#x2F;strong&gt; The temptation is to put the innovation team on stream two and keep your strongest operators on stream one. This is backwards. Stream two is harder, riskier, and more consequential. It needs your best technical and product minds, people who understand both the existing business and what AI can actually do in production.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Set a time horizon, not just a budget.&lt;&#x2F;strong&gt; Stream two needs 12 to 18 months to prove itself. If you&#x27;re evaluating it on quarterly revenue contribution, you&#x27;ll kill it before it has a chance to work. Define what success looks like at 6 months, 12 months, and 18 months. Make those milestones about capability and learning, not just revenue.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Accept the cannibalisation.&lt;&#x2F;strong&gt; If stream two succeeds, it will take revenue from stream one. That&#x27;s the point. Better you cannibalise your own business than let someone else do it. Amazon understood this when they launched AWS, which competed directly with their own retail infrastructure costs. Apple understood it when the iPhone killed the iPod. The companies that survive are the ones willing to disrupt themselves.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-real-risk-is-doing-nothing&quot;&gt;The real risk is doing nothing&lt;&#x2F;h2&gt;
&lt;p&gt;I talk to a lot of leaders who frame this as a risk decision. &quot;What&#x27;s the risk of investing in stream two?&quot; The risk of investing is that you spend money on something that takes time to generate returns.&lt;&#x2F;p&gt;
&lt;p&gt;The risk of not investing is that your business becomes irrelevant.&lt;&#x2F;p&gt;
&lt;p&gt;Those are not symmetrical risks.&lt;&#x2F;p&gt;
&lt;p&gt;PwC&#x27;s 2026 data is unambiguous: companies applying AI widely to products, services, and customer experiences achieved &lt;strong&gt;nearly 4 percentage points higher profit margins&lt;&#x2F;strong&gt; than those using AI only for internal efficiency. CEOs with strong AI foundations are 3x more likely to report meaningful financial returns. That&#x27;s not a marginal advantage. That&#x27;s the difference between leading and following.&lt;&#x2F;p&gt;
&lt;p&gt;The &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;roughly 80% of AI projects that never unlock their ROI&lt;&#x2F;a&gt; almost all share one characteristic: they were optimisation projects, not transformation projects. They made the existing thing slightly better. They didn&#x27;t build the next thing.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;start-now&quot;&gt;Start now&lt;&#x2F;h2&gt;
&lt;p&gt;If you&#x27;re running a profitable business and haven&#x27;t started stream two, you&#x27;re late. Not too late. But late enough that starting next quarter instead of this one is a decision you&#x27;ll regret.&lt;&#x2F;p&gt;
&lt;p&gt;The good news is that stream two doesn&#x27;t have to be enormous to begin with. A small team. A clear thesis about what an AI-native version of your business looks like. Permission to build something that might compete with your current product. And the understanding, at the board level, that this investment is not optional.&lt;&#x2F;p&gt;
&lt;p&gt;Your current business is funding your future one. Use it while it&#x27;s still strong enough to do so.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want to talk through how this applies to your business, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;adopting-ai-well&#x2F;&quot;&gt;Most companies are adopting AI. Few are adopting it well&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>What two hours with Anthropic&#x27;s agent team taught me about building AI</title>
        <published>2026-03-16T00:00:00+00:00</published>
        <updated>2026-03-16T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/anthropic-agent-sdk-what-works/"/>
        <id>https://ctozen.com/insights/anthropic-agent-sdk-what-works/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/anthropic-agent-sdk-what-works/">&lt;p&gt;Anthropic recently ran a full workshop on their Claude Agent SDK at the AI Engineer conference. Thariq Shihipar, who works on the SDK team, spent nearly two hours walking through how Anthropic builds agents internally.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve built agentic systems in production across insurance, defence, and enterprise software. Some of what I heard confirmed patterns I already use. Some gave me new tools. All of it crystallised into five principles that I think define how production agents should be built right now.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The actual agent code is about 50 lines. The real engineering is everything around it.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;Here are the five principles.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;1-your-agent-is-the-environment-not-the-code&quot;&gt;1. Your agent is the environment, not the code&lt;&#x2F;h2&gt;
&lt;p&gt;This was the workshop&#x27;s most powerful reframing. The orchestration code is trivial. The real engineering is what you put in the working directory. The same lesson, that the engineering lives in the scaffolding rather than the model, runs through &lt;a href=&quot;&#x2F;insights&#x2F;not-talking-to-an-llm&#x2F;&quot;&gt;everything you are actually talking to when you use an AI&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;A typical agent project:&lt;&#x2F;p&gt;
&lt;pre&gt;&lt;code&gt;your-agent&#x2F;
├── agent.ts              # ~50 lines of boilerplate
├── CLAUDE.md             # Instructions, API descriptions, rules
├── scripts&#x2F;              # Bash tools with --help
├── lib&#x2F;                  # TypeScript SDKs and types
├── data&#x2F;                 # Reference data
├── skills&#x2F;               # On-demand context
├── memories&#x2F;             # Persistent state (just files)
└── examples&#x2F;             # Example scripts for codegen
&lt;&#x2F;code&gt;&lt;&#x2F;pre&gt;
&lt;p&gt;Context is not just a prompt. It&#x27;s the scripts the agent can discover. The TypeScript SDK you generated for your API. The CLAUDE.md file describing what&#x27;s available. The skill directories the agent can navigate into when it needs specialised knowledge.&lt;&#x2F;p&gt;
&lt;p&gt;Thariq used a good analogy. If someone locked you in a room and gave you tasks, would you want a stack of papers or a computer with Google? You&#x27;d want the computer. &lt;strong&gt;Give the agent the tools to find its own information, not the information itself.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen this principle in my own work. The &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;insurance automation system&lt;&#x2F;a&gt; I designed resolves 67% of cases autonomously, and the agent code is a small fraction of the engineering effort. Most of the work went into the environment: data pipelines, decision rules, escalation logic, and verification layers. Once you make that shift, agent development becomes less about writing clever code and more about curating an excellent workspace.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;2-a-shell-replaces-a-tool-registry&quot;&gt;2. A shell replaces a tool registry&lt;&#x2F;h2&gt;
&lt;p&gt;Instead of defining a search tool, a lint tool, an execute tool, and a test tool, each with its own schema and error handling, Anthropic&#x27;s approach is to give the agent bash. The agent just uses &lt;code&gt;grep&lt;&#x2F;code&gt;, &lt;code&gt;npm run lint&lt;&#x2F;code&gt;, &lt;code&gt;npm test&lt;&#x2F;code&gt;, and &lt;code&gt;ffmpeg&lt;&#x2F;code&gt;. One generic tool replaces an entire registry.&lt;&#x2F;p&gt;
&lt;p&gt;The reason this works is composability. Bash lets the agent pipe outputs together, save intermediate results to files, discover new capabilities by running &lt;code&gt;--help&lt;&#x2F;code&gt;, and leverage the entire Unix ecosystem. You don&#x27;t need to anticipate every action the agent might take. You give it a shell and it composes the tools that already exist.&lt;&#x2F;p&gt;
&lt;p&gt;This is how Claude Code is built, and it&#x27;s the most capable coding agent available today. The same architecture powers non-coding agents in finance, legal, healthcare, and customer service. &lt;strong&gt;The bash tool is what makes it generalise.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;The workshop laid out a clear decision framework for when to use each capability. Tools for irreversible actions where you want explicit confirmation. Bash for composable operations. Code generation for dynamic logic.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;img src=&quot;&#x2F;img&#x2F;insights&#x2F;agent-sdk&#x2F;capability-selection.svg&quot; alt=&quot;Capability selection: decision tree for choosing tools, bash, or code generation&quot; &#x2F;&gt;&lt;&#x2F;p&gt;
&lt;p&gt;The practical takeaway: if you&#x27;re maintaining more than a handful of custom tool definitions, you&#x27;re probably doing work the shell could do better.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;3-translate-your-data-into-the-model-s-language&quot;&gt;3. Translate your data into the model&#x27;s language&lt;&#x2F;h2&gt;
&lt;p&gt;This is the principle with the highest return on effort. &lt;strong&gt;Make your problem in-distribution.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;ve only tried one search interface for your data, it&#x27;s probably not enough. For a spreadsheet agent, you might try SQL queries, cell reference syntax, grep on CSV, or XML queries (XLS files are XML underneath). Different approaches work differently depending on the data shape.&lt;&#x2F;p&gt;
&lt;p&gt;The most impactful thing you can do is translate your data into a format the model already knows well. Loading a CSV into SQLite and letting the agent write SQL queries is often dramatically more effective than any custom search tool. Generating TypeScript interfaces from your API schema gives the model type information it can reason about natively.&lt;&#x2F;p&gt;
&lt;p&gt;This single step often produces more improvement than any amount of prompt engineering. It&#x27;s also the easiest to test: try two formats, measure which produces better results, keep the winner.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;4-verify-at-every-layer-not-just-the-output&quot;&gt;4. Verify at every layer, not just the output&lt;&#x2F;h2&gt;
&lt;p&gt;Every agent follows three steps: gather context, take action, verify work. Planning sits optionally between the first and second steps.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;img src=&quot;&#x2F;img&#x2F;insights&#x2F;agent-sdk&#x2F;agent-loop.svg&quot; alt=&quot;The agent loop: gather context, plan, take action, verify work, iterate&quot; &#x2F;&gt;&lt;&#x2F;p&gt;
&lt;p&gt;That sounds trivial. Getting it right is not.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Verification is a discipline applied at every layer, not a final check.&lt;&#x2F;strong&gt; This aligns precisely with what I described in my &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;earlier piece on agentic AI in production&lt;&#x2F;a&gt;. Errors compound in multi-step pipelines. Catching them early is dramatically cheaper than catching them at the end.&lt;&#x2F;p&gt;
&lt;p&gt;Anthropic uses deterministic rules wherever possible: linting, compilation, schema validation, constraint checks. In Claude Code, if the agent tries to write a file it hasn&#x27;t read, the harness throws an error and tells it to read first. Simple, deterministic, and extremely effective.&lt;&#x2F;p&gt;
&lt;p&gt;A practical detail worth noting: the model reads error messages and iterates. If your error says &quot;you tried to insert 50,000 rows in one operation, please chunk this into batches of 1,000 or fewer,&quot; the agent will follow that coaching. &lt;strong&gt;Design your error messages as instructions, not diagnostics.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;Sub-agents add a powerful verification layer. You spawn a sub-agent with a completely fresh context and frame the task combatively: &quot;This analysis was written by a junior analyst. Find the errors.&quot; The fresh context means the verifier has no sympathetic relationship with the work. It&#x27;s genuinely adversarial.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;img src=&quot;&#x2F;img&#x2F;insights&#x2F;agent-sdk&#x2F;sub-agents.svg&quot; alt=&quot;Sub-agent architecture: main agent delegates to parallel workers and a verification agent&quot; &#x2F;&gt;&lt;&#x2F;p&gt;
&lt;p&gt;The gap between self-verification and adversarial verification is significant. I plan to integrate this pattern into my own production deployments. Sub-agents also shine for parallel processing (&quot;read and summarise sheet 1, sheet 2, sheet 3 simultaneously&quot;) and search offloading, where many queries run but only the final answer returns to the main agent. Both patterns keep the main context clean and focused.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;5-read-every-transcript-then-improve-the-environment&quot;&gt;5. Read every transcript. Then improve the environment&lt;&#x2F;h2&gt;
&lt;p&gt;The workshop was emphatic about one workflow: don&#x27;t start with the SDK. Start with Claude Code.&lt;&#x2F;p&gt;
&lt;p&gt;Set up a working directory with your APIs, data, and scripts. Write a CLAUDE.md. Chat with Claude Code and give it real tasks. Then read the transcripts.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;This is the single most important practice for improving agent design, and it&#x27;s the one most people skip.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;After every session, read the full transcript. Where did the agent get stuck? Where did it take unnecessary detours? What context was it missing? Where did it use training knowledge instead of your data?&lt;&#x2F;p&gt;
&lt;p&gt;Iterate on the environment based on what you learn. Improve the instructions. Add scripts. Create skills. Add hooks to catch behaviour you don&#x27;t want. Then read the transcripts again.&lt;&#x2F;p&gt;
&lt;p&gt;The iteration loop between reading transcripts and improving context is where most of the real engineering happens. Once results feel good, writing the agent.ts file and deploying to a sandbox is straightforward.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;a-note-on-security&quot;&gt;A note on security&lt;&#x2F;h2&gt;
&lt;p&gt;Anthropic describes their security model as a &quot;Swiss cheese defence.&quot; No single layer blocks everything. Together, the layers cover each other&#x27;s gaps.&lt;&#x2F;p&gt;
&lt;p&gt;Your job as an agent builder is the outer layer: sandboxing. One container per user. Network isolation. Filesystem isolation. Use providers like Cloudflare, Modal, E2B, or Daytona.&lt;&#x2F;p&gt;
&lt;p&gt;The &quot;lethal trifecta&quot; to guard against: code execution, filesystem changes, and data exfiltration. &lt;strong&gt;Sandbox the network and you cut off exfiltration even if the other two are compromised.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;For database access, the principle is counterintuitive but sound: give the agent broad access, then add guardrails. Letting the agent write dynamic SQL and fix its own errors through iteration produces better results than restricting it to predefined queries. Reserve strict masking for genuinely sensitive data.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;build-for-now-rewrite-in-six-months&quot;&gt;Build for now. Rewrite in six months&lt;&#x2F;h2&gt;
&lt;p&gt;Thariq closed with something that resonated. Model capabilities change fast. Code that was necessary six months ago may be unnecessary today. Build for current capabilities. Ship now. Rewrite later.&lt;&#x2F;p&gt;
&lt;p&gt;The companies moving fastest are the ones willing to throw away code and rebuild with current capabilities. As Thariq put it: &lt;strong&gt;&quot;We can write code 10x faster. We should throw out code 10x faster.&quot;&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;That&#x27;s not instability. That&#x27;s the natural rhythm of building on a platform that&#x27;s improving faster than any in history. The best time to start is now, knowing you&#x27;ll improve it later. If you want a structured way to move from prototype to production, &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;my methodology page&lt;&#x2F;a&gt; describes how I approach these engagements, and the &lt;a href=&quot;&#x2F;services&#x2F;#transformation-sprint&quot;&gt;transformation sprint&lt;&#x2F;a&gt; is designed specifically for getting agentic systems into production.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Building an agentic system and want to validate your architecture? &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026: what actually works in production&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;ai-made-developers-slower&#x2F;&quot;&gt;AI made developers 19% slower. Here&#x27;s what they were doing wrong&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;Case study: agentic AI in insurance&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>You&#x27;re not a 10x engineer. You&#x27;re an orchestrator, and that&#x27;s harder</title>
        <published>2026-03-12T00:00:00+00:00</published>
        <updated>2026-03-12T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/orchestrator-not-10x-engineer/"/>
        <id>https://ctozen.com/insights/orchestrator-not-10x-engineer/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/orchestrator-not-10x-engineer/">&lt;p&gt;Andrej Karpathy coined &quot;vibe coding&quot; in February 2025: &quot;you fully give in to the vibes, embrace exponentials, and forget that the code even exists.&quot; It was a throwaway tweet. It became Collins Dictionary&#x27;s Word of the Year. And a year later, Karpathy himself declared it passé.&lt;&#x2F;p&gt;
&lt;p&gt;His replacement term is more interesting: &lt;strong&gt;agentic engineering&lt;&#x2F;strong&gt;. &quot;The new default is that you are not writing the code directly 99% of the time. You are orchestrating agents who do and acting as oversight.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;That one sentence captures the biggest shift in software engineering in a generation. And most of the industry hasn&#x27;t caught up with what it actually means.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-old-10x-myth&quot;&gt;The old 10x myth&lt;&#x2F;h2&gt;
&lt;p&gt;The &quot;10x engineer&quot; was always a flawed concept. The idea that one developer could produce ten times the output of another was based on a misunderstanding of what software engineers actually do. Writing code was never the bottleneck. Understanding the problem, designing the system, and making the right trade-offs. That was always the hard part.&lt;&#x2F;p&gt;
&lt;p&gt;But the myth persisted because, in the old paradigm, implementation speed was visible and valued. The engineer who could bang out a feature in a day while others took a week &lt;em&gt;looked&lt;&#x2F;em&gt; ten times more productive, even if the feature was poorly designed and would need rewriting in six months.&lt;&#x2F;p&gt;
&lt;p&gt;AI has made that version of productivity meaningless. When any developer can direct an AI agent to produce code at machine speed, typing speed is no longer a differentiator. The skill that used to look like 10x, fast and fluent code production, has been commoditised overnight.&lt;&#x2F;p&gt;
&lt;p&gt;What hasn&#x27;t been commoditised is the thing that was always more important: &lt;strong&gt;judgement.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-orchestration-actually-looks-like&quot;&gt;What orchestration actually looks like&lt;&#x2F;h2&gt;
&lt;p&gt;Boris Cherny, the head of Claude Code at Anthropic, runs 5 local sessions and 5-10 remote sessions in parallel, each in its own git checkout. He shipped 22 pull requests in a day, then 27 the next, every line written by AI. He hasn&#x27;t edited code by hand since November 2025.&lt;&#x2F;p&gt;
&lt;p&gt;That sounds like effortless productivity. It isn&#x27;t.&lt;&#x2F;p&gt;
&lt;p&gt;What Cherny actually does is architectural direction, quality control, and context management at a pace that would overwhelm most engineers. Each session needs a well-scoped task. Each result needs review against the broader system. Roughly 10-20% of parallel sessions get abandoned because the agent went in an unexpected direction. The team maintains structured documentation that tells the agent what conventions to follow and what mistakes to avoid, and every time the agent makes a new kind of error, they update the documentation so it doesn&#x27;t happen again.&lt;&#x2F;p&gt;
&lt;p&gt;This isn&#x27;t &quot;vibe coding.&quot; This is engineering management at a different level of abstraction. The code is written by machines. The thinking (the decomposition, the verification, the architectural coherence) is entirely human.&lt;&#x2F;p&gt;
&lt;p&gt;At StrongDM, three engineers built what Simon Willison called &quot;the most ambitious form of AI-assisted software development I&#x27;ve seen yet.&quot; No human writes code. No human reviews code. The humans design specifications, curate test scenarios, and watch scores. They built digital replicas of Okta, Jira, Slack, and Google Docs to run thousands of automated test scenarios per hour.&lt;&#x2F;p&gt;
&lt;p&gt;Three people. Zero human code. And the system works because those three people have the engineering judgement to design the right specifications and the right tests. Which is vastly harder than writing the code would have been.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-dirty-secret&quot;&gt;The dirty secret&lt;&#x2F;h2&gt;
&lt;p&gt;Tanmai Gopal, CEO of a billion-dollar-plus startup, said something that cuts through the hype: &lt;strong&gt;&quot;70% of the effort required to make AI useful relies entirely on unwritten business context that exists only in human heads.&quot;&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;This is the part that gets lost in the &quot;AI writes all the code&quot; narrative. The code is the easy part. It was always the easy part. The hard part is knowing what to build, why to build it, and how it fits into the system that already exists.&lt;&#x2F;p&gt;
&lt;p&gt;An AI agent can write a feature. It cannot understand why the previous architect made certain trade-offs, what implicit assumptions the existing system relies on, which stakeholders care about which outcomes, or how a change in one subsystem will cascade through the business process it supports.&lt;&#x2F;p&gt;
&lt;p&gt;That context, the messy, human, organisational context, is the orchestrator&#x27;s primary raw material. And it&#x27;s the thing that makes the role harder, not easier, than the old &quot;just write the code&quot; paradigm.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-competence-drain&quot;&gt;The competence drain&lt;&#x2F;h2&gt;
&lt;p&gt;DHH, the creator of Ruby on Rails and one of the most respected voices in software development, described his experience with AI coding tools in strikingly physical terms: &quot;I can literally feel competence draining out of my fingers.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;He loves using AI for drafts, API lookups, and second opinions. But he keeps AI code in a separate window. He doesn&#x27;t let it drive. His analogy: &quot;You&#x27;re not going to get fit by watching fitness videos. You have to do the sit-ups.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;Dave Farley, author of the definitive book on continuous delivery, made the same point more directly: &quot;AI is not going to replace software engineers. But it is going to expose the ones who never learned to think like engineers in the first place.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;There&#x27;s a real tension here. The orchestrator role requires &lt;em&gt;more&lt;&#x2F;em&gt; engineering judgement than the implementer role, not less. You need to evaluate AI-generated code for correctness, security, performance, and maintainability. That means you need to understand code at a level that many developers never reached even when they were writing it themselves.&lt;&#x2F;p&gt;
&lt;p&gt;The &lt;a href=&quot;&#x2F;insights&#x2F;ai-made-developers-slower&#x2F;&quot;&gt;METR study&lt;&#x2F;a&gt; found that developers using AI were 19% slower, and believed they were 20% faster. That perception gap is what the competence drain looks like from the inside. You feel productive because the tool is doing things at machine speed. But your ability to critically evaluate what it produces is atrophying. And you don&#x27;t notice until something breaks in production.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-the-orchestrator-needs-to-be-good-at&quot;&gt;What the orchestrator needs to be good at&lt;&#x2F;h2&gt;
&lt;p&gt;If the role is shifting from implementer to orchestrator, the skill profile shifts with it. Based on what I&#x27;ve seen work, both in my own practice and in the organisations I advise, here&#x27;s what matters:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Problem decomposition.&lt;&#x2F;strong&gt; The single most important skill. An AI agent can execute a well-scoped task. It cannot take a vague business requirement and figure out what &quot;done&quot; looks like. The orchestrator breaks ambiguous problems into clear, testable units of work, each one small enough for an agent to handle and specific enough to verify.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Architectural judgement.&lt;&#x2F;strong&gt; AI generates code that works in isolation. The orchestrator ensures it works within the system: that the data model is consistent, the abstractions are right, the performance characteristics match the requirements, and the changes don&#x27;t introduce subtle regressions elsewhere. This requires the kind of deep system understanding that only comes from experience. The &lt;a href=&quot;&#x2F;impact&#x2F;national-security-platform&#x2F;&quot;&gt;national security platform&lt;&#x2F;a&gt; I worked on is a good example: when systems operate across 30+ countries at defence-grade reliability, there&#x27;s no room for AI-generated code that &quot;works in isolation&quot; but breaks the wider architecture.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Verification discipline.&lt;&#x2F;strong&gt; Gergely Orosz, one of the most followed voices in software engineering, identified this as the skill that&#x27;s becoming &lt;em&gt;more&lt;&#x2F;em&gt; valuable as AI generates more code: the ability to read code critically and catch what&#x27;s wrong. Not just syntax errors. Architectural flaws, security vulnerabilities, subtle logic errors that the agent will never flag. CodeRabbit&#x27;s data shows AI produces 75% more logic errors and 8x more excessive I&#x2F;O operations than humans. Someone has to catch those.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Context maintenance.&lt;&#x2F;strong&gt; The orchestrator holds the business context, the system context, and the user context in their head simultaneously, and translates that context into instructions the agent can act on. This is what Tanmai Gopal&#x27;s &quot;70% unwritten business context&quot; means in practice: the orchestrator is the bridge between what the organisation needs and what the machine can do. In the &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;insurance automation project&lt;&#x2F;a&gt;, the agentic system works because of clear decision delegation: the agent handles cases autonomously but the humans defined the boundaries, the escalation triggers, and the verification rules. That&#x27;s orchestration at the organisational level.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Knowing when not to use AI.&lt;&#x2F;strong&gt; This might be the most underrated skill. Not every task benefits from AI delegation. Some problems require the kind of deep, focused thinking that gets &lt;em&gt;worse&lt;&#x2F;em&gt; when you&#x27;re managing parallel agent sessions. The best orchestrators I&#x27;ve worked with are deliberate about when they direct agents and when they sit down and think.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-identity-problem&quot;&gt;The identity problem&lt;&#x2F;h2&gt;
&lt;p&gt;There&#x27;s something uncomfortable about this transition that doesn&#x27;t get discussed enough.&lt;&#x2F;p&gt;
&lt;p&gt;An anonymous junior engineer at a large San Francisco tech company told the SF Standard: &quot;I&#x27;m basically a proxy to Claude Code. My manager tells me what to do, and I tell Claude to do it. The skill you spent years developing is now just commoditised to the general public. It makes you feel kind of empty.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;Michael Parker, VP of Engineering at TurinTech, put it differently: &quot;I used to be a craftsman. Now I feel like a factory manager of Ikea. I&#x27;m just shipping low-quality chairs.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;These aren&#x27;t edge cases. A lot of experienced developers are grappling with the same feeling. The thing they were good at, writing elegant, efficient, well-structured code, is no longer the thing that matters most. The new thing that matters, orchestrating AI agents, maintaining context, verifying output, doesn&#x27;t feel like craftsmanship. It feels like project management.&lt;&#x2F;p&gt;
&lt;p&gt;I think this is a temporary perception. Orchestration at its best is a craft, one that requires deeper understanding of systems, architecture, and engineering principles than implementation ever did. But it&#x27;s a new craft, and the industry hasn&#x27;t yet built the identity, the language, or the career ladders around it.&lt;&#x2F;p&gt;
&lt;p&gt;The organisations that figure this out first, that build roles, progression paths, and recognition structures for the orchestrator, will attract and retain the best engineering talent. The ones that don&#x27;t will lose their best people to the companies that do.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-this-means-for-engineering-leaders&quot;&gt;What this means for engineering leaders&lt;&#x2F;h2&gt;
&lt;p&gt;If you lead an engineering team, the transition from implementer to orchestrator isn&#x27;t optional. It&#x27;s happening whether you manage it or not.&lt;&#x2F;p&gt;
&lt;p&gt;The question is whether it happens deliberately, with clear expectations, appropriate training, and workflow design that sets people up to succeed, or whether it happens chaotically, with developers independently experimenting with AI tools, accumulating technical debt, and losing skills they&#x27;ll need when the agent produces something wrong.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;d start with three things:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Redefine what &quot;senior&quot; means.&lt;&#x2F;strong&gt; If your promotion criteria still centre on code quality and implementation speed, they&#x27;re measuring the old game. Senior engineers in the agentic era are defined by judgement, system thinking, and the ability to direct AI effectively, not by how fast they can write a function.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Build verification into the workflow.&lt;&#x2F;strong&gt; AI-generated code needs different review practices than human-written code. The failure modes are different: more duplication, more security vulnerabilities, more plausible-looking logic errors. Your code review process needs to account for this explicitly.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Invest in the transition.&lt;&#x2F;strong&gt; Your best implementers won&#x27;t automatically become your best orchestrators. Some will. Others will need support, training, and time. The ones who make the transition will be extraordinarily valuable. The investment is worth it.&lt;&#x2F;p&gt;
&lt;p&gt;The title &quot;software engineer&quot; might survive. Or it might evolve into something else. What won&#x27;t change is the need for people who can think clearly about complex systems, make good decisions under uncertainty, and translate business problems into technical solutions. That&#x27;s always been the job. The tools just changed.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you&#x27;re navigating this transition in your engineering organisation and want to think it through with someone who&#x27;s been in the seat, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;training-ladder-broken&#x2F;&quot;&gt;The training ladder is broken&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;ai-made-developers-slower&#x2F;&quot;&gt;AI made developers 19% slower: here&#x27;s what they were doing wrong&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026: what actually works in production&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;services&#x2F;#fractional-cto&quot;&gt;How I help engineering teams make this transition&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>AI made developers 19% slower. Here&#x27;s what they were doing wrong</title>
        <published>2026-03-08T00:00:00+00:00</published>
        <updated>2026-03-08T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/ai-made-developers-slower/"/>
        <id>https://ctozen.com/insights/ai-made-developers-slower/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/ai-made-developers-slower/">&lt;p&gt;The most rigorous study we have on AI coding productivity comes from METR, a non-profit AI research lab that ran a randomised controlled trial with 16 experienced open-source developers across 246 real-world tasks. The result: developers using AI tools completed tasks &lt;strong&gt;19% slower&lt;&#x2F;strong&gt; than without them.&lt;&#x2F;p&gt;
&lt;p&gt;That&#x27;s a striking finding. But the truly alarming part isn&#x27;t the slowdown. It&#x27;s this: before the study, developers predicted AI would make them 24% faster. After experiencing the slowdown, they &lt;em&gt;still believed&lt;&#x2F;em&gt; AI had sped them up by 20%.&lt;&#x2F;p&gt;
&lt;p&gt;That perception gap, being slower while feeling faster, is the most dangerous thing happening in software engineering right now.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-metr-actually-measured&quot;&gt;What METR actually measured&lt;&#x2F;h2&gt;
&lt;p&gt;The study deserves close reading, because the details matter more than the headline.&lt;&#x2F;p&gt;
&lt;p&gt;These were experienced developers working on their own mature repositories, codebases they knew intimately. They were using mainstream AI coding tools: autocomplete, chat-based assistance, the kind of tooling that most organisations have already deployed.&lt;&#x2F;p&gt;
&lt;p&gt;Domenic Denicola, maintainer of jsdom and one of the study participants, published a detailed account. A performance optimisation he estimated at 30 minutes took 4 hours and 7 minutes with AI. A test-writing task he estimated at 1 hour took 4 hours and 20 minutes. Agents got stuck in loops, spent 30+ minutes finding files, applied syntax incorrectly, and moved lines one at a time instead of reordering them.&lt;&#x2F;p&gt;
&lt;p&gt;His verdict: AI made the tasks &quot;more engaging,&quot; like a game, but not faster. And agents required &quot;constant handholding and continuous awareness of the model&#x27;s limitations.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;This is not a story about AI being useless. It&#x27;s a story about &lt;strong&gt;developers using the wrong tools in the wrong way on the wrong tasks.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-autocomplete-trap&quot;&gt;The autocomplete trap&lt;&#x2F;h2&gt;
&lt;p&gt;Here&#x27;s what most organisations have done: they&#x27;ve bought Copilot licences, rolled them out to the engineering team, and called it AI adoption.&lt;&#x2F;p&gt;
&lt;p&gt;That&#x27;s like giving someone a power drill and asking them to use it as a hammer. It&#x27;s the wrong tool for the job. Or more precisely, it&#x27;s the right tool being used in the wrong paradigm.&lt;&#x2F;p&gt;
&lt;p&gt;Autocomplete-style AI, suggesting the next few lines as you type, is useful for boilerplate. But when you&#x27;re working in a codebase you already know well, on tasks you already know how to do, adding an AI middleman to your muscle memory is overhead. You&#x27;re slower because you&#x27;re reading and evaluating suggestions for code you could have written faster yourself.&lt;&#x2F;p&gt;
&lt;p&gt;The METR developers weren&#x27;t bad engineers. They were excellent engineers using AI tools designed for a workflow that doesn&#x27;t match how excellent engineers actually work.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-doing-it-right-looks-like&quot;&gt;What &quot;doing it right&quot; looks like&lt;&#x2F;h2&gt;
&lt;p&gt;Meanwhile, in a parallel universe, Boris Cherny, the head of Claude Code at Anthropic, ships 10 to 30 pull requests per day. Every line written by AI. He hasn&#x27;t edited code by hand since November 2025.&lt;&#x2F;p&gt;
&lt;p&gt;Lee Edwards, an investor at Root Ventures, wrote hundreds of thousands of lines of code across six projects in two weeks using agentic AI tools. He described it as &quot;a nuclear-powered six-axis mill. A single-person software factory.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;At StrongDM, three engineers built a system where no human writes code and no human reviews code. The humans design specifications, curate test scenarios, and watch scores. Simon Willison called it &quot;the most ambitious form of AI-assisted software development I&#x27;ve seen yet.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;These aren&#x27;t demo numbers. These are production systems. I&#x27;ve seen the same pattern in my own work: the &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;agentic AI system I designed for a European insurance brokerage&lt;&#x2F;a&gt; resolves 67% of customer service cases without human intervention. Not a demo. A production system handling thousands of cases per month.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The difference isn&#x27;t talent. It&#x27;s method.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;autocomplete-versus-agentic&quot;&gt;Autocomplete versus agentic&lt;&#x2F;h2&gt;
&lt;p&gt;The gap in the data makes no sense until you separate two fundamentally different approaches to AI-assisted development.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Autocomplete AI&lt;&#x2F;strong&gt; (Copilot, inline suggestions): the developer writes code, the AI suggests completions. The developer remains in the driver&#x27;s seat, making every decision, with the AI offering marginal assistance on each line. On familiar codebases with experienced developers, this adds friction more often than it removes it. DX&#x27;s survey of 121,000 developers found that productivity gains from this approach have plateaued at roughly 10%, unchanged since Q2 2025.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Agentic AI&lt;&#x2F;strong&gt; (Claude Code, Cursor agents, similar tools): the developer describes what needs to be built. The AI agent reads the codebase, writes the code, runs the tests, iterates on failures, and delivers a working result. The developer&#x27;s role shifts from writing to directing and reviewing. This is where the 5x-10x productivity claims come from, and where the evidence supports them.&lt;&#x2F;p&gt;
&lt;p&gt;The METR study measured the first approach. The production results from Cherny, Edwards, and StrongDM use the second.&lt;&#x2F;p&gt;
&lt;p&gt;This distinction matters enormously, because most organisations haven&#x27;t made the shift. They&#x27;re still in the autocomplete paradigm: buying licences, measuring adoption rates, and wondering why the numbers haven&#x27;t moved.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-perception-gap-is-the-real-problem&quot;&gt;The perception gap is the real problem&lt;&#x2F;h2&gt;
&lt;p&gt;Let me come back to that METR finding: developers were 19% slower but believed they were 20% faster. That&#x27;s a 39-percentage-point perception gap.&lt;&#x2F;p&gt;
&lt;p&gt;This pattern is everywhere. The DX survey found 92.6% of developers use AI tools at least monthly, but productivity gains are stuck at 10%. Fastly&#x27;s survey found 95% of developers spend extra hours debugging AI output. Seniors spend up to 40% of their time fixing AI-generated code. CodeRabbit&#x27;s analysis of 470 open-source repositories found that AI creates 1.7x more bugs, 75% more logic errors, and 2.74x more security vulnerabilities than human-written code.&lt;&#x2F;p&gt;
&lt;p&gt;Meanwhile, Stack Overflow&#x27;s 2025 developer survey shows trust in AI accuracy has &lt;em&gt;fallen&lt;&#x2F;em&gt; to 29%, down from 40%. And the top frustration, cited by 66% of developers: &quot;AI solutions that are almost right, but not quite.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;If your team believes AI is making them faster but you can&#x27;t see it in the output metrics, the METR perception gap is probably your explanation.&lt;&#x2F;strong&gt; They&#x27;re not lying. They genuinely feel more productive. The tool is engaging and reduces the tedium of certain tasks. But the net effect, including debugging, rework, and the cognitive overhead of evaluating suggestions, is neutral or negative.&lt;&#x2F;p&gt;
&lt;p&gt;You can&#x27;t fix what you can&#x27;t see. And if your developers sincerely believe the tool is helping when the data says it isn&#x27;t, you have a measurement problem masquerading as a productivity problem.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-i-d-tell-a-cto-reading-this&quot;&gt;What I&#x27;d tell a CTO reading this&lt;&#x2F;h2&gt;
&lt;p&gt;I use agentic AI tools daily. I&#x27;ve seen what they can do in the hands of someone who understands both the tools and the engineering problems they&#x27;re applied to. The productivity gains are real. Genuinely transformative in some workflows.&lt;&#x2F;p&gt;
&lt;p&gt;But I&#x27;ve also seen the other side: organisations that rolled out AI coding tools, measured adoption instead of outcomes, and now have developers who are slower, less rigorous in code review, and accumulating technical debt at a rate that will cost them dearly in 18 months.&lt;&#x2F;p&gt;
&lt;p&gt;The difference comes down to three things:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;First, use the right class of tool for the job.&lt;&#x2F;strong&gt; Autocomplete has its place for quick lookups, unfamiliar syntax, and API discovery. But for meaningful productivity gains, you need agentic tools that can take on whole tasks, not just suggest the next line. And your workflow needs to be rebuilt around delegation and review, not around typing faster.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Second, measure outcomes, not adoption.&lt;&#x2F;strong&gt; I don&#x27;t care what percentage of your team uses AI tools. I care about cycle time, defect rate, time-to-production, and business metrics. If those haven&#x27;t improved, your AI adoption isn&#x27;t working, regardless of what your developers report in satisfaction surveys. The discipline this requires is the same one I learned building &lt;a href=&quot;&#x2F;impact&#x2F;national-security-platform&#x2F;&quot;&gt;defence-grade platforms&lt;&#x2F;a&gt;: measure what matters, not what&#x27;s easy to count.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Third, invest in AI fluency, not just AI access.&lt;&#x2F;strong&gt; The gap between the METR developers (19% slower) and Boris Cherny (27 PRs per day) is not about the AI models. It&#x27;s about how you structure your work around them. That&#x27;s a skill, and like any skill, it requires deliberate development, not just tool provisioning.&lt;&#x2F;p&gt;
&lt;p&gt;The organisations that get this right will have a genuine competitive advantage. The ones that confuse AI adoption with AI effectiveness will have expensive tools and the same output. Or worse.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you&#x27;re trying to figure out whether your AI tooling investment is actually delivering, &lt;a href=&quot;&#x2F;contact&quot;&gt;let&#x27;s talk&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-spend-not-in-numbers&#x2F;&quot;&gt;Why your AI spend isn&#x27;t showing up in the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026: what actually works in production&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;services&#x2F;#transformation-sprint&quot;&gt;Transformation sprint: fixing AI tooling and workflow&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Most companies are adopting AI. Few are adopting it well</title>
        <published>2026-03-04T00:00:00+00:00</published>
        <updated>2026-03-04T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/adopting-ai-well/"/>
        <id>https://ctozen.com/insights/adopting-ai-well/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/adopting-ai-well/">&lt;p&gt;Here are four numbers that should not coexist:&lt;&#x2F;p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;84%&lt;&#x2F;strong&gt; of developers now use AI coding tools&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;29%&lt;&#x2F;strong&gt; trust the accuracy of what those tools produce&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;10%&lt;&#x2F;strong&gt; is the average productivity gain, unchanged for a year&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;55%&lt;&#x2F;strong&gt; of companies that laid off workers for AI now regret it&lt;&#x2F;li&gt;
&lt;&#x2F;ul&gt;
&lt;p&gt;That&#x27;s not an AI problem. That&#x27;s an adoption problem. And it&#x27;s the most expensive kind: the kind where you&#x27;ve already spent the money and aren&#x27;t getting the return. It&#x27;s also the most recoverable kind, because the fix doesn&#x27;t need a bigger budget or a better model. It needs the return unlocked from spend you&#x27;ve already committed.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-procurement-fallacy&quot;&gt;The procurement fallacy&lt;&#x2F;h2&gt;
&lt;p&gt;Most organisations treat AI adoption as a procurement decision. Buy Copilot licences. Roll them out. Track adoption rates. Report to the board that 90% of engineers are using AI tools, a habit that usually traces back to &lt;a href=&quot;&#x2F;insights&#x2F;ai-understanding-pyramid&#x2F;&quot;&gt;decision-makers who have watched a demo rather than built anything&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;This is the equivalent of buying gym memberships for the entire company and measuring success by how many people scanned their badges at the door. You&#x27;re measuring presence, not results.&lt;&#x2F;p&gt;
&lt;p&gt;MIT researchers found that &lt;strong&gt;95% of enterprise generative AI deployments had no measurable impact on profit and loss&lt;&#x2F;strong&gt;. Not &quot;small impact.&quot; No measurable impact. Forrester found that over half of companies that cut staff in the name of AI efficiency are now quietly rehiring, because it turns out that removing humans from processes that AI can&#x27;t actually handle creates problems that are more expensive than the savings.&lt;&#x2F;p&gt;
&lt;p&gt;The founders and small to mid-market teams I work with that are genuinely getting value from AI didn&#x27;t start by buying tools. They started by asking a different question entirely.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-doing-it-badly-looks-like&quot;&gt;What &quot;doing it badly&quot; looks like&lt;&#x2F;h2&gt;
&lt;p&gt;The pattern is remarkably consistent.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Stage 1: Tool deployment.&lt;&#x2F;strong&gt; The company buys AI coding tool licences. There&#x27;s an announcement. Maybe a lunch-and-learn. Developers start using autocomplete suggestions.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Stage 2: Adoption theatre.&lt;&#x2F;strong&gt; Usage numbers go up. The team reports feeling more productive. Internal surveys are positive. The board gets a slide showing 85% adoption.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Stage 3: The invisible costs accumulate.&lt;&#x2F;strong&gt; Code review becomes cursory. Developers rubber-stamp AI suggestions they should scrutinise. Technical debt accumulates faster. CodeRabbit&#x27;s analysis of 470 open-source repositories found AI-generated code contains 1.7x more bugs, 75% more logic errors, and 2.74x more security vulnerabilities. GitClear tracked a doubling of code churn (lines reverted or updated within two weeks) compared to the pre-AI baseline. Copy-pasted code increased 48%.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Stage 4: The reckoning.&lt;&#x2F;strong&gt; Defect rates climb. Production incidents increase. The productivity gains that everyone felt never materialise in the actual delivery metrics. But by now, the organisation has restructured around the assumption that AI is working. Unwinding is painful.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen this play out at multiple organisations. The specific tools vary. The pattern doesn&#x27;t.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-doing-it-well-looks-like&quot;&gt;What &quot;doing it well&quot; looks like&lt;&#x2F;h2&gt;
&lt;p&gt;The companies getting genuine returns from AI share characteristics that have nothing to do with which model they&#x27;re using or how much they&#x27;re spending on compute.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;they-changed-the-workflow-not-just-the-tooling&quot;&gt;They changed the workflow, not just the tooling&lt;&#x2F;h3&gt;
&lt;p&gt;Deloitte&#x27;s research shows that organisations taking a work-redesign approach, rethinking processes before selecting tools, are &lt;strong&gt;twice as likely to exceed ROI targets&lt;&#x2F;strong&gt; than those starting with the technology.&lt;&#x2F;p&gt;
&lt;p&gt;This means something concrete. It means you don&#x27;t give developers AI tools and tell them to keep working the same way. You redesign how work flows through the team. At Anthropic, the Claude Code team maintains structured documentation files that tell the AI agent what the codebase conventions are, what mistakes to avoid, and how to structure changes. That&#x27;s not a tool feature. It&#x27;s a workflow decision. And it improves output quality by 2-3x.&lt;&#x2F;p&gt;
&lt;p&gt;At Shopify, Tobi Lutke&#x27;s internal memo was explicit: &quot;Reflexive AI usage is now a baseline expectation.&quot; But the meaningful part wasn&#x27;t the mandate. It was the structural change. Teams must demonstrate why they cannot achieve what they need using AI before requesting additional headcount. That forces a genuine rethink of how work gets divided between humans and tools, rather than bolting AI onto the existing structure.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;they-measure-outcomes-not-activity&quot;&gt;They measure outcomes, not activity&lt;&#x2F;h3&gt;
&lt;p&gt;DX surveyed 121,000 developers across 450+ companies and found that the organisations seeing real gains aren&#x27;t the ones with the highest adoption rates. They&#x27;re the ones measuring business-level outcomes: cycle time, defect rate, time from commit to production, customer-facing incident frequency.&lt;&#x2F;p&gt;
&lt;p&gt;Some companies in the survey saw twice as many customer-facing incidents after AI adoption. Others saw a 50% reduction. The difference wasn&#x27;t the tools. It was whether the organisation had built quality gates that matched the new workflow.&lt;&#x2F;p&gt;
&lt;p&gt;Laura Tacho, CTO at DX, put it precisely: &quot;In struggling organisations, AI tends to highlight existing flaws rather than fix them.&quot;&lt;&#x2F;p&gt;
&lt;h3 id=&quot;they-invested-in-agentic-workflows-not-just-autocomplete&quot;&gt;They invested in agentic workflows, not just autocomplete&lt;&#x2F;h3&gt;
&lt;p&gt;This is the biggest differentiator I see. The &lt;a href=&quot;&#x2F;insights&#x2F;ai-made-developers-slower&#x2F;&quot;&gt;19% slowdown that METR found in their study&lt;&#x2F;a&gt; was developers using autocomplete-style tools on familiar codebases. The 5-10x gains that companies like StrongDM, Courier, and Anthropic report come from agentic workflows, where AI agents take on entire tasks, not just suggest the next line of code.&lt;&#x2F;p&gt;
&lt;p&gt;The shift from autocomplete to agentic isn&#x27;t just a tool upgrade. It requires fundamentally different skills from the developer: the ability to decompose problems into well-scoped tasks, write clear specifications, evaluate output critically, and maintain architectural coherence across AI-generated code. That&#x27;s a training investment, not a licence purchase.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;they-kept-humans-in-the-loop-deliberately&quot;&gt;They kept humans in the loop, deliberately&lt;&#x2F;h3&gt;
&lt;p&gt;Duolingo made headlines with a policy that only permits hiring if work cannot be automated. It&#x27;s a bold position, and I understand the logic. But the companies I&#x27;ve seen get the best results take a more nuanced approach: they define specifically which decisions AI makes autonomously, which require human review, and which remain entirely human.&lt;&#x2F;p&gt;
&lt;p&gt;This isn&#x27;t governance for governance&#x27;s sake. It&#x27;s engineering. The &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;insurance brokerage I worked with&lt;&#x2F;a&gt; achieved 67% autonomous case resolution, but the 33% that required human handling was designed with as much care as the automated portion. The handoff included full context, the agent&#x27;s reasoning, and the information it had retrieved. That&#x27;s what made the whole system trustworthy enough to scale. The same principle held at &lt;a href=&quot;&#x2F;impact&#x2F;public-sector-scale&#x2F;&quot;&gt;NHS Wales&lt;&#x2F;a&gt;, where the governance structures weren&#x27;t optional. They were the foundation for a transformation programme that identified £20M+ in savings opportunities.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-shopify-paradox&quot;&gt;The Shopify paradox&lt;&#x2F;h2&gt;
&lt;p&gt;Shopify is doing something interesting that illustrates the tension well. On one hand, Lutke&#x27;s memo says prove AI can&#x27;t do it before we hire a human. On the other hand, Shopify is hiring 1,000 interns specifically for &quot;AI-native thinking.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;That&#x27;s not a contradiction. It&#x27;s the right strategic move. They&#x27;re reducing headcount for tasks AI genuinely handles well, while simultaneously investing in developing people who can work &lt;em&gt;with&lt;&#x2F;em&gt; AI effectively. They understand that the tool is only as good as the human directing it, and that developing that human capability requires deliberate investment.&lt;&#x2F;p&gt;
&lt;p&gt;Most organisations are doing only the first half: cutting costs. Very few are doing the second half: building capability. And the second half is where the long-term competitive advantage lives.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-i-d-ask-any-engineering-leader&quot;&gt;What I&#x27;d ask any engineering leader&lt;&#x2F;h2&gt;
&lt;p&gt;If you&#x27;re running an engineering organisation and you&#x27;ve deployed AI tools, three questions will tell you whether you&#x27;re adopting well or just adopting:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Are you measuring outcomes or adoption?&lt;&#x2F;strong&gt; If your primary metric is &quot;percentage of developers using AI tools,&quot; you&#x27;re measuring the wrong thing. If you can point to specific improvements in cycle time, defect rate, or delivery throughput, and attribute them to AI, you&#x27;re on the right track.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Have you changed the workflow or just added a tool?&lt;&#x2F;strong&gt; If developers are working the same way they did before, with AI suggestions layered on top, you&#x27;re in the autocomplete trap. The real gains come from redesigning how work is structured, delegated, and reviewed.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Are you developing AI fluency or just providing AI access?&lt;&#x2F;strong&gt; The gap between developers who are &lt;a href=&quot;&#x2F;insights&#x2F;ai-made-developers-slower&#x2F;&quot;&gt;slower with AI tools&lt;&#x2F;a&gt; and developers who are 10x more productive is not about the tools. It&#x27;s about skill, method, and workflow design. That requires investment: in training, in mentorship, in giving people time to develop new ways of working.&lt;&#x2F;p&gt;
&lt;p&gt;The good news is that the organisations doing this well are pulling ahead fast. The bad news is that most organisations haven&#x27;t started. If you&#x27;re not sure where your organisation falls, the &lt;a href=&quot;&#x2F;ai-readiness&#x2F;&quot;&gt;AI readiness diagnostic&lt;&#x2F;a&gt; will give you a clear picture in 15 minutes.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want to figure out where your AI investment is actually delivering and where it&#x27;s theatre, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;How I approach AI transformation&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>The training ladder is broken. And nobody has a plan to fix it</title>
        <published>2026-02-25T00:00:00+00:00</published>
        <updated>2026-02-25T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/training-ladder-broken/"/>
        <id>https://ctozen.com/insights/training-ladder-broken/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/training-ladder-broken/">&lt;p&gt;Entry-level developer hiring has dropped 67% since 2022. At the fifteen largest US tech companies, new graduate hiring has fallen by more than half since 2019. Fresh graduates now make up just 7% of big tech hires. At startups, the number has gone from 30% to under 6%.&lt;&#x2F;p&gt;
&lt;p&gt;These aren&#x27;t projections. These are the numbers right now.&lt;&#x2F;p&gt;
&lt;p&gt;And almost nobody in a leadership position is talking about what happens next.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-logic-that-got-us-here&quot;&gt;The logic that got us here&lt;&#x2F;h2&gt;
&lt;p&gt;The reasoning is straightforward, and on the surface, hard to argue with.&lt;&#x2F;p&gt;
&lt;p&gt;AI tools can now do a significant portion of what junior developers used to do. Bug fixes. Boilerplate code. Test writing. Documentation. Simple feature implementation. The tasks that used to take a new graduate three days now take an AI agent three minutes.&lt;&#x2F;p&gt;
&lt;p&gt;So companies look at the numbers and make the obvious call: why hire a junior at £60-90K when an AI tool costs a fraction of that? Hire fewer, more experienced engineers. Give them agentic AI tools. Get more output from a smaller, senior team.&lt;&#x2F;p&gt;
&lt;p&gt;I understand the logic. I&#x27;ve been in the room when this decision gets made. And in the short term, it works. Headcount drops. Output holds. The CFO is happy.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;The problem is that this logic has a time bomb buried in it.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;where-senior-engineers-come-from&quot;&gt;Where senior engineers come from&lt;&#x2F;h2&gt;
&lt;p&gt;This might seem obvious, but it needs saying: every senior engineer was once a junior engineer.&lt;&#x2F;p&gt;
&lt;p&gt;The skills that make someone a valuable senior developer (architectural judgement, debugging intuition, the ability to read a codebase and understand not just what it does but why it was built that way, the instinct for where a system will break under load) none of these come from courses or certifications. They come from years of doing the work. The grunt work. The work we&#x27;re now automating away.&lt;&#x2F;p&gt;
&lt;p&gt;Harvard economists analysed 62 million LinkedIn profiles and 200 million job postings. Their finding was precise: at firms that adopted generative AI, junior headcount fell 7.7% within six quarters. Senior employment barely moved. The decline was steepest among graduates of mid-tier universities, not the elite schools (those graduates are too productive to replace) and not the lowest tier (too cheap to bother replacing). The middle got squeezed.&lt;&#x2F;p&gt;
&lt;p&gt;Mark Russinovich, Azure&#x27;s CTO and not exactly a junior voice in this conversation, published a paper in February with Scott Hanselman arguing that AI tools &quot;give senior engineers an AI boost while imposing an AI drag on early-in-career developers.&quot; He said this is now a hot topic in every customer engagement Microsoft has. Every company sees it. Nobody has a plan.&lt;&#x2F;p&gt;
&lt;p&gt;Matt Garman, the CEO of AWS, was more blunt. He called replacing junior developers with AI &quot;one of the dumbest things I&#x27;ve ever heard.&quot; His reasoning: juniors are the least expensive employees you have, and they&#x27;re the most engaged with AI tools. Cut them, and in ten years &quot;you have no one that has learned anything.&quot;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-training-ladder-problem&quot;&gt;The training ladder problem&lt;&#x2F;h2&gt;
&lt;p&gt;Software engineering has always had an informal training ladder. You join as a junior. You fix bugs. You write tests. You struggle with a codebase someone else built. You make mistakes and learn from the code review. You pair with a senior who shows you why your elegant solution won&#x27;t survive production traffic. Gradually, painfully, you develop judgement.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;That ladder is being dismantled.&lt;&#x2F;strong&gt; Not deliberately. Nobody set out to destroy it. But the economic incentives are all pointed the same way: automate the junior work, hire fewer juniors, give the savings back to the business.&lt;&#x2F;p&gt;
&lt;p&gt;The problem is that the junior work &lt;em&gt;was&lt;&#x2F;em&gt; the training. It wasn&#x27;t grunt work that happened to be done by beginners. It was the mechanism by which beginners became experts.&lt;&#x2F;p&gt;
&lt;p&gt;Anthropic, the company that builds Claude, ran their own study on this. Junior engineers using AI scored 50% on comprehension tests. Those who coded manually scored 67%. The largest gap appeared in debugging, the exact skill that separates a competent senior from a dangerous one.&lt;&#x2F;p&gt;
&lt;p&gt;Stack Overflow&#x27;s monthly question volume has collapsed from 200,000 at peak to under 4,000 by the end of 2025. That&#x27;s a 78% year-over-year decline. The knowledge ecosystem that trained a generation of developers, and that AI models themselves were trained on, is dying.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-five-year-cliff&quot;&gt;The five-year cliff&lt;&#x2F;h2&gt;
&lt;p&gt;Octopus Deploy&#x27;s AI Pulse Report projects a three-phase crisis: experimentation (2024-2025), junior hiring freeze (2025-2027), and senior talent crisis (2027-2030).&lt;&#x2F;p&gt;
&lt;p&gt;I think that timeline is roughly right. The wage gap is already widening. US junior developers earn $75-95K while seniors command $193-263K. That gap will widen further as junior supply increases (more graduates, fewer positions) and senior demand intensifies (fewer juniors maturing into senior roles, more AI systems requiring senior oversight).&lt;&#x2F;p&gt;
&lt;p&gt;The maths is simple. If you stop feeding the bottom of the pipeline, the top of the pipeline runs dry. Not immediately. Not this quarter. But the organisations making these cuts today will feel the consequences, and by the time they feel them, rebuilding the pipeline will take years, not months.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-i-d-actually-do-about-it&quot;&gt;What I&#x27;d actually do about it&lt;&#x2F;h2&gt;
&lt;p&gt;I don&#x27;t think the answer is to ignore AI&#x27;s capabilities and keep hiring juniors to do work that AI genuinely does better. That&#x27;s nostalgic, not strategic.&lt;&#x2F;p&gt;
&lt;p&gt;But I do think the answer requires more imagination than most organisations are applying. Here&#x27;s what I&#x27;d recommend to any CTO or engineering leader thinking about this:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Redefine what juniors do, don&#x27;t eliminate the role.&lt;&#x2F;strong&gt; The junior developer role needs to evolve, not disappear. Instead of writing boilerplate, juniors should be reviewing AI-generated code, learning to spot the patterns of failure that AI produces (and it produces them reliably; CodeRabbit&#x27;s data shows AI creates 1.7x more bugs and 2.74x more security vulnerabilities than humans). This is a different training ground, but it&#x27;s still a training ground.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Make mentorship a measured responsibility.&lt;&#x2F;strong&gt; Russinovich and Hanselman&#x27;s paper proposes a &quot;preceptor model,&quot; structured mentorship where senior engineers are explicitly assessed on developing junior talent. I&#x27;d go further: make it a promotion criterion. If you can&#x27;t develop the people around you, you&#x27;re not senior. You&#x27;re just experienced.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Invest in the AI-native training path.&lt;&#x2F;strong&gt; Shopify is hiring 1,000 interns specifically for what they call &quot;AI-native thinking.&quot; That&#x27;s a bet on a new kind of training ladder, one where the junior&#x27;s job is to direct AI agents, evaluate their output, and develop the judgement to know when the output is wrong. It&#x27;s a different skill set, but it still requires deliberate development.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Budget for the future, not just the quarter.&lt;&#x2F;strong&gt; The short-term savings from cutting junior hires are real. The long-term cost of having no pipeline is also real, and much larger. The organisations that get this right will be the ones that treat junior development as an investment with a 3-5 year payback period, not a cost to be optimised away.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-uncomfortable-truth&quot;&gt;The uncomfortable truth&lt;&#x2F;h2&gt;
&lt;p&gt;I&#x27;ve built engineering teams &lt;a href=&quot;&#x2F;impact&#x2F;founder-exit&#x2F;&quot;&gt;across four ventures&lt;&#x2F;a&gt; and in &lt;a href=&quot;&#x2F;impact&#x2F;national-security-platform&#x2F;&quot;&gt;defence-grade environments at Improbable&lt;&#x2F;a&gt;. I&#x27;ve hired juniors who became the best engineers I&#x27;ve ever worked with. And I&#x27;ve watched what happens when organisations optimise purely for short-term efficiency at the expense of long-term capability.&lt;&#x2F;p&gt;
&lt;p&gt;The uncomfortable truth is that the market is telling companies to do something that is locally rational and globally catastrophic. Cut the juniors. Save the money. Ship more with fewer people. And then, in five years, wonder why there are no senior engineers available, and why the ones who exist command salaries that make the original savings look like pocket change.&lt;&#x2F;p&gt;
&lt;p&gt;The companies that will have the strongest engineering teams in 2030 are the ones making the unfashionable decision to invest in junior talent today. Not because it&#x27;s charitable. Because it&#x27;s strategic.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you&#x27;re rethinking your engineering team structure and want to talk through the trade-offs, &lt;a href=&quot;&#x2F;services&#x2F;#fractional-cto&quot;&gt;a fractional CTO engagement&lt;&#x2F;a&gt; is one way to get that thinking in the room. Or just &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-right-for-you&#x2F;&quot;&gt;Fractional CTO: is outsourced technology leadership right for you?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Why your AI spend isn&#x27;t showing up in the numbers</title>
        <published>2026-02-18T00:00:00+00:00</published>
        <updated>2026-02-18T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/why-ai-spend-not-in-numbers/"/>
        <id>https://ctozen.com/insights/why-ai-spend-not-in-numbers/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/why-ai-spend-not-in-numbers/">&lt;p&gt;I&#x27;ve sat in enough boardrooms to recognise the moment. The CEO pulls up the quarterly numbers, someone asks about the AI initiative, and the room goes quiet. Not because AI isn&#x27;t being used. It is. But the P&amp;amp;L hasn&#x27;t moved.&lt;&#x2F;p&gt;
&lt;p&gt;This is not an anecdote. It&#x27;s the dominant pattern. It&#x27;s also a solvable one. The spend isn&#x27;t wasted because AI can&#x27;t deliver; it&#x27;s underperforming because the return was never engineered in. Unlock that, and the same investment starts showing up where it should.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-data-is-damning&quot;&gt;The data is damning&lt;&#x2F;h2&gt;
&lt;p&gt;PwC&#x27;s 2026 Global CEO Survey, covering 4,454 CEOs, found that &lt;strong&gt;56% say AI has delivered no significant benefits&lt;&#x2F;strong&gt;. Only one in eight report improvements to both cost and revenue. MIT&#x27;s analysis of generative AI pilots is worse: &lt;strong&gt;95% produce no measurable P&amp;amp;L impact&lt;&#x2F;strong&gt;. Not &quot;disappointing results.&quot; No measurable impact at all.&lt;&#x2F;p&gt;
&lt;p&gt;The NBER published a working paper in February 2026 surveying roughly 6,000 executives across the US, UK, Germany, and Australia. More than 70% of firms are actively using AI. Over &lt;strong&gt;80% report no impact on employment or productivity over the last three years&lt;&#x2F;strong&gt;. Executives themselves average about 90 minutes a week using AI tools; a quarter report using them not at all.&lt;&#x2F;p&gt;
&lt;p&gt;These are not fringe findings. This is the consensus view from every major research institution studying AI deployment in 2025–2026.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-the-gap-exists&quot;&gt;Why the gap exists&lt;&#x2F;h2&gt;
&lt;p&gt;The instinct is to blame the technology. It&#x27;s not the technology.&lt;&#x2F;p&gt;
&lt;p&gt;Deloitte&#x27;s enterprise survey found that organisations taking a &lt;strong&gt;work-redesign approach, rethinking processes before deploying tools, are twice as likely to exceed their ROI targets&lt;&#x2F;strong&gt; as those taking a technology-first approach. Prosci&#x27;s research across 1,107 organisations is even more specific: &lt;strong&gt;63% of AI failures trace to human factors, not technical ones&lt;&#x2F;strong&gt;. Cultural and organisational barriers account for 65% of failures. Technical issues? 22%.&lt;&#x2F;p&gt;
&lt;p&gt;Most organisations are doing this backwards. They buy the tool, run a pilot, get a promising demo, declare success, and then wonder why the numbers don&#x27;t move when they try to scale. The pilot works because someone babysat it. Production fails because nobody redesigned the workflow around it.&lt;&#x2F;p&gt;
&lt;p&gt;Workday presented data at Davos suggesting that roughly &lt;strong&gt;40% of &quot;time saved&quot; by AI goes straight into rework&lt;&#x2F;strong&gt;, correcting low-quality outputs that the system generated. That&#x27;s not a productivity gain. That&#x27;s a productivity shell game.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-actually-works&quot;&gt;What actually works&lt;&#x2F;h2&gt;
&lt;p&gt;I&#x27;ve built production AI systems that moved business metrics. Not in a lab, not in a pilot, in production at enterprise scale. At AdBrain, the agentic AI system I co-founded and built for a European insurance brokerage achieved &lt;strong&gt;&lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;67% autonomous case resolution&lt;&#x2F;a&gt;&lt;&#x2F;strong&gt; and a &lt;strong&gt;23% improvement in sales KPIs&lt;&#x2F;strong&gt;. These weren&#x27;t projections. They were measured outcomes on real customer cases.&lt;&#x2F;p&gt;
&lt;p&gt;The difference wasn&#x27;t the model or the infrastructure. The difference was approach.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;We started with the process, not the technology.&lt;&#x2F;strong&gt; We mapped every decision point in the customer service workflow, identified which decisions could be delegated to an AI system with acceptable error rates, and built governance around the ones that couldn&#x27;t. The technology was in service of the workflow redesign, not the other way round.&lt;&#x2F;p&gt;
&lt;p&gt;This is the pattern I see again and again in the organisations that get results. The 5% that McKinsey identifies as achieving true AI transformation aren&#x27;t using better models. They&#x27;re deploying AI into redesigned processes with clear KPIs, governance frameworks, and kill criteria for initiatives that aren&#x27;t working.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-90-day-question&quot;&gt;The 90-day question&lt;&#x2F;h2&gt;
&lt;p&gt;If you&#x27;ve been investing in AI and the numbers haven&#x27;t moved, the question isn&#x27;t whether to invest more. It&#x27;s whether you&#x27;re deploying AI into the right processes, with the right workflow design, and with governance that scales.&lt;&#x2F;p&gt;
&lt;p&gt;Most founders and small to mid-market teams I work with can answer that question in 90 days, with one production system delivering measurable results and a clear roadmap for what comes next. The &lt;a href=&quot;&#x2F;ai-readiness&#x2F;&quot;&gt;AI readiness diagnostic&lt;&#x2F;a&gt; is a good place to start. It takes 15 minutes and gives you an honest baseline.&lt;&#x2F;p&gt;
&lt;p&gt;The AI isn&#x27;t the hard part. The hard part is being honest about what&#x27;s working and what isn&#x27;t, and having someone in the room who has done this before.&lt;&#x2F;p&gt;
&lt;p&gt;If that sounds like a conversation worth having, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;. No slides, no pitch. Just an honest assessment of where you are.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;shadow-ai-audit-finding&#x2F;&quot;&gt;Shadow AI is your next audit finding&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Turning AI theatre into AI that moves the numbers</title>
        <published>2026-02-11T00:00:00+00:00</published>
        <updated>2026-02-11T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/ai-transformation-performance-art/"/>
        <id>https://ctozen.com/insights/ai-transformation-performance-art/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/ai-transformation-performance-art/">&lt;p&gt;I want to name something that everyone in the room knows but nobody says out loud.&lt;&#x2F;p&gt;
&lt;p&gt;Most AI transformations are performance art. They exist to demonstrate that something is happening, not to deliver results. The innovation lab is staffed. The pilots are running. The quarterly board update has an AI slide with a roadmap and a timeline and a budget. And the P&amp;amp;L hasn&#x27;t moved.&lt;&#x2F;p&gt;
&lt;p&gt;This is not a controversial observation. BCG&#x27;s own research admits that &lt;strong&gt;60% of AI consulting engagements generate no material value&lt;&#x2F;strong&gt;. Only 5% achieve anything at scale. McKinsey puts the number of organisations achieving true AI transformation at &lt;strong&gt;one in twenty&lt;&#x2F;strong&gt;. Gartner predicts that 60% of AI initiatives will be abandoned through 2026.&lt;&#x2F;p&gt;
&lt;p&gt;The industry has a name for this. They call it &quot;pilot purgatory.&quot; I think that&#x27;s too kind. Pilots are experiments that lead somewhere. What most organisations are doing is theatre.&lt;&#x2F;p&gt;
&lt;p&gt;But here is the part that matters: this is the most fixable problem in enterprise AI. The gap between theatre and transformation isn&#x27;t talent, budget, or better models. It&#x27;s a handful of structural choices, and they are entirely learnable. The same numbers that look damning are really a map. They tell you exactly what the 5% do that everyone else doesn&#x27;t. The rest of this piece is that map.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-ai-theatre-looks-like&quot;&gt;What AI theatre looks like&lt;&#x2F;h2&gt;
&lt;p&gt;You can spot it from the language. &quot;We&#x27;re exploring.&quot; &quot;We&#x27;re evaluating vendors.&quot; &quot;We have a pilot running with promising results.&quot; &quot;We&#x27;re building our AI strategy.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;These phrases have one thing in common: they describe activity, not outcomes. Nobody is measuring anything. Nobody has defined what success looks like. And nobody has the authority, or the incentive, to kill something that isn&#x27;t working.&lt;&#x2F;p&gt;
&lt;p&gt;The typical pattern runs like this:&lt;&#x2F;p&gt;
&lt;p&gt;A consulting firm is engaged to &quot;develop an AI strategy.&quot; They deliver a 60-page deck with a maturity assessment, a vendor landscape, and a phased roadmap. The organisation selects two or three use cases for pilots. The pilots are built by a small team with enthusiastic support from one sponsor. The demos look impressive. Then the pilot hits production-readiness requirements (security review, data governance, integration with existing systems, change management) and stalls.&lt;&#x2F;p&gt;
&lt;p&gt;Six months later, the pilot is still &quot;in progress.&quot; The team has moved on to the next pilot. The first one quietly dies. The board update still shows green.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen this pattern at organisations of every size, from funded startups to FTSE 250 companies. The technology is real. The results are not.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-it-happens&quot;&gt;Why it happens&lt;&#x2F;h2&gt;
&lt;p&gt;Three structural reasons.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;First, incentives are misaligned.&lt;&#x2F;strong&gt; The people evaluating AI tools are not the people whose P&amp;amp;L depends on the outcomes. When the innovation team&#x27;s KPI is &quot;number of pilots launched&quot; rather than &quot;revenue impact of deployed systems,&quot; you get exactly what you&#x27;d expect: lots of pilots, no production systems.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Second, the process is backwards.&lt;&#x2F;strong&gt; Deloitte&#x27;s research shows that organisations taking a work-redesign approach, rethinking processes before selecting tools, are &lt;strong&gt;twice as likely to exceed ROI targets&lt;&#x2F;strong&gt; than those starting with the technology. But redesigning processes is hard, political, and slow. Buying a tool and running a pilot is fast, visible, and feels like progress.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Third, nobody has done it before.&lt;&#x2F;strong&gt; Most organisations don&#x27;t have a senior leader who has built production AI systems. They have people who have evaluated them, recommended them, and managed pilots of them. The people making the call have usually &lt;a href=&quot;&#x2F;insights&#x2F;ai-understanding-pyramid&#x2F;&quot;&gt;watched a demo, not built one&lt;&#x2F;a&gt;. The gap between &quot;this demos well&quot; and &quot;this runs in production at scale&quot; is enormous, and it&#x27;s a gap that only operating experience can bridge.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-the-alternative-looks-like&quot;&gt;What the alternative looks like&lt;&#x2F;h2&gt;
&lt;p&gt;The organisations that get real results from AI share a few characteristics.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;They start with the P&amp;amp;L, not the technology.&lt;&#x2F;strong&gt; Before selecting any tool or model, they identify a specific business process where AI can deliver measurable improvement, and they define what &quot;measurable&quot; means upfront. Not &quot;efficiency gains.&quot; A number. A KPI. A target.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;They redesign the workflow.&lt;&#x2F;strong&gt; Prosci&#x27;s research across 1,107 organisations found that &lt;strong&gt;63% of AI failures trace to human factors, not technology&lt;&#x2F;strong&gt;. The organisations that succeed don&#x27;t bolt AI onto existing processes. They redesign the process around what AI can and can&#x27;t do, with clear decision delegation: what the system decides, what a human decides, and where the handoff happens.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;They deploy to production in weeks, not quarters.&lt;&#x2F;strong&gt; At AdBrain, we had an agentic AI system handling real customer cases within weeks of starting the build. Not because we cut corners. Because we scoped tightly. One process. One set of decisions. Full governance. Measured KPIs from day one. The result: &lt;strong&gt;&lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;67% autonomous case resolution&lt;&#x2F;a&gt;&lt;&#x2F;strong&gt;, &lt;strong&gt;23% sales lift&lt;&#x2F;strong&gt;. Production, not pilot.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;They kill what doesn&#x27;t work.&lt;&#x2F;strong&gt; This is the hardest part. Most organisations don&#x27;t have kill criteria for AI initiatives. If it&#x27;s not working after 90 days, they add more resources or extend the timeline. The right move is to kill it, learn from it, and redirect the investment to something that can deliver.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-90-day-test&quot;&gt;The 90-day test&lt;&#x2F;h2&gt;
&lt;p&gt;If you want to know whether your AI programme is performance art or real transformation, ask yourself three questions:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Can you name one AI system in production that is measured by a business KPI?&lt;&#x2F;strong&gt; Not &quot;deployed.&quot; Not &quot;being used by the team.&quot; Measured. With a number. That someone reports on.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Do you have kill criteria for your current AI initiatives?&lt;&#x2F;strong&gt; Is there a defined point at which you&#x27;d stop investing in a pilot that isn&#x27;t delivering?&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Has anyone in your leadership team built a production AI system before?&lt;&#x2F;strong&gt; Not evaluated one. Not managed a pilot. Built one, deployed it, and measured the results.&lt;&#x2F;p&gt;
&lt;p&gt;If the answer to any of those is no, you&#x27;re closer to theatre than transformation. That&#x27;s not a judgement. It&#x27;s a diagnosis. And the good news is that the fix is faster than you think.&lt;&#x2F;p&gt;
&lt;p&gt;One production system. One set of measured KPIs. One honest assessment of what&#x27;s working and what isn&#x27;t. That&#x27;s the starting point. Everything else is decoration.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;d like to have that conversation, &lt;a href=&quot;&#x2F;contact&quot;&gt;I&#x27;m here&lt;&#x2F;a&gt;. No deck, no maturity assessment. Just an honest view of where you are and what would actually move the numbers.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-spend-not-in-numbers&#x2F;&quot;&gt;Why your AI spend isn&#x27;t showing up in the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026: what actually works in production&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;How I approach AI transformation&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Shadow AI is your next audit finding</title>
        <published>2026-02-04T00:00:00+00:00</published>
        <updated>2026-02-04T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/shadow-ai-audit-finding/"/>
        <id>https://ctozen.com/insights/shadow-ai-audit-finding/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/shadow-ai-audit-finding/">&lt;p&gt;Your employees are already using AI. The question is whether you know about it.&lt;&#x2F;p&gt;
&lt;p&gt;Cyberhaven&#x27;s research found that &lt;strong&gt;38% of employees are sharing confidential company data through consumer AI tools&lt;&#x2F;strong&gt;: ChatGPT, Claude, Gemini, Copilot. Not through sanctioned enterprise deployments with data handling agreements. Through personal accounts, on personal devices, with no audit trail and no governance.&lt;&#x2F;p&gt;
&lt;p&gt;The broader number is worse: &lt;strong&gt;78% of employees are using AI tools their employer hasn&#x27;t approved&lt;&#x2F;strong&gt;. They&#x27;re summarising client documents, generating financial analysis, drafting legal correspondence, and processing sensitive data through systems their IT and compliance teams don&#x27;t know exist.&lt;&#x2F;p&gt;
&lt;p&gt;This is shadow AI. And it&#x27;s about to become a very expensive problem.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-regulatory-clock-is-ticking&quot;&gt;The regulatory clock is ticking&lt;&#x2F;h2&gt;
&lt;p&gt;The EU AI Act&#x27;s major enforcement milestones begin &lt;strong&gt;August 2, 2026&lt;&#x2F;strong&gt;. That includes transparency obligations, high-risk system requirements under Annex III, and governance structures that most organisations haven&#x27;t built yet.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;re operating in the EU or processing EU citizen data, which includes most UK businesses with European clients, you need to be ready. Not &quot;aware.&quot; Ready. With documented governance frameworks, risk assessments for every AI system in use (including the ones your employees adopted without telling you), and human oversight structures that can withstand regulatory scrutiny.&lt;&#x2F;p&gt;
&lt;p&gt;This isn&#x27;t theoretical. IBM&#x27;s Institute for Business Value reports that &lt;strong&gt;82% of organisations say AI risks have accelerated their need for governance modernisation&lt;&#x2F;strong&gt;. They know the gap exists. Most haven&#x27;t closed it.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-governance-fails-when-it-s-treated-as-a-blocker&quot;&gt;Why governance fails when it&#x27;s treated as a blocker&lt;&#x2F;h2&gt;
&lt;p&gt;The instinct in most organisations is to respond to shadow AI with a ban. Lock down the tools, restrict access, send a memo. This doesn&#x27;t work. The tools are too useful, the workarounds too easy, and the productivity gains too visible for people to stop using them just because a policy document said so.&lt;&#x2F;p&gt;
&lt;p&gt;The organisations that get this right treat governance as an &lt;strong&gt;enabler&lt;&#x2F;strong&gt;, not a blocker. The goal isn&#x27;t to stop people using AI. It&#x27;s to channel that usage into managed, monitored, compliant systems that the organisation controls. Governance that makes AI adoption faster, not slower, by removing the ambiguity that forces employees to make their own decisions about what&#x27;s acceptable.&lt;&#x2F;p&gt;
&lt;p&gt;A proper governance framework does three things:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;First, it maps what&#x27;s actually happening.&lt;&#x2F;strong&gt; A shadow AI audit identifies every tool in use, every data flow, every risk. You can&#x27;t govern what you can&#x27;t see. Most organisations are genuinely surprised by what they find.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Second, it provides sanctioned alternatives.&lt;&#x2F;strong&gt; For every unapproved tool your employees are using, there should be an approved path that&#x27;s at least as convenient. If the governance framework makes people&#x27;s work harder, they&#x27;ll route around it, and you&#x27;re back to shadow AI.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Third, it builds the compliance structure.&lt;&#x2F;strong&gt; Risk classifications, human oversight requirements, data handling protocols, incident response procedures. The EU AI Act doesn&#x27;t require perfection. It requires a demonstrable, documented framework that shows you&#x27;re managing AI risk systematically. This is the approach I took at &lt;a href=&quot;&#x2F;impact&#x2F;public-sector-scale&#x2F;&quot;&gt;NHS Wales&lt;&#x2F;a&gt;, where governance had to withstand national-level scrutiny. The same principles apply at any scale.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-cost-of-waiting&quot;&gt;The cost of waiting&lt;&#x2F;h2&gt;
&lt;p&gt;Informatica&#x27;s 2026 survey of 600 data leaders found that &lt;strong&gt;three out of four say governance hasn&#x27;t kept pace&lt;&#x2F;strong&gt; with AI adoption. Nearly 7 in 10 organisations have adopted generative AI, and almost half have moved into agentic AI. The tools are already deployed. The governance is months, or years, behind.&lt;&#x2F;p&gt;
&lt;p&gt;The gap creates three categories of risk:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Regulatory risk.&lt;&#x2F;strong&gt; EU AI Act non-compliance penalties scale to 7% of global annual turnover for the most serious violations. Even for mid-market companies, that&#x27;s a material number.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Data risk.&lt;&#x2F;strong&gt; Confidential client data, financial information, proprietary IP, all flowing through systems with no data processing agreements, no audit logs, and no retention controls.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Operational risk.&lt;&#x2F;strong&gt; Decisions being made based on AI outputs that no one is verifying, using models that no one has validated for the use case, with error rates that no one is measuring.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-i-d-do-in-your-position&quot;&gt;What I&#x27;d do in your position&lt;&#x2F;h2&gt;
&lt;p&gt;If I were sitting in your seat (CTO, COO, or board member of a company with 50+ employees) I&#x27;d do three things in the next 30 days:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Run a shadow AI audit.&lt;&#x2F;strong&gt; Not a survey. A proper audit. Map the tools, the data flows, and the risks. Accept that you&#x27;ll find things you didn&#x27;t expect.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Build a governance framework that enables, not restricts.&lt;&#x2F;strong&gt; Classify your AI use cases by risk level. Provide sanctioned tools for the high-value, high-frequency use cases. Build monitoring into the approved stack.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Set a timeline for EU AI Act readiness.&lt;&#x2F;strong&gt; August 2026 isn&#x27;t far. If you need to be compliant, work backwards from that date and identify every gap.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;d like help with any of that, &lt;a href=&quot;&#x2F;contact&quot;&gt;let&#x27;s talk&lt;&#x2F;a&gt;. I&#x27;ve built governance frameworks alongside production AI systems, not as a separate workstream, but as part of the deployment. It&#x27;s the only way it works.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;services&#x2F;#transformation-sprint&quot;&gt;The AI Transformation Sprint&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>How to unlock AI ROI: what the 20% do differently</title>
        <published>2026-01-28T00:00:00+00:00</published>
        <updated>2026-01-28T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/why-ai-projects-fail/"/>
        <id>https://ctozen.com/insights/why-ai-projects-fail/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/why-ai-projects-fail/">&lt;p&gt;A statistic that should focus the mind of anyone investing in AI: according to multiple industry surveys, roughly 80% of companies have deployed generative AI. And roughly 80% report no material impact on earnings.&lt;&#x2F;p&gt;
&lt;p&gt;It&#x27;s tempting to read that as &quot;AI doesn&#x27;t pay back.&quot; That&#x27;s the wrong lesson. The returns are real and repeatable, but they have to be unlocked. The 80% aren&#x27;t evidence that the technology fails; they&#x27;re evidence that most organisations haven&#x27;t yet learned how to deploy it. The real risk to your investment isn&#x27;t outright failure. It&#x27;s quiet underperformance: an initiative that returns a fraction of what it could, because the work around the tool never changed. This piece is about what the 20% do differently to unlock the return.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-is-the-easy-explanation-for-ai-failure-wrong&quot;&gt;Why is the easy explanation for AI failure wrong?&lt;&#x2F;h2&gt;
&lt;p&gt;The tempting interpretation is that the technology isn&#x27;t there yet. That LLMs hallucinate too much. That the models aren&#x27;t reliable enough. That we need better tools before AI can deliver business value.&lt;&#x2F;p&gt;
&lt;p&gt;This is comfortable because it requires no action. Just wait.&lt;&#x2F;p&gt;
&lt;p&gt;It&#x27;s also wrong. The organisations delivering measurable ROI from AI are not waiting. They have fundamentally different approaches to how they build and deploy AI systems, and those approaches are not technology-dependent. They&#x27;re architectural and organisational.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-do-the-20-of-successful-ai-projects-do-differently&quot;&gt;What do the 20% of successful AI projects do differently?&lt;&#x2F;h2&gt;
&lt;h3 id=&quot;1-they-solve-the-data-problem-first&quot;&gt;1. They solve the data problem first&lt;&#x2F;h3&gt;
&lt;p&gt;Every serious AI deployment that delivers business outcomes starts with a data architecture question, not a model question.&lt;&#x2F;p&gt;
&lt;p&gt;The question isn&#x27;t &quot;which LLM should we use?&quot; It&#x27;s &quot;can we reliably retrieve the specific information this AI system needs to make good decisions?&quot;&lt;&#x2F;p&gt;
&lt;p&gt;In most organisations, the answer to that second question is no. Customer data is fragmented across CRM systems. Product data doesn&#x27;t conform to a consistent schema. Historical decisions weren&#x27;t recorded in a way that makes them usable as context. Case notes are in unstructured text with no metadata.&lt;&#x2F;p&gt;
&lt;p&gt;Before any AI system can work reliably in production, someone has to make this data retrieval problem tractable. The organisations that skip this step and go straight to building on top of raw, fragmented data end up with AI systems that work inconsistently and unpredictably. Which is worse than no AI at all, because it erodes trust.&lt;&#x2F;p&gt;
&lt;p&gt;The 20% treat data architecture as a prerequisite, not a detail.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;2-they-define-the-right-problem-to-solve&quot;&gt;2. They define the right problem to solve&lt;&#x2F;h3&gt;
&lt;p&gt;There&#x27;s a pattern in AI project failures that goes like this:&lt;&#x2F;p&gt;
&lt;ol&gt;
&lt;li&gt;Organisation decides to &quot;add AI&quot;&lt;&#x2F;li&gt;
&lt;li&gt;Team picks a use case that seems tractable: a chatbot, a recommendation engine, a document summariser&lt;&#x2F;li&gt;
&lt;li&gt;The use case gets built and deployed&lt;&#x2F;li&gt;
&lt;li&gt;Nobody uses it, or it doesn&#x27;t move any metric that matters&lt;&#x2F;li&gt;
&lt;li&gt;Conclusion: &quot;AI doesn&#x27;t work for us&quot;&lt;&#x2F;li&gt;
&lt;&#x2F;ol&gt;
&lt;p&gt;The failure happened at step 2. The use case wasn&#x27;t connected to a problem that the business actually has, in a workflow that people actually use, with a metric that the business actually tracks.&lt;&#x2F;p&gt;
&lt;p&gt;The 20% start from the business problem: what is the high-volume, high-friction, high-cost workflow that AI could change? Then they work backward to the AI architecture, not forward from &quot;what&#x27;s possible with LLMs.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;For the &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;insurance brokerage I worked with&lt;&#x2F;a&gt; at AdBrain, the problem was specific and measurable: thousands of customer service cases per month, inconsistent handling, slow resolution, agents unable to scale. AI was the solution to that problem, not a technology experiment running parallel to the real business.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;3-they-invest-in-production-engineering&quot;&gt;3. They invest in production engineering&lt;&#x2F;h3&gt;
&lt;p&gt;The gap between &quot;the demo works&quot; and &quot;the system works in production&quot; is where most AI projects disappear.&lt;&#x2F;p&gt;
&lt;p&gt;Production AI requires:&lt;&#x2F;p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Retrieval that works at scale,&lt;&#x2F;strong&gt; not just in a controlled test environment with clean data&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Graceful degradation.&lt;&#x2F;strong&gt; What happens when the model is uncertain, when the data is missing, when the case is outside the training distribution?&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Observability.&lt;&#x2F;strong&gt; Can you see what the system is doing, why it made each decision, and where it&#x27;s failing?&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Human-in-the-loop design.&lt;&#x2F;strong&gt; When should the AI handle something autonomously, and when should it escalate? And when it escalates, does the human get the context they need?&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Audit trails,&lt;&#x2F;strong&gt; especially in regulated industries, but increasingly in any consequential business process&lt;&#x2F;li&gt;
&lt;&#x2F;ul&gt;
&lt;p&gt;The 20% treat production engineering as a first-class concern from the start. The 80% treat it as a problem to solve after the demo works.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;4-they-stage-the-rollout&quot;&gt;4. They stage the rollout&lt;&#x2F;h3&gt;
&lt;p&gt;&quot;Big bang&quot; AI deployment almost never works. The 20% deploy incrementally: start with a small fraction of real traffic, measure outcomes against real business metrics, build confidence in the system before scaling.&lt;&#x2F;p&gt;
&lt;p&gt;This isn&#x27;t just risk management. It&#x27;s how you build institutional trust in an automated system. The people whose jobs the system is affecting need to see it work. Leadership needs to see the metrics move. Compliance needs to see the audit trail.&lt;&#x2F;p&gt;
&lt;p&gt;Staged rollout also allows you to catch failures early, in production conditions that your test environment will never fully replicate, without catastrophic consequences.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;5-they-treat-ai-governance-as-a-competitive-advantage&quot;&gt;5. They treat AI governance as a competitive advantage&lt;&#x2F;h3&gt;
&lt;p&gt;With the EU AI Act in full effect and organisations increasingly asking hard questions about AI risk, the companies that have built mature AI governance frameworks are at a commercial advantage over those that haven&#x27;t.&lt;&#x2F;p&gt;
&lt;p&gt;This doesn&#x27;t mean governance as bureaucracy. It means:&lt;&#x2F;p&gt;
&lt;ul&gt;
&lt;li&gt;Clear ownership of AI decisions (the CAIO role or equivalent)&lt;&#x2F;li&gt;
&lt;li&gt;Documented decision logic for automated systems&lt;&#x2F;li&gt;
&lt;li&gt;Regular accuracy and bias auditing&lt;&#x2F;li&gt;
&lt;li&gt;Clear escalation paths when systems fail&lt;&#x2F;li&gt;
&lt;li&gt;An approach to EU AI Act risk categorisation for the AI systems you&#x27;re deploying&lt;&#x2F;li&gt;
&lt;&#x2F;ul&gt;
&lt;p&gt;Only about 1 in 5 companies have this in place today. That&#x27;s a gap, and a gap creates differentiation.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;how-do-you-avoid-being-in-the-80-of-failed-ai-projects&quot;&gt;How do you avoid being in the 80% of failed AI projects?&lt;&#x2F;h2&gt;
&lt;p&gt;If you&#x27;re building or deploying AI, the questions above aren&#x27;t abstract. They&#x27;re the questions your enterprise customers&#x27; IT and legal teams will ask before signing a contract. They&#x27;re the questions an acquirer&#x27;s technical due diligence team will surface during an exit process. They&#x27;re the questions investors will ask when they want to understand why your AI investment is defensible.&lt;&#x2F;p&gt;
&lt;p&gt;Building with these considerations in mind from the start is not slower or more expensive than building without them. It&#x27;s usually faster, because you don&#x27;t have to rebuild the parts that break. If you want a quick sense of where your organisation stands on these patterns, try the &lt;a href=&quot;&#x2F;ai-readiness&#x2F;&quot;&gt;AI readiness diagnostic&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you want to pressure-test your AI approach against these patterns, &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;sculpting-world-class-tech-team&#x2F;&quot;&gt;Your world-class engineering team is already in the building&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;agentic-ai-2026&#x2F;&quot;&gt;Agentic AI in 2026: what actually works in production&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;shadow-ai-audit-finding&#x2F;&quot;&gt;Shadow AI is your next audit finding&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Agentic AI in 2026: what actually works in production</title>
        <published>2026-01-14T00:00:00+00:00</published>
        <updated>2026-01-14T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/agentic-ai-2026/"/>
        <id>https://ctozen.com/insights/agentic-ai-2026/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/agentic-ai-2026/">&lt;p&gt;Every major technology conference in the last 18 months has featured a demo of an AI agent doing something impressive. Booking a flight. Writing and executing code. Managing a customer enquiry from start to finish.&lt;&#x2F;p&gt;
&lt;p&gt;Most of those demos are not production systems.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve built agentic AI in production, at a &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;European insurance brokerage&lt;&#x2F;a&gt; at enterprise scale, with 67% autonomous resolution rates on real customer service cases. I&#x27;ve also watched a significant number of agentic AI projects fail in ways that were entirely predictable. This is an honest account of what the difference looks like.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-agentic-ai-actually-means&quot;&gt;What &quot;agentic AI&quot; actually means&lt;&#x2F;h2&gt;
&lt;p&gt;An AI agent, in the technical sense, is a system that can take actions, not just generate text. It can retrieve information, execute tools, make decisions across multiple steps, and produce outcomes in the world rather than just producing responses.&lt;&#x2F;p&gt;
&lt;p&gt;The demo version of this: an agent that can look up flight prices, check calendar availability, book a seat, and confirm the booking in a single conversational interaction.&lt;&#x2F;p&gt;
&lt;p&gt;The production version of this: an agent that can do the above 10,000 times per day, on inputs that don&#x27;t match the training distribution, with graceful handling of edge cases, a full audit trail of every decision, and escalation logic that surfaces the cases the agent shouldn&#x27;t handle autonomously.&lt;&#x2F;p&gt;
&lt;p&gt;The gap between those two descriptions is where most agentic AI projects fail.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-most-agents-don-t-work-in-production&quot;&gt;Why most agents don&#x27;t work in production&lt;&#x2F;h2&gt;
&lt;h3 id=&quot;problem-1-retrieval-is-the-weakest-link&quot;&gt;Problem 1: Retrieval is the weakest link&lt;&#x2F;h3&gt;
&lt;p&gt;Agents are only as good as the information they can access. And in almost every real-world deployment, retrieving the right information, from the right source, in the right format, at the right time, is harder than building the agent logic.&lt;&#x2F;p&gt;
&lt;p&gt;Vector databases are not magic. Embedding quality degrades with domain-specific content. Retrieval precision drops sharply when the knowledge base is large, inconsistent, or poorly structured. The agent that works beautifully on clean, curated documentation fails silently on the messy reality of actual enterprise data.&lt;&#x2F;p&gt;
&lt;p&gt;The solution is to treat retrieval as an engineering problem of its own, with the same rigour as any other part of the system. Chunking strategies, embedding models, hybrid retrieval combining semantic and keyword search, metadata filtering, reranking. These are not implementation details; they are load-bearing engineering decisions. Retrieval is one of the layers I unpack in &lt;a href=&quot;&#x2F;insights&#x2F;not-talking-to-an-llm&#x2F;&quot;&gt;you&#x27;re not talking to an LLM, you&#x27;re talking to a system&lt;&#x2F;a&gt;.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;problem-2-error-propagation-in-multi-step-pipelines&quot;&gt;Problem 2: Error propagation in multi-step pipelines&lt;&#x2F;h3&gt;
&lt;p&gt;Single-step AI is relatively forgiving. If the model makes an error, you see it, you correct it, you move on.&lt;&#x2F;p&gt;
&lt;p&gt;Multi-step agentic pipelines are not forgiving. Each step&#x27;s output becomes the next step&#x27;s input. Errors compound. A misclassification in step 1 propagates through the entire pipeline and produces a confident, coherent, wrong answer at the end.&lt;&#x2F;p&gt;
&lt;p&gt;Production agents need explicit error detection at each step. They need to know when they&#x27;re uncertain. They need to know when the input doesn&#x27;t match the patterns they were built for. And they need escalation logic that fires reliably when these conditions occur. Not optimistically, but conservatively.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;problem-3-the-state-management-problem&quot;&gt;Problem 3: The state management problem&lt;&#x2F;h3&gt;
&lt;p&gt;Conversational agents need to maintain context across a session. Orchestration agents need to manage state across multiple tool calls. Long-running agents need to persist state across interruptions and restarts.&lt;&#x2F;p&gt;
&lt;p&gt;State management in agentic systems is genuinely hard. The naive approach, passing the entire conversation history as context on every API call, doesn&#x27;t scale and degrades performance. The sophisticated approach (explicit state machines, persistent state stores, careful context window management) requires engineering that most early-stage AI projects don&#x27;t budget for.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;problem-4-testing-is-much-harder-than-for-traditional-software&quot;&gt;Problem 4: Testing is much harder than for traditional software&lt;&#x2F;h3&gt;
&lt;p&gt;You can write unit tests for functions. You can write integration tests for APIs. Testing an AI agent that takes non-deterministic multi-step actions is a different problem.&lt;&#x2F;p&gt;
&lt;p&gt;How do you test a system where the same input can produce different outputs? How do you build a test suite that catches regressions in natural language reasoning? How do you know your agent will handle the edge cases it hasn&#x27;t seen during development?&lt;&#x2F;p&gt;
&lt;p&gt;The answer involves a combination of golden-set evaluation (specific inputs with expected outputs), adversarial testing (deliberately trying to break the agent), and production monitoring (catching failures when they happen and feeding them back into the development process). None of this is impossible, but all of it requires deliberate engineering investment.&lt;&#x2F;p&gt;
&lt;h3 id=&quot;problem-5-trust-and-adoption&quot;&gt;Problem 5: Trust and adoption&lt;&#x2F;h3&gt;
&lt;p&gt;Even when the agent works technically, getting people to trust it, and therefore use it, is its own challenge.&lt;&#x2F;p&gt;
&lt;p&gt;A customer service agent that resolves 67% of cases autonomously is only valuable if the remaining 33% that require human handling are escalated smoothly, with full context, to agents who trust the system enough to take the handoff efficiently.&lt;&#x2F;p&gt;
&lt;p&gt;Building this trust requires transparency: the human needs to understand why the agent is escalating, what it&#x27;s already determined, and what information it retrieved. Black-box escalation (&quot;here&#x27;s a case the AI couldn&#x27;t handle, good luck&quot;) destroys the value of the automated component.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-makes-agentic-ai-actually-work-in-production&quot;&gt;What makes agentic AI actually work in production&lt;&#x2F;h2&gt;
&lt;p&gt;Based on the systems I&#x27;ve built, the patterns that distinguish production-ready agents from demo-ready agents are:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Retrieval engineering as a first-class concern.&lt;&#x2F;strong&gt; Not an afterthought. Not a vector database with default settings. A carefully designed retrieval architecture with tested precision and recall at real-world data scale.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Conservative escalation logic.&lt;&#x2F;strong&gt; Better to escalate unnecessarily than to resolve incorrectly. The agent should know what it doesn&#x27;t know, and be honest about it.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Explicit state management.&lt;&#x2F;strong&gt; Every step of the pipeline has observable state. Every decision is logged. Every error is caught and categorised.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Staged rollout with real metrics.&lt;&#x2F;strong&gt; Start at 10% of production volume. Measure resolution quality, escalation rate, user satisfaction, and business outcomes. Scale when the numbers are good.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Human-in-the-loop design that respects human time.&lt;&#x2F;strong&gt; The escalation handoff should give the human everything they need to pick up efficiently, not a blank canvas.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Production monitoring that catches drift.&lt;&#x2F;strong&gt; AI system performance degrades over time as input distributions change. The system needs to monitor for this and surface it proactively.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-this-means-if-you-re-deploying-agents&quot;&gt;What this means if you&#x27;re deploying agents&lt;&#x2F;h2&gt;
&lt;p&gt;If you&#x27;re building agentic AI in 2026, the question &quot;can our agent do X?&quot; is the wrong starting point. The question is &quot;can our agent do X reliably, in production, at the scale we need, with the error handling and auditability our customers require?&quot;&lt;&#x2F;p&gt;
&lt;p&gt;Most demos say yes to the first question. Very few production systems actually deliver on the second.&lt;&#x2F;p&gt;
&lt;p&gt;The founders and small to mid-market teams I work with who get this right are the ones who treat the production engineering question as seriously as the model selection question. From day one, not after launch.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Building an agentic AI product and want a second opinion on your architecture approach? &lt;a href=&quot;&#x2F;contact&quot;&gt;Let&#x27;s talk.&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-projects-fail&#x2F;&quot;&gt;How to unlock AI ROI: what the 20% do differently&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;why-ai-spend-not-in-numbers&#x2F;&quot;&gt;Why your AI spend isn&#x27;t showing up in the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;How I approach production AI deployment&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>The most important skill in business and life</title>
        <published>2023-11-09T00:00:00+00:00</published>
        <updated>2023-11-09T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/navigating-uncertainty/"/>
        <id>https://ctozen.com/insights/navigating-uncertainty/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/navigating-uncertainty/">&lt;p&gt;CEO confidence in revenue growth has hit a five-year low. PwC&#x27;s 2026 survey of 4,454 chief executives across 95 countries found that only 30% are confident, down from 56% in 2022. Mentions of &quot;uncertainty&quot; appeared in 87% of public earnings statements in early 2025. Glassdoor reviews mentioning uncertainty are up 80% year-over-year.&lt;&#x2F;p&gt;
&lt;p&gt;The five highest measurements of global economic policy uncertainty ever recorded have all come in the past five years. And this is before the full impact of AI reshapes entire industries.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve led technology teams through several of these inflection points, &lt;a href=&quot;&#x2F;impact&#x2F;public-sector-scale&#x2F;&quot;&gt;at NHS Wales&lt;&#x2F;a&gt;, where the wrong call affects millions of people, and at startups, where the wrong call means you run out of runway. The pattern I&#x27;ve noticed is that the skill separating the leaders who thrive from those who freeze is always the same: &lt;strong&gt;the ability to make good decisions when nobody knows what happens next.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-uncertainty-breaks-most-leaders&quot;&gt;Why uncertainty breaks most leaders&lt;&#x2F;h2&gt;
&lt;p&gt;There&#x27;s a neuroscience explanation for why uncertainty is so paralysing.&lt;&#x2F;p&gt;
&lt;p&gt;When the brain encounters uncertain threats, the amygdala fires and cortisol rises. A systematic review of the research found that the resulting stress response significantly impairs decision-making on uncertainty-based tasks. Worse, and this is the part that should concern every leader, elevated cortisol impairs metacognition. You make worse decisions &lt;em&gt;and you don&#x27;t realise it&lt;&#x2F;em&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;This creates a predictable failure pattern in organisations. The leader faces an uncertain situation, say, whether to invest heavily in AI or wait for the technology to mature. Stress rises. The brain&#x27;s threat response activates. And instead of reasoning clearly, the leader defaults to one of two failure modes: analysis paralysis (delay the decision indefinitely, commission another report) or false certainty (pick a direction with excessive conviction, ignore disconfirming evidence).&lt;&#x2F;p&gt;
&lt;p&gt;Both are catastrophic. And both are everywhere.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-the-evidence-says-actually-works&quot;&gt;What the evidence says actually works&lt;&#x2F;h2&gt;
&lt;p&gt;Jim Collins and Morten Hansen spent nine years studying companies that outperformed their industry index by 10x in chaotic, uncertain environments. Their findings upend the conventional wisdom about visionary leadership.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;10X leaders were not more creative, more visionary, or more risk-seeking than comparison leaders.&lt;&#x2F;strong&gt; They were more disciplined, more empirical, and more paranoid.&lt;&#x2F;p&gt;
&lt;p&gt;Three specific behaviours defined them:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Empirical creativity: &quot;fire bullets, then cannonballs.&quot;&lt;&#x2F;strong&gt; Before making big bets, 10X leaders ran small, low-cost experiments to calibrate their aim. They didn&#x27;t rely on analysis or intuition alone. They tested. Then, when they had empirical evidence that something worked, they committed resources aggressively. The comparison companies did the opposite: they fired cannonballs first, making large bets based on untested assumptions.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Fanatic discipline.&lt;&#x2F;strong&gt; 10X companies changed only 10-20% of their strategies over 20 years. Comparison companies changed 55-70%. Consistency, sticking with a validated approach even when the environment was chaotic, beat agility. This is counterintuitive. Most leadership advice says &quot;be agile, pivot fast.&quot; The data says the opposite: find what works and hold the line.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Productive paranoia.&lt;&#x2F;strong&gt; 10X leaders built financial buffers and contingency plans &lt;em&gt;before&lt;&#x2F;em&gt; crises hit, not during them. They assumed bad things would happen and prepared accordingly. They carried more cash, had more contingency plans, and were more attuned to threats than their peers.&lt;&#x2F;p&gt;
&lt;p&gt;The most striking finding: luck was evenly distributed across both groups. What differed was what leaders did with it.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-ooda-loop-and-pattern-recognition&quot;&gt;The OODA loop and pattern recognition&lt;&#x2F;h2&gt;
&lt;p&gt;Colonel John Boyd developed the OODA loop (Observe, Orient, Decide, Act) from studying air combat. His insight was that the entity which cycles through this loop fastest doesn&#x27;t just win. It creates confusion and disorientation in the opponent.&lt;&#x2F;p&gt;
&lt;p&gt;But Boyd&#x27;s most important contribution was identifying &lt;em&gt;orientation&lt;&#x2F;em&gt; as the bottleneck. Not observation, not decision, not action, but the mental model you bring to the situation. Your culture, experience, prior assumptions, and cognitive biases all shape how you interpret what you observe. And when your mental model is wrong, faster action just means you fail faster.&lt;&#x2F;p&gt;
&lt;p&gt;Gary Klein&#x27;s research on expert decision-makers confirms this. He studied firefighters, military officers, and surgeons making high-stakes decisions under time pressure. He found that experts don&#x27;t compare options analytically. They use pattern recognition from experience to rapidly identify a workable course of action. The quality of their decisions depends on the quality of their mental models, which depend on the depth and breadth of their experience. I saw this firsthand when I served as &lt;a href=&quot;&#x2F;impact&#x2F;expert-witness&#x2F;&quot;&gt;expert witness in a Hong Kong High Court case&lt;&#x2F;a&gt; involving 39 casualties. The analysis that changed the outcome didn&#x27;t come from better data. It came from a different way of looking at the same evidence.&lt;&#x2F;p&gt;
&lt;p&gt;This is why the best decisions under uncertainty come from leaders who have &lt;strong&gt;built diverse mental models through operating experience&lt;&#x2F;strong&gt;, not from leaders who have the best analysis or the most data.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-this-means-in-the-age-of-ai&quot;&gt;What this means in the age of AI&lt;&#x2F;h2&gt;
&lt;p&gt;The current AI transition is the most significant source of uncertainty in business since the internet. And most leaders are handling it badly.&lt;&#x2F;p&gt;
&lt;p&gt;EY&#x27;s 2026 CEO Outlook found that 82% of CEOs are more optimistic about AI than a year ago, but 60% admit they&#x27;ve intentionally slowed implementation due to fear of errors. Half believe their job stability depends on successfully integrating AI. Gartner reports that 72% of CIOs are breaking even or losing money on AI investments.&lt;&#x2F;p&gt;
&lt;p&gt;The pattern is familiar. Uncertainty triggers the stress response. Leaders oscillate between two extremes: rushing to adopt AI without understanding the risks (false certainty) or delaying meaningful action while commissioning strategy decks (analysis paralysis).&lt;&#x2F;p&gt;
&lt;p&gt;Collins&#x27;s research suggests a better approach, one I use with every organisation I work with:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Fire bullets first.&lt;&#x2F;strong&gt; Don&#x27;t bet the business on a comprehensive AI transformation. Pick one high-value process. Build a focused AI solution. Measure it against a real business metric. Learn from what works and what doesn&#x27;t. Then, and only then, fire the cannonball.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Be empirical, not ideological.&lt;&#x2F;strong&gt; The AI debate has become tribal: accelerationists versus sceptics, maximalists versus minimalists. None of that matters. What matters is whether the specific AI system you&#x27;re building moves a specific number that your business cares about. &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;If it doesn&#x27;t, kill it.&lt;&#x2F;a&gt; If it does, scale it.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Build the buffer.&lt;&#x2F;strong&gt; Productive paranoia in the AI context means keeping the skills and the people you&#x27;ll need if the technology doesn&#x27;t deliver what it promises. It means not cutting your &lt;a href=&quot;&#x2F;insights&#x2F;training-ladder-broken&#x2F;&quot;&gt;junior engineering pipeline&lt;&#x2F;a&gt; to fund AI tools that might not work. It means maintaining optionality, the ability to change course without catastrophic cost.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-kodak-lesson&quot;&gt;The Kodak lesson&lt;&#x2F;h2&gt;
&lt;p&gt;Kodak invented the digital camera in 1975. Their leadership buried it because they feared cannibalising their film business. Despite a succession of new CEOs and mounting evidence that digital photography was the future, the organisation couldn&#x27;t adapt. The result: an 80% workforce decline, a collapsed stock price, and a bankruptcy filing after more than a century of market dominance.&lt;&#x2F;p&gt;
&lt;p&gt;Nokia&#x27;s engineers presented a full touchscreen phone prototype to management. The response: &quot;that&#x27;s not how phones work.&quot; The company lost roughly $100 billion in market value.&lt;&#x2F;p&gt;
&lt;p&gt;Blockbuster&#x27;s executives dismissed Netflix as a &quot;very small niche business&quot; and turned down a $50 million acquisition offer. Netflix is now worth over $100 billion.&lt;&#x2F;p&gt;
&lt;p&gt;In every case, the failure wasn&#x27;t a lack of information. It was a failure of orientation. Boyd&#x27;s bottleneck. The mental models of the leaders in the room couldn&#x27;t accommodate the change they were facing. So they didn&#x27;t.&lt;&#x2F;p&gt;
&lt;p&gt;The leaders navigating AI well today, and I include Satya Nadella&#x27;s transformation of Microsoft from a $300 billion company to a $2.5 trillion one among the best examples, share a different orientation. They treat their mental models as provisional. They expect to be wrong about some things. They test, measure, and adjust. And they maintain the discipline to hold their nerve when the uncertainty is highest.&lt;&#x2F;p&gt;
&lt;p&gt;Eisenhower put it best: &quot;Plans are worthless, but planning is everything.&quot; The value isn&#x27;t in the plan. It&#x27;s in the thinking the plan forces you to do. And when reality diverges from the plan, as it always does, that thinking is what allows you to improvise effectively.&lt;&#x2F;p&gt;
&lt;p&gt;That&#x27;s the skill. Not predicting the future. Not having the best strategy. Not being the most decisive person in the room. Just the ability to think clearly when nobody knows what happens next, and to act on that thinking with discipline.&lt;&#x2F;p&gt;
&lt;p&gt;It&#x27;s the most important skill in business. And it&#x27;s learnable.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;If you&#x27;re making technology decisions under uncertainty and want a second perspective, a &lt;a href=&quot;&#x2F;services&#x2F;#fractional-cto&quot;&gt;fractional CTO engagement&lt;&#x2F;a&gt; is one way to get it. Or simply &lt;a href=&quot;&#x2F;contact&quot;&gt;get in touch&lt;&#x2F;a&gt;.&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;ai-transformation-performance-art&#x2F;&quot;&gt;Turning AI theatre into AI that moves the numbers&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;sculpting-world-class-tech-team&#x2F;&quot;&gt;Your world-class engineering team is already in the building&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;impact&#x2F;public-sector-scale&#x2F;&quot;&gt;NHS Wales: transformation at national scale&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;How I work&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>The most powerful yet overlooked tool in a leader&#x27;s arsenal</title>
        <published>2023-10-13T00:00:00+00:00</published>
        <updated>2023-10-13T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/power-of-thank-you/"/>
        <id>https://ctozen.com/insights/power-of-thank-you/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/power-of-thank-you/">&lt;p&gt;A Wharton study gave a university director one task: visit a group of fundraisers and say, sincerely, &quot;I am very grateful for your hard work. We sincerely appreciate your contributions.&quot;&lt;&#x2F;p&gt;
&lt;p&gt;That was it. No bonus. No promotion. No formal programme. Just a human being telling other human beings that their work mattered.&lt;&#x2F;p&gt;
&lt;p&gt;The result: the thanked group made &lt;strong&gt;50% more fundraising calls&lt;&#x2F;strong&gt; the following week. The control group (same job, same pay, same conditions, no visit) showed zero change.&lt;&#x2F;p&gt;
&lt;p&gt;The researchers, Adam Grant and Francesca Gino, found the mechanism wasn&#x27;t confidence. It was social worth. Being thanked made people feel valued as members of a community. And that feeling, belonging and mattering, drove behaviour more powerfully than any incentive scheme could.&lt;&#x2F;p&gt;
&lt;p&gt;This is not a soft finding. It&#x27;s one of the hardest results in organisational psychology. And most leaders ignore it.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-numbers-behind-thank-you&quot;&gt;The numbers behind &quot;thank you&quot;&lt;&#x2F;h2&gt;
&lt;p&gt;Gallup and Workhuman ran a longitudinal study tracking 3,400 employees from 2022 to 2024. The headline finding: employees who received high-quality recognition were &lt;strong&gt;45% less likely to have left&lt;&#x2F;strong&gt; two years later.&lt;&#x2F;p&gt;
&lt;p&gt;That&#x27;s not a marginal effect. That&#x27;s nearly half of voluntary turnover, preventable.&lt;&#x2F;p&gt;
&lt;p&gt;The same study found that employees who felt fulfilled by recognition were 4x as likely to be engaged and 65% less likely to be actively job-searching. For a 10,000-person company, Gallup estimates that a strong recognition culture saves up to &lt;strong&gt;$16.1 million in turnover costs annually&lt;&#x2F;strong&gt;.&lt;&#x2F;p&gt;
&lt;p&gt;And yet only 22% of employees say they receive the right amount of recognition. That number hasn&#x27;t moved since 2022.&lt;&#x2F;p&gt;
&lt;p&gt;The data from other sources converges on the same point. O.C. Tanner&#x27;s Global Culture Report found employees are 18x more likely to produce great work when recognition is integrated and personalised. Deloitte found that organisations with recognition programmes see 14% higher engagement, productivity, and performance than those without. A 15% improvement in engagement translates to a 2% increase in margins.&lt;&#x2F;p&gt;
&lt;p&gt;79% of employees who quit cite lack of appreciation as a major reason. 59% say they have never had a boss who truly appreciates them. These are not small numbers. These are systemic failures of leadership.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-happens-in-the-brain&quot;&gt;What happens in the brain&lt;&#x2F;h2&gt;
&lt;p&gt;The neuroscience is straightforward. Recognition triggers dopamine, the reward signal that drives motivation and reinforces behaviour. It also triggers oxytocin, which builds trust and social bonding. And serotonin, associated with well-being and satisfaction.&lt;&#x2F;p&gt;
&lt;p&gt;The dopamine effect creates a reinforcement loop: when someone is recognised for a behaviour, the neurochemical reward makes them more likely to repeat it. This isn&#x27;t metaphorical. It&#x27;s the same mechanism that makes habits form. Recognition literally rewires the brain&#x27;s reward circuitry.&lt;&#x2F;p&gt;
&lt;p&gt;An Indiana University study found that participants who practised gratitude showed lasting increases in neural sensitivity in the medial prefrontal cortex, months after the intervention. The brain doesn&#x27;t just respond to recognition in the moment. It adapts structurally over time, becoming more attuned to positive signals. Gratitude, given and received, compounds.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;the-story-i-can-t-forget&quot;&gt;The story I can&#x27;t forget&lt;&#x2F;h2&gt;
&lt;p&gt;Years ago I was working with an exceptional team of software engineers at a company that was, genuinely, considered a gold standard in culture and talent development.&lt;&#x2F;p&gt;
&lt;p&gt;The team had just finished a multi-month crunch. They&#x27;d been fighting like lions to enable the business to win a massive multi-year, multi-billion pound contract. Weekends. Late nights. The kind of sustained intensity that burns people out even when the outcome is good.&lt;&#x2F;p&gt;
&lt;p&gt;One engineer was the technical leader of the entire effort. He&#x27;d worked harder, and had done more to make the successful outcome happen, than anyone else on the team.&lt;&#x2F;p&gt;
&lt;p&gt;When the win was announced, the company&#x27;s senior leaders praised their own effort. Publicly and privately, they thanked themselves. The engineer who had made it happen, the person who&#x27;d given months of his life to the outcome, went unnoticed.&lt;&#x2F;p&gt;
&lt;p&gt;What followed was predictable, if you know the research. He quietly quit. Not formally. He stayed in his seat. But the engagement evaporated. He became negative. The negativity spread. Productivity across the team visibly dropped.&lt;&#x2F;p&gt;
&lt;p&gt;Within a few months, he left. Several colleagues followed. These were engineers with rare, specialised skills, the kind that take years to develop and months to replace. The cost to the company was enormous, and it compounded for over a year.&lt;&#x2F;p&gt;
&lt;p&gt;At the exit interview he told me that the lack of appreciation was the turning point. He&#x27;d realised the leadership didn&#x27;t care about their people. That the values and the speeches were, in his experience, hollow.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;All it would have taken was a &quot;thank you.&quot;&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;Gallup&#x27;s data says employees who received recognition in the prior month saw their attrition risk decrease by 58%, their perception of growth opportunities increase by 108%, and their positive perception of leaders increase by 180%. One moment of genuine recognition. Those are the numbers.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-this-matters-more-now-than-ever&quot;&gt;Why this matters more now than ever&lt;&#x2F;h2&gt;
&lt;p&gt;I want to make the connection to the current moment, because I think it&#x27;s important.&lt;&#x2F;p&gt;
&lt;p&gt;We are in the middle of the most significant transformation of knowledge work in a generation. AI is changing what engineers do, how teams are structured, and what skills matter. McKinsey found that 70% of large-scale transformations fail, and the primary reasons are employee resistance and poor change management. BCG puts it at 70% of AI implementation challenges being people problems, not technical ones.&lt;&#x2F;p&gt;
&lt;p&gt;People who are anxious about their relevance, and a lot of engineers are anxious right now, don&#x27;t need another all-hands presentation about the AI roadmap. They need to know that their contribution matters. That the years of expertise they&#x27;ve built aren&#x27;t invisible to the people making decisions about the future.&lt;&#x2F;p&gt;
&lt;p&gt;Nearly half of CEOs in one survey admitted their employees were resistant or openly hostile to AI-driven changes. The instinct is to respond with more communication, more training, more change management programmes. And those things matter. But the most powerful intervention is also the simplest: seeing the person, acknowledging their work, and saying thank you.&lt;&#x2F;p&gt;
&lt;p&gt;Highly engaged employees experience significantly fewer negative effects of change-related stress. And engagement, as every study cited above confirms, is driven more by recognition than by any other single factor.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;making-it-real&quot;&gt;Making it real&lt;&#x2F;h2&gt;
&lt;p&gt;If recognition is so effective and so cheap, why don&#x27;t more leaders do it?&lt;&#x2F;p&gt;
&lt;p&gt;HBR published research showing that &lt;strong&gt;more powerful people express less gratitude&lt;&#x2F;strong&gt;. As leaders gain authority, they become less attuned to the contributions of others. It&#x27;s not malice. It&#x27;s a cognitive shift. Power narrows attention toward goals and away from people. The very thing that makes someone an effective leader in some dimensions makes them worse at the human dimension that matters most.&lt;&#x2F;p&gt;
&lt;p&gt;The fix is simple but requires intention.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Be immediate.&lt;&#x2F;strong&gt; Recognition delayed is recognition diluted. The Grant&#x2F;Gino study showed the effect in the &lt;em&gt;same week&lt;&#x2F;em&gt;. When someone does good work, say so then. Not at the next performance review. Not at the next all-hands. Now.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Be specific.&lt;&#x2F;strong&gt; &quot;Great job&quot; is better than silence. But &quot;the way you structured that data pipeline saved us two weeks of rework&quot; is better than &quot;great job.&quot; Specificity signals that you actually noticed the work, not just the outcome.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Be genuine.&lt;&#x2F;strong&gt; People detect performative gratitude instantly. If you don&#x27;t mean it, don&#x27;t say it. But if you genuinely believe someone&#x27;s work mattered, and if you&#x27;re paying attention you&#x27;ll find evidence of this every day, say so. The bar is not eloquence. The bar is sincerity.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Make it structural, not optional.&lt;&#x2F;strong&gt; The organisations with the best retention don&#x27;t rely on individual managers remembering to say thank you. They build recognition into the operating rhythm: weekly stand-ups, code reviews, project retrospectives. Google&#x27;s gThanks programme lets any employee nominate a colleague for a small bonus. The mechanism matters less than the consistency.&lt;&#x2F;p&gt;
&lt;p&gt;The engineering leader who builds a culture of genuine recognition will retain more of their best people, build more resilient teams, and navigate the AI transition with less friction than the one who doesn&#x27;t. I&#x27;ve led teams at &lt;a href=&quot;&#x2F;impact&#x2F;public-sector-scale&#x2F;&quot;&gt;national scale in the public sector&lt;&#x2F;a&gt; and in &lt;a href=&quot;&#x2F;impact&#x2F;national-security-platform&#x2F;&quot;&gt;defence environments where the pressure is relentless&lt;&#x2F;a&gt;. The teams that delivered best were the ones where recognition was part of the operating culture, not an afterthought. The data is unambiguous on this point.&lt;&#x2F;p&gt;
&lt;p&gt;And it starts with two words.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;best-technology-rarely-wins&#x2F;&quot;&gt;The best technology rarely wins. The best-led team does&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;training-ladder-broken&#x2F;&quot;&gt;The training ladder is broken&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;services&#x2F;#fractional-cto&quot;&gt;Fractional CTO services&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Fractional CTO. Is outsourced technology leadership right for you?</title>
        <published>2022-09-28T00:00:00+00:00</published>
        <updated>2022-09-28T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/fractional-cto-right-for-you/"/>
        <id>https://ctozen.com/insights/fractional-cto-right-for-you/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/fractional-cto-right-for-you/">&lt;p&gt;The fractional CTO model exists because of a real gap. You need senior technical leadership for architecture decisions, investor credibility, hiring, and AI strategy, but a full-time CTO hire is either unaffordable, impractical, or premature at your stage.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve been on both sides of this. I&#x27;ve been the full-time CTO. I&#x27;ve been the fractional CTO stepping into organisations mid-crisis. And I&#x27;ve worked with companies that tried to do without one entirely. The last option is almost always the most expensive.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;when-does-a-fractional-cto-engagement-work&quot;&gt;When does a fractional CTO engagement work?&lt;&#x2F;h2&gt;
&lt;p&gt;A fractional CTO engagement works when the organisation has real technology decisions to make but doesn&#x27;t yet need, or can&#x27;t yet justify, a full-time executive hire.&lt;&#x2F;p&gt;
&lt;p&gt;That typically looks like:&lt;&#x2F;p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Post-seed or Series A.&lt;&#x2F;strong&gt; You have 3-20 engineers and no technical co-founder. Architecture decisions are being made by the most senior developer, who may be excellent at engineering but hasn&#x27;t done this at company level before.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Technical co-founder departure.&lt;&#x2F;strong&gt; The scenario every funded startup dreads. I stepped into exactly this situation at Mentor360, stabilised the team, and secured a Royal Navy contract during a period that could have derailed the company.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;AI deployment.&lt;&#x2F;strong&gt; You&#x27;re building AI into your product or operations and need someone who&#x27;s done it in production, not someone who&#x27;s read about it. The &lt;a href=&quot;&#x2F;impact&#x2F;insurance-automation&#x2F;&quot;&gt;insurance automation case&lt;&#x2F;a&gt; is a good example of what this looks like in practice: agentic AI resolving 67% of cases autonomously, designed and deployed during a fractional engagement.&lt;&#x2F;li&gt;
&lt;li&gt;&lt;strong&gt;Fundraise or exit preparation.&lt;&#x2F;strong&gt; Investors and acquirers will ask hard technology questions. You need a credible technical voice in the room.&lt;&#x2F;li&gt;
&lt;&#x2F;ul&gt;
&lt;p&gt;In these situations, a fractional CTO at 2 days per week gives you 80% of the strategic value of a full-time hire at a fraction of the cost, without the six-month recruitment process or the risk of a bad hire.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-are-the-honest-disadvantages-of-a-fractional-cto&quot;&gt;What are the honest disadvantages of a fractional CTO?&lt;&#x2F;h2&gt;
&lt;p&gt;I&#x27;d be doing you a disservice if I didn&#x27;t name these.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Limited availability.&lt;&#x2F;strong&gt; A fractional CTO isn&#x27;t in the building every day. The trade-off is that you&#x27;re paying exclusively for the high-value strategic work (architecture decisions, hiring guidance, investor preparation) rather than a full-time salary that includes meetings, email, and organisational overhead. Whether that trade-off works depends on your stage and needs.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Temporary by design.&lt;&#x2F;strong&gt; A fractional engagement isn&#x27;t a permanent commitment. That&#x27;s a feature for some organisations and a limitation for others. The best arrangements either evolve into something more permanent or end with a clear handover to a full-time hire.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Less skin in the game.&lt;&#x2F;strong&gt; A fractional CTO doesn&#x27;t have equity (I don&#x27;t take it; it keeps incentives clean). That means the relationship is professional, not existential. For most engagements, this is healthy. But it&#x27;s worth being honest about the dynamic.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Quality varies enormously.&lt;&#x2F;strong&gt; This is the biggest risk. The fractional CTO market has grown fast, and not everyone offering the service has the experience to back it up. &quot;CTO&quot; on a LinkedIn profile doesn&#x27;t mean someone has built production systems, managed engineering teams, or sat in an investor meeting with hard questions coming.&lt;&#x2F;p&gt;
&lt;p&gt;Two things to look for: &lt;strong&gt;independence&lt;&#x2F;strong&gt; and &lt;strong&gt;operating experience&lt;&#x2F;strong&gt;. A fractional CTO provided by a development agency has a structural conflict of interest. Their agency benefits from recommending more development work. And a fractional CTO who has only advised but never built lacks the pattern recognition that comes from being in the seat when things go wrong. &lt;a href=&quot;&#x2F;about&#x2F;&quot;&gt;My background&lt;&#x2F;a&gt; includes four exits, CTO roles at NHS Wales and in defence technology, and production AI systems. That operating experience is what makes fractional work credible.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;how-do-you-evaluate-whether-a-fractional-cto-is-right-for-your-business&quot;&gt;How do you evaluate whether a fractional CTO is right for your business?&lt;&#x2F;h2&gt;
&lt;p&gt;Before engaging a fractional CTO, including me, ask yourself:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Do you have real decisions to make?&lt;&#x2F;strong&gt; If you need architecture direction, hiring guidance, or investor preparation in the next 90 days, a fractional CTO earns their fee quickly. If you need someone to write code, hire a senior engineer.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Is the timing right?&lt;&#x2F;strong&gt; A fractional CTO adds the most value at inflection points: post-raise, pre-exit, during a scaling push, or when AI deployment is on the roadmap. If you&#x27;re in steady-state with a functioning team, lighter-touch advisory may be more appropriate.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Can you define the scope?&lt;&#x2F;strong&gt; The best fractional engagements have clear boundaries: weekly cadence, defined responsibilities, specific deliverables at 90 days. Open-ended arrangements tend to drift.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;is-a-fractional-cto-right-for-you&quot;&gt;Is a fractional CTO right for you?&lt;&#x2F;h2&gt;
&lt;p&gt;Having senior technical leadership is not optional for technology-driven businesses. The arrangement (full-time, fractional, or advisory) depends on your stage, budget, and needs.&lt;&#x2F;p&gt;
&lt;p&gt;If you&#x27;re weighing the options, &lt;a href=&quot;&#x2F;contact&quot;&gt;a conversation is a good starting point&lt;&#x2F;a&gt;. No commitment, just an honest assessment of what you actually need.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;does-your-business-need-a-cto&#x2F;&quot;&gt;Does your business need a CTO?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;services&#x2F;#fractional-cto&quot;&gt;Fractional CTO service details and pricing&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;How I work&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;impact&#x2F;public-sector-scale&#x2F;&quot;&gt;Case study: NHS Wales transformation&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
    <entry xml:lang="en">
        <title>Does your business need a CTO?</title>
        <published>2022-06-30T00:00:00+00:00</published>
        <updated>2022-06-30T00:00:00+00:00</updated>
        
        <author>
          <name>
            
              Unknown
            
          </name>
        </author>
        
        <link rel="alternate" type="text/html" href="https://ctozen.com/insights/does-your-business-need-a-cto/"/>
        <id>https://ctozen.com/insights/does-your-business-need-a-cto/</id>
        
        <content type="html" xml:base="https://ctozen.com/insights/does-your-business-need-a-cto/">&lt;p&gt;I&#x27;ve watched this pattern play out dozens of times. A founder or managing director knows their business depends on technology but isn&#x27;t sure whether they need a CTO. They assume it&#x27;s a hire they can defer, something for later, when the budget is bigger or the product is further along.&lt;&#x2F;p&gt;
&lt;p&gt;Deferring the CTO decision is almost always a mistake. By the time founders realise it, the cost of operating without senior technical leadership has already compounded.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-is-the-real-cost-of-operating-without-a-cto&quot;&gt;What is the real cost of operating without a CTO?&lt;&#x2F;h2&gt;
&lt;p&gt;The obvious risk is bad architecture decisions. But the deeper problem is subtler: &lt;strong&gt;you don&#x27;t know what you don&#x27;t know.&lt;&#x2F;strong&gt;&lt;&#x2F;p&gt;
&lt;p&gt;Without senior technical leadership, you can&#x27;t evaluate whether your engineering team is building the right thing in the right way. You can&#x27;t assess vendor claims. You can&#x27;t tell whether your AI pilot is genuinely promising or a well-produced demo. You can&#x27;t prepare for the technical questions investors or acquirers will ask. The kind of &lt;a href=&quot;&#x2F;impact&#x2F;expert-witness&#x2F;&quot;&gt;analytical rigour that changes outcomes&lt;&#x2F;a&gt; simply isn&#x27;t present without someone whose job it is to provide it.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve seen companies lose months of engineering time, sometimes years, to decisions that a competent CTO would have caught in week one. Wrong cloud provider for the use case. Wrong data architecture for the product. Wrong hiring profile for the stage. Each of these compounds quietly until the bill comes due, usually at the worst possible moment: during a fundraise, an exit process, or a scaling push.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;what-does-a-cto-actually-do&quot;&gt;What does a CTO actually do?&lt;&#x2F;h2&gt;
&lt;p&gt;The title means different things at different stages. But at its core, a CTO does six things:&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Sets technology strategy.&lt;&#x2F;strong&gt; Not in isolation. In lockstep with business strategy. Every serious business decision has a technology dimension. The CTO ensures those dimensions are visible before commitments are made.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Defines what to build.&lt;&#x2F;strong&gt; Which products, which features, in what order. This isn&#x27;t a technical exercise. It&#x27;s a commercial one, informed by technical reality. The CTO translates business priorities into engineering priorities.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Decides how to build it.&lt;&#x2F;strong&gt; Build or buy. Internal team or outsource. Which stack, which architecture, which deployment model. These decisions determine speed, cost, and scalability for years. Getting them wrong at Series Seed creates Series B problems.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Executes the roadmap.&lt;&#x2F;strong&gt; Plans, builds or sources the team, manages the development process. Owns the delivery.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Translates between business and engineering.&lt;&#x2F;strong&gt; In board meetings, investor calls, and cross-functional discussions, the CTO makes technology decisions legible to non-technical stakeholders. This is where fundraises succeed or stall. Investors need to trust that someone credible is making the technology calls.&lt;&#x2F;p&gt;
&lt;p&gt;&lt;strong&gt;Leads the engineering team.&lt;&#x2F;strong&gt; Hires well, structures the team correctly for the stage, shields engineers from organisational noise so they can focus on deep work. The leadership mistakes at this stage are expensive. A bad senior hire can set you back six months.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;why-is-a-cto-s-role-strategic-not-just-technical&quot;&gt;Why is a CTO&#x27;s role strategic, not just technical?&lt;&#x2F;h2&gt;
&lt;p&gt;A great CTO isn&#x27;t the best coder in the room. They&#x27;re the person who sees what&#x27;s coming: the technology shift that creates an opportunity, the architectural decision that will box you in, the vendor relationship that&#x27;s about to become a liability.&lt;&#x2F;p&gt;
&lt;p&gt;I&#x27;ve held this role at national scale, at NHS Wales, where the wrong technology call has consequences for millions of people, and at startups, where the wrong call means you run out of runway. I&#x27;ve also been &lt;a href=&quot;&#x2F;impact&#x2F;founder-exit&#x2F;&quot;&gt;a founder four times&lt;&#x2F;a&gt;, so I know the other side of the table too. The stakes vary. The discipline is the same.&lt;&#x2F;p&gt;
&lt;h2 id=&quot;when-does-a-business-actually-need-a-cto&quot;&gt;When does a business actually need a CTO?&lt;&#x2F;h2&gt;
&lt;p&gt;Whether you&#x27;re a 5-person startup or a 200-person mid-market company, if technology is core to your business, you need CTO-calibre thinking in the room. The question is the arrangement.&lt;&#x2F;p&gt;
&lt;p&gt;A permanent, full-time CTO is ideal, but it&#x27;s expensive, takes months to hire well, and the cost of hiring the wrong one is severe. Various alternatives work: interim, fractional, or advisory arrangements all prove superior to having no senior technical leadership at all.&lt;&#x2F;p&gt;
&lt;p&gt;If a permanent hire isn&#x27;t feasible yet, &lt;a href=&quot;&#x2F;services#fractional-cto&quot;&gt;a fractional CTO engagement&lt;&#x2F;a&gt; gives you the expertise at a fraction of the cost, with none of the hiring risk and all of the strategic value.&lt;&#x2F;p&gt;
&lt;hr &#x2F;&gt;
&lt;p&gt;&lt;em&gt;Related: &lt;a href=&quot;&#x2F;insights&#x2F;fractional-cto-right-for-you&#x2F;&quot;&gt;Fractional CTO: is outsourced technology leadership right for you?&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;insights&#x2F;best-technology-rarely-wins&#x2F;&quot;&gt;The best technology rarely wins. The best-led team does&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;impact&#x2F;founder-exit&#x2F;&quot;&gt;Four companies, four exits&lt;&#x2F;a&gt; · &lt;a href=&quot;&#x2F;methodology&#x2F;&quot;&gt;How I work as a Fractional CTO&lt;&#x2F;a&gt;&lt;&#x2F;em&gt;&lt;&#x2F;p&gt;
</content>
        
    </entry>
</feed>
