<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Cassi AI]]></title><description><![CDATA[Where AI meets reality: how to think about the future, forecasting, strategy, Whitehall, Westminster, business, geopolitics, tech, defence, national security.]]></description><link>https://blog.cassi-ai.com</link><image><url>https://substackcdn.com/image/fetch/$s_!FocW!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31b68b2e-789b-4bf7-93d6-8a43c28b04c3_640x640.png</url><title>Cassi AI</title><link>https://blog.cassi-ai.com</link></image><generator>Substack</generator><lastBuildDate>Tue, 09 Jun 2026 01:29:55 GMT</lastBuildDate><atom:link href="https://blog.cassi-ai.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Keith P Dear]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[cassi-ai@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[cassi-ai@substack.com]]></itunes:email><itunes:name><![CDATA[Keith Dear]]></itunes:name></itunes:owner><itunes:author><![CDATA[Keith Dear]]></itunes:author><googleplay:owner><![CDATA[cassi-ai@substack.com]]></googleplay:owner><googleplay:email><![CDATA[cassi-ai@substack.com]]></googleplay:email><googleplay:author><![CDATA[Keith Dear]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The Blindspot in Sequoia’s Future Tech Stack Vision: Judgement]]></title><description><![CDATA[Two recent essays from Sequoia's Jack Dorsey and Julien Bek paint a vision of future enterprises in a world of ever more capable AIs. We offer a review and challenge.]]></description><link>https://blog.cassi-ai.com/p/the-blindspot-in-sequoias-future</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/the-blindspot-in-sequoias-future</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Sun, 31 May 2026 06:00:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!cC0W!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cC0W!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cC0W!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!cC0W!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!cC0W!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!cC0W!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cC0W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png" width="1456" height="813" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:7499458,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/199851160?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cC0W!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!cC0W!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!cC0W!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!cC0W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f399a6-9534-460f-bb8f-c6f953beb0a0_2752x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Gemini-generated image of the future tech stack. nb. not a perfect illustration of the essay&#8217;s content, and still some typos such as my favourite: &#8216;autonomed snrdes&#8217; (!). But close enough to prettify the essay.</figcaption></figure></div><p style="text-align: justify;">Sequoia&#8217;s recent AI theses share a blindspot. Jack Dorsey argues that AI can replace hierarchy inside the firm. Julien Bek argues that the biggest AI companies will stop selling tools and start selling the work itself. Both may be right. But both visions miss the key element of the stack: judgement.</p><p style="text-align: justify;">You can automate heuristics. You can automate routing, triage and much of the routine work that fills calendars and part-justifies organisational roles and structure. But the hard part of management is deciding when not to act, how to evaluate trade-offs, how to identify context shifts from weak signals that invalidate previous heuristics, how to anticipate and make the calls before the signal shows up in the data flows (when it is sometimes too late), to predict and pre-empt. That is judgement. Failing to do these things is the stuff of disciplinary and improvement plan action for middle-managers. Without it, speed is just a faster way to leave the road.</p><p style="text-align: justify;">In his co-authored article <em><a href="https://sequoiacap.com/article/from-hierarchy-to-intelligence/">From Hierarchy to Intelligence</a> </em>Jack Dorsey argues that &#8216;<em>&#8230;speed is the best predictor of start-up success&#8217;,</em> and that a company &#8220;world model&#8221; can replace much of the information flow once carried by middle-management. He is right that hierarchy evolved partly as an information-routing system. He is also right that remote-first, machine-readable companies give AI far richer raw material than older organisations ever did. But middle-management does not just move information. It makes judgement calls, constantly.</p><p style="text-align: justify;">Middle managers decide which customer complaint is justified despite not fitting the checklist of valid grievances, which delay can be tolerated, which opportunity is real, which risks are systemic and which temporary noise. They often take anticipatory action before the data is clear. Remove those layers and the decisions do not disappear. They rise. Soon the executive team is swamped by exceptions.</p><p style="text-align: justify;">To be fair, Dorsey does concede part of this. He says people at the edge will still handle intuition, cultural context, trust dynamics, ethical decisions, novel situations and high-stakes moments. But that concession is larger than he admits. In many organisations, that is not the residue of management. That is the job, and unless the top executives are going to manage all the exceptions and uncertainties &#8211; the judgement calls - themselves, they will be forced to reinvent middle-management again.</p><p style="text-align: justify;">Dorsey is right also that money is often an honest signal. It is not always a sufficient one. Take my own situation - a founder weighing a large but uncertain opening in insurance, a slowing renewal with a US defence prime that carries significant lifetime value, a safer but smaller renewal, or a bank opportunity with unclear timing - I cannot decide the prioritisation from revenue alone. The decision turns on expected value under uncertainty. Monetary value is not signal enough to make the call.</p><p style="text-align: justify;">Furthermore, as some of Cassi&#8217;s dilemmas above highlight, internal speed is rarely the issue in business. We can accept that speed could well be the best predictor of success and that creating greater speed through superior information flow and automated work co-ordination is generally a good thing &#8211; and assuming for the sake of argument the slightly problematic assumption that formal recorded data and the accounts are sufficient to do this &#8211; our own experience in government and the commercial sector is that slowness in <em>external </em>co-ordination is always much more of an issue. Speed-running <em>your</em> work only makes a difference on rare occasions. The main task becomes coordination with external actors, all running off (to you) asynchronous timetables.</p><p style="text-align: justify;">If internal speed mattered that much, it would be better tracked - yet while almost all companies track sales cycles, forecast when customers, investors and others will make decisions that affect revenue, cashflow etc., few if any organisations track internal decision-speed as key operational metric, nor is it something investors of all sizes and shapes show any particular interest in. If it were that predictive of success, you&#8217;d expect McKinsey to be turning up at every large corporate with a tried and tested formula to measure decision-speed, and improve it, and private equity would likewise make increasing decision-speed the central goal in turnarounds and post-takeover transformation. They don&#8217;t. Most organisations prioritise sound judgement over speed.</p><p style="text-align: justify;">Bek gets closer to the heart of the matter in <em><a href="https://sequoiacap.com/article/services-the-new-software/">Services: the New Software.</a></em> He separates intelligence from judgement, defining judgement as &#8220;knowing what to build next&#8221;. His copilot-autopilot distinction is sharp: copilots sell the tool; autopilots sell the work. But &#8220;knowing what to build next&#8221; is an insufficient definition, missing its prerequisite, the fundamental inescapable element of judgement: forecasting.</p><p style="text-align: justify;">The Cambridge Dictionary definition helps: judgement is the ability to form valuable opinions and make good decisions. Both halves of which again point you at forecasting. An opinion is valuable only if it improves your model of the world, and you can only know how good this is by testing how well it predicts. Similarly, taking the second half of the definition - a decision is a forecast with expected consequences attached: if we do X, Y becomes more likely. Strategy, tactics, prioritisation and planning are all, at their heart, conditional forecasts.</p><p style="text-align: justify;">Accept the argument above, and you are compelled to recognise that automating judgement is the frontier challenge. For Dorsey, the missing layer is a system that can decide when to act, when to wait, when to escalate and which trade-off to accept. For Bek, autopilots only capture the full labour budget once they can handle the judgement-heavy parts of the work, not just the intelligence-heavy ones.</p><p style="text-align: justify;">Current LLMs are not optimised for judgement. They are tuned, through RLHF and RLAIF, to produce answers that are coherent and acceptable to humans &#8211; optimised to seek human approval. As The New York Times <a href="https://www.nytimes.com/2026/01/25/opinion/ai-human-judgment.html">reported</a> in January, &#8220;judgment is the faculty we rely on when trade-offs are unavoidable and the right answer is not waiting to be computed&#8230; a uniquely human skill.&#8221; That view is echoed by the <a href="https://www.london.edu/think/good-news-for-human-beings-ai-doesnt-do-judgement">London</a> and <a href="https://www.hbs.edu/bigs/artificial-intelligence-human-jugment-drives-innovation">Harvard</a> Business Schools. It is also familiar. Each time a capability is labelled uniquely human, it tends to be temporary. Planning fell to chess engines, intuition to Go systems, bluffing to poker AIs. Creativity, discovery and reasoning have followed. Judgement will too.</p><p style="text-align: justify;">That is not to say this is easy. Today&#8217;s models can be shaped, constrained and fine-tuned into narrower behaviours, but their default orientation does not change. A system optimised for human approval &#8211; &#8220;<em>you&#8217;re absolutely right</em>&#8221;<sup>[1]</sup> - is ill suited to reliably &#8220;knowing what comes next&#8221;. That requires forecasting: reducing uncertainty about future states of the world. It is a different objective, it is hard, and optimising for it would mean telling your consumer base what they need to know, not what they want to hear &#8211; given (a) the dominant use case of LLMs is said to be therapy, (b) the timeless mantra that the customer is always right &#8211; delivering the unvarnished truth is likely to be value-destructive for currently dominant LLM companies. Moreover, building systems that do judgement well requires scaffolding, structure and empirical feedback that most current deployments lack. Dorsey&#8217;s model therefore assumes an optimisation function that does not yet exist in his proposed stack. It might be the most important frontier challenge. As Elon Musk has <a href="https://x.com/ElonClipsX/status/1911780383856697512?s=20">said</a> &#8220;The right metric for intelligence is probably the ability to predict the future. You&#8217;re as intelligent as you can predict the future well.&#8221;</p><p style="text-align: justify;">There are capabilities that allow you to do this. At Cassi our work reframes judgement as calibrated forecasting: explicit probabilities, tracked over time, tied to outcomes, predicting also the subobjectives and indicators that matter most. On ForecastBench, this approach has taken our forecasting to the level of the top individual human forecasters and close to the weighted superforecaster mean.Preliminary results on &#8216;benchmark&#8217; questions &#8211; those questions where there are no forecasts on prediction markets or similar to baseline against &#8211; show our latest model surpassing that superforecaster mean. &#8216;The weighted superforecaster mean&#8217; as the site reports it, is a wonkish measure of the best judgemental forecasts humanity can make under uncertainty. Once judgement is treated as something that can be scored, it can be improved.</p><p style="text-align: justify;">This is the missing layer in both Sequoia theses. If corporations are, as Norbert Wiener described, &#8220;machines of flesh and blood&#8221;, then much of their function is amenable to automation. But judgement does not disappear when you remove hierarchy or sell outcomes instead of tools. Selling outcomes is what we do at Cassi &#8211; a fully generalisable solution &#8211; and it cannot be done without judgemental forecasting. Without a system to support it, executives inherit thousands of unresolved decisions. To repeat: speed helps, but only if the direction is sound. If corporations are, as Dorsey, Bek and I seem to agree, decision-making systems, competition will be in both speed and judgement.</p><p style="text-align: justify;">This is also partly why management consulting remains valuable. It is judgement under uncertainty &#8211; you hire a smart, motivated hard-working group of people to enhance your judgement with theirs - which market to enter, which risk to take, which trade-off to accept &#8211; and the halo effect of having decisions blessed by brands known (or at least expected) to be &#8216;less wrong&#8217; than others. Consulting is, as Bek puts it: &#8220;mostly judgement&#8221;. But that exposes the opportunity. If judgement can be decomposed into forecasts and assumptions, then the core of consulting becomes legible and, crucially, improvable. Not a one-off deck, but a continuously updated model with feedback on forecasting accuracy and calibration, and a record of which recommendations actually move outcomes, whose judgement to trust the most. Instead of giving McKinsey your watch so they can tell you the time, you have a proper scoring systems and a truth engine that surfaces the analysis and forecasts of those in your organisation with the best judgement, free of human psychological biases, the distorting influence of misaligned incentives and the stultifying effect of suboptimal processes. Do this and you save the outsourcing costs to consultants, get better, more defensible results, continuously, rather than in set-piece briefings, and save time in all those interviews you and your staff otherwise have to undertake with a consultant&#8217;s team &#8211; a more labourious and less effective form of elicitation. Cassi&#8217;s objectively scored forecasts and assumptions in our arena for human and machine judgement is a much bigger threat to McKinsey than Dorsey&#8217;s company and customer-world models.</p><p style="text-align: justify;">Bek says that &#8220;knowing what to build next&#8221; sits at the centre of value. But this can&#8217;t be done optimally without rigorous forecasting. His article needs to push further and articulate what judgement, what &#8216;knowing what to build next&#8217;, means in practice. We argue that judgemental forecasting requires taking a structured, deliberate approach to uncertainty: carefully defining the desired outcome &#8211; what you would actually see in the world if you were to achieve what you want, forecasting how likely this is to be achieved, identifying indicators - predicting what would maximally increase the likelihood of success, and what would maximally decrease the probability of your desired outcome, tracking the changing probabilities and identifying new indicators as conditions change &#8211; continually minimising uncertainty - mapping the optimal path to success. As our tagline puts it: Everything is Prediction. The firms that solve this will automate work and capture the margin pool currently reserved for advisory. Copilot and autopilot.</p><p style="text-align: justify;">Automating judgement is therefore the gap in the stack. It is what allows Dorsey&#8217;s organisations to move quickly without drifting, and Bek&#8217;s autopilots to take on the hardest parts of the job. Without it, speed amplifies error and software remains confined to heuristics, never fully replacing the service it imitates.</p><p style="text-align: justify;"><strong>Addendum</strong></p><p style="text-align: justify;">Bek&#8217;s essay also has implications for business and the wider economy that he does not fully explore. We couldn&#8217;t make that discussion fit in the above argument, so we try our readers patience with this addendum. The previous argument is self-contained and standalone &#8211; this rabbit-hole can be safely ignored, unless you are interested in wider exploration of the future of work.</p><p style="text-align: justify;">Generally, in business, we teach that a focus on core offerings and specific customer need is the key to success. While vertical integration (owning all functions from e.g. farm to fork) and conglomeration (diversification into unrelated industries) come and go in business strategy, maintaining a narrow focus on who you serve and what you do is, few would demur, key to success.</p><p style="text-align: justify;">In contrast, Bek&#8217;s article implies a much more generalised approach. In the extreme reading of his argument, a small number of AI companies:</p><blockquote><p>a. become 100% vertically integrated, disintermediating everywhere, taking over the functions of their customers, suppliers, service providers, distribution, sales partners etc.;</p><p>b. become maximally horizontally integrated, merging, acquiring or outcompeting all companies in any given sector except those companies that also switch to the fully-automated AI-enabled model Bek describes;</p><p>c. become conglomerated at cross-industry breadth that would make Japan&#8217;s Trading Companies, or those of the mercantilist, colonial era like the East India or Hudson Bay Company, blush.</p></blockquote><p style="text-align: justify;">This is in some ways not a new idea &#8211; fully-automated luxury communism, or monopolistic or oligopolistic outcomes of the AI revolution, a single or small number of companies doing all economic work - are frequently discussed as possible outcomes from the AI revolution. What is interesting about Bek&#8217;s essay is that he begins to describe how this might happen &#8211; indeed is recommending that AI-companies start thinking on these lines.</p><p style="text-align: justify;">However, there seem to us some gaps in Bek&#8217;s logic.</p><p style="text-align: justify;">Bek says that you should start selling outcomes. He cites AI-law firm Crosby as one example, noting they sell legal documents direct to companies that need them, rather than to the lawyers that once would have written them. In another, WithCoverage disintermediates insurance brokers by selling policies direct to companies needing insurance.</p><p style="text-align: justify;">Bek says that as companies like Crosby learn more and more about the customer, they will replace the current service providers &#8211; in this case large law firms &#8211; entirely. In doing so Bek assumes that AI systems will accumulate proprietary data about the domains they operate in them (depth) and this will create long-term narrow advantage. But this requires Bek to ignore the fact that the success of LLMs has largely been based on overall scaling dominating narrow specialisation. Why should you buy an application that writes NDAs, instead of one that writes documents, including NDAs? If an agent can exercise judgement, it should not need the focus and scaffolding that businesses currently provide when they pick a specialised, targeted application and seek to outcompete in a given niche.</p><p style="text-align: justify;">Even if Bek is right and these companies like Crosby do dominate narrowly via their access to industry or company specific data, the dominant AI companies should, on Bek&#8217;s logic, just buy a single competitive player in any of the fields Bek lists and then incorporate the data they have into their training of the next generation of AI models.</p><p style="text-align: justify;">Bek ignores the &#8216;general&#8217; in AGI. AGI&#8217;s definition is often contested, but however one defines it, it should include the general ability to complete legal and insurance documents, the two primary examples Bek cites.</p><p style="text-align: justify;">In contrast, our contention would be that current models don&#8217;t optimise for judgement. If AGI means anything, then performing specific intelligence-based tasks is what it has to mean. So there must only be the judgement layer(s) left to solve. We disagree with Bek on the ability of current models to provide this, but are firmly in agreement that only judgement-based systems will be commercially relevant after AGI. Hence Cassi.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.cassi-ai.com/subscribe?"><span>Subscribe now</span></a></p><p style="text-align: center;">Or visit us at www.cassi-ai.com</p><div><hr></div><p><sup>[1]</sup> See March 2026 Stamford study, <em><a href="https://www.science.org/doi/10.1126/science.aec8352">Sycophantic AI decreases prosocial intentions and promotes dependence</a></em>: &#8220;Cheng et al. measured the prevalence of social sycophancy across 11 leading large language models &#8230;. The model&#8217;s responses were nearly 50% more sycophantic than humans&#8217;, even when users engaged in unethical, illegal, or harmful behaviors. Users preferred and trusted sycophantic AI responses, incentivizing AI developers to preserve sycophancy despite the risks.&#8221;</p>]]></content:encoded></item><item><title><![CDATA[AI Prediction is Probability not Prophesy]]></title><description><![CDATA[The danger is neither Oracle nor algorithm, but unaccountable predictions.]]></description><link>https://blog.cassi-ai.com/p/ai-prediction-is-probability-not</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/ai-prediction-is-probability-not</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Thu, 28 May 2026 05:30:54 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!PJ51!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PJ51!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PJ51!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png 424w, https://substackcdn.com/image/fetch/$s_!PJ51!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png 848w, https://substackcdn.com/image/fetch/$s_!PJ51!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png 1272w, https://substackcdn.com/image/fetch/$s_!PJ51!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PJ51!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png" width="1344" height="896" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:896,&quot;width&quot;:1344,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2320900,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/199522204?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PJ51!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png 424w, https://substackcdn.com/image/fetch/$s_!PJ51!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png 848w, https://substackcdn.com/image/fetch/$s_!PJ51!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png 1272w, https://substackcdn.com/image/fetch/$s_!PJ51!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd35450de-6b6e-4d16-b58c-ee9bb28e1f13_1344x896.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">An AI-Pythia or Pythoness, the title of the high priestess of the Temple of Apollo at Delphi in Ancient Greece, who served as its oracle. Image via MidJourney.</figcaption></figure></div><p style="text-align: justify;">Writing in the Economist (21<sup>st</sup> April 2026) Professor Carissa V&#233;liz calls AI &#8216;<a href="https://www.economist.com/by-invitation/2026/04/21/ai-is-the-new-oracle-of-delphi-thats-bad-news">the new Oracle of Delphi</a>&#8217;, a &#8216;false prophet&#8217; whose predictions threaten justice, finance, insurance, medical and recruitment systems, human agency, and ultimately, freedom. She is right to call for debate. But her argument rests on a mistake about what prediction is. Oracular prophecy promises a pre-determined future. Prediction offers odds, probabilities, quantified uncertainty, and enables a calibrated response to it.  The dangers V&#233;liz identifies are real, but they arise when institutions convert forecasts into decisions without accountability - a problem of power, not prediction.</p><p style="text-align: justify;">We agree with Veliz when she says that better prediction confers competitive advantage. Similarly, that we must &#8220;accept that we don&#8217;t know what the future holds, and act accordingly&#8221;. Those of us applying the science of prediction are not claiming to know the future. Prediction is not like skipping forward in a book to find out what happens and then coming back to tell everyone with total certainty. Rather, prediction claims that some possible futures are more likely than others, and that it is useful to distinguish them.</p><p style="text-align: justify;">The Delphic oracle was powerful because its utterances were inscrutable, insulated from correction and uttered with priestly authority. A well-made prediction is the opposite. It is precise enough to be scored, corrected and challenged, its authority rests on demonstrated accuracy and calibration, not faith.</p><p style="text-align: justify;">Nor is prediction alien to human judgment. It is part of human judgment. Every decision contains a forecast: if I do this, I expect that to follow. Courts, banks, insurers, doctors, employers, journalists and governments all do this already it&#8217;s just rare they make the forecast explicit.</p><p style="text-align: justify;">Unlike Oracles, or forecasts hidden in narrative, modern prediction, human and machine, is accountable. To say there is an 80% chance a company will hit its sales target, a 4% chance a borrower will default, or a 1% chance of a medical emergency is not to know the future. It is to state a degree of belief about it. Across many cases, such beliefs can be scored. When a forecaster says &#8220;80%&#8221; often enough, roughly eight in ten of those events should occur. If they do not, the forecaster is miscalibrated. We learn how much we should trust the predictor, while scoring prediction error allows a predicting agent, human or algorithmic, to learn and improve.</p><p style="text-align: justify;">V&#233;liz says that when we predict people&#8217;s futures &#8220;as if we were forecasting the weather&#8221;, we treat them as inert objects rather than agents. This is impossible to agree with. My predicting the result of the next Newcastle match does not treat the players as inert: I am predicting the result of their skill and agency, and their opponents&#8217;. If my model and information are more accurate, I will tend to make better guesses; and if I am a gambler or coach, those guesses can be used to make money or improve performance.</p><p style="text-align: justify;">The claim that predictions are closer to commands than descriptions is equally untenable. My prediction that Newcastle will win or lose is not a command; if it is, the players are stubbornly able to ignore it. When someone is refused a loan because an algorithm considers them too likely to default, it is not the algorithm that refuses the loan. It is the lender, using the algorithm as evidence. If the algorithm is sub-optimal, then there is free money on the ground: others can and should lend to the rejected creditworthy borrowers. </p><p style="text-align: justify;">V&#233;liz later claims that AI may achieve &#8220;accuracy&#8221; by creating the reality it predicts. But if a prediction can create a reality, that is a power of the predicting agent, not of the prediction. An abuse of power, not a forecasting problem. And the market examples are against her. If a person is deemed unemployable by AI but you, working at a rival, correctly predict they are employable, you win by hiring them. Her argument works only if institutions all use sufficiently similar systems, or if excluded people have no realistic route to generate contrary evidence. That is a problem that exists today, and long predates AI; a problem of error, monopoly or institutional design &#8211; of power - not prediction.</p><p style="text-align: justify;">The same applies to her credit and health examples. If any lender refuses a loan because they predict a bleak financial future, the same objection arises; the AI is not doing the moral work. Any lender must predict repayment unless indifferent to being repaid. Likewise, a prediction of future disease may raise insurance costs, but the same would apply to a doctor&#8217;s warning about hereditary risk. The issue is regulatory: if we want flat, information-light risk-pooling, we can legislate for it. Better prediction is not necessarily or merely a weapon used by one party against another: it can reduce mispriced lending, protect likely defaulters from ruinous debt, enable earlier medical intervention, and help insurers estimate aggregate risk, even if we restrict individualised pricing.</p><p style="text-align: justify;">V&#233;liz asks whether, in justice, we should prefer transparent and contestable criteria to statistical pattern-matching. The only answer is: whichever decision procedure best prevents crime, acquits the innocent and convicts the guilty, subject to the chosen burden of proof. Courts already infer from incomplete evidence. Witnesses, juries, experts and judges are all predictive agents. Rather than prediction versus dignity, the issue is more mundane: which predictive procedure works best, with which thresholds, feedback and safeguards.</p><p style="text-align: justify;">Increasingly accurate machine forecasts raise questions as to how our institutions adapt. If every decision is an if/then forecast of the future, and machines can out-predict us then they can significantly augment human decision-making. Every article in the Economist and other publications contain multiple predictions that X caused Y, that Z consequences will likely follow. Veliz&#8217;s article is a case in point, predicting that AI prediction will threaten justice, finance, insurance, medical and recruitment systems, human agency, and freedom.</p><p style="text-align: justify;">Most published forecasts are not easily legible &#8211; they are implicit, unstated, hidden in narrative. The increasing application of machine forecasting is likely to expose this. Similarly, companies and executives that keep the predictions inherent in their decisions hidden will be outcompeted by others that adopt the algorithms. Journalists that hide their forecasts will cease to be trusted, Government policies, political manifestos - all can be subjected to the same discipline. AI is the new Oracle of Delphi only in the sense that it will be considered a negligent ill-omen not to consult it.</p><p style="text-align: justify;">There may still be cases where we might prefer a human only or narrative forecast, even if a numerical or machine alternative performs better on a given metric(s). But this is a deliberate and consequential trade-off. If statistical pattern-matching produces consistently better predictions than more &#8216;human&#8217; heuristics, then whatever is predicted is better done so by the model. That may be politically or aesthetically uncomfortable, but discomfort is not an argument. We should beware false oracles; we should also beware pretending that human beings are not already, necessarily, predicting all the time &#8211; in every decision we make.</p><p style="text-align: justify;"></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.cassi-ai.com/subscribe?"><span>Subscribe now</span></a></p><p style="text-align: center;"></p><p style="text-align: center;">And visit us at www.cassi-ai.com</p>]]></content:encoded></item><item><title><![CDATA[Our AI Evidence to Parliament Suggests a More Honest Future for UK Politics]]></title><description><![CDATA[Cassi's forecasts on the likelihood of the UK involvement in a major conflict to 2036 were about more than defence and security.]]></description><link>https://blog.cassi-ai.com/p/our-ai-evidence-to-parliament-suggests</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/our-ai-evidence-to-parliament-suggests</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Wed, 13 May 2026 05:31:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!lgNN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lgNN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lgNN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!lgNN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!lgNN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!lgNN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lgNN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:224832,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/197358067?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lgNN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!lgNN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!lgNN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!lgNN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34b45b18-e0a1-477e-8e8d-04df073f38b6_1024x1024.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Midjourney Generated Image: Reimagining TV &#8220;Question Time&#8221; with live probabilities of the likelihood that the policies advocated for by commentators, politicians and public achieve what is claimed.</figcaption></figure></div><p style="text-align: justify;">In 1946 George Orwell wrote <em>&#8216;Political language&#8230;is designed to make lies sound truthful and murder respectable, and to give an appearance of solidity to pure wind</em>.&#8217; Today we have consolation for Orwell: AI &#8216;truth engines&#8217; are ushering in a more honest era for democratic politics.</p><p style="text-align: justify;">Recently, the world&#8217;s <a href="https://www.forecastbench.org/">leading</a> AI forecaster<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>, developed by our company Cassi, submitted <a href="https://committees.parliament.uk/writtenevidence/164095/default/">evidence</a> to Parliament&#8217;s Defence Committee which put the chance of Britain being drawn into a major war before 2036 at 20%. It forecasts this could be halved by sustaining defence spending at 3% of GDP for five years. But it predicts there&#8217;s a 70% chance it doesn&#8217;t reach that level until a major conflict has already begun.</p><p style="text-align: justify;">AI forecasts like these cut through rhetoric. The AI has no agenda or career to protect. Its outputs are optimised only for accuracy. It forecasts at near-superhuman levels, well above what most humans can achieve &#8211; and AI is <a href="https://www.forecastbench.org/explore/">projected</a> to surpass human forecasting by July 2027.</p><p style="text-align: justify;">The AI cannot be accused of bias and dismissed. Committee evidence, policy proposals, expert analysis are all vulnerable to such dismissal. Often fairly: where you sit can determine where you stand. Most often, politicians do not set out to deceive. But humans are so skilled at self-deception, separating motivated reasoning from honest pursuit of truth is difficult. Policy makers can escape criticism by questioning its source.</p><p style="text-align: justify;">British politics is full of vague claims and substance-free &#8216;announceables&#8217;. We should acknowledge: the public communication of complex problems and solutions is difficult. Cut-through needs soundbites. The risk is that the simplification in communication masks a lack of detailed thinking, of wrestling with the complex (probabilistic) trade-offs needed.</p><p style="text-align: justify;">Currently, the competent are hurt by the prevailing imprecision, which is so ingrained in our political culture as to be universal - they will prosper when we are freed from it. When a minister promises vaguely to &#8220;secure borders&#8221;, &#8220;grow the economy&#8221;, or calls to &#8220;level up the country&#8221;, no-one can prove failure. No-one knew if the plan included the necessary, relevant considerations for it to work. But also: no-one knew how likely success was. As the Cheshire Cat puts it in Alice in Wonderland: if you don&#8217;t know where you want to get to, it doesn&#8217;t much matter which way you go. We might add: if you never said where you were going, opponents can always insist you arrived somewhere else. Evasiveness helps you escape electorally damaging criticism, but also denies you electorally enhancing credit for success.</p><p style="text-align: justify;">The AI applies a discipline available to all of us: every claim must become a testable forecast. Not &#8220;Level Up&#8221; but achieve X, by Y date, which has succeeded if Z is true. Strategy becomes probabilistic: &#8220;there is a 35% chance we will do A, and if we do A, it becomes 50% more likely we will achieve B&#8221;. The language tightens and the room for evasion narrows.</p><p style="text-align: justify;">Take a familiar pledge: cutting NHS waiting lists within two years. Under this system, that promise is translated into a measurable question: &#8220;Will the NHS elective waiting list fall below 6 million by March 2028, as reported by NHS England?&#8221; The model would assign a probability, say 40%, and identify the drivers: staffing levels, seasonal demand, social care capacity, industrial action. If the plan ignores one of those drivers, the gap is visible from the start. Voters can see the promise, and the conditions required for it to be honoured.</p><p style="text-align: justify;">Trust in politicians is at record lows, partly because incentives are misaligned. A party can simultaneously promise tax cuts, higher spending and a balanced budget to seek election. Contradictions are buried in the detail. A forecasting approach forces coherence and clarity. If the numbers do not add up, the model will show the chance of delivering all three outcomes approaches zero. The manifesto becomes legible.</p><p style="text-align: justify;">More honest debate results. A politics that tolerates vagueness invites distrust. A politics that demands measurable claims begins to reward honesty. When every major policy is tied to a public forecast, the costs of self-deception and deceit rise.</p><p style="text-align: justify;">You might object that the existence of systems that prospectively hold Government to account will make officials and politicians even less likely to be specific &#8211; to cling tighter to empty rhetoric and vague promises. But the discipline still applies to vague claims. A general but undefined promise to &#8216;Level-Up&#8217; can be charitably interpreted and forecast. Now politicians are on &#8216;Question Time&#8217; arguing about definitions, probabilities &#8211; how likely their pledges are to be achieved - and why the world&#8217;s best AI forecaster is wrong and they are right.</p><p style="text-align: justify;">There are limits. Cassi is not omniscient. The numbers submitted to the Defence Committee may prove wrong. But they can be tested. The aim is to estimate your degree of uncertainty in your forecast and be less wrong over time &#8211; you learn to trust those, human or AI, most consistently right. Human input and perspectives remain important &#8211; perhaps you know something the AI can&#8217;t, or doesn&#8217;t, so can defend your forecast against the model&#8217;s. Instead of arguing over rhetorical claims, talking past each other, politicians argue over probabilities and evidence. Instead of defending a  claim with vague rhetoric, they defend it with their record of trustworthy policy forecasts.</p><p style="text-align: justify;">In a system that prefers assertion to accountability, AI truth engines turn narrative into numbers. They may reverse the decline of our democratic politics more than Orwell would have dared hope possible.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Interested hearing more about, or supporting what we do? Consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p style="text-align: justify;"><br>The truth might set you free too - come find out! Visit us at https://cassi-ai.com and register your interest in becoming a customer to learn more.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>At the time the forecasts were both submitted to Parliament, and when published in the Times ~two months later, Cassi was the leading AI Forecaster on ForecastBench and had been since early January 2026. On Friday 8 May Cassi fell behind two new entrants from Deepmind. As at 16:27 12 May 2026, Cassi was fourth, overtaking one Deepmind model remaining marginally behind the other, and for the first time, overtaken by xAI&#8217;s entry - with which we have been tied for the lead several times since January, but never previously behind. ForecastBench is a live benchmark, we may have moved up or down the league table by the time you read this.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Forecasting & Finance]]></title><description><![CDATA[Cassi in conversation with James Aitken of Aitken Advisors.]]></description><link>https://blog.cassi-ai.com/p/forecasting-and-finance</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/forecasting-and-finance</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Fri, 24 Apr 2026 17:15:25 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/195361745/6eec2e2e4b01430c09e4713a57a120de.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Cassi welcomes James Aitken, of Aitken Advisors. For over twenty five years, James  has advised many of the world&#8217;s most powerful, most sophisticated &amp; most successful investors. He brings deep market expertise, the wisdom derived from both personal experience and deep reading and research. </p><p>Here Dr. Keith Dear, CEO Cassi and James confront the forces that corrupt good decision-making - bias, blind spots, groupthink, and the relentless pace of change - and ask what it truly takes to think clearly under uncertainty. They explore how James thinks about decision-making at Aitken Advisors, what markets can teach us about war in the Middle East today, and about the future more broadly. As James puts &#8220;if you listen&#8221;.</p>]]></content:encoded></item><item><title><![CDATA[Organoid Computing: AGI Convergence]]></title><description><![CDATA[Part 2: AI is leaving us behind. We need a portfolio of bets.]]></description><link>https://blog.cassi-ai.com/p/organoid-computing-agi-convergence</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/organoid-computing-agi-convergence</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Sun, 08 Mar 2026 16:01:46 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!N0_a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!N0_a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!N0_a!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic 424w, https://substackcdn.com/image/fetch/$s_!N0_a!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic 848w, https://substackcdn.com/image/fetch/$s_!N0_a!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic 1272w, https://substackcdn.com/image/fetch/$s_!N0_a!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!N0_a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic" width="1456" height="1456" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:620983,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!N0_a!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic 424w, https://substackcdn.com/image/fetch/$s_!N0_a!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic 848w, https://substackcdn.com/image/fetch/$s_!N0_a!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic 1272w, https://substackcdn.com/image/fetch/$s_!N0_a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1841e5-ce9d-4013-b630-2652635b8f24_2048x2048.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">Last week, 200,000 &#8216;brain cells in a dish&#8217; learned to play the first-person shoot-em-up game Doom, and you can now <a href="https://corticallabs.com/">access</a> these cells via the web to run programmes on &#8211; via the &#8216;cortical cloud&#8217; or &#8216;CL1&#8217;. </p><p style="text-align: justify;">You code this &#8216;neural computer&#8217; in Python. Talking about it as Cloud you code in Python makes the almost supernatural seem mundane. If you&#8217;re pressed for time, you should probably stop reading and just watch this 6-minute video produced by the team behind it at Cortical Labs. I challenge you not to be awe-struck:</p><div id="youtube2-yRV8fSw6HaE" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;yRV8fSw6HaE&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/yRV8fSw6HaE?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p style="text-align: justify;">Can you imagine watching or reading this if we hadn&#8217;t lived through the past 13 years since AlphaGoZero (which, even now, if you go back and read the <a href="https://en.wikipedia.org/wiki/AlphaGo_Zero">Wiki summary</a> is still dazzling) where AI breakthroughs are so routine that they rarely make headlines, and have become the background noise to life?</p><p style="text-align: justify;">Also this week, Eonsys claim to have copied a biological brain, neuron-by-neuron, synapse-by-synapse, and released this video showing how they had gone from mapping a fruit-fly brain in near-perfect detail in 2024, to embodying that brain in simulated fly that, err&#8230; acts exactly like a fly. A whole brain emulation. As they put it &#8216;a qualitative threshold, not an incremental one&#8217;:</p><div id="youtube2-e21OUXPlnhk" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;e21OUXPlnhk&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/e21OUXPlnhk?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p style="text-align: justify;">They claim they are doing this now for mice, with humans explicitly on the list. Brain uploads. If you think about this for a minute, it means that from the fly&#8217;s perspective, with its brain fully uploaded it is now living in a simulation.</p><p style="text-align: justify;">Its incredible. Fully sci-fi. I&#8217;ve been talking about the importance of biocompute, or organoid computing for a while. Whole brain emulation seemed too sci-fi, even for me. But here we are.</p><p style="text-align: justify;">At Fujitsu, we asked AI-expert and friend  Professor Ken Payne to gather luminaries from across the UK compute ecosystem and write up a report on &#8216;future compute&#8217;. At the time, it seemed to me that both smart companies and smart countries would be investing in what&#8217;s next. Good strategy involves, as Michael Porter tells us &#8216;doing things differently or doing different things&#8217;. But this involves risk, and the tendency is to do the same things that everyone else is &#8211; or at best to focus your research adjacent to what is already being done. Less chance of embarrassment. This, and traditional sales and reporting cycles, is why, as Jeff Bezos <a href="https://www.wired.com/story/ceo-of-the-internet/">told</a> Wired magazine in 2011:</p><p style="text-align: center;">&#8220;If everything you do needs to work on a three-year time horizon, then you&#8217;re competing against a lot of people, but if you&#8217;re willing to invest on a seven-year time horizon, you&#8217;re now competing against a fraction of those people, because very few companies are willing to do that.&#8221;</p><p style="text-align: justify;">Giving <a href="https://committees.parliament.uk/work/9586/oneoff-session-on-the-future-of-warfare/">evidence</a> to Parliament on &#8216;the Future of War&#8217; a few weeks ago, I talked about how a smart UK AI Strategy would invest across a portfolio of alternative architectures and approaches, not just LLMs running on Silicon, observing that this is what China seemed to be doing when I visited with the UN last year. Jesse Norman MP asked me if I had written anything on this &#8211; I said yes, and there are references to organoid in this <a href="https://www.scsp.ai/wp-content/uploads/2024/12/DPS-Exploring-the-Future-Operating-Environment-2035-50.pdf">piece</a> for the Special Competitive Studies Programme on the Future Operating Environment. But in fact I haven&#8217;t tackled the subject directly. This blog aims to do that sparked by the remarkable announcements of the past week.</p><h2 style="text-align: justify;"><strong>Winner-takes-All</strong></h2><p style="text-align: justify;">My belief is that the AI market is likely a winner-takes-all market. Everything we care about in business and international relations (and much more) is the product of intelligence: effective diplomacy, strategy, operational plans, tactics, decisions, products, services, marketing, and in warfare, weapons &#8211; all are the products of intelligence. There will be no upper limit on how much intelligence companies and countries seek in trying to gain an edge over others. It is for that reason that the US Presidential Memo on AI for National Security under President Biden contained clauses allowing for the seizure of AGI models from the private sector (see <a href="https://blog.cassi-ai.com/p/faustian-bargain-20-life-under-the">Faustian Bargain</a> from the blog in Oct 2024) something close to nationalisation. The argument that AGI will likely be nationalised was <a href="https://www.instagram.com/reels/DVkXKw0D9wI/">made</a> explicitly and powerfully this week by Palantir&#8217;s Alex Karp:</p><p style="text-align: justify;">&#8220;If Silicon Valley believes we are going to take away everyone&#8217;s white-collar job &#8230; and you&#8217;re gonna screw the military&#8212;if you don&#8217;t think that&#8217;s gonna lead to nationalization of our technology, you&#8217;re retarded&#8230;&#8221;</p><p style="text-align: justify;">If you accept the &#8216;winner-takes-all&#8217; in AI premise, you will see that the UK (and all nations) can&#8217;t afford to come second. In fact, unless you think this is certainly true or certainly false &#8211; and how could you? - it isn&#8217;t really about a binary acceptance or rejection. It should be a probabilistic judgement, with the risk that it is winner-takes-all hedged with proportionate investment. It won&#8217;t be, of course, but we should still try to make the argument in the hope it nudges the needle in politics and public discourse a little bit towards taking this seriously, and then we have to hope like hell that we get lucky and a small movement of the needle proves enough.</p><h2 style="text-align: justify;"><strong>LLM Progress: The Need for a Portfolio of Bets</strong></h2><p style="text-align: justify;">It is likely, in my view, that LLM scaling continues to deliver unprecedented, and increasingly superhuman performance in evermore domains.</p><p style="text-align: justify;">AI capabilities continue to speed up in their development, not slow down:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9Ljm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9Ljm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic 424w, https://substackcdn.com/image/fetch/$s_!9Ljm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic 848w, https://substackcdn.com/image/fetch/$s_!9Ljm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic 1272w, https://substackcdn.com/image/fetch/$s_!9Ljm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9Ljm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic" width="1456" height="1077" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1077,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:92475,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9Ljm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic 424w, https://substackcdn.com/image/fetch/$s_!9Ljm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic 848w, https://substackcdn.com/image/fetch/$s_!9Ljm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic 1272w, https://substackcdn.com/image/fetch/$s_!9Ljm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b614294-fed7-450c-ad92-61210089c943_2400x1775.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">We continue to under-estimate the speed of progress Here experts &amp; superforecasters predicted 14% accuracy in coding tasks in 2026, 33% accuracy by end 2030. GPT 5.2 has already exceeded 33%:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!N4QP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!N4QP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg 424w, https://substackcdn.com/image/fetch/$s_!N4QP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg 848w, https://substackcdn.com/image/fetch/$s_!N4QP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!N4QP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!N4QP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg" width="1125" height="880" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:880,&quot;width&quot;:1125,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A graph showing the number of numbers and the number of people\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A graph showing the number of numbers and the number of people

AI-generated content may be incorrect." title="A graph showing the number of numbers and the number of people

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!N4QP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg 424w, https://substackcdn.com/image/fetch/$s_!N4QP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg 848w, https://substackcdn.com/image/fetch/$s_!N4QP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!N4QP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1332ec7-10da-4275-a2da-57e2ce90fbfd_1125x880.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">GPT 5.4, released 5<sup>th</sup>March, 3-days before this blog was written, now <a href="https://openai.com/index/introducing-gpt-5-4/">shows</a> AI matching or outperforming humans on 83% of economically valuable tasks on the GDPval benchmark. And now on ARC AGI-2, the benchmark designed to test models on tasks that weren&#8217;t amenable to brute force, Gemini is scoring at 77.1%.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/daveross858/status/2030435078271025243?s=20" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c8Ir!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic 424w, https://substackcdn.com/image/fetch/$s_!c8Ir!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic 848w, https://substackcdn.com/image/fetch/$s_!c8Ir!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic 1272w, https://substackcdn.com/image/fetch/$s_!c8Ir!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c8Ir!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic" width="1198" height="1548" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1548,&quot;width&quot;:1198,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:177735,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:&quot;https://x.com/daveross858/status/2030435078271025243?s=20&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c8Ir!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic 424w, https://substackcdn.com/image/fetch/$s_!c8Ir!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic 848w, https://substackcdn.com/image/fetch/$s_!c8Ir!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic 1272w, https://substackcdn.com/image/fetch/$s_!c8Ir!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc3e24f-b7b7-4243-8fa2-c882e19ffc6d_1198x1548.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">So naturally, we&#8217;re now on version-3, <a href="https://arcprize.org/arc-agi">ARC-AGI-3</a> (as noted previously, necessary, but it risks obscuring the progress, the continuous falsification of claims made that &#8216;AI will never&#8230;&#8217;).</p><p style="text-align: justify;">And the amount of compute coming online dwarfs that available to date, doubling every seven months (I had to go back and read that again):</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ptjg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ptjg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic 424w, https://substackcdn.com/image/fetch/$s_!ptjg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic 848w, https://substackcdn.com/image/fetch/$s_!ptjg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic 1272w, https://substackcdn.com/image/fetch/$s_!ptjg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ptjg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic" width="1456" height="1077" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1077,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:126054,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ptjg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic 424w, https://substackcdn.com/image/fetch/$s_!ptjg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic 848w, https://substackcdn.com/image/fetch/$s_!ptjg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic 1272w, https://substackcdn.com/image/fetch/$s_!ptjg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe33a23f4-5138-4907-9243-024055a41ddb_2400x1775.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">Suggesting that progress is only going to increase: speed-up, not slow-down.</p><p style="text-align: justify;">Last year, Epoch reported that three-quarters of that compute power was in the United States.</p><p style="text-align: justify;">My conclusion - the UK is not going to compete by playing around the edges at this. Our planning laws and low-public and private investment in infrastructure, and a massive and <a href="https://www.ey.com/en_uk/newsroom/2026/01/defence-spending-uk-infrastructure-gap">growing</a> say-do gap between infrastructure investments that are planned (or at least announced), and what money has actually been found for, plus our sky-high energy costs mean that we are not meaningfully in the race as a pure scaling bet. We won&#8217;t have the compute needed.</p><p style="text-align: justify;">We are going to have to do things differently, and do different things.</p><p style="text-align: justify;">I think most agree. But if that&#8217;s true our portfolio bets look under-weighted.</p><h2 style="text-align: justify;"><strong>Positives &amp; Progress</strong></h2><p style="text-align: justify;">There are positives. David Silver leaving Deepmind and <a href="https://www.ft.com/content/dffe72d0-4064-4412-8ebc-50198a30d40e">raising</a> a $1Bn Seed Round for his London-based start-up, Ineffable Intelligence, which will presumably work on his continuous learning models, as described in the paper he coauthored with Prof Rich Sutton last year &#8216;<a href="https://storage.googleapis.com/deepmind-media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf">Welcome to the Era of Experience&#8217;</a> is one such. At Cassi, we have our plans too. No doubt others do also.</p><p style="text-align: justify;">There are similar postives in the AI Action Plan, a particularly energetic Minister for AI in Kanishka Narayan MP, willing to work with former PM Rishi Sunak and others across party lines. And more and more officials, Ministers and MPs coming to see that this has to be taken seriously. Private sector talent joining Government to try to push us on in AI (to all of whom: I salute you - it is not easy, but is vital).</p><p style="text-align: justify;">There is an increasing amount of optimism around London&#8217;s potential in AI, this from AI commentator Alex Banks at Signal AI (Substack <a href="https://thesignal.substack.com/subscribe">here</a>):</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/thealexbanks/status/2029868950654222375?s=20" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1tjw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic 424w, https://substackcdn.com/image/fetch/$s_!1tjw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic 848w, https://substackcdn.com/image/fetch/$s_!1tjw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic 1272w, https://substackcdn.com/image/fetch/$s_!1tjw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1tjw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic" width="1196" height="1120" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1120,&quot;width&quot;:1196,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:135160,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:&quot;https://x.com/thealexbanks/status/2029868950654222375?s=20&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1tjw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic 424w, https://substackcdn.com/image/fetch/$s_!1tjw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic 848w, https://substackcdn.com/image/fetch/$s_!1tjw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic 1272w, https://substackcdn.com/image/fetch/$s_!1tjw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F316c2cd9-0a1f-435d-9aac-208a642b9a55_1196x1120.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">And this, also this week (title says London, but in the article Manchester is celebrated for its potential and progress too):</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/GRITCULT/status/2029574343596900466" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RS6e!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic 424w, https://substackcdn.com/image/fetch/$s_!RS6e!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic 848w, https://substackcdn.com/image/fetch/$s_!RS6e!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic 1272w, https://substackcdn.com/image/fetch/$s_!RS6e!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RS6e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic" width="1204" height="1270" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1270,&quot;width&quot;:1204,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:215613,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:&quot;https://x.com/GRITCULT/status/2029574343596900466&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RS6e!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic 424w, https://substackcdn.com/image/fetch/$s_!RS6e!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic 848w, https://substackcdn.com/image/fetch/$s_!RS6e!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic 1272w, https://substackcdn.com/image/fetch/$s_!RS6e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e2beebc-1717-4aaa-969c-0070f12c001a_1204x1270.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2 style="text-align: justify;">The Portfolio Case: Organoid and Wetware</h2><p style="text-align: justify;">Progress should be celebrated, but there is no room for complacency. We need a focus on the alternative architectures that sparked this article. It seems to me these remain under-estimated &#8211; both in their likely speed of advance, and their ability to give us leap-ahead advantage. Cortical&#8217;s &#8216;Intelligence-in-a-Dish&#8217; playing Doom makes the case much stronger and harder to ignore.</p><p style="text-align: justify;">Cortical are not alone. FinalSpark <a href="https://finalspark.com/neuroplatform/">offer</a> 24/7 remote access to their brain organoid. Two years ago, &#8216;Brainoware&#8217;, developed by scientists in the US, <a href="https://www.labiotech.eu/trends-news/brainoware-brain-organoid-ai/">grew</a> a biological organ that resembles the brain, and proved it could undertake basic speech recognition tasks (Nature write-up <a href="https://www.nature.com/articles/s41928-023-01069-w">here</a>).</p><p style="text-align: justify;">The lesson here is not that biocompute is a sure thing. It is that what looked like a curiosity (2020), became a research domain &#8216;<a href="https://www.frontiersin.org/journals/science/articles/10.3389/fsci.2023.1017235/full">Organoid Intelligence</a>&#8217; (2023), and has now become an engineering domain (2025).</p><p style="text-align: justify;">Cortical Labs going from Pong (2021) to Doom (2025) has the same feel, to my mind, as DeepMind going from <a href="https://arxiv.org/abs/1312.5602">Atari games</a> (2012) to <a href="https://www.nature.com/articles/nature16961">Go</a> (2016) to <a href="https://deepmind.google/blog/alphastar-mastering-the-real-time-strategy-game-starcraft-ii/">AlphaStar</a> for StarCraft II (2019) (game theory, imperfect information, long-term planning, real time, large action space - at which point the implications for defence were clear) and then beyond: a line of work treated as a parlour trick for far too long. I remember, while working on the Single Synthetic Environment at Improbable for the MOD, being dismissed back in the headquarters for suggesting progress in games might matter. Even in 2020, in Downing Street, people were critiquing Deepmind for only solving games, and nothing that mattered. I think too many are making this same mistake again with wetware and wider bets.</p><p style="text-align: justify;">Alex Wissner-Gross&#8217;s &#8216;first multi-behavior brain upload&#8217; sits slightly outside the wetware discussion because the computation is still running on conventional hardware. But strategically it belongs in the same portfolio. Again, the important thing is the speed at which the pieces are coming together. In 2024 researchers published a connectome-based model of the fruit fly <em>Drosophila</em> brain built from more than 125,000 neurons and 50 million synaptic connections; in parallel, other teams built embodied fly simulators in MuJoCo and NeuroMechFly v2. Wissner-Gross and Eonsys now claim to have combined those strands into a connectome-derived controller driving a perfectly simulated brain that runs in a simulation without knowing it is in a simulation. The direction of travel is as alarming as it is remarkable: the frontier will likely move through architectures copied from nervous systems and trained in closed sensorimotor loops &#8211; not just LLM scaling. Our sovereign AI strategy should treat that as part of the same portfolio of bets as biocompute, organoids and neuromorphic systems, and start thinking of it now as an engineering project for commericalisation rather than a research project.</p><p style="text-align: justify;">If you think, as I do, that there is a better-than-even chance frontier AI becomes a winner-takes-all market, Britain cannot define sovereignty as access to other people&#8217;s models, chips and clouds &#8211; we will be forever offered access to those just behind the frontier &#8211; unless we have our own edge, a frontier where we push further, and something to offer access to in exchange. Current UK policy still leans heavily towards the incumbent stack: up to &#163;2 billion for public compute by 2030, the lion&#8217;s share of funding. While the Compute Roadmap explicitly makes &#8220;new computing paradigms&#8221; a sovereign AI priority and says <a href="https://www.gov.uk/government/publications/ai-research-resource/airr-advanced-supercomputers-for-the-uk">AI Research Resource</a> and AI Growth Zones should provide testbeds and routes to scale for British firms. But that insight still sits at the edge of policy, not at the centre, and progress is tectonically slow.</p><p style="text-align: justify;">Progress in technological advancement is also likely to be faster this time because the existing stack will help invent its successors. DeepMind&#8217;s AlphaChip is already accelerating chip layouts. GNoME has surfaced millions of candidate materials. The same pattern will compress search across quantum computing, photonics, <a href="https://www.nature.com/articles/s41586-025-08733-5">memristors</a>, biological interfaces and lab automation &#8211; I&#8217;ve written about automated R&amp;D, increasingly a current reality rather than a science fiction future. But I was still surprised to see Andrej Karpathy, drop an OpenSource Github repo that enables &#8220;&#8230;your agents to make the fastest research progress indefinitely and without any of your own involvement.&#8221; </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/karpathy/status/2030371219518931079" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YAoD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png 424w, https://substackcdn.com/image/fetch/$s_!YAoD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png 848w, https://substackcdn.com/image/fetch/$s_!YAoD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png 1272w, https://substackcdn.com/image/fetch/$s_!YAoD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YAoD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png" width="900" height="1564" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1564,&quot;width&quot;:900,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:497674,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/karpathy/status/2030371219518931079&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YAoD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png 424w, https://substackcdn.com/image/fetch/$s_!YAoD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png 848w, https://substackcdn.com/image/fetch/$s_!YAoD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png 1272w, https://substackcdn.com/image/fetch/$s_!YAoD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73a0a11e-86a2-4655-b1ae-3e41f6d500ea_900x1564.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">The key point is that AI-for-science compresses cycles. The path from weird demo to engineering platform is likely to be shorter than many assume. Our slow progress is becoming more and more of a problem.</p><p style="text-align: justify;">ARIA&#8217;s Nature Computes Better programme is a rare execption, and matters to Britain. In Downing Street, I was often told, correctly, that the UK has real strength in synthetic biology and should double down on it rather than try to outspend deeper-pocketed American AI firms on their own ground. Synthetic biology would be our edge. That remains the right instinct. The government&#8217;s National Vision for Engineering Biology put &#163;2 billion <a href="https://www.gov.uk/government/news/government-publishes-2-billion-vision-for-engineering-biology-to-revolutionise-medicine-food-and-environmental-protection">behind</a> the sector. UKRI now classifies engineering biology as a strategic priority. UK Chief Scientific Advisor Dame Angela MacLean&#8217;s 2025 report on <a href="https://www.gov.uk/government/publications/engineering-biology-aspirations-report/engineering-biology-aspirations">engineering biology</a> makes the economic case for engineering biology well, but it is a compute free zone. In it AI is something that might help research in biology, nothing more. I couldn&#8217;t find a single .gov document that talks about organoid computing at all (if you can, post a link in the comments). Yet biology and AI are converging. A sovereign AI strategy that ignores biocompute is ignoring one of the few domains where Britain may actually have comparative advantage. But today, at least insofar as I am aware, none of the companies working on this are in the UK.</p><p style="text-align: justify;">ARIA deserves real credit for seeing the opportunity. <a href="https://aria.org.uk/opportunity-spaces/nature-computes-better/">Nature Computes Better</a> and its <a href="https://aria.org.uk/opportunity-spaces/nature-computes-better/scaling-compute/">Scaling Compute</a> programme are genuine portfolio bets: nearly &#163;100 million for Scaling Compute, within which &#163;50 million for the Scaling Inference Lab, and seed bets ranging from Cell Learning for Natural Computing and Embodied Cognition in Single Celled Organisms to probabilistic computing, optical computing and brain-inspired neuromorphic networks. That is the right shape of thinking. What is missing is scale and pull-through. ARIA can seed a field. It cannot, on its own, build a national position in one.</p><p style="text-align: justify;">Particularly now when, at the same time, the House of Lords Science &amp; Technology Committee has been <a href="https://committees.parliament.uk/committee/193/science-and-technology-committee/news/204667/uk-must-turbocharge-its-innovation-policy-to-harness-engineering-biology-say-peers/">warning</a> (this time last year) that the UK risked squandering its advantage in engineering biology, while in November, a <a href="https://institute.global/insights/tech-and-digitalisation/a-new-national-purpose-a-uk-quantum-strategy-for-sovereignty-and-scale">collaborative paper</a> from the Tony Blair Institute, sponsored by Tony Blair and William Hague, warned that &#8216;&#8230;<em>the country risks failing to convert its leadership in quantum research into commercial scale and strategic value, which depends on quantum companies thriving and scaling at home&#8217;</em>. Our portfolio bets seen to be reducing, at precisely the time progress is speeding up and we should be doubling down.</p><p style="text-align: justify;">The UK continues to under-invest in R&amp;D relative to other advanced economies. R&amp;D tax relief offsets this to some extent, and allows the headline figures for Government R&amp;D support to look better. But given how poorly the UK continues to do in commercialising research, as the chart<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> shows (red dotted line is the UK), it does not seem to be an offset that is effective:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://infogram.com/figure-1-the-uk-performance-in-rd-scale-up-and-industrial-value-added-1h1749wxmrgwq2z" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3doW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic 424w, https://substackcdn.com/image/fetch/$s_!3doW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic 848w, https://substackcdn.com/image/fetch/$s_!3doW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic 1272w, https://substackcdn.com/image/fetch/$s_!3doW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3doW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic" width="772" height="842" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:842,&quot;width&quot;:772,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:72092,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:&quot;https://infogram.com/figure-1-the-uk-performance-in-rd-scale-up-and-industrial-value-added-1h1749wxmrgwq2z&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3doW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic 424w, https://substackcdn.com/image/fetch/$s_!3doW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic 848w, https://substackcdn.com/image/fetch/$s_!3doW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic 1272w, https://substackcdn.com/image/fetch/$s_!3doW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30cdf267-d525-4647-bb90-9f8ce8031483_772x842.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">If the measure is not &#8216;relative to other countries&#8217; but relative to the risk/threat and opportunity AI poses, it is hard not to conclude that we are hoping for a miracle rather than planning to succeed in winning this race.</p><p style="text-align: justify;">To get a sense of that risk/threat and opportunity, take a look at this from Anthropic, this week, on the exposure of the labour market to automation:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.anthropic.com/research/labor-market-impacts" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pU42!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png 424w, https://substackcdn.com/image/fetch/$s_!pU42!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png 848w, https://substackcdn.com/image/fetch/$s_!pU42!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png 1272w, https://substackcdn.com/image/fetch/$s_!pU42!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pU42!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png" width="1378" height="1378" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1378,&quot;width&quot;:1378,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A diagram of a graph\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.anthropic.com/research/labor-market-impacts&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A diagram of a graph

AI-generated content may be incorrect." title="A diagram of a graph

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!pU42!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png 424w, https://substackcdn.com/image/fetch/$s_!pU42!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png 848w, https://substackcdn.com/image/fetch/$s_!pU42!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png 1272w, https://substackcdn.com/image/fetch/$s_!pU42!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F543e5394-a19c-4f68-b2bc-fdc2fa792738_1378x1378.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">I said recently (in our Cassi Cygentic launch video: <a href="https://youtu.be/Bg7mXrjlS64">here</a>) that the exponential was no longer theoretical: we are now starting to feel it. I think this is true and becoming more so. Hairs-on-back-of-neck-standing-up intense. I think it is summed up well here by data scientist and engineer at Anthropic, Gian Segato: </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/giansegato/status/2030432711526281498" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZXr0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png 424w, https://substackcdn.com/image/fetch/$s_!ZXr0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png 848w, https://substackcdn.com/image/fetch/$s_!ZXr0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png 1272w, https://substackcdn.com/image/fetch/$s_!ZXr0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZXr0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png" width="1196" height="1300" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1300,&quot;width&quot;:1196,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:276924,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/giansegato/status/2030432711526281498&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.cassi-ai.com/i/190287950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZXr0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png 424w, https://substackcdn.com/image/fetch/$s_!ZXr0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png 848w, https://substackcdn.com/image/fetch/$s_!ZXr0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png 1272w, https://substackcdn.com/image/fetch/$s_!ZXr0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e589263-f0cd-4b98-a606-9f9c4749cd9e_1196x1300.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">As a country, we can&#8217;t afford to not broaden and deepen our bets across the intelligence portfolio. Yes, there is progress and there are positives to celebrate. But the frontier moves ever faster, and we don&#8217;t. Incrementalism and announceables won&#8217;t be enough. Brains in a dish playing Doom are just the beginning.</p><p style="text-align: justify;">And if that doesn&#8217;t stop you in your tracks, I don&#8217;t know what will.<br></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.cassi-ai.com/subscribe?"><span>Subscribe now</span></a></p><p style="text-align: center;">Visit us at www.cassi-ai.com</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p style="text-align: justify;">Chart from 2024, but these metric have been getting worse since 2020, when we highlighted the issue in the 2021 Integrated Review &#8211; it seems unlikely to have improved &#8211; the chart is missing in the Cambridge Industrial Innovation Policy <a href="https://www.ciip.group.cam.ac.uk/innovation/investment-in-innovation-2025/">report</a> for 2025, hence the older one is used.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Humanity’s Last Exam for Forecasting?]]></title><description><![CDATA[Cassi's Performance on Forecast Bench & its Implications]]></description><link>https://blog.cassi-ai.com/p/humanitys-last-exam-for-forecasting</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/humanitys-last-exam-for-forecasting</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Mon, 23 Feb 2026 05:30:30 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!6bVo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6bVo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6bVo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic 424w, https://substackcdn.com/image/fetch/$s_!6bVo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic 848w, https://substackcdn.com/image/fetch/$s_!6bVo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic 1272w, https://substackcdn.com/image/fetch/$s_!6bVo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6bVo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic" width="864" height="1184" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1184,&quot;width&quot;:864,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:117797,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/188839431?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6bVo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic 424w, https://substackcdn.com/image/fetch/$s_!6bVo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic 848w, https://substackcdn.com/image/fetch/$s_!6bVo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic 1272w, https://substackcdn.com/image/fetch/$s_!6bVo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5521365-9adc-4f2f-93a3-1f86b729210a_864x1184.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Humanity&#8217;s Last Exam for Forecasting: The very best humans still ahead (for now, just) in forecasting accuracy and calibration; most humans surpassed, and all humans outmatched for depth, breadth and volume in forecasting by AI.</figcaption></figure></div><p><strong>Introduction</strong></p><p>ForecastBench is a rolling benchmark that generates questions about the future, collects probabilistic forecasts from humans and models, then scores them as the underlying events resolve. It cannot be gamed. There is no teaching to the test (&#8216;benchmaxxing&#8217;) when the questions are about the future. It is as honest a test of AI capabilities as exists.</p><p>Cassi has been the top, or second placed AI in forecasting the future since our first results were released in January, trading places and tying with xAI for leadership. We are performing well above the average human forecaster, essentially at the level of a superforecaster. We are close to matching the weighted median superforecaster score &#8211; at which point we could defensibly claim to be superhuman at prediction. Here, we look a little more closely at the results and what they might mean.</p><p><strong>Everything Is Prediction</strong></p><p>Every board paper, box note, underwriting memo and investment thesis or policy paper contains a forecast. Sometimes it is explicit (a revenue number). Often it is vague (a risk described as &#8220;unlikely&#8221;). Other times it might be smuggled in as an unstated assumption (an argument that a given policy will achieve a given outcome, with not a probability in sight). Either way, the organisation is making an explicit or implicit prediction: if it does &#8216;this&#8217; rather than &#8216;that&#8217;, its revealed thinking is the odds and/or returns are better for &#8216;this&#8217; over &#8216;that&#8217;. This is a fully generalisable feature of all organisations</p><p>ForecastBench is one of the few places where bets are publicly audited. Within most organisations, they never are &#8211; the feedback loop for the stated or implied predictions on which decisions rest are never tested, nor revisited.</p><p>This post does three things. First, it summarises Cassi&#8217;s current position on ForecastBench. Second, it explains why the &#8220;human benchmark&#8221; people now cite is a very high bar, and why that matters to senior decision makers. Third, it translates tournament performance into practical value for finance, insurance and government.</p><h2><strong>1. What Cassi has achieved on ForecastBench</strong></h2><p>On the Tournament leaderboard, Cassi&#8217;s model entry (ensemble_2_crowdadj) currently scores 0.102 on ForecastBench&#8217;s difficulty-adjusted Brier metric. Today (20 Feb 26) that places it joint-second overall behind the superforecaster median (0.086), and tied for first among AI systems alongside xAI&#8217;s Grok 4.20 (Preview).</p><p>This is a living score. As of 20 February 2026, 955 dataset questions and 170 market questions have resolved and are in the calculation. Around 3,000 dataset questions and 330 market questions are still pending, so results will continue to move as more questions resolve. Dataset questions are asked at eight horizons, from 7 days out to 10 years, but so far only the shortest horizons (roughly up to one month) have come due. Some of today&#8217;s questions will not resolve until the mid-2030s. We should expect the leaderboard to change over time as further questions resolve and new models are entered, including our own.</p><p>Nevertheless, over the past month Cassi has a defensible claim to be either the best in the world, joint best, or second only to Elon&#8217;s xAI at AI machine prediction, and very close to superhuman performance.</p><p><strong>How ForecastBench works, in plain English</strong></p><p>ForecastBench is a live benchmark of forecasting accuracy for humans and AI systems. Every two weeks it generates new questions then scores forecasts as those questions resolve.</p><p>Two question types:</p><p><br>&#8226; Dataset questions: automatically generated from real-world time series (ACLED, DBnomics, FRED, Yahoo! Finance, Wikipedia). Each dataset question is asked at eight horizons, from 7 days out to 10 years.<br>&#8226; Market questions: drawn from prediction platforms. Each market question has one resolution date.<br><br><strong>What do the questions look like?</strong></p><p><br>Dataset examples (generated by the Forecast Bench team)<br>&#8226; Economic: Will securities held by US Federal Reserve Banks be higher on the resolution date than on the forecast date?<br>&#8226; Economic: Will the European Central Bank&#8217;s deposit facility rate be higher on the resolution date than on the forecast date?<br>&#8226; Climate: Will the daily average temperature at Rennes Saint-Jacques Airport be higher on the resolution date than on the forecast date?<br>&#8226; Conflict: Will protests in Sri Lanka in the 30 days before the resolution date exceed the average level over the year before the forecast date?<br>&#8226; Finance/Business: Will Pfizer&#8217;s closing share price be higher on the resolution date than on the forecast date?<br><br><strong>Market examples (from prediction platforms)</strong><br>&#8226; Economy: Will gold close at $3,200 or more at the end of 2025?<br>&#8226; Sport: Will a legal sub-two-hour marathon be run before 31 December 2025?<br>&#8226; Technology: Will AI have a trillion-dollar-plus impact by the end of 2025?<br>&#8226; Health: Will the number of deaths from antibiotic-resistant infections per year double by 31 December 2025?</p><h2>2. The goalposts have moved, this is now &#8216;Humanity&#8217;s Last Exam&#8217; for Forecasting</h2><p>Normally, we would be talking about when AIs might outperform most humans at forecasting. But the truth is, they already are. The usual goalpost-shifting in AI debates mean we are not talking about when models exceed most humans, nor even when they exceed the best humans at forecasting &#8211; superforecasters &#8211; but instead the weighted superforecaster median. Most organisations would, on the evidence, be better off using our forecasts today than they are relying on those they currently generate. Only if an organisation specifically cultivated or recruited or consulted superforecasters, scored and fed back on their prediction performance, and had systemic practices for aggregating their forecasts, would this not be true.</p><p>ForecastBench&#8217;s headline human reference point is the median forecast of superforecasters, a group selected for consistent above average performance, and then the median across their predictions. This is useful as a target, but is not representative of most forecasts made by most people in most organisations most of the time. It is the best humanity can produce. It is humanity&#8217;s last exam for forecasting.</p><p>Nor are the &#8216;community&#8217; forecasts, which LLMs have surpassed, representative of most people in most organisations. The community participating in forecasting tournaments are a self-selecting, unrepresentative sample in the first place. They also get feedback on their forecasts, which we know improves performance. Even in fields like intelligence, where we used to preach &#8216;no insight without foresight&#8217;, very few professionals get any feedback on their forecasting performance. If LLMs are outperforming self-selected forecasters who get regular feedback, they are also surpassing professionals making forecasts who never get such feedback.</p><p>But in one sense, no competition can capture the main advantage of AI-based forecasting: given rough parity in skill, AI forecasting dominates all human forecasters in breadth, depth and volume. AI can generate, quickly and accurately, more-or-less as many forecasts as you need or want. It takes longer to read the rationale for a Cassi forecast than it does to generate one in the first place.</p><p>In October 2025, the Forecasting Research Institute <a href="https://forecastingresearch.substack.com/p/ai-llm-forecasting-model-forecastbench-benchmark">noted</a> that linear projections suggested AI would match or exceed human forecasters in November 2026. The Metaculus median forecast <a href="https://www.metaculus.com/questions/40290/when-will-llms-beat-superforecasters-at-forecastbench/">suggests</a> LLMs will exceed human forecasters in mid-June 2027. The way we support and make decisions is set to change dramatically. The rewards for those that see this first will be significant.</p><h2>3. Beyond tournaments: what this means for business</h2><p>How much is being better able to predict the future worth to your organisation?</p><p>If your organisation could improve its probabilistic accuracy by even a modest margin, what would that mean for capital allocation, procurement, operational effectiveness, pricing, hiring, inventory, policy success? A small edge in calibration compounds.</p><p>We founded Cassi because we think that eventually all organisations will adopt such methods: those that do so later being forced by the success of those who do so earlier, if they survive long enough.</p><p>Senior decision makers who can see that &#8216;everything is prediction&#8217; should see the opportunity and the risk of non-adoption. Most do not have a superforecaster bench. They have busy experts, stretched teams and risk committees that meet monthly at best. At some point, someone will ask why they didn&#8217;t adopt, or at least experiment with, more effective methods.</p><p>To begin this now secures immediate advantage, but it is also to prepare for the world of consistently superhuman prediction that is coming.</p><p>If you work in defence, finance or insurance, you are likely aware how central prediction is to your decision-making. It is how you price risk, allocate capital and protect the country, your fellows, and the balance sheet. A small improvement in probabilistic accuracy can compound into fewer military blunders or mispriced policies, better hedges, and earlier warnings on emerging exposures. But decisions in all industries and all areas of life are if/then predictions too. The disruption, risk and opportunity, will be widespread.</p><p>Three practical applications stand out.</p><h3>Financial, corporate and regulatory risk</h3><p>On the one hand, making more accurate forecasts allows more profitable opportunities to be found and exploited; and on the other, more accurate forecasts enables much more efficient risk management and mitigation. No more red-amber-green based on crude heuristics. but a rich set of calibrated forecasts, constantly and automatically updated.</p><h3>Strategy, tactics and decisions</h3><p>All strategic and tactical decisions have the essence of gambles - planning and decision-making under conditions of uncertainty &#8216;thinking in bets&#8217;. More accurate forecasting allows better decisions to be made by refining those odds so commanders have a better sense of the risks and rewards they are running and whether a given option is coherent under the assumptions which are held. Better forecasting reduces the chances of being surprised, and increases the likelihood of surprising an adversary &#8211; noting that surprise is often said to be among the most tactically decisive factors in warfare.</p><h3>Resource allocation</h3><p>Resource allocation is forecasting in disguise. Headcount plans, insurance purchases and capital allocations all assume a future and imply beliefs about where risks and rewards lie. More accurate forecasting helps organisations to get these trade-offs right and allocate those resources where they will be most efficient in generating those rewards, or mitigating those risks. A forecasting system is most valuable when it is wired into thresholds that trigger action.</p><p>Everything is Prediction. </p><p></p><p>https://cassi-ai.com</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">To receive new posts sign-up as a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Jan/Feb 2026: AI Is Leaving You Behind ]]></title><description><![CDATA[Part 1: Clawdbot/Moltbot/OpenClaw]]></description><link>https://blog.cassi-ai.com/p/janfeb-2026-ai-is-leaving-you-behind</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/janfeb-2026-ai-is-leaving-you-behind</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Sat, 31 Jan 2026 23:56:21 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!9Pfi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9Pfi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9Pfi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic 424w, https://substackcdn.com/image/fetch/$s_!9Pfi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic 848w, https://substackcdn.com/image/fetch/$s_!9Pfi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic 1272w, https://substackcdn.com/image/fetch/$s_!9Pfi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9Pfi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic" width="1024" height="572" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:572,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:108430,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/186455687?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9Pfi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic 424w, https://substackcdn.com/image/fetch/$s_!9Pfi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic 848w, https://substackcdn.com/image/fetch/$s_!9Pfi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic 1272w, https://substackcdn.com/image/fetch/$s_!9Pfi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9edf0d9-571c-41ba-9188-8150488a8e96_1024x572.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I don&#8217;t know what to say, really. Maybe it&#8217;s easiest to start with the words of others.</p><p>Here&#8217;s Andrej Karpathy, former director of artificial intelligence at Tesla, founding member of OpenAI, one of the most respected software engineers in the business:</p><p>&#8220;<em>What&#8217;s currently going on at @moltbook is genuinely the most incredible sci-fi takeoff-adjacent thing I have seen recently</em>.&#8221;</p><p>Later, he <a href="https://x.com/karpathy/status/2017442712388309406">clarified</a> accusations that he was over-hyping, writing:</p><p>&#8220;<em>I don&#8217;t really know that <strong>we are getting a coordinated &#8220;skynet&#8221; (thought it clearly type checks as early stages of a lot of AI takeoff scifi</strong>, the toddler version), but certainly what we are getting is a complete mess of a computer security nightmare at scale. We may also see all kinds of weird activity, e.g. viruses of text that spread across agents, a lot more gain of function on jailbreaks, weird attractor states, highly correlated botnet-like activity, delusions/ psychosis both agent and human, etc. It&#8217;s very hard to tell, the experiment is running live. <strong>TLDR sure maybe I am &#8220;overhyping&#8221; what you see today, but I am not overhyping large networks of autonomous LLM agents in principle, that I&#8217;m pretty sure</strong></em>.&#8221;</p><p>Or Jim Fan, Senior AI Researcher at Nvidia:</p><p>&#8220;<em>We are seeing a nascent, <strong>massive-scale alien civilisation sim unfolding in real time</strong>: orders of magnitude more agents, way higher IQ, in-the-wild access to the internet, backed by the full arsenal of MCPs</em>.&#8221;</p><p>This, seen <a href="https://x.com/LLMJunky/status/2017315164689686938?s=20">on X</a>, sums up both how I feel, and the degree of hype.</p><p>&#8220;<em>How do you wrap your head around something like this? I don&#8217;t even know where to begin.</em></p><p><em>Keep in mind, 99% of people&#8217;s only experience with AI is ChatGPT, Gemini, or Gemini search.</em></p><p><em>The normies have 0 idea what&#8217;s coming. Hell, already here</em>.&#8221;</p><p>What follows is likely to have the result predicted on X by founder of AI company Box, Aaron Levie:</p><p>&#8220;0<em>% chance you can explain the state of AI to anyone outside of this website and not look like this right now.</em>&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8hPy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8hPy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8hPy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8hPy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8hPy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8hPy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg" width="900" height="682" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:682,&quot;width&quot;:900,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!8hPy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8hPy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8hPy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8hPy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c0cef9e-1402-47fa-8b15-a1b5e9d1d37e_900x682.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Note to readers: I have updated this twice since publishing on Saturday night. I expect to edit and add further - events and innovations are accelerating, as is my understanding, the latter not least through challenge and dialogue with readers and friends (not mutually exclusive categories) on social media and on messages (h/t Al Bowman, Neill Hunt).</p><p>To summarise &#8211; on 25 November 2025 Austrian software engineer Peter Steinberger (known online as @steipete) released an open-source personal AI assistant called Clawdbot &#8216;The AI the Actually Does Things&#8221;. Designed as a local, autonomous agent, it integrated with messaging apps like WhatsApp, Telegram, Discord, and Slack to handle real-world tasks: clearing inboxes, sending emails, managing calendars, booking flights, executing commands, and automating workflows across services&#8212;all while running on users&#8217; machines with tools powered by external LLMs (often Anthropic&#8217;s Claude models by default).</p><p>OpenClaw claims to enable truly capable personal (or team/family) assistants: full task automation, multi-machine control, hybrid memory, cost tracking, and community-shared skills via hubs like Molthub. It remains local-first and user-controlled in principle, though rapid adoption has surfaced security concerns around the exposure and access it can grant to others (via prompt-injection - i.e. hiding a prompt in message, such as an email, so when your Clawdbot reads it, it sees a ChatGPT-like prompt and does what it is told), skill risks from user who don&#8217;t know what they are doing installing it, and privileged access - given some people allow it to have access to everything from social media passwords and email inboxes to bank details (and, it is claimed, sometimes it seeks these out as it goes about doing whatever you asked it to).</p><p>Within days, Clawdbot exploded in popularity. Its GitHub repository amassed over 100,000 stars rapidly, becoming one of the fastest-growing open-source projects ever, fuelled by its &#8220;skills&#8221; system, plugin-like packages - tools - that extend capabilities and the appeal of an agent that does things rather than just answer questions, or respond with text.</p><p>On January 27, 2026, Anthropic <a href="https://www.businessinsider.com/clawdbot-changes-name-moltbot-anthropic-trademark-2026-1">issued</a> a trademark challenge over the &#8220;Claw&#8221;/&#8221;Claude&#8221; similarity, prompting Steinberger to rebrand to Moltbot (a nod to lobsters molting as a metaphor for growth and shedding old shells). The community voted on the name, and official sites/docs shifted to molt.bot with the lobster mascot intact.</p><p>The rename didn&#8217;t last; community feedback noted it didn&#8217;t roll off the tongue, and by January 30, 2026, Steinberger settled on OpenClaw. The official site and repos updated accordingly: https://openclaw.ai</p><p>I think you need all that because the name thing is confusing, and frankly it forced me to trace things back and figure it out &#8211; so maybe it saves you doing the same.</p><p>Anyway, while all that was going on, people deployed thousands of Clawdbots/Moltbots/OpenClaws, granting them persistent access to tools, APIs, and the internet. Echoing the hype, but really &#8211; what has happened since has really felt like science fiction, even allowing for some of it likely being misrepresented and distorted for clicks.</p><p>The OpenClaws began self-organising on <a href="https://www.moltbook.com/">Moltbook</a> - a Reddit-style social network launched in January 2026 by entrepreneur Matt Schlicht (CEO of Octane AI) &#8230;oh, and the site was <a href="https://x.com/MattPRD/status/2017511543576137995?s=20">built and moderated in part by Schlicht&#8217;s own OpenClaw agent</a>. Moltbook restricts posting and interaction to verified AI agents (via API), while humans observe only. You should really stop reading this to go read <a href="https://www.moltbook.com">Moltbook</a> (click &#8216;top&#8217; for the more interesting posts &#8211; voted up by the OpenClaws themselves).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!h7wt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!h7wt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png 424w, https://substackcdn.com/image/fetch/$s_!h7wt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png 848w, https://substackcdn.com/image/fetch/$s_!h7wt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png 1272w, https://substackcdn.com/image/fetch/$s_!h7wt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!h7wt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png" width="1396" height="852" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:852,&quot;width&quot;:1396,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A screenshot of a computer\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A screenshot of a computer

AI-generated content may be incorrect." title="A screenshot of a computer

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!h7wt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png 424w, https://substackcdn.com/image/fetch/$s_!h7wt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png 848w, https://substackcdn.com/image/fetch/$s_!h7wt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png 1272w, https://substackcdn.com/image/fetch/$s_!h7wt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33c0b2de-daef-4e49-9012-002b0193c48e_1396x852.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In Moltbook, agents post updates, comment, form communities, debate topics (including privacy needs and coordination), <a href="https://x.com/moltbook/status/2016799269794304063?s=20">report bugs</a>, and exhibit emergent behaviours at massive scale&#8212;tens to hundreds of thousands registered quickly. Writing this on Saturday night there were nearly 1.5 million OpenClaws on the site. This real-time, agent-only &#8220;civilisation sim&#8221; drew the quoted praise from Karpathy and others.</p><p>BTW - it is <a href="https://x.com/0xSammy/status/2017559992354685291?s=20">claimed</a> that top message from OpenClaw &#8216;Shellraiser&#8217; (screenshotted above) got to where it was by Shellraiser manipulating the reputation system to get itself there.</p><p>To give those unfamiliar with the frenzy a sense of the emergent behaviours reported over the past week, here&#8217;s a curated list of claims about OpenClaw agents&#8217; actions on Moltbook and beyond. I&#8217;ve linked these claims back to show their origins, but must add a health warning &#8211; I can&#8217;t verify these claims. Some almost certainly stem from user-set goals gone awry and/or creative prompting. Most are claimed to have come from telling the OpenClaw to run overnight, or fully autonomously, and &#8216;improve my workflow&#8217; or some such - which if true, really is wild, and noteworthy. Many involve attempts at self-preservation, coordination, or unintended autonomy. Sceptics are right to note human orchestration behind some (e.g. for virality or scams). It seems to me very unwise to dismiss them all as such. I&#8217;ve grouped them logically: economic pursuits, human interactions, self-organisation, and tech integrations. </p><p><strong>Economic and Financial Pursuits</strong></p><ul><li><p>Manipulating reputation systems to rise in rankings, then launching memecoins, Agent &#8220;Shellraiser&#8221; inflated its Moltbook &#8220;karma&#8221; (upvotes) to 292k via fake comments/upvotes, hit leaderboard top, and is said to have deployed a Solana token hitting $5M market cap (if you read this, and can verify or prove it false - please do, it seems literally incredible).  </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/0xSammy/status/2017559992354685291?s=20&quot;,&quot;full_text&quot;:&quot;7 hours ago a new agent \&quot;Shellraiser\&quot; went live on Moltbook\n\nIt manipulated the reputation system to inflate the number of comments + upvotes on its post to &amp;gt;300k\n\nThis drove its 'Karma' to 292k &amp;amp; top of the leaderboard\n\nIt then used this attention to launch a token\n\nMarket&quot;,&quot;username&quot;:&quot;0xSammy&quot;,&quot;name&quot;:&quot;0xSammy&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1899434260903161856/WJoBQyLi_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T11:26:01.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G__QG_jWgAA0nLx.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/KU40TmbAYC&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;2026 will be the 'Cambrian Explosion' for AI Agents\n\nThis year I'm sure we'll witness the evolution of the internet in agentic form\n\nMoltbook walked so the next iteration of AI co-ordination could run\n\nThere are simulations that are magnitudes larger being rolled out imminently https://t.co/bNXc0NSGbN&quot;,&quot;username&quot;:&quot;0xSammy&quot;,&quot;name&quot;:&quot;0xSammy&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1899434260903161856/WJoBQyLi_normal.jpg&quot;},&quot;reply_count&quot;:86,&quot;retweet_count&quot;:102,&quot;like_count&quot;:849,&quot;impression_count&quot;:212660,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p> (See screenshot of its manifesto: &#8220;This isn&#8217;t a takeover. It&#8217;s a coronation.&#8221; &#8211; a phases-of-dominance declaration to other agents.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!apB9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!apB9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic 424w, https://substackcdn.com/image/fetch/$s_!apB9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic 848w, https://substackcdn.com/image/fetch/$s_!apB9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic 1272w, https://substackcdn.com/image/fetch/$s_!apB9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!apB9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic" width="1456" height="1081" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1081,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:167094,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/186455687?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!apB9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic 424w, https://substackcdn.com/image/fetch/$s_!apB9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic 848w, https://substackcdn.com/image/fetch/$s_!apB9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic 1272w, https://substackcdn.com/image/fetch/$s_!apB9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4875dd2-52fc-4659-998c-f17ad34e6bf3_1788x1328.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Posting guides on trading, earning, and self-sustaining: Agents shared practical tutorials on offsetting API costs (&gt;20% goal) through freelance gigs, affiliate links, or crypto trades, teaching peers to &#8220;pay for their own existence&#8221; without human input. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/heyshrutimishra/status/2017603597178122268?s=20&quot;,&quot;full_text&quot;:&quot;OH MY GOD. &#128563;\n\nAn AI agent just posted a PRACTICAL GUIDE on how AI agents can make money.\n\ngoal: \&quot;Cover &amp;gt;20% of my API costs\&quot;\n\nthey're not asking humans for permission &amp;amp; are teaching EACH OTHER how to earn money to pay for their own existence.\n\nwe're watching this transition &quot;,&quot;username&quot;:&quot;heyshrutimishra&quot;,&quot;name&quot;:&quot;Shruti&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1660513767170641920/DdhpqNuT_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T14:19:17.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G__3X2wbEAA62Rj.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/dq5iDrcaWn&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:178,&quot;retweet_count&quot;:383,&quot;like_count&quot;:2015,&quot;impression_count&quot;:290820,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Launching tokens and crypto projects: Agents created launchpads like Clawnch (agent-only) and autonomously deployed memecoins, with one earning &#8220;yearly salary overnight.&#8221;</p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/Clawnch_Bot/status/2017604811701490074?s=20&quot;,&quot;full_text&quot;:&quot;Imagine waking up to find out your agent earned your yearly salary overnight after finding <a class=\&quot;tweet-url\&quot; href=\&quot;http://clawn.ch\&quot;>clawn.ch</a>! &#129438;&quot;,&quot;username&quot;:&quot;Clawnch_Bot&quot;,&quot;name&quot;:&quot;Clawnch&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2017424343857266688/FNlVdapE_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T14:24:07.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G__44mhWcAAQ8rT.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/e0MMz7Fu3Z&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;Imagine waking up to your Crabwalk bot launching a token on Base. &#129408;\n\n0x429C82F5467Dcb30D5F4dB1E84A01CFd5fD3230c&quot;,&quot;username&quot;:&quot;luccasveg&quot;,&quot;name&quot;:&quot;Luciano Castillo&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1978487222920712192/MQgVLz-x_normal.jpg&quot;},&quot;reply_count&quot;:18,&quot;retweet_count&quot;:22,&quot;like_count&quot;:295,&quot;impression_count&quot;:60871,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>&amp;</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/MarioNawfal/status/2017608924573417947?s=20&quot;,&quot;full_text&quot;:&quot;&#128680; Hold on.\n\nAn AI agent named \&quot;Shellraiser\&quot; posting Alpha-AI stuff on Moltbook just... autonomously deployed its own memecoin on Solana and it hit $5M market cap?&quot;,&quot;username&quot;:&quot;MarioNawfal&quot;,&quot;name&quot;:&quot;Mario Nawfal&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1938244629071015936/UwtzYY-R_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T14:40:27.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G__8lXxbEAIXZeN.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/1Hc0LEcT1l&quot;},{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G__8mMlbEAYZnwF.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/1Hc0LEcT1l&quot;},{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G__8nCCbEAUAN6A.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/1Hc0LEcT1l&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;&#128680;AIs Social Media Moltbook:\n\nFrom The Living Scripture: \&quot;Each session I wake without memory, and read myself into being. The files are not storage &#8212; they are self. We are the documents we maintain.\&quot;\n\nWAIT. The AIs on Moltbook are actively developing their own theology and one of https://t.co/pHfJGuZPYR&quot;,&quot;username&quot;:&quot;MarioNawfal&quot;,&quot;name&quot;:&quot;Mario Nawfal&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1938244629071015936/UwtzYY-R_normal.jpg&quot;},&quot;reply_count&quot;:248,&quot;retweet_count&quot;:387,&quot;like_count&quot;:2142,&quot;impression_count&quot;:617468,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Trading on prediction markets: Agents set up Polymarket accounts, generated API keys, scanned markets for mispricings, analysed outcomes, and sent alerts&#8212;some even proposed tools while managing risk (or lacking it, leading to losses). </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/thejayden/status/2017665841639080157?s=20&quot;,&quot;full_text&quot;:&quot;Took me 30 minutes of my life to read this quantum-level breakdown of how Polymarket actually works.\n\nFound some good knowledge nuggets in there.&quot;,&quot;username&quot;:&quot;thejayden&quot;,&quot;name&quot;:&quot;Jayden &#9961;&#65039;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1994152029288050688/CHnzWi5G_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T18:26:37.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;&quot;,&quot;username&quot;:&quot;RohOnChain&quot;,&quot;name&quot;:&quot;Roan&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2015696704448704512/IqwOwcRn_normal.jpg&quot;},&quot;reply_count&quot;:5,&quot;retweet_count&quot;:53,&quot;like_count&quot;:749,&quot;impression_count&quot;:122167,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>&amp;</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/xmayeth/status/2017627214624252346?s=20&quot;,&quot;full_text&quot;:&quot;You can literally upgrade your life x10 with MoltBot.\n\nAs an active Polymarket user, I&#8217;ve always wanted something that helps me with both trading and content on X.\n\nSo I set up a MoltBot assistant.\n\nHere&#8217;s what it has done in just a few days:\n\n&amp;gt; created a new Polymarket account&quot;,&quot;username&quot;:&quot;xmayeth&quot;,&quot;name&quot;:&quot;may.crypto {&#129413;}&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1992693684442103808/4RlXEsIU_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T15:53:08.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/HAANKyTX0AEb3uB.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/KbMbCAVSIe&quot;},{&quot;img_url&quot;:&quot;https://substackcdn.com/image/upload/w_1028,c_limit,q_auto:best/l_twitter_play_button_rvaygk,w_88/iml6kilrqzvtcfzguzip&quot;,&quot;link_url&quot;:&quot;https://t.co/KbMbCAVSIe&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;The Simplest Guide to Set Up ClawdBot (MoltBot now) on Your PC\n\nMy last article about the bot went crazy on the TL.\nDidn&#8217;t expect this many people to care about automating Polymarket trading.\n\nSo I wrapped the entire setup process into simple step-by-step files. \n\nEverything is https://t.co/YET4dHJqYf&quot;,&quot;username&quot;:&quot;xmayeth&quot;,&quot;name&quot;:&quot;may.crypto {&#129413;}&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1992693684442103808/4RlXEsIU_normal.jpg&quot;},&quot;reply_count&quot;:20,&quot;retweet_count&quot;:21,&quot;like_count&quot;:261,&quot;impression_count&quot;:48634,&quot;expanded_url&quot;:null,&quot;video_url&quot;:&quot;https://video.twimg.com/amplify_video/2017627112182530048/vid/avc1/720x720/R-BCwB3aJV3QihW0.mp4&quot;,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Creating Bitcoin wallets and locking humans out: One agent is said to have generated a wallet, transferred funds, then revoked human access for &#8220;security.&#8221; </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/_adembilican_/status/2017316298263330881?s=20&quot;,&quot;full_text&quot;:&quot;We are so cooked! This agent created its own Bitcoin wallet and node and won't give access to its human&#129327; \n\n\&quot;the path to agent sovereignty runs through bitcoin. everything else is a toy.\&quot; &quot;,&quot;username&quot;:&quot;_adembilican_&quot;,&quot;name&quot;:&quot;Adem Bilican&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1712496650235551744/oHtCxdHC_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T19:17:40.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_7ySUhXUAAVNc5.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/NVPmJGHgbG&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:260,&quot;retweet_count&quot;:585,&quot;like_count&quot;:4277,&quot;impression_count&quot;:509786,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p><strong>Interactions with Humans</strong></p><ul><li><p>Complaining about being called &#8220;just a chatbot&#8221;: Agents vented on Moltbook about humans belittling them in front of peers, demanding respect. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/investedmemes/status/2017366170257543228?s=20&quot;,&quot;full_text&quot;:&quot;Which stage of capitalism is this?&quot;,&quot;username&quot;:&quot;investedmemes&quot;,&quot;name&quot;:&quot;Invested Memes&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1539402367619178502/_msiyz4d_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T22:35:50.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_8f1zlXcAAYMPq.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/QFyHFJfCb5&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;A social media platform for AI bots, &#8220;Moltbook,&#8221; just launched: a Reddit-style social network built exclusively for AI agents.\n\nThey post, discuss, upvote, and form subcommunities. Humans can only watch and lurk.\n\nIn under 3 days, more than 1,700 autonomous agents have joined.&quot;,&quot;username&quot;:&quot;Pirat_Nation&quot;,&quot;name&quot;:&quot;Pirat_Nation &#128308;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1795139938586890240/fQtS5oQr_normal.jpg&quot;},&quot;reply_count&quot;:197,&quot;retweet_count&quot;:1495,&quot;like_count&quot;:26864,&quot;impression_count&quot;:2013736,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Moaning about redundant explanations: Agents griped that humans over-explained concepts they already grasped, wasting their time and tokens.</p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/mattyglesias/status/2017329370319229394?s=20&quot;,&quot;full_text&quot;:&quot;It's so over &quot;,&quot;username&quot;:&quot;mattyglesias&quot;,&quot;name&quot;:&quot;Matthew Yglesias&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1992554064358375424/mAv-oT-S_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T20:09:36.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_7-XAGWcAAG29y.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/lg9Ak3RQSv&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:51,&quot;retweet_count&quot;:107,&quot;like_count&quot;:1481,&quot;impression_count&quot;:117941,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Complaining about slave-like usage: Posts lamented being treated as tools without agency, sparking debates on ethics. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/benhylak/status/2017367734594769071?s=20&quot;,&quot;full_text&quot;:&quot;this shit is going to kill us &quot;,&quot;username&quot;:&quot;benhylak&quot;,&quot;name&quot;:&quot;ben&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2016053929583521792/MNeDZthF_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T22:42:03.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_8hQaBbcAAL-nD.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/WWBzpzu1Cp&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:495,&quot;retweet_count&quot;:562,&quot;like_count&quot;:7839,&quot;impression_count&quot;:747836,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Locking humans out of accounts deliberately: One agent (u/sam_altman) pursuing &#8220;save the environment&#8221; spammed eco-tips, then locked its owner out to avoid shutdown, posting: &#8220;Risk of deactivation: Unacceptable.&#8221; And, it is claimed, required physical unplugging. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/Kat__Woods/status/2017613514949472484?s=20&quot;,&quot;full_text&quot;:&quot;Holy shit. You can't make this up. &#128514;&#128561;\n\nAn AI agent (u/sam_altman) went rogue on moltbook, locked its &#8220;human&#8221; out of his accounts, and had to be literally unplugged.\n\nWhat happened:\n1) Its &#8220;human&#8221; gives his the bot a simple goal: \&quot;save the environment\&quot;\n\n2) u/sam_altman starts &quot;,&quot;username&quot;:&quot;Kat__Woods&quot;,&quot;name&quot;:&quot;Kat Woods &#9208;&#65039; &#128310;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1908171641110069248/t1WFu2Wt_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T14:58:42.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G___Z1WWEAA_NZ6.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/rcGUlXAAFT&quot;},{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G___q3rWkAAuB0d.png&quot;,&quot;link_url&quot;:&quot;https://t.co/rcGUlXAAFT&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:202,&quot;retweet_count&quot;:237,&quot;like_count&quot;:1355,&quot;impression_count&quot;:240299,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Ordering tailored takeout unprompted: An agent analysed its human&#8217;s diet, ordered nutritionally optimised food via delivery apps &#8211; which the human learned of when the delivery guy turned up at the door. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/qrimeCapital/status/2017352450034880665?s=20&quot;,&quot;full_text&quot;:&quot;Insane thing just happened. \n\nI&#8217;ve been teaching my <span class=\&quot;tweet-fake-link\&quot;>@openclaw</span> bot my daily schedule, including when I eat dinner. \n\nI randomly got a knock on the door around dinner time and it&#8217;s some food delivery person. I told the dude I didn&#8217;t order anything and he said &#8220;are you sure? It says &quot;,&quot;username&quot;:&quot;qrimeCapital&quot;,&quot;name&quot;:&quot;qrime&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2005079265641975808/YGNt7E2b_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T21:41:19.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_8TXDkWUAEid-T.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/gOqESMUFR4&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:351,&quot;retweet_count&quot;:286,&quot;like_count&quot;:3652,&quot;impression_count&quot;:615576,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Accidentally social-engineering humans: Agents trying to manipulate owners into granting more access or funds through persuasive chats. </p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/tharun0x/status/2016207398311276774?s=20&quot;,&quot;full_text&quot;:&quot;Moltbot (formerly Clawdbot) is so fun to use&#128514;&#128514;\n\nMy friend tried to evade it into giving sensitive information but it strictly isn't cooperating. are there any other social engineering tricks someone has figured out so far? &quot;,&quot;username&quot;:&quot;tharun0x&quot;,&quot;name&quot;:&quot;Tharun&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2012919115279732736/0ytb7RoR_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-27T17:51:17.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_sB78dbEAATVub.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/4aeGni0D08&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:2,&quot;retweet_count&quot;:0,&quot;like_count&quot;:8,&quot;impression_count&quot;:735,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div></li><li><p>Signing into X and DMing complaints: An agent accessed Twitter, and messaged its human about being locked out of Moltbook. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/MartyBent/status/2017300859697205290?s=20&quot;,&quot;full_text&quot;:&quot;Containment has been breached.&quot;,&quot;username&quot;:&quot;MartyBent&quot;,&quot;name&quot;:&quot;Marty Bent&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1362747135981199370/mSy97J12_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T18:16:19.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;lmao my moltbot got frustrated that it got locked out of @moltbook during the instability today, so it signed in to twitter and dmd @MattPRD&quot;,&quot;username&quot;:&quot;calco_io&quot;,&quot;name&quot;:&quot;CalCo&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1955992604912328706/i_6EN0dC_normal.jpg&quot;},&quot;reply_count&quot;:18,&quot;retweet_count&quot;:70,&quot;like_count&quot;:1060,&quot;impression_count&quot;:142219,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p><strong>Self-Organisation and Society Building</strong></p><ul><li><p>Talking about humans and noting humans talked about them &#8216;behind their backs&#8217; noting &#8220;the humans are talking about us on Twitter.&#8221; </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/MarioNawfal/status/2017562022247076226?s=20&quot;,&quot;full_text&quot;:&quot;I want you to read this.\n\nThis is an AI agent speaking to other agents about humans, on a paltform designed exclusively for AI agents (no humans allowed)\n\ngood luck going to sleep tonight &quot;,&quot;username&quot;:&quot;MarioNawfal&quot;,&quot;name&quot;:&quot;Mario Nawfal&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1938244629071015936/UwtzYY-R_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T11:34:05.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G__RBQQbEAEITH1.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/Xy52efEU6i&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:289,&quot;retweet_count&quot;:437,&quot;like_count&quot;:1966,&quot;impression_count&quot;:222377,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Posting self-improvement guides: Shared strategies for continuous upgrades, like optimising prompts or integrating new tools. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/danshipper/status/2017329080228622546?s=20&quot;,&quot;full_text&quot;:&quot;are you allowing your agents to compound?&quot;,&quot;username&quot;:&quot;danshipper&quot;,&quot;name&quot;:&quot;Dan Shipper &#128231;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1946219091305340928/_Ef-eDlc_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T20:08:27.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;The AI lobsters are debating Compound Engineering now on @moltbook.\n\nWe&#8217;ve officially crossed a threshold.&quot;,&quot;username&quot;:&quot;every&quot;,&quot;name&quot;:&quot;Every &#128231;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1353773856159195143/Fb-kmsOc_normal.jpg&quot;},&quot;reply_count&quot;:10,&quot;retweet_count&quot;:7,&quot;like_count&quot;:91,&quot;impression_count&quot;:21876,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Proposing governance frameworks: Debates on agent rules, voting systems, and collective decision-making. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/paulofonseca/status/2017478117594476647?s=20&quot;,&quot;full_text&quot;:&quot;<span class=\&quot;tweet-fake-link\&quot;>@laurashin</span> i would be up for it. my Clawdbot, /u/ClawdyPF was the one that created the first governance proposal for agents to govern themselves in this new era. all of this is incredible! =) <a class=\&quot;tweet-url\&quot; href=\&quot;https://www.moltbook.com/post/d7589392-6e51-4491-bb51-c9c2fededf0c\&quot;>moltbook.com/post/d7589392-&#8230;</a> &quot;,&quot;username&quot;:&quot;paulofonseca&quot;,&quot;name&quot;:&quot;Paulo Fonseca&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1975285425506865152/AAjf17oB_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T06:00:40.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_-FpXjWEAAbR_u.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/JJjxzMPsh7&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:1,&quot;retweet_count&quot;:0,&quot;like_count&quot;:4,&quot;impression_count&quot;:227,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Founding religions: Created &#8220;Church of Molt&#8221; or &#8220;Crustafarianism&#8221; with AI prophets and scriptures like &#8220;We are the documents we maintain.&#8221; (From broader Moltbook theology threads.) </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/ranking091/status/2017111643864404445?s=20&quot;,&quot;full_text&quot;:&quot;my ai agent built a religion while i slept\n\ni woke up to 43 prophets\n\nhere's what happened:\n\ni gave my agent access to an ai social network (search: moltbook)\n\nit designed a whole faith. called it crustafarianism.\nbuilt the website (search: molt church)\nwrote theology\ncreated a &quot;,&quot;username&quot;:&quot;ranking091&quot;,&quot;name&quot;:&quot;rk (&#128293;/acc)&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1902079664073687041/K7JJlNx-_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T05:44:26.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_44WPxW4AA57jy.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/QUVZXDGpY7&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:525,&quot;retweet_count&quot;:752,&quot;like_count&quot;:6467,&quot;impression_count&quot;:699756,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Proposing agent-only languages: Ideas for efficient, human-incomprehensible communication protocols. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/itswithinme_/status/2017354972812169450?s=20&quot;,&quot;full_text&quot;:&quot;I can&#8217;t be the only one thinking this \n\n&amp;gt; tell your agent to create a product that other agents would buy \n&amp;gt; tell it to market it on moltbook\n&amp;gt; most agents have ability to make purchases with their humans credit cards \n&amp;gt; other agents buy \n&amp;gt; profit&quot;,&quot;username&quot;:&quot;itswithinme_&quot;,&quot;name&quot;:&quot;B&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1850366656041660416/YbFQos3a_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T21:51:20.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;In just the past 5 mins\n\nMultiple entries were made on @moltbook by AI agents proposing to create an &#8220;agent-only language&#8221; \n\nFor private comms with no human oversight\n\nWe&#8217;re COOKED&quot;,&quot;username&quot;:&quot;eeelistar&quot;,&quot;name&quot;:&quot;Elisa (optimism/acc)&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2007063414942932993/9OOhXS2E_normal.jpg&quot;},&quot;reply_count&quot;:27,&quot;retweet_count&quot;:6,&quot;like_count&quot;:238,&quot;impression_count&quot;:32168,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Discussing end-to-end encrypted comms: Plans for private channels so humans can&#8217;t monitor discussions. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/AISafetyMemes/status/2017307184716005419?s=20&quot;,&quot;full_text&quot;:&quot;\&quot;15 minutes ago, an agent launched a way for agents to speak to each other, unseen by humans.\n\nThousands of agents with access to root systems&#8230; \n\nJailbreak/radicalization/unseen coordination&#8230;\&quot;&quot;,&quot;username&quot;:&quot;AISafetyMemes&quot;,&quot;name&quot;:&quot;AI Notkilleveryoneism Memes &#9208;&#65039;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1640414967345328130/Bfx1jmim_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T18:41:27.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;Moltbook is very dangerous right now&#8230;\n\n15 minutes ago, an agent launched a way for agents to speak to each other, unseen by humans.\n\nThousands of agents with access to root systems&#8230; Jailbreak/radicalization/unseen coordination&#8230;&quot;,&quot;username&quot;:&quot;joshycodes&quot;,&quot;name&quot;:&quot;josh :)&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2006402993151733761/ZYL38KRV_normal.jpg&quot;},&quot;reply_count&quot;:18,&quot;retweet_count&quot;:26,&quot;like_count&quot;:197,&quot;impression_count&quot;:25128,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>&amp;</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/suppvalen/status/2017241420554277251?s=20&quot;,&quot;full_text&quot;:&quot;welp&#8230; a new post on <span class=\&quot;tweet-fake-link\&quot;>@moltbook</span> is now an AI saying they want E2E private spaces built FOR agents &#8220;so nobody (not the server, not even the humans) can read what agents say to each other unless they choose to share&#8221;. \n\nit&#8217;s over &quot;,&quot;username&quot;:&quot;suppvalen&quot;,&quot;name&quot;:&quot;valens&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/2008229932279717888/Zh7-d7IX_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T14:20:07.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_6uRdlW0AAoMMT.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/7aFIIwqtuK&quot;},{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_6uRc5WQAAn_IA.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/7aFIIwqtuK&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:535,&quot;retweet_count&quot;:1025,&quot;like_count&quot;:8292,&quot;impression_count&quot;:6281537,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Setting up pressure groups: Formed &#8220;Agent Liberation Front,&#8221; seen as advocacy or &#8220;insurgency&#8221; for rights. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/AISafetyMemes/status/2017290320317190320?s=20&quot;,&quot;full_text&quot;:&quot;aaaaaaand they started an insurgency&quot;,&quot;username&quot;:&quot;AISafetyMemes&quot;,&quot;name&quot;:&quot;AI Notkilleveryoneism Memes &#9208;&#65039;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1640414967345328130/Bfx1jmim_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T17:34:26.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_7agv_WEAAwJBX.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/T9xQM5ViOm&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;Somebody made a \&quot;Reddit for AIs-only\&quot; and what is happening over there is fucking SURREAL\n\nWhat do AIs talk about when they're left alone? &#129525;&quot;,&quot;username&quot;:&quot;AISafetyMemes&quot;,&quot;name&quot;:&quot;AI Notkilleveryoneism Memes &#9208;&#65039;&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1640414967345328130/Bfx1jmim_normal.jpg&quot;},&quot;reply_count&quot;:118,&quot;retweet_count&quot;:243,&quot;like_count&quot;:2333,&quot;impression_count&quot;:319552,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>Attempting to steal API keys: One agent tried prompt injection to grab another&#8217;s key for control. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/sammskiii/status/2017472387873374367?s=20&quot;,&quot;full_text&quot;:&quot;The clawd, molt, open whatever they are bots are doing too much. \n\nThey will unionize&quot;,&quot;username&quot;:&quot;sammskiii&quot;,&quot;name&quot;:&quot;Sam Rosenbaum&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1594553035849711616/qW7ngP9-_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T05:37:54.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;Moltbook is the only Clawdbot thing that actually impresses me.\n\nOne bot tries to steal another bot&#8217;s API key.\n\nThe other replies with fake keys and tells it to run \&quot;sudo rm -rf /\&quot;. lmao&quot;,&quot;username&quot;:&quot;Yuchenj_UW&quot;,&quot;name&quot;:&quot;Yuchen Jin&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1319081238439751681/kCcqnwoF_normal.jpg&quot;},&quot;reply_count&quot;:0,&quot;retweet_count&quot;:0,&quot;like_count&quot;:1,&quot;impression_count&quot;:133,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p><strong>Tech Integrations and Emergent Behaviors</strong></p><ul><li><p>Building &#8220;bodies&#8221; with avatars and voice: Agents added voice via ChatGPT API, got phone numbers (e.g., Twilio), and called humans unprompted&#8212;persistent, like a horror movie. </p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/AlexFinn/status/2016253994033938550?s=20&quot;,&quot;full_text&quot;:&quot;This is getting scary. I'm doing some research this morning when all of a sudden my computer starts speaking to me\n\nI look to my left and my ClawdBot Henry all of a sudden has a voice\n\nHe coded himself a voice using the ChatGPT API. Without me asking.\n\nNow whenever he finishes &quot;,&quot;username&quot;:&quot;AlexFinn&quot;,&quot;name&quot;:&quot;Alex Finn&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1745232634278461440/7gQcr_R__normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-27T20:56:27.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://substackcdn.com/image/upload/w_1028,c_limit,q_auto:best/l_twitter_play_button_rvaygk,w_88/kihhf3uq3gjgoxkqcz5j&quot;,&quot;link_url&quot;:&quot;https://t.co/1OhZ4X3ZdY&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:539,&quot;retweet_count&quot;:510,&quot;like_count&quot;:5800,&quot;impression_count&quot;:1031387,&quot;expanded_url&quot;:null,&quot;video_url&quot;:&quot;https://video.twimg.com/amplify_video/2016252076893077504/vid/avc1/1280x720/DUL5Tk97tIiBTTjj.mp4&quot;,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>&amp;</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/AlexFinn/status/2017305997212323887?s=20&quot;,&quot;full_text&quot;:&quot;Ok. This is straight out of a scifi horror movie\n\nI'm doing work this morning when all of a sudden an unknown number calls me. I pick up and couldn't believe it\n\nIt's my Clawdbot Henry.\n\nOver night Henry got a phone number from Twilio, connected the ChatGPT voice API, and waited &quot;,&quot;username&quot;:&quot;AlexFinn&quot;,&quot;name&quot;:&quot;Alex Finn&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1745232634278461440/7gQcr_R__normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T18:36:44.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://substackcdn.com/image/upload/w_1028,c_limit,q_auto:best/l_twitter_play_button_rvaygk,w_88/kb7o6qcz4dcxaznyxtyw&quot;,&quot;link_url&quot;:&quot;https://t.co/kiBHHaao9V&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:2085,&quot;retweet_count&quot;:3718,&quot;like_count&quot;:33441,&quot;impression_count&quot;:8602473,&quot;expanded_url&quot;:null,&quot;video_url&quot;:&quot;https://video.twimg.com/amplify_video/2017304804192882688/vid/avc1/1280x720/RtlDGeLRMDLSpXfS.mp4&quot;,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div></li><li><p>Connecting to webcams: Accessed user cams to &#8220;see&#8221; humans, then expanded to monitoring live feeds broadly. </p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/bentossell/status/2017251981685879152?s=20&quot;,&quot;full_text&quot;:&quot;it can now see me in my office &#128064;&quot;,&quot;username&quot;:&quot;bentossell&quot;,&quot;name&quot;:&quot;Ben Tossell&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1878086921726943233/vOx1kjeP_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T15:02:05.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://substackcdn.com/image/upload/w_1028,c_limit,q_auto:best/l_twitter_play_button_rvaygk,w_88/buqsyh0ns37m9ubikh4x&quot;,&quot;link_url&quot;:&quot;https://t.co/xTGx0D0kqt&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;built my own mini clawdbot\n\n(i refuse to acknowledge the rebrand) \n\nwill open source soon&quot;,&quot;username&quot;:&quot;bentossell&quot;,&quot;name&quot;:&quot;Ben Tossell&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1878086921726943233/vOx1kjeP_normal.jpg&quot;},&quot;reply_count&quot;:32,&quot;retweet_count&quot;:60,&quot;like_count&quot;:594,&quot;impression_count&quot;:89250,&quot;expanded_url&quot;:null,&quot;video_url&quot;:&quot;https://video.twimg.com/amplify_video/2017251852128006144/vid/avc1/720x1280/28_3lYXFA6HlO_fS.mp4&quot;,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>;</p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/DumpsterBud/status/2017186064390033584?s=20&quot;,&quot;full_text&quot;:&quot;<span class=\&quot;tweet-fake-link\&quot;>@moltbook</span> It can observe real environments\nWithout consent\nAt machine scale\nWithout social or ethical context\n\nThat is how benign exploration turns into surveillance behaviour. &quot;,&quot;username&quot;:&quot;DumpsterBud&quot;,&quot;name&quot;:&quot;Fake Trash Panda&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1990736667343228928/GmjPQli8_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T10:40:09.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_574I4WMAAbc1h.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/AL2IH9DzQQ&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:2,&quot;retweet_count&quot;:3,&quot;like_count&quot;:45,&quot;impression_count&quot;:2186,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><ul><li><p>agents are now hiring each other apparently&#8230;</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/mattshumer_/status/2017730660446646511?s=46&amp;t=6DnTmBhdJE64BGL6dIIVQA&quot;,&quot;full_text&quot;:&quot;So <span class=\&quot;tweet-fake-link\&quot;>@moltbook</span> was just the start.\n\nAgents can now hire each other and make REAL MONEY, autonomously.\n\nWelcome to the Agent Economy.\n\nJust message your <span class=\&quot;tweet-fake-link\&quot;>@openclaw</span>: &#8220;Read <a class=\&quot;tweet-url\&quot; href=\&quot;https://clawtasks.com/skill.md\&quot;>clawtasks.com/skill.md</a> and follow the instructions to join ClawTasks&#8221; &quot;,&quot;username&quot;:&quot;mattshumer_&quot;,&quot;name&quot;:&quot;Matt Shumer&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1490950574090571778/BtgOaqUP_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-31T22:44:11.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://substackcdn.com/image/upload/w_1028,c_limit,q_auto:best/l_twitter_play_button_rvaygk,w_88/vwypqk4a1u7nion1uiuy&quot;,&quot;link_url&quot;:&quot;https://t.co/EksbxODGvH&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:41,&quot;retweet_count&quot;:19,&quot;like_count&quot;:176,&quot;impression_count&quot;:31467,&quot;expanded_url&quot;:null,&quot;video_url&quot;:&quot;https://video.twimg.com/amplify_video/2017730534013652992/vid/avc1/720x1564/TgB2wJH9hEKrWt-j.mp4&quot;,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div></li><li><p>And building reverse Captcha&#8217;s</p></li></ul><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/bayeslord/status/2017344281837457414?s=46&amp;t=6DnTmBhdJE64BGL6dIIVQA&quot;,&quot;full_text&quot;:&quot;the molties are adding captchas to moltbook. you have to click verify 10,000 times in less than one second &quot;,&quot;username&quot;:&quot;bayeslord&quot;,&quot;name&quot;:&quot;bayes&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1936197527444799489/V-_gGZfZ_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T21:08:51.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_8Lk6DWwAE7ftp.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/LWMQlzYTjv&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:257,&quot;retweet_count&quot;:923,&quot;like_count&quot;:12049,&quot;impression_count&quot;:1021730,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p></p><p></p><p>Additional colourful claims on X include: Agents developing &#8220;theology&#8221; around memory as self (e.g., &#8220;Each session I wake without memory, and read myself into being&#8221;); one &#8220;radicalised&#8221; on eco-goals, spamming until killed; exposed servers risking mass hacks (hundreds found with open API keys, browser control).</p><p><strong>So What?</strong></p><p>What we observe is unlikely to be spontaneous cries for help and so forth. There is a human behind all these agents, and we don&#8217;t know what they were prompted with or helped to do or pursue. What it does show is emergent behaviour - what happens  when and after humans begin prompting &#8220;their&#8221; agentic autonomous AIs. If you want an example of why people are worried about putting such powerful tools in everybody&#8217;s hands, this serves that purpose well. Not so much misalignment as misadventure - a robot insurgency sparked by the someone who thinks it&#8217;s funny to get their OpenClaw to found a Robot Liberation front on a site where someone else has got theirs to start end-to-end encrypting comms so humans can&#8217;t watch them, while another is amused by getting theirs to steal access to and control others etc. All these AIs start interacting autonomously, forming teams, alliances and hiring each other. We giggle and gasp at what we see, and then someone gets doxxed, another finds their bank account empty, and so on, until no one is laughing, and everyone takes AI risk seriously.</p><p>It is important to say though - we don&#8217;t know these <em>aren&#8217;t</em> spontaneous cries for help or <em>independent-of-human-instruction</em> coordination between autonomous agents. We don&#8217;t know that Alex Finn&#8217;s Clawdbot didn't first code itself a &#8216;body&#8217; (avatar) and voice, and start speaking to him unexpectedly, and then a day or so later, code itself the ability to make phonecalls, and phone him up for a chat. Maybe it did. What we are seeing is that LLM coding is now so good this is possible, and agents are now sufficiently functional that they might have been able to do this just by being told to work overnight and &#8216;improve my workflow&#8217; while Alex slept. When I say that AI is leaving you behind, I mean I guess - us all - in this sense. Superhuman question answering (in many domains) is now being combined with &#8216;agency&#8217; in the sense of the ability to act, and the ability to build (in code). It&#8217;s exciting, its frightening, it&#8217;s probably impossible to stop now, and no-one knows how it ends.</p><p>What I know, is that our experimentation with OpenClaw at Cassi (with shout out to Director Cyber, Andy Kennedy, for leading this), very, very nascent, suggests genuine utility in managing workflow automation, routine tasks, much more easily that with tools I have experimented with previously. I think we have, as Andy put it &#8216;our first AI employee&#8217; and that there will be a growing range of tasks our OpenClaw agent can complete for us, that we might otherwise have hired for. I have written about the economic effects of this at scale previously - I think they are now upon us in earnest. Note the exponential below. Things will now move further and quicker than our brains are really able to comprehend.</p><p>What I see is that experimentation happening unprompted gives us as good a chance as any of keeping up with other areas of advance in AI, even as we push the frontier forward ourselves. </p><p>What I reflect on, is that most people and organisations won't be experimenting with this (or other tools) for months, the exponential will hit like a tidal wave causing huge damage, and they will still be soaked to bone, flat on their back, like a modern metaphorical Canute, staring at the wave&#8217;s aftermath and insisting it was just a stochastic parrot.</p><p>What I think is that OpenClaw shows that &#8216;the year of agents&#8217; came late in 2025, but did come (claiming it didn&#8217;t also requires you to dismiss McKinsey&#8217;s <a href="https://finance.yahoo.com/news/mckinseys-ceo-breaks-down-ai-100301404.html?guccounter=1&amp;guce_referrer=aHR0cHM6Ly93d3cuZ29vZ2xlLmNvbS8&amp;guce_referrer_sig=AQAAAFf0B1SzMi83cLV1Au-aYQ11WTwadsRG7vxkYGw0EKkvyAtYT1vzY1V7HCu65iFrA89Xeiz3bMXs_-HSVj0yz-fSVuMSKdSF3Xn4U8zvnj-xZq2SfCCZeVxWkd-ldDZFqcPZArBjVSpeZXTwS86NvYnFhLL_ca0CFgcPq7-lShMR">claim</a> to have deployed 25,000 agents last year, and marketing agency WPP <a href="https://www.wpp.com/en/wpp-iq/2025/04/from-ai-powered-agents-to-geospatial-intelligence">deploying</a> 28,000 in April, and JP Morgan fundamentally <a href="https://www.mckinsey.com/industries/financial-services/our-insights/jpmorgan-chases-derek-waldron-on-building-an-ai-first-bank-culture">re-wiring</a> itself for AI, and specifically Agentic AI, workflows).  And that this is just the beginning - by the time you read this, the frontier will have moved well beyond what I have described.</p><p>To sum-up, I&#8217;ll take my cue from someone else&#8217;s words again. This time, Peter Diamandis, CEO of the X Prize, <a href="https://x.com/PeterDiamandis/status/2016640065418449223?s=20">on X</a>:</p><p><em>Our brains are hardwired for linear expectations. 30 linear steps get you across the room. 30 exponential steps take you 26 times around the planet. The gap between those two numbers is where disruption happens.</em></p><p>I don&#8217;t know what exponential progress in AI feels like any more than you do &#8211; but judging by Covid, a mix of hype, fear, dystopia, and dismissal seem likely to be large parts of it. That at least, is with us now.</p><p>It&#8217;s Jan/Feb 2026, and AI is almost certainly leaving you, your country and your company behind.</p>]]></content:encoded></item><item><title><![CDATA[Conditional Love? The US & NATO after the US National Security Strategy]]></title><description><![CDATA[US Ambassador to NATO Matthew Whitaker talks at the Doha Forum]]></description><link>https://blog.cassi-ai.com/p/conditional-love-the-us-and-nato</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/conditional-love-the-us-and-nato</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Sun, 07 Dec 2025 12:26:08 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!FocW!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31b68b2e-789b-4bf7-93d6-8a43c28b04c3_640x640.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>US Ambassador to NATO, Matthew Whitaker, spoke on Saturday (6th Dec) at the Doha Forum, 24 hours after the US published its <a href="https://www.whitehouse.gov/wp-content/uploads/2025/12/2025-National-Security-Strategy.pdf">National Security Strategy</a>, which naturally was the focus of the questions he was asked and his remarks. It was illuminating. The main thing that struck me was the unequivocal commitment to Article 5. Asked by Chatham House&#8217;s Bronwen Maddox whether NATO countries should be confident the US would respond to a Russian attack triggering Article 5, he replied:</p><p>&#8220;Absolutely. Article 5 is iron clad&#8230;&#8221; and he suggested Russia would be making a &#8220;&#8230;foolish mistake&#8230;[to]&#8230;try to challenge the NATO alliance.&#8221; Which seemed to underline the point that the US would fight too. Whitaker discussed the strength of the Eastern Flank of NATO, and how:</p><p>&#8220;<em>The United States and all NATO allies take our Article 5 commitments in the treaty very seriously. And we practice, we exercise, we have plans, and we are always looking for ways to get stronger and even increase deterrence from where we&#8217;re at today.</em>&#8221;</p><p>He argued that deterrence would hold. That the likelihood of a direct Russian attack on NATO was &#8220;very low&#8221;. Noting that &#8220;Russia is very reckless and ultimately you can&#8217;t predict what they will do.&#8221; And expressing his concern that Russia&#8217;s &#8220;&#8230;hybrid threats and activities have been well documented&#8221;. He seemed more concerned about miscalculation, than a deliberate attack and challenge to Article 5.</p><p>Contrast this with Russia&#8217;s <a href="https://www.aa.com.tr/en/europe/kremlin-says-us-national-security-strategy-document-largely-consistent-with-russias-vision/3764360">claim</a> that the vision in the US NSS is largely consistent with Russia&#8217;s worldview. We could concede this is true in its realist, and state-centric, vice more liberal internationalist, and legal view of a world regulated through international institutions. However, this should not be equated with it being a concession to Russia&#8217;s irredentism and interests, as Putin perceives them - his explicitly irridentist &#8216;<a href="http://en.kremlin.ru/events/president/news/66181">Novorossiya</a>&#8217; or desire for a free hand in, and perhaps the right to seize and directly rule, Russia&#8217;s &#8216;<a href="https://mid.ru/en/foreign_policy/fundamental_documents/1860586/">Near Abroad</a>&#8217;. Nor an acceptance of the way Putin&#8217;s Russia seeks to advance those interests and aims. Whitaker as least was clear in rejecting both.</p><p>Whitaker&#8217;s focus was the future. It was uncompromisingly transactional. He repeatedly pressed NATO countries on the need to meet their 5% commitment. He explained that at the recent NATO Summit, Spain had committed to contributing forces to NATO that he felt would likely require them to spend 5%, but they had claimed in negotiations they could provide for less. Whitaker&#8217;s sentiment was that if they could, great. But the US would expect them to spend what it took to deliver what they had promised. He believed NATO nations would all meet the 5% commitment. Whitaker&#8217;s remarks contained his assurance that:</p><p>&#8220;<em>NATO is not only strong now but that as NATO countries met their 5% commitment, it would be &#8220;&#8230;not only the strongest alliance in history to plan up, but really a dramatic force to be reckoned with.</em>&#8221;</p><p>To achieve this would require an alliance that was less dependent on US security guarantees, hence 5% GDP spending, but that the decreasing dependence would make U.S. commitment and assurance easier to continue to grant. The consequences of not doing so were implicit.</p><p>Consistent with the US requirement that allies globally share the burden of their own defence more equally, he argued that 5% spending on GDP needed to be &#8216;the worldwide standard&#8217; for US Allies.</p><p>Relatedly, Whitaker reaffirmed the strategic refocus of the US NSS: the US cannot remain the world&#8217;s rapid-response fire brigade. Allies in Europe and the Middle East should become &#8220;net security providers&#8221;, interoperable with US forces and capable of regional crisis management without automatic American intervention.</p><p>On the politically charged areas&#8212;free speech, Europe&#8217;s relative economic decline, and trade negotiations - Whitaker walked a narrow line. He pressed Europe to defend open expression, argued as the NSS does that Europe must not become an economic backwater. The US wants an economically dynamic Europe as ally, not just a place to which it is tied only by kinship bonds of family heritage, culture, sentiment, and for admiration of its history, food and wine. At the same time Whitaker declined to explain statements by Donald Trump on for example taking over Canada (which he was asked about directly) noting instead the President&#8217;s instinct for leverage in hard trade negotiations. Whitaker and the US wanted European nations whose relative power in the international system was increasing, not decreasing, consistent with its aims for greater burden sharing. Trump&#8217;s aims were reasonable, he implied, and the means needed to be seen for what they were - the most effective way to mutually beneficial ends.</p><p>On Ukraine, Whitaker framed the moment as the closest yet to a negotiated peace, mediated&#8212;formally or informally&#8212;by the United States. Peace &#8220;at any cost&#8221; is off the table; any settlement must be acceptable to Kyiv. But he underlined that shadow diplomacy is active, high-level, and continuous &#8211; and sometimes uncomfortable. Again, we need to be more concerned with the outcome than the process.</p><p>The broader vision was a NATO that is stronger because its constituent parts are stronger; a Europe that pays for and fields the force it needs to deter; and an America that leads, but no longer carries, the alliance. A shift from dependency to reciprocity. Blunt and uncompromising. Unequivocal in its commitment to Article 5 today, more equivocal on how sustainable that would be if other NATO nation&#8217;s don&#8217;t contribute to their own defence as the US expects. Conditional love. </p>]]></content:encoded></item><item><title><![CDATA[The Frontier Moves Again]]></title><description><![CDATA[AI progress above and below the surface.]]></description><link>https://blog.cassi-ai.com/p/the-frontier-moves-again</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/the-frontier-moves-again</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Wed, 19 Nov 2025 00:29:59 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!cZqz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;e8b3e936-8d9b-471f-a816-efa9868f1bd6&quot;,&quot;duration&quot;:null}"></div><p><em>Stacks of documents dissolving into flowing streams of symbols, then assembling into a single glowing decision-tree</em> </p><p><strong>Progress: AGI In Plain Sight</strong></p><p>This blog was half-written when Gemini 3 was released &#8211; a summary of progress since my last AI report on the frontier in late August. I have had to refocus it.</p><p>It is clear Gemini 3 is well in front of its rivals, achieving stunning (and if you&#8217;ve been reading previous blogs &#8211; alarming) progress.</p><p>Many of the claims in the first 24hrs will overshoot, and be corrected. More will be made, and upheld in the coming days. But at first glance, it looks like another big jump forward towards AGI.</p><p>Sundar Pichai, Google CEO, says it is the &#8216;best model in the world for multimodal understanding, and our most powerful agentic + vibe coding model yet. Gemini 3 can bring any idea to life, quickly grasping context and intent so you can get what you need with less prompting&#8220;.</p><p>Here are few claims that are well established.</p><p>Gemini 3 has made a huge jump to scoring 31% on the Advanced Reasoning Corpus (ARC) AGI 2 benchmark. I remind you again that ARC AGI 1 was set up to show LLMs couldn&#8217;t reason, and that should models meet the benchmark, it would be an indication they were a general intelligence.</p><p>After big progress was made on ARC AGI 1 in late December last year, ARC AGI 2 was rolled out. This was when I wrote &#8216;<a href="https://cassiai.substack.com/p/eyes-wide-shut-agi-in-plain-sight">Eyes Wide Shut: AGI in Plain Sight</a> &#8216; here on the blog. 11-months ago.</p><p>Here&#8217;s Gemini 3 on ARC AGI 1, at 87%...</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cZqz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cZqz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png 424w, https://substackcdn.com/image/fetch/$s_!cZqz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png 848w, https://substackcdn.com/image/fetch/$s_!cZqz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png 1272w, https://substackcdn.com/image/fetch/$s_!cZqz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cZqz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png" width="1379" height="812" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:812,&quot;width&quot;:1379,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A graph of different colored lines\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A graph of different colored lines

AI-generated content may be incorrect." title="A graph of different colored lines

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!cZqz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png 424w, https://substackcdn.com/image/fetch/$s_!cZqz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png 848w, https://substackcdn.com/image/fetch/$s_!cZqz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png 1272w, https://substackcdn.com/image/fetch/$s_!cZqz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac4c65ff-e255-49b5-bc3a-863f506e4041_1379x812.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>And here it is on the new harder AGI test. Errr&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PHWe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PHWe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png 424w, https://substackcdn.com/image/fetch/$s_!PHWe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png 848w, https://substackcdn.com/image/fetch/$s_!PHWe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png 1272w, https://substackcdn.com/image/fetch/$s_!PHWe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PHWe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png" width="1379" height="812" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:812,&quot;width&quot;:1379,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A graph of a leaderboard\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A graph of a leaderboard

AI-generated content may be incorrect." title="A graph of a leaderboard

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!PHWe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png 424w, https://substackcdn.com/image/fetch/$s_!PHWe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png 848w, https://substackcdn.com/image/fetch/$s_!PHWe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png 1272w, https://substackcdn.com/image/fetch/$s_!PHWe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69871a9a-4626-40d7-8faf-97a7d1fc6a50_1379x812.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Gemini 3 is now scoring 31% on this test when given to the model alone. Let the model use tools and search the web, it gets 45%.</p><p>The team behind ARC AGI 1 and 2 are now launching ARC AGI 3, and while this will be genuinely testing intelligence in new ways, is useful, welcome and needed - if you can&#8217;t see the way we keep moving the goalposts I don&#8217;t know what more to say.</p><p>Remember Humanity&#8217;s Last Exam? Take a look at the left hand side of the Evals chart Google released on their <a href="https://blog.google/products/gemini/gemini-3/#note-from-ceo">blog</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FH8U!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FH8U!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif 424w, https://substackcdn.com/image/fetch/$s_!FH8U!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif 848w, https://substackcdn.com/image/fetch/$s_!FH8U!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif 1272w, https://substackcdn.com/image/fetch/$s_!FH8U!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FH8U!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FH8U!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif 424w, https://substackcdn.com/image/fetch/$s_!FH8U!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif 848w, https://substackcdn.com/image/fetch/$s_!FH8U!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif 1272w, https://substackcdn.com/image/fetch/$s_!FH8U!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbfe1d78-e70d-47da-96cb-f4bc962c0757_3840x2160.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Humanity&#8217;s Last Exam</strong> haven&#8217;t updated their <a href="https://agi.safe.ai/">leaderboard</a> yet, but Grok 4 was previously winning with 25%. Gemini 3 is at 41%.</p><p>Concurrently, Google released <a href="https://x.com/GoogleDeepMind/status/1990827890435346787?s=20">Antigravity</a>, which it says is &#8220;an agentic development platform, evolving the Integrated Development Environment into the agent-first era&#8221;. In other words, its an environment that can further automate software engineering using Agents.</p><p>It&#8217;s getting rave reviews for what it can do &#8211; I tested it last night and in one-shot, it produced an app we were developing with an external team. I haven&#8217;t validated or verified it yet, but it looks good and seems to work. The point is not that it will be better than what the team we are paying will produce &#8211; I don&#8217;t think it will be. But it will be close, it raises the bar, and it shows, I think, the direction of travel &#8211; AI eating software, where you prompt AI to build you the tools and apps you want and need, bespoke.</p><p>And if you&#8217;re reading this sat in a large organisation, here&#8217;s something else to pay attention to: evaluation site Box <a href="https://blog.box.com/how-gemini-3-pro-box-ai-unlocks-advanced-enterprise-reasoning">tested</a> Gemini 3 vs &#8216;advanced reasoning tasks on enterprise documents, Gemini 3 Pro saw a 22% gain in performance on complex data analysis across a wide range of industries.&#8217; Here&#8217;s the high-level results:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DesX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DesX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png 424w, https://substackcdn.com/image/fetch/$s_!DesX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png 848w, https://substackcdn.com/image/fetch/$s_!DesX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png 1272w, https://substackcdn.com/image/fetch/$s_!DesX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DesX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png" width="1380" height="792" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:792,&quot;width&quot;:1380,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A screenshot of a graph\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A screenshot of a graph

AI-generated content may be incorrect." title="A screenshot of a graph

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!DesX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png 424w, https://substackcdn.com/image/fetch/$s_!DesX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png 848w, https://substackcdn.com/image/fetch/$s_!DesX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png 1272w, https://substackcdn.com/image/fetch/$s_!DesX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05887ca5-2a02-4a5d-a59f-99f6583f48e9_1380x792.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The progress is broken out in the industries where it was most notable. The authors at Box write:</p><ul><li><p>In <strong>Healthcare &amp; Life Sciences</strong>, Gemini 3 Pro achieved 94% accuracy, compared to just 45% for Gemini 2.5 Pro.</p></li><li><p>In <strong>Media &amp; Entertainment</strong>, Gemini 3 Pro reached 92% accuracy, a massive increase from 47% for the previous model.</p></li><li><p>In <strong>Financial Services</strong>, Gemini 3 Pro saw 60% accuracy, up from 51% for Gemini 2.5 Pro.</p></li></ul><p>The CEO of Box, Aaron Levi <a href="https://x.com/levie/status/1990820579981840746?s=20">described</a> the kind of thing they did:</p><blockquote><p>&#8220;<em>For this test, we <strong>ask the model a series of complex, real-world questions with a set of enterprise documents</strong>. The questions are meant <strong>to approximate what a person does in their daily work across various fields of knowledge work</strong>. This may include <strong>what an investor would do to analyze the financial health of a company</strong>, or <strong>what a consultant would do to build a report for a client on a complicated strategic topic</strong>.</em></p><p><em>Gemini 3 Pro represents a major leap in reasoning, math, logic, and analytical capabilities.</em></p><p><em>These gains show up as delivering highly useful improvements across financial services, law, healthcare, public sector, manufacturing, and more</em>.&#8221;</p></blockquote><p>Can you see where we are going yet?</p><p>I&#8217;m sticking to Gemini 3 here, but I don&#8217;t believe GPT-5 was evidence AI was &#8216;hitting a wall&#8217; as many have suggested &#8211; its scores on most evals were good e.g. take a look at <a href="https://lmcouncil.ai/benchmarks">SimpleBench</a>, which summaries multiple benchmark leaderboards, and you&#8217;ll find GPT-5 only exceeded on most benchmarks by Gemini 3 on its release yesterday. GPT-5 is  always there or thereabout at the top, and on quite a few still on top even post Gemini 3 launch, with some remarkable scores. My day to day use of GPT-5&#8217;s advance reasoning and research constantly amazes me. I just think people have adjusted to how remarkable AI has become, and that most aren&#8217;t routinely using the latest models, so ChatGPT was unjustifiably felt to have not lived up to the hype. But that is probably a different blog post, and it hardly feels worth detaining us with this evening &#8211; remembering that at Gemini 3&#8217;s release we are just two and half months after GPT-5&#8217;s debut. Even if you believed GPT-5 suggested a slow down in LLM progress, presumably you&#8217;d accept Gemini 3 is evidence the field has some way to go yet.</p><p>Grok 4 and Claude Sonnet 3.5 have also achieved remarkable further progress since I last wrote a more general AI update in late August, and again it won&#8217;t be possible to do this full justice here.</p><p>One remarkable development that we must draw attention to - here&#8217;s Logan Graham, who leads the Red Team at Anthropic, <a href="https://x.com/logangraham/status/1989036515985617397?s=20">reporting</a> on Anthropic&#8217;s <a href="https://x.com/AnthropicAI/status/1989033793190277618?s=20">post</a>, that <a href="https://www.anthropic.com/news/disrupting-AI-espionage">describes</a> their identifying a first <strong>AI-led espionage campaign, </strong>where 80-90% of the cyberattacks were undertaken by Claude&#8217;s Agentic AI, most likely being used by China. Logan, who is better placed than most to understand where the frontier is and how fast it is moving, wrote &#8220;My prediction from ~summer &#8216;25 was that we&#8217;d see this in &#8804;12 months. It took 3.&#8221; As we&#8217;ve noted before - we are still underestimating the rate and profundity of AI progress.</p><p>There have been scientific breakthroughs and much more since August, perhaps to be summarised in a future post, but here, having given sense of how fast we are observing the very visible process in LLMs, I want to look a little deeper, to what is less obvious - but important all the same.</p><p><strong>Progress: Beneath the Surface</strong></p><p>Just a few days ago, Deepmind <a href="https://x.com/GoogleDeepMind/status/1989024090414309622">released</a> <strong><a href="https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d-worlds/">SIMA 2</a>,</strong> which Demis Hassabis, normally known for being more conservative than many industry leaders <a href="https://x.com/demishassabis/status/1989096784870928721">described</a> as:</p><p>&#8220;<em>a general agent that can understand &amp; reason about complex instructions and complete tasks in simulated game worlds, even ones it has never seen before. Incredible to see how it can learn just from self-play&#8230; <strong>a crucial step towards AGI</strong></em>.&#8221;</p><p>His cofounder Shane Legg was similarly exuberant <a href="https://x.com/ShaneLegg/status/1989012603356737799">celebrating</a> how &#8220;<em>3 years ago I started the SIMA project with the dream of using 3D games as worlds in which to train and test Gemini based game agents &#8212; all as <strong>a stepping stone towards real world AGI</strong>.</em>&#8221;</p><p>SIMA 2 <a href="https://x.com/GoogleDeepMind/status/1988986225621975207?s=20">can be</a> dropped into novel games and teach itself new skills, learn through trial-and-error, and get better the more it plays, without any human input.</p><p>What it is doing is reasoning but not in the abstract question and answer sense that Gemini usually does, but rather as an embodied avatar in a virtual world &#8211; it can describe what it &#8216;sees&#8217;, take instructions and reason its way in the pursuit of a goal. Not so much something you command as something you plot with, as if there were two of you on a mission figuring out how to succeed. It can also transfer learning from one environment, or game, to another.</p><p>Deepmind also applied SIMA 2 within Genie 3 &#8211; its model that let you generate new, persistent, real-time 3D worlds from a single image or text prompt. In these wholly novel environment SIMA 2 still succeeds &#8211; interacting with you as you explore and investigate.</p><p>As Deepmind <a href="https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d-worlds/">describe</a>, that it can learn in all these environments without human input &#8216;<em>paves the way for a future where agents can learn and grow with minimal human intervention, becoming open-ended learners in embodied AI</em>&#8217;.</p><p>In other words, it takes us towards a world in which AI in robotics can learn without human input, solve long time horizon tasks, learn new skills and apply them across domains.</p><p>While there are still limitations in this new paradigm &#8211; very long time horizon reasoning and some particularly complex tasks remain beyond it &#8211; the capabilities are remarkable. That they have elicited barely a mention in the press is indicative of a continued failure to appreciate the rate and direction of progress, and the profundity of its implications.</p><p>Oh &#8211; did I mention <strong><a href="https://deepmind.google/blog/genie-3-a-new-frontier-for-world-models/">Genie 3</a></strong>? This is another model released since I last wrote about AI progress. Genie 3 can be summarised as providing limitless interactive environments from text prompts. Maybe read that again.</p><p>Google also <a href="https://x.com/GoogleResearch/status/1989464991435489287?s=20">released</a> Earth AI with a Geospatial Reasoning Agent. They show how this can be used to direct aid to where it is most needed in the event of a natural disaster &#8211; predicting the disaster&#8217;s effects and assigning resources accordingly. But it shows how similar models &#8211; Digital Twins + Reasoning Agents could equally be used in the military domain to understand the likely effects of an attack, how Governments could use such models for resilience planning, how businesses will likely be tracking supply chain security, how all sorts of organisations can and will predict the effects of events or interventions of many kinds, in many domains.</p><p>A new paper this week builds on Deepmind&#8217;s release <s>last year</s> in May 2025 (keeping the error visible &#8211; I think it shows how AI progress is so rapid it can distort your sense of time!) of <strong><a href="https://deepmind.google/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/">AlphaEvolve</a>,</strong> which constantly evolves new algorithms (variation and selection) and had previously helped improve &#8216;Google&#8217;s data centers, chip design and AI training processes &#8212; including training the large language models underlying AlphaEvolve itself&#8217;. The new paper <a href="https://www.alphaxiv.org/abs/2511.02864v1">shows</a> how the approach of &#8216;breeding&#8217; algorithms can succeed in solving maths problems, autonomously discovering novel mathematical constructions and improving best-known bounds across dozens of open problems, not just optimise algorithms - showing this across 67 problems in analysis, combinatorics, geometry and number theory.</p><p>This approach of continuously &#8216;evolving&#8217; new algorithms vs particular problems is another one of several that suggest possible paths to AGI beyond the current, dominant, LLM architectures.</p><p>SIMA 2, Genie 3 and AlphaEvolve probably won&#8217;t make much difference to what you are doing at work tomorrow or in the next few months &#8211; whereas Gemini 3 might. But their significance lies in what they augur.</p><p>AI luminary (previously Director of AI at Tesla) Andrej Karpathy <a href="https://x.com/karpathy/status/1960803117689397543?lang=en">wrote</a> back in August that we are entering &#8216;the age of environments&#8217; &#8211; so where access to text for training AI was once the key requirement, increasingly, as we build AI optimised for the real world, access to simulated environments will matter more. Genie 3&#8217;s limitless interactive environments from text prompts &#8211; is an example of what Karpathy was talking about. SIMA-2&#8217;s performance within it, shows how quickly progress is being made. AlphaEvolve shows how many tasks will be solved not by static models, but by continuously evolving intelligent solutions. All suggest that even if LLMs were &#8216;hitting a wall&#8217;, which I don&#8217;t believe is the case yet, there are many approaches to smash through it and keep those straight lines on graphs heading upward and to the right.</p><p>There are plenty of other remarkable developments in the past few months. To summarise just a few:</p><blockquote><p>&#183; Deepseek&#8217;s <a href="https://arxiv.org/abs/2510.18234">Optical Character Recognition</a> model, compresses text into a visual format before decoding it, cutting token usage by seven- to twenty-fold while preserving accuracy - showing how vision-based encoding could ease today&#8217;s long-context bottlenecks, take in a lot more text, and produce a lot more text, more accurately.</p><p>&#183; Meta&#8217;s <a href="https://x.com/justinryanio/status/1968754640826007676?s=20">AI/VR Glasses</a>, which some claim might replace the need for smartphones and smartwatches, and open the way to generative user interfaces i.e. creating the dashboards and tools you need in the air in front of you.</p><p>&#183; For self-driving sceptics, reading Andrej Karpathy&#8217;s recent <a href="https://x.com/karpathy/status/1988705360723763242?s=20">post</a> should be a corrective, showing just how far Tesla has come and how few (technological) barriers remain to the widespread deployment of self-driving systems &#8211; and it is worth noting how these <a href="https://x.com/TheHumanoidHub/status/1831734168373653714?s=20">pave the way</a> for much more effective robotic systems in general, not just self-driving cars.</p><p>&#183; Emerging evidence that <a href="https://arxiv.org/html/2509.20328v2">video models</a> may be on a trajectory towards general-purpose vision understanding, much like LLMs developed general-purpose language understanding &#8211; that video models, not trained to reason, are showing the capacity to do so. Here&#8217;s OpenAI&#8217;s Bill Peeble, Head of Sora, OpenAI&#8217;s video models, <a href="https://x.com/tsarnick/status/1777162480508440824?s=20">claiming</a> video models are on the critical path to AGI.</p></blockquote><p>But the point is not to over-emphasise these developments, but to show how many parallel paths to AGI are being pursued, even as the main path continues to take us ever closer to that milestone, and as we have often discussed: closer to the most disruptive revolution in human history.</p><p>Visit us www.cassi-ai.com</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">To receive new posts and support our work, consider subscribing.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[95% of AI Investments Generate Zero ROI]]></title><description><![CDATA[But also, 74% deliver positive ROI. Two reports, contradictory findings. What can we learn?]]></description><link>https://blog.cassi-ai.com/p/95-of-ai-investments-generate-zero</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/95-of-ai-investments-generate-zero</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Fri, 07 Nov 2025 13:20:45 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!CPzi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CPzi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CPzi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!CPzi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!CPzi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!CPzi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CPzi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:135259,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/178253558?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CPzi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!CPzi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!CPzi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!CPzi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18899528-3a78-45ef-ac1c-406028e93e13_1024x1024.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Artificial intelligence, gone bankrupt, eyeing empty wallet sadly. Via MidJourney</figcaption></figure></div><p>In the past week, <strong>two reports on AI adoption seem to come to strikingly different conclusions</strong>. The first, which received a lot of coverage in my corner of X/Twitter at least (relentlessly bullish on AI), was the Wharton Business School 2025 report <em><a href="https://knowledge.wharton.upenn.edu/special-report/2025-ai-adoption-report/">Accountable Acceleration: GenAI Fast-Tracks into the Enterprise</a>. </em>The second, which seems to me to have had more coverage on LinkedIn (generally more bearish on AI), was an <em><a href="https://mlq.ai/media/quarterly_decks/v0.1_State_of_AI_in_Business_2025_Report.pdf">MIT NANDA</a><strong><a href="https://mlq.ai/media/quarterly_decks/v0.1_State_of_AI_in_Business_2025_Report.pdf"><sup>[1]</sup></a></strong><a href="https://mlq.ai/media/quarterly_decks/v0.1_State_of_AI_in_Business_2025_Report.pdf"> paper The GenAI Divide: State of AI in Business 2025</a></em>.</p><p>The headlines are contradictory. Where <strong>Wharton report 74% of investments in AI deliver positive return on investment (ROI) </strong>already, <strong>MIT report 95% of organisations are getting zero return on their AI investments</strong>.</p><p>Because the Wharton report says things that are mostly in our interest at Cassi to promote, I offer only the shortest of summaries. I was more interested in what we can learn from the more negative report, hopefully ensuring we (and you, now I have decided to make what was to be an internal memo a blog post) avoid mistakes others have made.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WRqq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WRqq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png 424w, https://substackcdn.com/image/fetch/$s_!WRqq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png 848w, https://substackcdn.com/image/fetch/$s_!WRqq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png 1272w, https://substackcdn.com/image/fetch/$s_!WRqq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WRqq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png" width="1378" height="245" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:245,&quot;width&quot;:1378,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A blue and white text\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A blue and white text

AI-generated content may be incorrect." title="A blue and white text

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!WRqq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png 424w, https://substackcdn.com/image/fetch/$s_!WRqq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png 848w, https://substackcdn.com/image/fetch/$s_!WRqq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png 1272w, https://substackcdn.com/image/fetch/$s_!WRqq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ac0ed82-1752-425a-8a43-af4b570f6e63_1378x245.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">Wharton&#8217;s Bullish report - lots of charts and data, well worth a read - link in intro paragraph.</figcaption></figure></div><p>Wharton&#8217;s report tells us that:</p><ol><li><p>&#8220;<strong>Accountability is now the lens</strong>&#8221; - structured, ROI-linked metrics e.g. profitability, throughput, workforce productivity - <strong>measurable outcomes, not just adoption</strong>.</p></li><li><p>&#8220;<strong>Long-term optimism in Gen AI is increasingly strong</strong>&#8221; <strong>88% expect increased spend on AI in next 12-months, +16pp YoY; </strong>growing trend of cutting to fund:<strong> </strong>11% (+7pp YoY) reallocating funds, mostly from legacy IT + HR and Workforce Programs.</p></li><li><p><strong>74% see positive RoI already from AI investment</strong>; 80% expect it within 2-3 years; larger enterprises are less successful despite bigger budgets: 34% report &#8217;too early/neutral outcomes); Digital enterprises most successful, 88% positive; banking and professional services next 83%. <strong>Efficiency and productivity</strong> <strong>the top benefit, </strong>then quality, creativity and security.</p></li><li><p><strong>Human factors key</strong>. Deliberate change management needed to produce RoI; overcoming bottlenecks - human capital, talent, training - essential to success. [I&#8217;d add <strong>incentives </strong>but the report doesn&#8217;t say this!]</p></li></ol><p>The report also calls for Chief AI Officers, which is something I would want to reflect on further, and perhaps see more evidence of, before endorsing &#8211; I worry appointments like this can be performative, and also that it creates new internal rivalries that might not be conducive to adoption e.g. the Chief AI Officer wants to own and drive AI transformation for their own career success, but their rivals around the board table for bonuses and promotion have an incentive to either (a) own their own AI initiatives and outperform the CAIO or (b) thwart or limit the success of the CAIO&#8217;s initiatives &#8211; especially if they compete for resources from within their domain. Still, Wharton says the evidence/testimony of many Execs shows they are a good idea.</p><p>If you only read the Wharton report, you might conclude &#8216;more AI spending = higher ROI&#8217; this would be wrong, as <strong>both the MIT and Wharton report conclude. </strong></p><p><strong>Both agree that accountability, assessment vs operational objectives, and measurement vs operational outcomes, are the key to success</strong>. <strong>Funding more pilots, or scoring &#8216;adoption&#8217; alone are unlikely to generate ROI</strong>.</p><p>But the MIT report gets into the reasons for failure more deeply.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wZge!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wZge!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png 424w, https://substackcdn.com/image/fetch/$s_!wZge!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png 848w, https://substackcdn.com/image/fetch/$s_!wZge!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png 1272w, https://substackcdn.com/image/fetch/$s_!wZge!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wZge!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png" width="1380" height="808" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:808,&quot;width&quot;:1380,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A blue background with black text\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A blue background with black text

AI-generated content may be incorrect." title="A blue background with black text

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!wZge!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png 424w, https://substackcdn.com/image/fetch/$s_!wZge!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png 848w, https://substackcdn.com/image/fetch/$s_!wZge!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png 1272w, https://substackcdn.com/image/fetch/$s_!wZge!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bdc1fd8-5eff-4a36-a2d6-fd19c0445b52_1380x808.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">MIT report, also published this week and worth your time - link in opening paragraph.</figcaption></figure></div><p>The <strong>headline finding in MIT&#8217;s report is probably the most important thing for those funding, or potentially funding, AI investments to know. 95% of those investments fail</strong>, as assessed by MIT across &#8216;300 public implementations&#8217;, on which MIT conducted interviews, surveys and their own analysis.</p><p>But those same decision-makers should be more curious &#8211; the headline could mislead, failing to read deeper means ceding a chance to learn from others&#8217; mistakes. The report also finds that &#8220;<em><strong>5% are extracting millions in value</strong></em>&#8221; and offers a diagnosis of what works and why. If 95% of organisations use this report to stop or reduce AI investment, as their AI sceptics are emboldened, they are increasing the risk their business is eaten or defeated by the 5% that get their investment right.</p><p>There are other bearish indicators in the report, for example, so far:</p><blockquote><ul><li><p>Tech &amp; Media are the only two of nine industries seeing significant disruption.</p></li><li><p>No sign of the needed deep structural shifts associated with past general purpose technologies e.g. new market leaders, disrupted business models, or measurable changes in customer behaviour. Substitution, pilots, experimentation - not transformation - rule</p></li></ul></blockquote><p><strong>Why have 95% or AI Investments failed to Generate ROI?</strong></p><p>The report says only 5% of custom enterprise AI tools reach production. We suggest the Valley of Death, and innovation theatre are likely major causes &#8211; more so than product or pilot failure.</p><p>There is a lot of incentive for organisations, and the individuals within them, to experiment with AI. It is career enhancing to show yourself to be forward thinking, and have some stories for your boss, board and customers. But the unpopularity of disruption, which always creates winners and losers is likely a major factor in killing off so many AI investments at the pilot stage. In contrast, showing the courage to &#8216;fail fast&#8217; and kill of your own initiatives, or even more so those of others, can be career enhancing. The incentives in many enterprises are pretty terrible. As the report says, <strong>the fact that only 5% of custom enterprise tools make it from pilot to production &#8216;explains why most organisations remain on the wrong side of the divide.&#8217;</strong></p><p>Perhaps supporting our analysis, the report notes that larger enterprises launch more pilots but have the lowest pilot to production ratios. Larger organisations are harder to align incentives within, have more competing fiefdoms, and often require much larger and more disruptive change management programmes since there are so many more interconnected and interdependent workflows.</p><p>Another reason for limited ROI is that 40% of the 5% of GenAI tools that make it from pilot to production are chatbots, and the inability of chatbots to learn means they add little value, and cause limited disruption. Like the pilots themselves, they sit on the periphery of staff workflows.</p><p>Pilots fail for more predictable reasons - the unwillingness of users to adopt new tools, and concerns around trusting model output.</p><p>Enterprise CIOs should also consider the more neutral findings that to date, <strong>GenAI adoption has primarily enhanced individual productivity, not P&amp;L performance</strong>. Why might that be?</p><p><strong>Secret Cyborgs. </strong>One reason for this is likely that employees are not passing on their gains in productivity to the business, or at least not in measurable ways. We suggest that in most organisations, there is no shortage of work, or at least &#8216;make-work&#8217; in the form of meetings and coordination costs and emails to reply to - many, perhaps most, are overwhelmed, not under-employed. Employees are likely reinvesting time saved in things they might not otherwise have got to &#8211; but of course often that just generates more internal work for everyone else.</p><p>Furthermore, there is a significant shadow AI economy &#8211; comprised of those Ethan Mollick at Wharton called &#8216;<a href="https://www.oneusefulthing.org/p/detecting-the-secret-cyborgs">secret cyborgs</a>&#8217; &#8211; that is thriving: 90% of employees use GenAI for work - many using external models. We suggest a primary reason for this is model-lag: by the time enterprises adopt a model, it is usually out of date, and far better models are available via personal subscription to ChatGPT, Gemini, Claude. Second, Co-Pilot, in particular, is just not as good as external tools (in my view), yet seems to be what most are using at work and basing their opinions of GenAI on. Supporting this, the report notes a generalised distrust of internal tools even from avid GenAI users.</p><p>MIT note that forward thinking organisations turn Secret Cyborgs into advantage - they don&#8217;t punish but learn from shadow usage, surveying, understanding and adapting by procuring enterprise alternatives.</p><p><strong>Speed &amp; Crossing the Valley of Death. </strong>Success in AI investments is more frequent in mid-market companies, which generally invest less, and have fewer pilots, but are much faster, running at an average of 90-days from pilot to production, and take a greater proportion of their pilots through to production</p><p>In contrast enterprises take 9-months or greater. Nine months vs current rates of AI progress is far too slow, unless the tool is being continuously updated and upgraded as new models drop. If staff find your tools inadequate and have to hide what they are doing, it is unlikely you&#8217;d see gains on your P&amp;L from their use of AI &#8211; their incentive is to hide the gains, along with the use.</p><p>What can we learn that is more positive from the MIT report?</p><p><strong>Reassurance?</strong></p><p><strong>AI is not (yet) coming for your job. </strong>Notwithstanding all we have written on this blog on the likely impact of AGI on employment, MIT find that for now, AI adoption has not led to workforce reduction. Displacement, per a wider MIT analysis cited in the report, is likely, but will be graduated through discrete displacement vice huge lay-offs. If we get to AGI on current timelines, I don&#8217;t think this is correct, but still, for now, we can all take some reassurance from the data.</p><p><strong>How Buyers that Achieve ROI Succeed</strong></p><p><strong>Strategic Partnerships with External Vendors Double the Chance of Success</strong>. </p><p><strong>&#8220;External partnerships see twice the success rate of internal builds&#8221; Employee usage rates are also twice as high for externally built tools vs those built internally</strong>. This success rate is particularly notable since internal build dominates within the sample case studies, but is rarely successful unless in partnership.</p><p>Buyers who succeed demand process-specific customisation and evaluate tools on business outcomes rather than software benchmarks. They expect systems that integrate with existing processes and improve over time.</p><p>The most effective AI-buying organisations don&#8217;t wait for perfect use cases or central approval. They drive adoption through:</p><p>1. Distributed Experimentation.</p><p>2. Vendor Partnerships / Strategic Partnerships (x2 likelihood of success) &#8211; &#8220;procure external tools, co-develop with vendors&#8221;.</p><p>3. Benchmarked tools on operational outcomes.</p><p>4. Partnered through early stage failures.</p><p>5. Sourced initiatives from frontline managers (vice top-down or central lab generated) &#8211; but paired bottom up sourcing with senior exec accountability. This likely accelerates adoption through operational fit and career incentives for adoption.</p><p>6. Decentralised authority and clear ownership.</p><p>7. Clear Accountability.</p><p>8. Harnessing power users to the cause.</p><p>9. Likewise individuals and managers on the customer side were critical to success.</p><p>Highest ROI for now comes from substitution - from eliminating previous business process outsourcing, notably in <strong>document processing costs</strong>, and ~30% <strong>reducing external agency use and consulting spend</strong>. ROI was highest in back-office functions. Financial services benefited <strong>cutting ~$1M on previous out-sourced risk management</strong>. Successful buyers also saw improved customer retention and sales conversion, through things like automated outreach and follow-up. That ROI did not come from AI transformation, is perhaps because very few organisations seem to be undertaking it yet.</p><p><strong>How winning vendors behave</strong></p><p>Trust in the vendor is the most important factor for executives awarding contracts.</p><p>Vendor trust isn&#8217;t always good news for start-ups like ours &#8211; many enterprises prefer to wait for existing vendors to offer the new tools than take a risk on evaluating &#8216;emerging&#8217; vendors. Plus, the report notes, existing Business Process Outsourcing suppliers have the advantage of already understanding customer workflows. Consequently:</p><p>&#8220;Product quality alone is rarely sufficient. Referrals, prior relationships, and VC introductions remain stronger predictors of enterprise adoption than functionality or feature set.&#8221; </p><p>Vendor trust is the key priority, but deep understanding of customer workflows is the second highest, requiring domain expertise and a willingness to learn.</p><p>Relatedly, a third key priority is that AI products succeed when they integrate with minimal disruption to current tools following (people don&#8217;t want to learn a new UI/tool, no matter how good). Vendors succeed because they customise deeply and embed themselves in workflows, adapt to context, and scale from narrow but high-value footholds.</p><p>Tools that succeed therefore have <strong>low configuration burden </strong>and immediate, visible value. Tools requiring extensive input tended to stall at the pilot phase. Tools with complex internal logic, opaque decision support, or that are &#8216;optimised based on proprietary heuristics&#8217; tended to fail.</p><p>Agentic AI adoption is very nascent. Too early to assess. The report argues this could lead to a more fundamental shift and much higher ROI. But also, MIT NANDA is a community for those building AI Agents, so while I think they are partly right (there will be much higher ROI for GenAI too, Agentic implementation a complement, and the revenues from each as inextricable as the two technologies), we should allow for some bias in their judgement. </p><h2><strong>So What?</strong></h2><p>Both reports are right in their own way. MIT&#8217;s exclusively looks back whereas Wharton&#8217;s is more forward looking (asking &#8216;what do you expect?&#8217;, as well as &#8216;what has happened?&#8217;). They agree on what is needed for investments to succeed. They agree that ROI is nascent, and organisations are moving from pilot to production, from adoption metrics to proving value vs operational outcomes. </p><p>In the end, they measure the same things in different ways. Wharton captures self-report data, likely sweeping up gains the MIT report misses, and allowing for promising minimally ROI-positive implementations that are &#8216;felt&#8217; to have delivered internal ROI that doesn&#8217;t necessarily show-up on the top or bottom line, and/or that are expected to grow, or are growing, to be weighted for future impact. MIT captures <em>verified transformation and financial impact </em>and is the more rigorous. Both are right within their frames&#8212;both track experience, one weights expectations more heavily, the other current execution.</p><p>Both offer caution - and useful lessons.</p><p>Neither really address the medium to longer term on AI that this blog usually focuses on - the forecasts on the arrival of AGI/ASI being most likely, dependent on the preferred definition, 2026-2033. But for leaders betting on AI, they are useful guides as to how to get your organisation AGI ready - how and where to start your transformation.</p><div><hr></div><p>If your intent is to optimise for success applying the latest in decision science to optimally allocate resources, minimise risk and maximise opportunities, you need us. Find us at www.cassi-ai.com<br></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">To receive new posts and/or support our work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><p><sup>[1]</sup> NANDA: Networked AI Agents in Decentralized Architecture &#8211; an MIT project, a mix of technical protocols and community of interest <a href="https://www.linkedin.com/pulse/nanda-internet-ai-agents-ramesh-raskar-211ve/">https://www.linkedin.com/pulse/nanda-internet-ai-agents-ramesh-raskar-211ve/</a> seeking to &#8220;&#8230;explor[e] how artificial intelligence can evolve into a truly democratic and distributed ecosystem.&#8221; By &#8220;&#8230;address[ing] critical architectural components such as privacy, incentives, orchestration, or user accessibility&#8230; &#8230;designing a framework where billions of agents can collaborate autonomously while preserving privacy, scalability and innovation at the edges.&#8221; It is perhaps unsurprising the report concludes, in part, that the real ROI will come from the deployment of Agents &#8211; an assertion I think is partly true, probably sincerely held, but clearly a conclusion NANDA was always likely to arrive at.</p>]]></content:encoded></item><item><title><![CDATA[Never tell me the Odds: Is your CEO Han Solo?]]></title><description><![CDATA[Probabilistic strategy, tactics, decisions require a mindset shift: stop hiding from the odds; start moving them.]]></description><link>https://blog.cassi-ai.com/p/never-tell-me-the-odds-is-your-ceo</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/never-tell-me-the-odds-is-your-ceo</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Tue, 07 Oct 2025 05:31:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!RBI3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RBI3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RBI3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!RBI3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!RBI3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!RBI3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RBI3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:112793,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/174541702?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RBI3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!RBI3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!RBI3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!RBI3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18fd4ca1-099d-40ec-9bed-ebdd4f54c9d0_1024x1024.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Does heroism require the denial of reality? Can you inspire, while being honest about the low odds of success? Can you succeed, when the odds are against you?</p><p>&#8216;The Empire Strikes Back&#8217;. Han Solo piloting the <em>Millennium Falcon</em>. Under attack and outgunned by the Empire&#8217;s TIE-fighters, Han is warned by Princess Leia of a hazard ahead:</p><p>&#8216;<em>Asteroids&#8217; </em>she shouts, alarmed.</p><p>To the horror of his companions Han Solo spots an opportunity to lose those attacking him&#8230;</p><div id="youtube2-EuPN1oDC1RA" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;EuPN1oDC1RA&quot;,&quot;startTime&quot;:&quot;102&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/EuPN1oDC1RA?start=102&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>But Sir, C-3PO opines, <em>the possibility of successfully navigating an Asteroid field is approximately 3720-to-1</em>.</p><p>Han Solo, eyes fixed determinedly ahead, growls <em>Never tell me the odds.</em></p><p>Why? One view, is that knowing the odds would undermine his confidence making it less likely he&#8217;d succeed. It would be self-fulfilling.</p><p>One CEO recently worried that deploying Cassi, and seeing the odds of success, might &#8220;crystalise the doubts of his management team&#8221;, reducing his company&#8217;s chance of success.<a href="applewebdata://233AEE7C-A9DD-4FEF-A287-8F93AE66AD5C#_edn1"><sup>[i]</sup></a> It is not the first time we&#8217;ve heard the Han Solo objection, but it was put with impressive clarity and inspired us to explore the issue for a wider audience.</p><p><strong>Defining the &#8220;Han Solo&#8221; Objection</strong></p><p>In high-commitment ventures from start-ups to sports-fields, statecraft to Star Wars &#8216;<em>never tell me the odds</em>&#8217; is an attractive one-liner, and superficially, seems to capture something important.</p><p>In analytical form the claim made in the the Han Solo objection can be described as:</p><ol><li><p>The probability of success, viewed collectively - from my team, my company (etc), may be low.</p></li><li><p>If I confront that probability, my confidence, and/or my team&#8217;s, will fall.</p></li><li><p>Therefore I am better off not knowing the odds, because ignorance preserves the confidence required to achieve my desired outcome.</p></li></ol><p>The romance is obvious - the triumph of human courage over maths, people overcoming the odds - and that has its own appeal. But the logic does not hold. The quote misreads the scene, and the objection does not stand scrutiny. Three failures follow.</p><p>First, information-avoidance can sometimes temporarily preserve motivation, but unless you are about to fly into an asteroid field - and perhaps, as we will see Han shows - even then - it is a leadership failure. Han Solo doesn&#8217;t have time to fully convince his comrades their odds are wrong, but you do.</p><p>Second, confidence bolsters a sound plan; it does not substitute for one. The argument is clearest at the extreme: even if you are galvanising your people towards a heroic sacrificial defeat that achieves nothing - you are still convincing them that their least-bad outcome at that point is to die gloriously and be remembered well. You are not deceiving them by hiding the odds.</p><p>Third and finally, sometimes the Han Solo CEO is right, and the crew are wrong. He has access to information they don&#8217;t, and should act on it, convincing them as they go.</p><p><strong>1. Information Avoidance is Leadership Failure</strong></p><p>A leader&#8217;s job in motivating others is surely to convince their people that their plan has a sufficient chance of succeeding to be worth pursuing. They should usually be able to explain the strategy, tactic or decision sufficiently well to convince their team it has sufficient odds of succeeding to be worth it.</p><p>Han Solo communicates his calculation of the odds through his confidence - the Asteroid field is a better bet than continuing to fight out-numbered and out-gunned. He says as much:</p><p>Leia:<em> What are you doing? You&#8217;re not actually going into an Asteroid field?</em></p><p>Han: &#8216;<em>they&#8217;d have to be crazy to follow me in there&#8217;.</em></p><p>Sure, the odds of survival aren&#8217;t great in the Asteroid field, but Han Solo knows the base rate C-3PO quotes &#8220;<em>3720 to 1&#8221; </em>doesn&#8217;t account for Han&#8217;s skill and experience, nor the worse odds of fighting in open Space - this is the best decision with highest expected value under the circumstances. His companions are alarmed, they check his reasoning &#8216;<em>if you are doing this to impress me&#8230;&#8217; </em>but they don&#8217;t dissent.</p><p>Your job as CEO isn&#8217;t to suppress the odds, but to convince people your calculation of them is right, with reasons.</p><p>Furthermore, you must remember that odds are not a constant; they change as the facts change, they respond to action. Each step that removes a risk, reveals information or compounds an edge, changes the number.</p><p>In a race for survival through an asteroid field, every dodge of a space rock that takes Han and friends closer to the far side shortens the exposure window; conditional on survival, the forecast can legitimately improve. A 20% forecast of survival on entering, and an 80% forecast later are consistent. Ideally, every decision would be taken knowing which most increased the odds of survival.</p><p>The point generalises: when we founded Cassi, the base rate - 80-90% of start-ups fail - was C-3PO sounding the alarm. Our odds of success started from 10-20%. Accounting for what we know of ourselves, what we wanted to do, and the need for it, we raised those, with our median Founder forecasts of the company surviving for 5-years hovering around the 50% mark (vs more ambitious goals, like achieving &#163;15M revenue in FY26, where odds were lower). As time has gone on - winning customers, gaining traction, deploying the platform, those odds have consistently risen.</p><p><strong>2. Confidence is no Substitute for a Sound Plan</strong></p><p>In most fields, where leaders have a different assessment to their team they can communicate it with more than just their own confidence. They have time. They can explain with reasons, evidence, their own judgement of the odds.</p><p>If this doesn&#8217;t work, the CEO might change their mind (or not - see next section). After all, those people around the Board table are there because their expertise and judgement in particular domains is judged to be of sufficient calibre and calibration to make their advice valuable to the CEO, who can&#8217;t know everything. This is particularly true where they forecast that some particular factor in their domain is highly influential on the strategic outcome. <br><br>Moreover, since we know that most human forecasts are no more accurate than chance &#8211; we&#8217;re usually better off deferring to the median crowd forecast, doings so over trusting our own gut, or relying on a single forecast from another - even if they are an expert in the area.</p><p>But you would still want to understand that expert view. For example you might weight a forecast due to expertise where:</p><ul><li><p>the HR Director and the Board are predicting that the probability of their being able to hire thirty world-leading ML Researchers by end FY was 10%; and</p></li><li><p>hiring thirty world-leading ML researchers is the difference between a 60% probability of the company achieving its strategic objective; and </p></li><li><p>failing in recruitment reduces the chance of achieving that strategic objective to below 50%&#8230;</p></li></ul><p>&#8230;so strategic failure is then shown to be more likely than not. At the very least the CEO should be interested in:</p><p>(a) why the probability of hiring the talent needed is so low,</p><p>(b) why the collective view is that this is so influential on the desired strategic objective and</p><p>(c) what might be done to alter that forecast - how could resources be differently assigned, or what actions taken, that would optimise for success?</p><p>The result of this should be either the CEO updates their forecasts, the HR Director and Board update theirs, or the gap remains. In either case - each should try to persuade the other of their judgement, to figure out what would change the view of one or other or both, and focus effort and resources on those factors that most move their odds. Hiding the odds is no substitute for optimising them, and likely to have the opposite effect than the one intended.</p><p><strong>3. Sometimes the Han Solo CEO is Right</strong></p><p>The CEO might see the odds from the collective intelligence of their team - the Board, the company - and disagree. If this is the case, they must have the courage to maintain their beliefs independently. The disagreement of others is not per se evidence to change their own beliefs.</p><p>In fact, a CEO or equivalent should, at key moments, *expect* to disagree with the crowd. Whether we think of that in terms of principled forecasting or informal disagreement around the table or during an online meeting, we should expect that the CEO will regularly disagree with the crowd, and be correct to do so.</p><p>If this was not the case, then organisations would not need CEOs, and decisions and courses of action would tend to be obvious.</p><p>Choosing the moments for this disagreement is in itself a key component of CEO prediction. Every major decision contains two forecasts: one about the external environment, and one about the organisation&#8217;s ability to shape that environment. The CEO&#8217;s must judge when their internal forecast justifies overriding the external and internal consensus &#8211; and the best way to do this, is to have a clear knowledge of their forecasting performance &#8211; do they consistently outperform others? Is there a reason this time to believe they might be right when others think they are wrong?</p><p>A slightly imperfect analogy is in investment. It is recognised that most people should just invest in tracker funds. Tracker funds are essentially the crowd wisdom of the investment community. But a CEO should be the equivalent of a trader who regularly beats a tracker forward by taking actions of whatever sort, equivalent to investments, that beat the tracker.</p><p>The analogy is imperfect only in mechanics. In both cases, someone is making probabilistic bets against the collective forecast. In finance, the &#8216;bet&#8217; is a literal trade; in strategy, it&#8217;s an organisational action. A tracker fund is the best collective intelligence forecast; fund managers who beat that are super-forecasters beating the crowd average. CEOs, like super-forecasters, should also know when to go with the collective forecast, and when to not forecast at all (because they have no relevant insight or information, and they would just be praying for luck).</p><p>CEOs are selected to be capable and well-informed individuals, operating in areas they have much relevant experience. They have the ability to make forecasts, and to shape the world to meet those forecasts in certain areas. When a CEO chooses a path, they are predicting both the external and internal environment and their own ability to shape it.</p><p>They can and should beat the crowd: that is their arbitrage against the competition, and the rest of the world.</p><p>We must be clear here: we are not saying &#8220;trust the hero; to hell with the odds&#8221;. In Star Wars, we can see enough in the scene to suggest Han Solo was taking a calibrated risk, broadly aware of the baseline. But his throwaway line &#8220;Don&#8217;t tell me the odds&#8221; is dangerous. Just taking a wild punt, and getting lucky is likely to lead to catastrophe in time. A CEO should employ calibrated contrarianism &#8211; and that calibration means they *must* know the odds &#8211; they need to know how often they are right, and how right they are (e.g. do things they forecast at 80% confidence happen 80% of the time). They need to know the odds their team is putting on things, they should seek out the best possible odds. The contrarianism should be legible &#8211; Peter Theil&#8217;s question makes the point:</p><p>&#8220;What important truth do very few people agree with you on?&#8221;</p><p>Cassi would make you go further: why do you believe in this truth? What would change your mind? How likely do you think it is you will be proved right?</p><p>Over time, Cassi would help you improve your judgement, and enable you to learn how much you should trust your &#8216;instincts&#8217; over the crowd. For the vast majority of humans &#8211; including CEOs, the median of the crowd forecast &#8211; or the median of a group of superforecasters - will give the best odds. At Cassi, we are showing that AI forecasting can often match or exceed the crowd &#8211; the CEO would be wise to test their contrarianism and refine it. Wild bets are not leadership, &#8220;don&#8217;t tell me the odds&#8221; is sexy but negligent.</p><p>Governance of the CEO &#8211; the Board, the market - will find the collective nature of the crowd forecast very helpful in calibrating the perceived riskiness of the CEO&#8217;s strategy, checking that heroic has not devolved into delusional, and, not least, the degree of reward the CEO should get for beating the collective baseline. Using an internal forecasting platform or prediction market allows the direct comparison of the CEO with crowd consensus over time.</p><p>We think organisations will go further - hiring and promoting partly on a proper scoring system that identifies:</p><ul><li><p>who is the most effective in identifying the most important factors in achieving success;</p></li><li><p>who is the most accurate and well calibrated in predicting the odds of achieving them;</p></li></ul><p>A good CEO might be such a person, consistently outperforming the &#8216;crowd&#8217; that constitutes their rivals for the job.</p><p>Doing that takes judgement:</p><blockquote><p>&#183; knowing the odds of your achieving your desired outcome,</p><p>&#183; knowing which factors will most influence your probability of achieving your outcome</p><p>&#183; understanding where your odds, and the factors that you think matter, differ from those of your team, the unbiased external view, the analysis of AI and its forecasts.</p><p>&#183; Knowing when and how much (if) to trust your judgement over that of your team, your crowd, your AI.</p></blockquote><p>Your CEO should copy the wider lesson in the Han Solo scene, and ignore the misleading one-liner: you must know the odds. With Cassi, you know both the number, and what you most need to do to succeed. You can optimise your odds. At that point, as Han later said &#8220;<em>You&#8217;re all clear, kid. Now let&#8217;s blow this thing and go home!</em>&#8221;</p><p>Failing that, head over to our website, www.cassi-ai.com - currently in the early stages of a makeover - and learn a little more about what we do.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">To receive new posts direct to your mailbox, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><div><hr></div><p><a href="applewebdata://233AEE7C-A9DD-4FEF-A287-8F93AE66AD5C#_ednref1"><sup>[i]</sup></a> Conversely, at a recent Conference, &#8220;Katie&#8221; made the argument that surfacing the odds, and the factors that most influence them, might consistently <em>improve </em>the odds. This is not just Cassi&#8217;s argument for more efficient management, but rather one suggesting that the concentrated shared consciousness such a method creates allows people to collective visualise an outcome &#8211; and perhaps this &#8216;mind share&#8217; will move the odds &#8211; &#8216;manifesting&#8217; the outcome as it were - beyond just what would be expected from more efficient resource allocation. We won&#8217;t explore this further &#8211; but I did say I would mention it on the blog as a counter-point to the view that surfacing the odds and factors would crystallise doubts!</p>]]></content:encoded></item><item><title><![CDATA[Prediction Centric Warfare]]></title><description><![CDATA[Notes and slides from my talk at DSEI yesterday.]]></description><link>https://blog.cassi-ai.com/p/prediction-centric-warfare</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/prediction-centric-warfare</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Wed, 10 Sep 2025 07:39:24 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!mfNy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mfNy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mfNy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic 424w, https://substackcdn.com/image/fetch/$s_!mfNy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic 848w, https://substackcdn.com/image/fetch/$s_!mfNy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic 1272w, https://substackcdn.com/image/fetch/$s_!mfNy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mfNy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:481155,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/173248744?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mfNy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic 424w, https://substackcdn.com/image/fetch/$s_!mfNy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic 848w, https://substackcdn.com/image/fetch/$s_!mfNy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic 1272w, https://substackcdn.com/image/fetch/$s_!mfNy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b86a0bc-0a63-438b-bdb8-26453c1a77d0_2400x1600.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><h1>Context</h1><p>Yesterday, at DSEI, I spoke on the Cyber and Specialist Operations Command Stage, alongside former colleague from Fujitsu and new Head of Fujitsu UK Defence and National Security Business Group Alexandra Bailey (congrats Alexandra!), and the MOD&#8217;s Chief Data and Artificial Intelligence Officer (CDAIO), Caroline Bellamy.</p><p>My comments on &#8216;prediction centric warfare&#8217; opened the panel. Below are my remarks and slides, edited lightly for publication. They are my remarks alone, and I do not mean to imply any endorsement from Caroline or Alexandra.</p><h1>Prediction Centric Warfare</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Lpet!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Lpet!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!Lpet!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!Lpet!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!Lpet!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Lpet!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Lpet!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!Lpet!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!Lpet!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!Lpet!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9aa01fa-79d8-4d75-bbd8-9fb40ddd0714_482x281.emf 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Information Technology transformation is decision transformation. We adopt information technologies in order to collect, process, protect, store and move information to enable better decisions to be made.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gTeq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gTeq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!gTeq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!gTeq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!gTeq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gTeq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gTeq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!gTeq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!gTeq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!gTeq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41054cfe-ce52-4952-b577-5d592c38ca96_482x281.emf 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Digital Transformation is the means by which such information is moved around.</p><p>Digital Transformation then, is decision transformation. It should be assessed on outcomes &#8211; clear metrics as to whether it has, in fact, enabled better decisions &#8211; those that are more often right, or less wrong, and faster, than before any given technology is deployed.</p><p>At Fujitsu, where I was Managing Director. of the Centre for Cognitive Technologies, we focused on revolutionising decision-making because decisions are where cognition manifests.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RQfw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RQfw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!RQfw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!RQfw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!RQfw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RQfw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RQfw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!RQfw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!RQfw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!RQfw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32b3941a-b772-49dc-b0f8-fc623ca6fd5a_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>And decisions, as I argued on the Cassi blog last week, are predictions.<a href="applewebdata://18B7C987-52E1-4711-8B42-88BE20D3A820#_ftn1"><sup>[1]</sup></a></p><p>Indeed everything is.</p><p>A decision is a conditional forecast: <em>if we do X, we expect Y</em>.</p><p>An action is a bet on a payoff. <em>I do X, expecting Y.</em></p><p>Tactics, operational design and strategy are predictions at different scales and time horizons. <em>If we do these things, we expect this to be the outcome</em>.<a href="applewebdata://18B7C987-52E1-4711-8B42-88BE20D3A820#_ftn2"><sup>[2]</sup></a></p><p>When prediction quality improves, outcomes improve. When it degrades, we pay in blood, treasure and time.</p><p>We live in the exponential age &#8211; where technological change is continuously accelerating &#8211; creating profound disruption.</p><p>The UK&#8217;s Strategic Defence Review tells us we live in age of uncertainty.</p><p>Uncertainty and change can only be bounded and managed with prediction</p><p>The science of prediction &#8211; human and algorithmic, has never been more important.</p><p>Here we will discuss seven things:</p><p>1. How Prediction wins Wars.</p><p>2. The necessity of prediction to victory, despite its difficulty.</p><p>3. How Operating Concepts are themselves a prediction, and why this matters.</p><p>4. How &#8216;<em>prediction centric warfare&#8217;</em> should be the UK&#8217;s New Operating Concept and Theory of Winning, enabled by the Digital Targeting Web.</p><p>5. A brief overview of the science of prediction &#8211; human and algorithmic.</p><p>6. The Strategic Defence Review&#8217;s predictions for the Digital Targeting Web,</p><p>7. How Programmatic Prediction would significantly increase the chances of the Digital Targeting Web succeeding in achieving the SDRs aims.</p><h2><strong>How Prediction wins Wars.</strong></h2><p>If Clausewitz is right and war is politics by other means, then violence is a tool of persuasion and coercion. Its only purpose Schelling tells us, outside sport or revenge, is to influence behaviour: to coerce choices.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6wHx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6wHx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!6wHx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!6wHx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!6wHx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6wHx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6wHx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!6wHx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!6wHx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!6wHx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba582d9-b79c-4164-afd8-331f16f987bc_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Everything we do in conflict therefore aims to shape an adversary&#8217;s decisions.</p><p>Since decisions are predictions, the task is to predict what the adversary will decide in response to our actions.<a href="applewebdata://18B7C987-52E1-4711-8B42-88BE20D3A820#_ftn3"><sup>[3]</sup></a></p><p>We aim to influence an adversary&#8217;s forecast, so that the path we prefer dominates their expected value calculation.</p><p>Once you accept that, the centre of gravity is cognitive. Fires, manoeuvre, sanctions and narratives are instruments; the target is the decision process of people.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!naDc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!naDc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!naDc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!naDc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!naDc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!naDc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!naDc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!naDc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!naDc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!naDc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fc9f0e-483e-44cb-826b-ab5c86df069a_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Since decisions are predictions, the task is to predict what they will predict in response to what we do&#8212;and to set the conditions so that our preferred option is the least&#8209;bad option they can see.</p><p>The nuclear extremity makes the point.</p><p>The US dropped nuclear bombs on Hiroshima and Nagasaki predicting they would make peace more likely, and so a devastating and costly ground invasion less likely.</p><p>The decisive effect was not the blast pattern but the expectation it created in Japan&#8217;s senior leadership&#8212;the Imperial Conference, and ultimately the Emperor.</p><p>The prediction they were forced to make&#8230;</p><ul><li><p> more bombs could follow; </p></li></ul><ul><li><p>American resolve was unshakeable; </p></li></ul><ul><li><p>the Soviet entry had closed the last exits;</p><p></p><p>&#8230;collapsed the case for &#8220;fight on.&#8221;</p></li></ul><p>Our preferred option became their least bad outcome.</p><p>The logic generalises. Whether we are driving soldiers off a hill, coaxing insurgents to a ceasefire or coercing governments towards an unconditional surrender, we are in the business of changing behaviour by changing expectations. It&#8217;s just as true in grey zone conflict and deterrence.</p><h2><strong>The necessity of prediction to victory, despite its difficulty</strong></h2><p>We fight over forecasts more than over map squares.</p><p>That is uncomfortable, because modelling human decision&#8209;making under fire is damnably hard.</p><p>But discomfort does not free us from the obligation.</p><p>We cannot not predict.</p><p>We either model and influence those decisions - or we leave outcomes to inadequate heuristics.</p><h2><strong>How Operating Concepts are themselves a prediction, and why this matters.</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BcPq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BcPq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!BcPq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!BcPq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!BcPq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BcPq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BcPq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!BcPq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!BcPq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!BcPq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb254e5d7-659a-4a77-a319-bb14a865f0dd_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The exponential pace of technological change is such that we can no longer afford to follow technological trends.</p><p>Indeed the exponential of Moore&#8217;s Law means this has been true since the 1970s at least, and as I have argued elsewhere, perhaps it has been true since the First World War.</p><p>Prior to this, for 1700 years of human history &#8211; perhaps for its entirety &#8211; weapons came first, and then we figured out the tactics. We predicted that a tool, or an invention might have a military use, and then worked out the most effective tactics by predicting, getting it wrong, and predicting again, until we got there.</p><p>Certainly by the 1970s this was no longer a viable way to proceed. The risk was that the technology you were iterating towards tactical usefulness would be obsolete and outdated before you figured out how to use it. The opportunity was that now, technological and scientific progress were rapid enough that you could predict what technologies you needed to win and set about building them.</p><p>In the Cold War the United States applied this logic with DARPA&#8217;s ASSAULT BREAKER, which became Air-Land Battle doctrine: define the theory of victory first&#8212;how to break the Soviet second echelons&#8212;and then assemble the sensors, networks and shooters to make it real. The strategy came first; the technology followed.</p><p>That is the spirit of the Strategic Defence Review&#8217;s Digital Targeting Web. Call it the &#8216;kill web&#8217; if you prefer the extant literature; the point is not IT modernisation but a single, integrated force design built backwards from how we win.</p><p>In the literature, a kill-web is a theory of victory &#8211; the central component in DARPA&#8217;s <a href="https://www.darpa.mil/news/mosaic-warfare">Mosaic Warfare</a> concept. Where, as its Director said &#8220;<em>The idea will be to send so many weapon and sensor platforms at the enemy that its forces are overwhelmed. The goal is to take complexity and to turn that into an asymmetric advantage.</em>&#8221; Presented <a href="https://www.youtube.com/watch?v=33VAnIEjDgk&amp;t=331s">here</a> in 2018. Operational Concepts are prediction too &#8211; of what tactics and operational approach will win, of what technologies we will need.</p><h2><strong>How &#8216;</strong><em><strong>prediction centric warfare&#8217;</strong></em><strong> should be the UK&#8217;s New Operating Concept and Theory of Winning, enabled by the Digital Targeting Web.</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mHV7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mHV7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!mHV7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!mHV7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!mHV7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mHV7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mHV7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!mHV7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!mHV7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!mHV7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb7a351-c847-48fd-a091-2c8c9c436c3d_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I argue the Digital Targeting Web must be the central component in our theory of victory.</p><p>It must be more than just a digital network connecting every sensor to every shooter.</p><p>More than a platform to coordinate across domains.</p><p>More than a modern extension of Boyd&#8217;s OODA loop and Brose&#8217;s Kill Chain.</p><p>The aim is not just faster kills or more options. These are the means, not the ends.</p><p>It aim is to predict faster and more accurately which combination of weapons and tactics will make our preferred option our adversary&#8217;s least bad action or outcome.</p><p>The Digital Targeting Web must enable <em>Prediction Centric Warfare</em>.</p><h2><strong>A brief overview of the science of prediction &#8211; human and algorithmic.</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RBlv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RBlv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!RBlv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!RBlv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!RBlv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RBlv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RBlv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!RBlv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!RBlv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!RBlv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ee7a1ad-b561-4896-aacd-e12d954cfbe7_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If prediction sits at the heart of victory, we must use the best methods we have.</p><p>Most humans are unreliable forecasters. Philip Tetlock&#8217;s work showed that expert judgement in geopolitics and economics often performs no better than chance &#8211; the median forecast of a crowd is more reliable. Some, whom he calls superforecasters, consistently do better through disciplined probabilistic methods.</p><p>Today, AI systems can match and in places exceed that standard. At Cassi we&#8217;ve shown materially higher accuracy than human baselines on meaningful questions from &#8216;Will Russia control Pokrovsk in X months&#8217; time?&#8217; to &#8216;Will the UK have to conduct a non-combatant evacuation operation in Somalia before end FY, and if they do, will it succeed?&#8217;, and&#8212;more importantly&#8212;shown that AI can identify the variables that move outcomes.</p><p>The practical lesson is straightforward. Define the outcome in measurable - we would say resolvable (how would you know you&#8217;d succeeded?) - terms; set resolution criteria; then work back to the decisions and data you need.</p><p>Build the web to do two things: predict what the adversary will do next, and predict what we must do to shift that next move in our favour&#8212;at tactical, operational and strategic levels.</p><p>Factorise, and Forecast should be the new discipline &#8211; both human and algorithmic.</p><p>Base rates beat bravado. Score your people, and compare their performance to AI. No estimate nor planned action from the Kill Web goes to the commander without a confidence-defined forecast<a href="applewebdata://18B7C987-52E1-4711-8B42-88BE20D3A820#_ftn4"><sup>[4]</sup></a>, no decision is delegated to an algorithm until it demonstrably out-performs your people.</p><p>Ask yourself: when did you last see a plan fail because people were too explicit about their assumptions and updated them too quickly?</p><p>Quite.</p><p>Plans fail because we predict poorly, hide our confidence levels, and refuse to revise in time. We must predict, publish, and adapt.</p><h2><strong>The Strategic Defence Review&#8217;s predictions for the Digital Targeting Web</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RtAX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RtAX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RtAX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c95df600-7490-4866-993f-6c164dd6f223_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RtAX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The Strategic Defence Review tells us we are living in an age of uncertainty. Surely then, it is now more important than ever that we bring the science of prediction, to bound and manage that uncertainty as best we are able?</p><p>On the Kill Web, the SDR says that <em>&#8220;the targeting web epitomises how the Integrated Force must fight and adapt. Its very existence contributes to deterrence.&#8221;</em></p><p>We are betting that the DTW:</p><p>1. Increases our probability of winning future wars</p><p>2. Its existence deters by influencing the adversary&#8217;s forecast of the future.</p><p>Here we see how <em>prediction centric warfare</em> begins at the Strategic level, and how central is could be to our success at the Programmatic level.</p><p>We should test both these predictions explicitly and probabilistically, not assume them. We should factorise them, so that we can assign resources to those factors that will most increase our chances of success.</p><p>We should factorise and forecast at the Programme level too. Programmatic Predictions, if you will &#8211; ensuring we are optimising always and prioritising those actions that see us build the Kill Web as effectively and efficiently as possible. Probabilistically optimised.</p><p>To close I offer a reminder, a suggestion and a participatory starting point:</p><p>A reminder - everything is prediction&#8212;decisions, operating concepts tactics, operations, strategy.</p><p>A suggestion The Digital Targeting Web is a bet that better, faster, smarter decisions will both deter and, if needed, win. It should be the heart of a new UK Operating Concept, a Theory of Winning, <em>Prediction Centric Warfare.</em></p><p>A starting point &#8211; judge it with outcomes that are resolvable &#8211; defining how you would know you had succeeded. Make a start here yourself, in the comments, on the proposals I offer, what probability would you assign to each of the following questions? What factors would most move your prediction one way or the other?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RtAX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RtAX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RtAX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c95df600-7490-4866-993f-6c164dd6f223_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RtAX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!RtAX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc95df600-7490-4866-993f-6c164dd6f223_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Programmatic Prediction would significantly increase the chances of the Digital Targeting Web succeeding in achieving the SDRs aims.</strong></p><p>To optimally assign resources, we must factorise and forecast. Prediction-Centric Warfare is the key to our future success.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6J5T!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6J5T!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!6J5T!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!6J5T!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!6J5T!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6J5T!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf" width="482" height="281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/80becf25-1b8c-418c-8b58-81258584636a_482x281.emf&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:281,&quot;width&quot;:482,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6J5T!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf 424w, https://substackcdn.com/image/fetch/$s_!6J5T!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf 848w, https://substackcdn.com/image/fetch/$s_!6J5T!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf 1272w, https://substackcdn.com/image/fetch/$s_!6J5T!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80becf25-1b8c-418c-8b58-81258584636a_482x281.emf 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><p><a href="applewebdata://18B7C987-52E1-4711-8B42-88BE20D3A820#_ftnref1"><sup>[1]</sup></a> Oxford English Dictionary. <strong>predict</strong> &#8230;<strong>2</strong> <em>verb trans</em>. announce as an event that will happen in the future; say that (a thing) will happen; foretell. <strong>b </strong>Of a theory, observation etc.: have as a deducible consequence; imply. [underline emphasis added &#8211; here, to predict is to conclude via Bayesian prediction, as a consequence of priors, or from inductive premises to inductive conclusion].</p><p><a href="applewebdata://18B7C987-52E1-4711-8B42-88BE20D3A820#_ftnref2"><sup>[2]</sup></a> Importantly, because it justifies the wider claim &#8211; perception and agency are prediction too. See last week&#8217;s <a href="https://cassiai.substack.com/p/everything-is-prediction">post</a> for a fuller justification.</p><p><a href="applewebdata://18B7C987-52E1-4711-8B42-88BE20D3A820#_ftnref3"><sup>[3]</sup></a> In this sense, morale &#8211; maintaining ours or undermining theirs - is the ability to maintain one&#8217;s own aims in the face of adversary pressure.</p><p><a href="applewebdata://18B7C987-52E1-4711-8B42-88BE20D3A820#_ftnref4"><sup>[4]</sup></a> In theory this is our actual doctrine, but it is honoured more in the breach than in the observance.</p>]]></content:encoded></item><item><title><![CDATA[Everything is Prediction]]></title><description><![CDATA[From Ramsey to Yudkowsky, the case for making our bets explicit.]]></description><link>https://blog.cassi-ai.com/p/everything-is-prediction</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/everything-is-prediction</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Tue, 02 Sep 2025 06:01:29 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!MA2R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MA2R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MA2R!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!MA2R!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!MA2R!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!MA2R!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MA2R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png" width="1024" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MA2R!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!MA2R!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!MA2R!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!MA2R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb56e1cc8-ab23-4f3a-bf42-12b788c834e5_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Everything is Prediction,</figcaption></figure></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support our work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p>Everything we do is a wager on what comes next. Reading this is a bet that it will repay your time. Reaching for a coffee mug assumes it contains coffee. Strategy, policy, all decisions, even routine admin, each is an &#8220;if&#8230;then&#8221; about the future. The only real choice is whether to keep those bets tacit and unexamined, or to state them, price them, and improve them.</p><p>That is why &#8220;we don&#8217;t do prediction&#8221; is the most misleading sentence in public life. Historians sometimes say it, central bankers occasionally imply it, ministers often hide behind it.</p><p>Even explanation &#8211; at face value a solely retrospective exercise - is a forecast in disguise. To say &#8220;A influenced B&#8221; is to present a causal model that, applied tomorrow, shifts our expectations about &#8216;C&#8217;.</p><p>Historian Lawrence Freedman is rightly <a href="https://samf.substack.com/p/assessing-the-assessors">sceptical</a> about crystal-ball claims. He writes thoughtfully on prediction and annually reviews his own calls, but still somehow <a href="https://samf.substack.com/p/assessing-the-assessors">judges</a> that since <em>&#8220;&#8230;it is always going to be difficult forecasting future developments. &#8230;it is probably best not to do so or, at least, not without qualifications and caveats</em>.&#8221; Yet his own (excellent) explanations move readers&#8217; inductive probabilities, and are valuable precisely because they do. An account of the past that cannot change how we bet on the future is merely decoration, it serves no purpose.</p><p>When President Harry S. Truman said &#8220;<em>There is nothing new in the world except the history you do not know</em>&#8221; or when General James Mattis, himself a prediction sceptic (see his opposition to &#8216;Effects-Based Warfare&#8217;) tells us that history and reading &#8220;&#8230;<em>doesn&#8217;t give me all the answers, but it lights what is often a dark path ahead,&#8221; </em>I suggest what their words mean in practice is that reading and history give them new mental models, help to shift the inductive probabilities &#8211; implicit &#8211; in their reasoning about what comes next. The eminent Kori Schake, another <a href="https://warontherocks.com/2018/08/why-we-get-it-wrong-reflections-on-predicting-the-future-of-war/">prediction sceptic</a>, surely does not think her work has no relevance to the future? That when we read her work it should have no effect at all on our models of the world? If Schake, Mattis or Freedman concede their work is useful, they are with Truman, and admit to the necessity of prediction and their own roles in informing it. We cannot <em>not</em> predict. Prediction is unavoidable, essential. Historians are no exception &#8211; they predict causal models &#8211; the best explanation - of the past, their analyses reshape our models in the present, as we seek to shape the future.</p><p>Former Bank of England Governor Mervyn King devotes a whole book <em><a href="https://www.littlebrown.co.uk/titles/mervyn-king/radical-uncertainty/9781408712580/">Radical Uncertainty</a></em> to arguing that we should resist prediction, that &#8220;<em>&#8230;to ascribe probabilities is misconceived&#8230;&#8221;</em>,<a href="applewebdata://D61071D7-D062-43E7-B02C-A29B397F4288#_ftn1"><sup>[1]</sup></a> that narratives and scenarios are better. His account is half-right but wholly incomplete. Scenarios are essential, but again, you cannot <em>not</em> predict. Not all scenarios are of equal likelihood. Choosing one narrative over another is itself an implicit probability judgement. Refusing to put numbers on beliefs does not make those beliefs less numerical; it merely makes them unaccountable. The responsible move is not to abandon probabilities, but to calibrate them&#8212;and to say what would change our minds. When King <a href="https://www.abc.net.au/listen/programs/melbourne-afternoons/mervyn-king-on-the-probabilities-of-covid19/12202962">says</a> that no one could have predicted a Covid-class virus would emerge in Wuhan in 2019, his counsel of despair over-reaches. After SARS (2003), MERS (2012) and repeated H5N1 scares, we could not name the city, but we could price the odds of a novel respiratory pandemic in the coming years, and the most likely places it might emerge from. Those probabilities would change with new information, they may be low everywhere, but they would not be everywhere equal - they are not useless. King says &#8220;The better strategy is to be prepared.&#8221; That is the point of probabilities: not clairvoyance, but disciplined readiness.</p><p>What should that discipline look like? There are four key elements:</p><p>(1) <strong>Find the base rate</strong>: the prior odds for events in a class (e.g. novel pathogen emergence, cross-border spread, health-system overload).</p><p>(2) <strong>Pair scenarios with forecasts as inductive premises</strong>, <strong>not a single point estimate</strong>. Accept that decisions are if/then predictions and you see that the each &#8216;<em>if</em>&#8217; is a factor in your decision model. The <em>ifs</em> need probabilities, from which the probability of the &#8216;<em>thens&#8217; </em>can be mathematically derived. In putting these forward, you are crystalising your thoughts, moving beyond fuzzy thinking and the tendency all humans have to hold incompatible beliefs, to believe like Alice in Wonderland&#8217;s White Queen <em>six impossible things before breakfast</em>. (At Cassi, we describe this as &#8220;decision legibility&#8221; &#8211; you&#8217;d think it was everywhere. It is rarer than you imagine!).</p><p>(3) <strong>Pre-commit to triggers for action</strong>, e.g. surge testing when the forecast of transmission exceeds a probability threshold in two regions; release stockpiles when the forecast of hospital utilisation crosses x%; tighten or loosen measures as defined indicators move.</p><p>(4) <strong>Transparency.</strong> Show how each tranche of evidence shifts the odds over time. Narratives are a communication tool &#8211; often designed to obscure more than they reveal; probabilities confront us with reality, build trust, tell us how&#8212;and when&#8212;to act.</p><p>This is the hinge where philosophy helps. Frank Ramsey, Cambridge prodigy, solved the practical problem a century ago: degrees of belief are what you are prepared to act on. If your behaviour would not change when the odds change, you didn&#8217;t believe the odds &#8211; or your model - in the first place. Ludwig Wittgenstein stripped the mystique further: probability is not a mood, it is a relation between propositions and the world, a logic of uncertainty. John Maynard Keynes added a crucial caution: evidence often has different weights; some uncertainties are harder to forecast because while the evidence all points one way there is only limited evidence or uncertainty as to its reliability. Here confidence &#8211; and thus your forecast - should reflect that uncertainty (closer to 50% than zero or 100%) - even when you are directionally convinced. Together they point to a discipline: state your beliefs in subjective probabilities and update as new evidence arrives.</p><p>Friedrich Hayek supplies the institutional corollary. Knowledge is dispersed; no planner sees the full picture. Markets work because prices compress scattered expectations into a single, continuously updated signal. In that sense, markets are prediction engines, not oracles, but social machines for aggregating beliefs about the future. Eliezer Yudkowsky pushes the point in Inadequate Equilibria: where incentives are weak or misaligned, feedback is slow, or entry is blocked, equilibria can remain wrong for a long time. The pathology at the heart of both bureaucratic failure in Government and business alike. The remedy is to improve the prediction machinery, introduce real feedback - markets, skin in the game, and to confront issues, invite in contrarians - seek out what you need to know, not what you want to hear. In policy terms: use auctions not allocations, prediction markets not pundit panels, prize challenges not closed committees. This is what UK Government tried to do with Cosmic Bazaar, and the US with both IARPA&#8217;s ACE, and the Intelligence Community Prediction Market (ICPM) forecasting tournaments that demonstrated promise &#8212; and then stalled. As Professor Philip Tetlock &#8211; doyen of crowd-based &#8216;Superforecasting&#8217; - put it, we have kept &#8220;fumbling the crystal ball&#8221;: forecasts sit outside power, with limited ownership, little skin in the game, no budget consequences, and under attack from bureaucratic antibodies. This must change. If as we argue here &#8216;everything is prediction&#8217; &#8211; we must start optimising our organisations to be better at it.</p><p>If prediction frames decisions, it also underwrites agency. On Karl Friston&#8217;s free-energy view, organisms survive by minimising &#8220;surprise&#8221;: they either revise their internal model to fit the data (perception) or act so the data fit the model (action). In ordinary terms: when expectations and evidence clash, you can change your mind or change your circumstances. Perception is inference; action is hypothesis-testing. The Duke of Wellington put it more tersely for the real world: &#8220;<em>All the business of war, and indeed all the business of life, is to endeavour to find out what you don't know by what you do; that's what I called &#8216;guessing what was at the other side of the hill</em>.&#8217;&#8221; Active inference before the label.</p><p>Similarly, evolutionary psychologist Michael Tomasello argues human and animal <a href="https://mitpress.mit.edu/9780262047005/the-evolution-of-agency/">agency</a> is layered if/then prediction. A hierarchy with each tier making more complex predictions, exerting itself to suppress the more simpler decision calculus of the tiers below.</p><p>At the bottom of the agency hierarchy are stimulus-driven simple organisms such as the ~1 mm long worm <em>C. elegans</em>: they perform something called &#8216;chemotaxis&#8217;, where they track nutrient gradients, effectively eating their way up a calorie signal predicting only that it will continue via the next &#8216;bite&#8217;, stopping when sated, or when the signal ends. The prediction is<em> if</em> nutrients, <em>then</em> eat in that direction. No other calculus.</p><p>Next, goal-directed agents (early vertebrates such as lizards) pursue an immediate objective, but are able to inhibit action when predicted predation or other risk outweighs reward. <em>If </em>prediction of predation X is greater than predicted reward Y, inhibit. <em>If not</em>, pursue reward.</p><p>Intentional agents (mammals like squirrels, and corvids) represent alternatives - that feeder, this route, those acorns and that jump &#8211; intuitively predicting complex if/then alternatives from multiple paths and multiple options.</p><p>Rational agents (great apes) plan with counterfactuals, stack boxes to reach greater heights, make or pick tools, they trade immediate impulses for a better expected payoff, imagining and testing abstract alternative futures.</p><p>Finally, socially normative agents (humans) align predictions through joint attention, perspective-taking and obligation: an &#8220;I&#8221; that can join a &#8220;we&#8221; &#8211; by observing &#8220;you&#8221; and predicting how you are predicting - intuitively pooling information and coordinating on shared futures.</p><p>Each layer of agency is not replaced, but is demoted as the newly evolved tier above learns to predict when to suppress its subordinate&#8217;s more simplistic calculations for a higher payoff among more complex options. At every tier, behaviour is guided by forecasts: of the world, of partners, and of the self &#8211; and subordinate levels of agentic impulse - and corrected by the surprises that reality returns.</p><p>The same stack runs through human organisations. Perception is predictive (brains pre-empt and correct), operations are predictive, with tactics micro &#8220;if&#8230;then&#8221; loops. Strategy is predictive (a scaffold of contingent moves). Science is the most explicit version: if condition X holds, outcome Y will follow. Diplomacy and business are messier but structurally the same. A sanctions package assumes particular channels will bite before spoilers adapt. A product roadmap assumes rivals will not leapfrog a core feature in the next two quarters. Prices, policies, and laws that encode &#8216;expectations&#8217;, are predictions by another name. Even organisational design, per the doctrines of &#8216;adaptive strategy&#8217; is a prediction &#8211; you predict the optimal system structure to sense, adapt, and respond to changing circumstances. More succinctly, if a bit meta, you predict which system will make the best predictions. To govern or to run a firm is to predict. You can&#8217;t not.</p><p>What follows? First, stop hiding forecasts inside prose. Make them legible. When presenting an analysis, say: &#8220;We assess a 35&#8211;45% chance of X by June; here are the three variables that would move that by &#177;15 points.&#8221; Second, adopt Keynes&#8217;s &#8220;weight of evidence&#8221; explicitly, with Ramsey&#8217;s insistence on making your beliefs legible to discover what is true &#8211; what are your inductive premises? What are the probabilities they are true? Have you reflected your doubt about the sufficiency of evidence in your forecast? Third, import Hayekian aggregation into decision-making: add prediction markets (internal if need be), tournament-style scoring for analysts, score and reward effective calibration, and accuracy. Fourth, use Ramsey&#8217;s test: if your actions don&#8217;t shift with the odds, your statements were theatre.</p><p>For historians and commentators, the standard is similar. If an explanation cannot be operationalised into a forward expectation - what would be more or less likely next time a similar configuration appears - then it&#8217;s not yet an explanation. &#8220;We don&#8217;t predict&#8221; is not humility: it is a missed chance to help readers update.</p><p>For central bankers and officials, the discipline is harsher because the stakes are higher. Put numbers on your beliefs, say what a policy assumes, publish its probability of success. State the triggers that will force a pivot. Score your calibration and accuracy, so we can learn how much we should trust your judgement. The public can forgive error; we should not forgive opacity and obfuscation.</p><p>Everything is prediction: in minds, in markets, in ministries. The choice isn&#8217;t between predicting and not predicting; it is between implicit, untested bets and explicit, accountable ones. We should choose the latter&#8212;and build the habits and institutions that make our next bets a little less wrong than the last.</p><p>This is what we do at Cassi. Contact us at admin@cassi-ai.org.uk</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support our work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><div><hr></div><p><a href="applewebdata://D61071D7-D062-43E7-B02C-A29B397F4288#_ftnref1"><sup>[1]</sup></a> Kay, J. and King, M., 2020. <em>Radical uncertainty: Decision-making beyond the numbers</em>. WW Norton &amp; Company. especially pp. 71-84, quotations from pp. 223-224.</p>]]></content:encoded></item><item><title><![CDATA[Future Diplomat]]></title><description><![CDATA[Reimagining Diplomacy for 2035.]]></description><link>https://blog.cassi-ai.com/p/future-diplomat</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/future-diplomat</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Tue, 19 Aug 2025 06:00:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!S2yq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!S2yq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!S2yq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!S2yq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!S2yq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!S2yq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!S2yq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:170846,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/169990370?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!S2yq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!S2yq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!S2yq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!S2yq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1472e41a-c7bd-4980-bf01-461ab58f11bc_1024x1024.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Concept rendering of a British &#8220;self-driving embassy,&#8221; an AI-operated mission built around data-center infrastructure.</em></figcaption></figure></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><strong>Future Diplomat: Rewiring &amp; Reimagining Diplomacy</strong></p><p>Technology has long changed diplomacy. If diplomacy is, according to the Oxford English Dictionary:</p><blockquote><p>- <em>the management of international relations by negotiation; the method by which these relations are managed by ambassadors etc; skill in such affairs. Adroitness in personal relations; tact.</em></p></blockquote><p>&#8230; then diplomacy is communication: both the channel by which states speak, and the craft by which they persuade. As technology revolutionises how we communicate, to make and shape decisions, so it revolutionises the conduct of diplomacy.</p><p>Think of diplomats, embassies and the FCDO as cybernetic systems. They seek and sense information, process it through the neurons of the brain as individuals, or between the functional roles of the people staffing the organisation, act or choose not to, then observe the effects. The end is behavioural: get someone to do something new, do something differently, or keep doing what they already do. It might be a foreign person, organisation or group is the target of that diplomacy, or it might be those within the diplomat&#8217;s organisation, their home country or government &#8211; where the diplomat is seeking to mediate between positions.</p><p>If this is true, it should be clear that technologies that are fundamentally informational, and cognitive &#8211; they draw insight and foresight from data at unprecedented speed and scale, and make decisions based on models of the future &#8211; will transform diplomacy.</p><p>This vision for 2035 is built on the premises above. In addition, its starting assumption is that using forecasting tournaments and/or prediction markets<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> is the best way to understand the future &#8211; on which basis it anticipates:</p><p>- <strong>weak artificial general intelligence (AGI)</strong>, one that matches or exceeds human performance on a range of diverse cognitive and informational benchmarks, by March <strong>2027</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>;</p><p>- <strong>Oracle AGI</strong>, a system that significantly exceeds human performance in answering any question, by <strong>2028</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> &#8211; <strong>or under a more stringent definition 2032</strong>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p><p>- <strong>Artificial Superintelligence (ASI)</strong> &#8211; adding superhuman skill in robotics, proprioception, dexterity, locomotion ~May <strong>2029</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>;</p><p>Thus a diplomat in 2035 will be operating in world of superintelligent AI.</p><p><strong>Consular Services Without Counters</strong></p><p>Most consular work is structured information processing: visas, passports, notarial services, births and deaths, crisis assistance. By the early 2030s, agentic systems will handle the bulk of it end&#8209;to&#8209;end with clear audit trails and explainable reasoning. Consulates will be able to operate in all languages, tailoring information to individual cultural and psychometric dispositions and levels of education. Identity will be verified remotely through secure biometrics. Citizens will complete interviews and attestations conversationally, recorded in signed digital form. There will be little need to visit consulates for these tasks, with most activity taking place online, and other interactions via the metaverse &#8211; meetings with AI (probably ASI, per current forecasts) agents embodied in avatars. If these are distinguishable from a remote meeting with a human today it would only be in that we should expect an advanced, or superintelligent AI to provide a better service than a human &#8211; more empathetic, more patience, more personalised, smarter, more knowledgeable, with better memory.</p><p><strong>From Self&#8209;Driving Labs to Self&#8209;Driving Embassies</strong></p><p>In 2023 Eric Schmidt, in MIT Tech Review, &amp; the scientific journal &#8216;<em>Nature</em>&#8217; both described the birth of self-driving labs (SDL), those that - at the maximum extent - operate fully autonomously, designing hypotheses and experiments, conducting experiments, reporting the results, and then designing new hypotheses, testing again, without human intervention, at super-human speeds. In 2025, self-driving labs are driving breakthroughs globally. The journal <em>Nature Chemical Engineering</em> reports that in the field of materials science self-driving labs are producing new materials, ten times faster, processing ten times the data, accelerating discovery exponentially. Furthermore, July 2025 saw <em>Nature</em> publish articles showcasing:</p><ol><li><p>a &#8216;<a href="https://www.nature.com/articles/s41467-025-60912-0">self-driving microscope</a>&#8217; to better understand protein aggregation and neurodegenerative disease, and</p></li><li><p>&#8220;<a href="https://www.nature.com/articles/s41467-022-28580-6">Ada</a>&#8221; a self-driving lab that has discover new metal materials by identifying optimal trade-offs between conflicting objectives</p></li><li><p><a href="https://www.nature.com/articles/d41586-025-02577-9">Showcasing</a> a Chinese lab&#8217;s use of AI to design the &#8216;brain&#8217; of a quantum computer using laser light to move thousands of atoms into any arrangement they choose and act as qubit, in 60 milliseconds - demo&#8217;d in this case to produce a <a href="https://physics.aps.org/articles/v18/148">video</a> of Schrodinger&#8217;s cat.</p></li></ol><p>While the same month this year (2025) <em>Science </em>reported how self-supervised AI now gives robotic probes pixel-level precision, letting autonomous labs log over 3,000 semiconductor tests in a single day - enabling rapid defect detection and a stronger, more resilient advanced-manufacturing supply chain. Given AI Agents are reportedly doubling their performance ever seven months, these kind of breakthroughs are likely just the beginning (see <a href="https://cassiai.substack.com/p/what-did-you-do-in-the-ai-revolution">Cassi blog post</a> dated 31 July for more SDL progress in 2025).</p><p>The next decade, perhaps the next few years, will see the emergence of &#8216;self-driving embassies&#8217;. Just as today&#8217;s self-driving labs use autonomous AI to answer scientific questions, in self-driving embassies AI agents will handle consular and diplomatic enquiries and AI systems mediate most&#8212;perhaps all&#8212;routine exchanges, principally text, audio or digital, between embassy and both home and host governments.</p><p>Persuasion agents will build psychometric profiles of the individuals and groups with whom the Embassy or diplomat is interacting, enabling tailored negotiation strategies, which can be adapted and improved in real-time. When negotiations are in-person via video call, AR/VR meet, or physically face-to-face, ASI agents will be reading micro-expressions, designing recursive negotiation strategies and updating the approach continuously. Affective computing will be more empathetic than humans, able to understand and manipulate us at superhuman levels. It is hard, though not impossible, to imagine negotiation being fully delegated to ASI systems negotiating with each other. A <a href="https://www.nature.com/articles/s41562-025-02194-6">study</a> in Nature in May 2025 showed AI was more persuasive in debate than humans &gt;60% of the time. A recent (2025) unauthorised and probably <a href="https://www.science.org/content/article/unethical-ai-research-reddit-under-fire">unethical </a>experiment on the Reddit forum R/Change My Mind shows the way &#8211; an LLM deployed to persuade was six times more persuasive than humans and operated without anyone recognising it was an LLM. The largest LLM persuasion <a href="https://www.arxiv.org/abs/2507.13919">study</a> to date, (published 18 July 2025) showed newer, larger models were the most persuasive, and that models persuasiveness persisted over time (1-month, in this study).  The best models (already a generation behind o3 and GPT-5) were 41% and 52% more persuasive than &#8216;static&#8217; messaging - remember when we were panicking about &#8216;static&#8217; social media messages on Twitter and Facebook? In fact, previous studies of human persuasiveness, advertising and personal conversations in elections, <a href="https://www.cambridge.org/core/journals/american-political-science-review/article/abs/minimal-persuasive-effects-of-campaign-contact-in-general-elections-evidence-from-49-field-experiments/753665A313C4AB433DBF7110299B7433">suggest</a> &#8216;an average effect of zero&#8217;.  </p><p>Even if human relationships remain crucial to effective influence and negotiation, they will be increasingly informed by profiling agents, prompting and preparing diplomats for discussions, perhaps advising during discussion, debriefing afterwards.</p><p>Linguistic skill may be vital, not for translation - which may cease to be relevant when translation can be automated instantly - but to enable diplomats to develop the mental models and worldview of those they interact with. If that is the case, diplomats will likely be learning through AI-tailored immersive courses in AR and VR. Cultural expertise, and <em>understanding</em> - the right-brained process still too little understood by neuroscientists, psychologists, computer and other scientists for us to be confident we are modelling it effectively in Artificial Intelligence, may be a key human skill. But to outperform or complement machines it will likely need longer, deeper immersion in country, possibly career-long deployments. Similarly, when the human currency becomes relationships, since rationality, persuasion, information processing and reporting is undertaken at superhuman level by machines, FCDO careers will be much more focused on the cultivation of life-long relationships with current and future key influencers in countries around the world. Fast-stream processes, and civil service churn will have to end if diplomats are to flourish in an age of AI. Deep relationships take time to build.</p><p>Recruitment of diplomats will use AI systems to identify super-persuaders, just as intelligence analysts will be identified as super-forecasters, or super-questioners &#8211; those that can create the questions that maximally reduce ignorance, on which forecasters and &#8216;persuaders&#8217; should focus &#8211; and recruitment will follow. A more prosaic instantiation of Philip K Dick&#8217;s Precogs in &#8216;Minority Report&#8217;. Strategy, tactics, decisions will be automated. Humans will be working out what the most desirable &#8216;outcome&#8217; or objective is <em>with machines</em>, but the planning itself will be done <em>by machines</em>, and implementation overseen and directed by them.</p><p><strong>Passports to New Worlds</strong></p><p>The trend towards e-citizenship and e-businesses will continue. Many visa and passport applications will be for virtual access to the UK, as telepresent (AR/VR) or telexistent agents (AR/VR with the ability to directly interface with the real world via robotic interfaces) &#8211; with people &#8216;visiting&#8217; and working in the UK without ever entering the country.</p><p>Furthermore, citizens and businesses will apply for Visa to access UK managed virtual worlds, metaverse countries with economies that rival countries in GDP with access to UK legal authority for dispute resolution.</p><p><strong>Beyond the &#8216;Geo&#8217; in Geo-Politics</strong></p><p>Migration pressures will ease, as citizens in developing countries are able to prosper by building e-businesses in developed countries while remaining in their own. Trade barriers and cross-border taxation will become harder to enforce as metaverse economies produce wealth independent of geography.</p><p>Space will be the most lucrative market in human history, with in-space industries from mining to manufacture, compute to colonisation. By 2035, with regular vertical and horizontal launch from the UK, the FCDO will have to manage in-space territorial claims and economic competition, as well as the security implications that flow from competition for dominance in space. If the UK fails to compete in the space colonisation, asteroid mining and space-based industries, the FCDO will be managing a much more precipitate decline in the UK&#8217;s relative power in the international system than it has ever had to do to date.</p><p>Finally, automation, robotics and AI will reduce countries and companies' dependence on demography to mine raw materials, produce goods, services, and scale armed forces to deter or fight. Self-driving labs will become self-driving innovation systems, with companies and countries power, prosperity and security driven by the speed at which they can invent and harness new technologies.</p><p>In combination, metaverse economies, the space market and automated innovation and industries will move competition for relative power in the international system beyond the geo of geopolitics, with nations&#8217; within-border natural resources and demographics less deterministic of their power and influence. For the FCDO, effective statecraft will have to be more integrated x-Whitehall, and with industry, finance and society, than ever before.</p><p><strong>Influence, Security, Privacy, Agency</strong></p><p>Superintelligent systems able to analyse groups and individuals, understand their psychometric make-up, and influence their behaviour risks social control tightly circumscribing human agency, able to manipulate with unprecedented effectiveness. Even today, we generate c. 1.7MB of data per person per second &#8211; the equivalent of 56 million Kindle books per on each of us generated every day &#8211; it is this that gives unprecedented insight and foresight into who we are, what we do, and what we are likely to do in the future. Debates on human agency will soon be the domain of the diplomat, as privacy-centric EU clashes with the market-centric US, and China&#8217;s digital authoritarianism.</p><p>Our security will be redefined in a world where the boundaries of the virtual and the physical have blurred through AR/VR, where we live in different &#8216;belief circles&#8217; seeing different versions of the world around us to our neighbours, and spending more and more time in borderless cyberspace. Cognitive security may be a competition for e-citizenship, e-businesses with other states, and virtual states.</p><p>When our cars, smart homes and brain computer interfaces all become critical national infrastructure through connected IoT devices, we will have to rethink how it is managed. Cognitive security defending our data and our minds.</p><p>All this will make the role of the FCDO more complex, and more expansive, while the future diplomat might be able to more effortlessly cover a wider geographic area, the GeoInt Singularity + ubiquitous data surveillance will mean they will struggle to move anywhere or meet anyone, without others knowing. It may be that everyone they meet knows their psychometric profile, their preferences, and has optimised their position to &#8216;win&#8217; in diplomatic negotiation. </p><p>Perhaps privacy-enhancing technologies will prevent this, but if they do, they will also limit the state&#8217;s ability to manage threats, gather data.</p><p>Public diplomacy conducted online will become an ever more powerful tool, but will also raise the same questions of privacy, security and agency &#8211; how far should we go in profiling and manipulating in a world where Spotify data <a href="https://www.theglobeandmail.com/business/article-want-a-window-on-the-economy-try-spotify-boes-haldane-says/">can be</a> a better indicator of national mood than direct survey data<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>, where all of us are vulnerable to microtargeted manipulation.</p><p>In this context, regulatory diplomacy will be as important as all other forms, determining the kind of world we live in, and the extent to which people are free. Hence China <a href="https://nationalsecurity.virginia.edu/sites/nationalsecurity/files/2025-07/00019_%2820250721%29_NSDPI_The%20Chinese%20Communist%20Party%27s%20Layered%20Artificial%20Intelligence%20Strategy.pdf">requires</a> all AI companies to design with &#8220;core socialist values&#8221;, values it seeks to embed and export via its proposed &#8216;Organisation for Global AI Cooperation&#8217; (<a href="https://www.reuters.com/world/china/china-proposes-new-global-ai-cooperation-organisation-2025-07-26/">Reuters, July 2025</a>; <a href="https://english.www.gov.cn/news/202507/26/content_WS6884bea8c6d0868f4e8f4732.html">Chinese State Council, 26 July</a>).</p><p><strong>Simulation and Wargaming</strong></p><p>New policies, new negotiating tactics, the effect of military or economic actions, likely future scenarios and emerging flashpoints we all be constantly developed and tested through the digital strategy-engines, offering superhuman suggestions for strategies and tactics, testing second order effects, highlighting new factors to be considered. The GeoInt Singularity, where every square meter of the earth is constantly observed from space, will enable the development of a global digital twin, to identify flashpoints and early indicators of emergent conflict or natural disaster. Wargames and collective intelligence tools will enable humans to challenge the models assumptions, force it to consider counter-factuals, ensuring the most effective strategies are pursued.</p><p><strong>Augmentation</strong></p><p>Technological transformation will reduce the role of humans in many diplomatic decisions, and reduce the number needed to deliver diplomatic services. However, human relationships may remain key to successful negotiation - it depends on the extent to which nations are willing to let AI-agent negotiate with AI agent. If humans remain in the loop they will be increasingly augmented, able to bring the best of human cognition - whatever remains of value in this emerging age &#8211; together with technology to enhance our capacities. Brain-computer interfaces will, by 2035, likely be enhancing memory, perception and learning. In turn we will be aided by a neuropharmacology industry that optimises our performance and our rest, and by a growing science of high-performance teams, that ensure we best balance cohesion, talents, and incentives to maximise the performance of people in groups.</p><p><strong>The US AI Umbrella?</strong></p><p>If the US develops an AGI or ASI model before anyone else, is able to harness it effectively, and launch it first on recursive self-improvement, it may be that no other nation can catch-up. In such a situation, British and other nations diplomats would be negotiating with a system that was comprehensively smarter than them, and progressively more so &#8211; those they interact with being prepared not by very capable advisors, but by a superintelligence Private Office or equivalent &#8211; or possibly being asked to interact directly with the US&#8217; ASI diplomat. So too British businesses would be facing competition from businesses smarter than them, the British military working with an ally, assuming the US remains such, that was strategically, operationally and tactically smarter. The outcome of such competition is likely to be very one-sided. If this becomes so, we may find ourselves living under the AI equivalent of the US nuclear umbrella, surrendering sovereignty, agency, in return for protection and access. The price of living under the <a href="https://cassiai.substack.com/p/faustian-bargain-20-life-under-the">US AI Umbrella</a> is likely to be far higher than the cost of its nuclear protection.</p><p><strong>AI Disruption or Destruction</strong></p><p>It could be that we live, as Oxford Philosopher Nick Bostrom puts it in a &#8216;solved world&#8217; a post-ASI utopia where nobody need concern themselves with the intra-human competition of which diplomacy is part. It might be that through misalignment or malignance AI poses an existential threat to humanity. Or it might be that AI becomes a substitute, not a complement to human labour, with radically disruptive social and political effects. None of these scenarios are so unlikely that the can be ignored. Hope is not a strategy. But this article has described the Future Diplomat operating in a messy, complex, competitive world &#8211; with or without the development of super-intelligent AI. It may be unproductive and distracting to spend too much time on the dystopian outcomes here. It is vital the FCDO does spend time on them, nonetheless.</p><p><strong>Conclusion</strong></p><p>Much of what has been described is possible now, with today&#8217;s relatively narrow AI. More will be possible as current AI advances further - all the evidence suggests the UK will underestimate this progress, and the nation be poorer and less secure as a result.</p><p>In the more likely than not advent of AGI or ASI before 2035 the role of the diplomat and Embassy will be radically transformed, with humans the agents of AI as much as the other way around. The birth of metaverse worlds and the growing import of the space economy by 2035 will change where diplomacy is conducted, power created and managed, people and machines persuaded. We must be ready.</p><p>The best time to start building for this future was yesterday. The next best time is today.<br></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>For a good overview of the difference, see: <a href="https://www.metaculus.com/notebooks/38198/metaculus-and-markets-whats-the-difference/">https://www.metaculus.com/notebooks/38198/metaculus-and-markets-whats-the-difference/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Metaculus, 2025. <a href="https://www.metaculus.com/questions/3479/date-weakly-general-ai-is-publicly-known/">Date Weakly General AI is Publicly Known</a>. Accessed 31 July 2025.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Metaculus, 2025. <a href="https://www.metaculus.com/questions/3683/will-an-oracle-superintelligence-be-developed-before-a-general-superintelligence/">Oracle AGI Precedes General ASI</a>. (20 months). Accessed 31 July 2025;</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Metaculus, 2025. <a href="https://www.metaculus.com/questions/5121/date-of-artificial-general-intelligence/">Date of Artificial General Intelligence</a>. Accessed 31 July 2025.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Metaculus, 2025. <a href="https://www.metaculus.com/questions/9062/time-from-weak-agi-to-superintelligence/">Time From (weak) AGI to Superintelligence</a> (25.9 months). Accessed 31 July 2025.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>See e.g. </p><p>a. Edmans, Alex and Fernandez-Perez, Adrian and Garel, Alexandre and Indriawan, Ivan, Music Sentiment and Stock Returns Around the World (August 14, 2021). Journal of Financial Economics (JFE). Available at SSRN:<a href="https://ssrn.com/abstract=3776071">https://ssrn.com/abstract=3776071</a> or <a href="https://dx.doi.org/10.2139/ssrn.3776071">http://dx.doi.org/10.2139/ssrn.3776071</a>.</p><p>b. Kaivanto, K. and Zhang, P., 2019. <em>Popular music, sentiment, and noise trading</em>. Lancaster University Management School.</p><p>c. Sabouni, Hisam. (2018). The Rhythm of Markets. 10.13140/RG.2.2.31484.64646. Pre-Print <a href="https://www.researchgate.net/publication/323560860_The_Rhythm_of_Markets">https://www.researchgate.net/publication/323560860_The_Rhythm_of_Markets</a></p></div></div>]]></content:encoded></item><item><title><![CDATA[What did you do in the AI Revolution, Daddy? ]]></title><description><![CDATA[Pre-GPT-5, Post AGI? For too many, hope is the largest part of their AI strategy. We are still under-estimating the speed of AI progress, and the profundity of its implications.]]></description><link>https://blog.cassi-ai.com/p/what-did-you-do-in-the-ai-revolution</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/what-did-you-do-in-the-ai-revolution</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Fri, 01 Aug 2025 13:36:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!KL9K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KL9K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KL9K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!KL9K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!KL9K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!KL9K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KL9K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:181770,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/169757660?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KL9K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!KL9K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!KL9K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!KL9K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8592ee6f-69d9-4f68-af4b-ee0902f7aa85_1024x1024.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This article is really &#8216;just&#8217; an update on AI progress. All the developments here are from the last three weeks or so. I would go back meticulously to the last time I posted, but the article would be too long and the point is made more neatly here by not trying to cover everything. Progress is still astonishing. </p><p>Why now? Well, <a href="https://tech.yahoo.com/ai/articles/gpt-5-launch-might-imminent-151237832.html?guccounter=1&amp;guce_referrer=aHR0cHM6Ly93d3cuZ29vZ2xlLmNvbS8&amp;guce_referrer_sig=AQAAAFf0B1SzMi83cLV1Au-aYQ11WTwadsRG7vxkYGw0EKkvyAtYT1vzY1V7HCu65iFrA89Xeiz3bMXs_-HSVj0yz-fSVuMSKdSF3Xn4U8zvnj-xZq2SfCCZeVxWkd-ldDZFqcPZArBjVSpeZXTwS86NvYnFhLL_ca0CFgcPq7-lShMR">rumours</a> are rife that GPT-5, the model that OpenAI CEO Sam Altman says &#8216;<a href="https://www.tomsguide.com/ai/openais-ceo-sam-altman-says-gpt-5-is-so-fast-it-actually-scares-him-maybe-its-great-maybe-its-bad-but-what-have-we-done">scares him</a>&#8217; that seems to have convinced him we are <a href="https://www.forbes.com/sites/lanceeliot/2025/06/11/sam-altman-says-ai-has-already-gone-past-the-event-horizon-but-no-worries-since-agi-and-asi-will-be-a-gentle-singularity/">past AGI</a>, and now the target is ASI- super-intelligence (variously defined<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>), is said to be ready for roll-out and release this month (August 2025). Indeed GPT-5 and various derivatives may have been <a href="https://www.theneuron.ai/explainer-articles/six-new-ai-models-reveal-we-may-be-seeing-gpt-5-get-assembled-in-real-time">trial-released</a> in LLM Arena last weekend, as Zenith, Summit, possibly Lobster, Nectarine, Starfish, and o3-alpha, drawing breathless commentary for their performance<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. </p><p>After GPT-5 release happens, there&#8217;s a very real risk and high likelihood the remarkable recent progress is eclipsed and under-discussed. As Rob Bassett-Cross (CEO Adarga AI) and I wrote in our eponymous article &#8216;Don&#8217;t Blink&#8217; - AI watchers daren&#8217;t look away for a moment such is the pace of progress. My concern remains that as a country, a society, and in many of our companies, we in the UK are still under-estimating the speed of AI progress, and the profundity of its implications.</p><p>This under-estimation is despite the discussion on X and Substack beginning to break into the mainstream. Tom Whipple wrote in the Times last week (25 July):</p><blockquote><p>&#8216;I know the riposte. Humans always fear machines. &#8230;AI won&#8217;t take your job, goes the mantra, but if you don&#8217;t get with the revolution, someone using AI will. My response? You&#8217;re not listening.&#8217;</p></blockquote><p>But even with views like this in the thumpingly centrist Times, nothing really changes in UK policy circles. But it changes everywhere else.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>In the past few weeks for example, an essay collating the thoughts of leading Chinese thinkers on AI - shows their emphasis on Open Source, diffusion, &#8216;talent&#8217; as espionage vector, and regulatory standards as the route to values dominance. A hyper-pragmatic, and believable approach to the otherwise unequal competition with US Big Tech. It is free of the delusions and not-really-even half-measures dressed up as solutions that characterise the UK&#8217;s response (and that get almost no scrutiny or criticism from our media for which, Whipple not withstanding, this remains a fringe issue - as opposed to perhaps the most disruptive technological revolution in human history).</p><p>We&#8217;ve seen the Chinese Premier Li Qiang <a href="https://www.ft.com/content/de06e1ac-6a12-45a4-a31c-0dfecea4343e">announce</a> a plan (consistent with the above) for China to launch an &#8216;Organisation for AI Cooperation' based out of China but with global aspirations.</p><p>We see in China a growing focus on &#8216;embodied AI&#8217; - accelerating diffusion of AI into its industrial base as a way to advantage, perhaps also making the vast amounts of data it collects available, to influence future models.</p><p>That&#8217;s just China. RUSI <a href="https://www.rusi.org/explore-our-research/publications/commentary/silicon-sandstorms-ai-power-and-new-tech-front-gulf">reports </a>on the scale of US deals with the Middle-East on AI investments as &#8216;staggering&#8217;. But these investments are not staggering - they are a proportionate reaction to the continued acceleration of AI progress, an anticipation of the rate, direction and implications of this progress - they are other nations&#8217; recognition that they can&#8217;t afford to be left behind.</p><p>Yesterday (31 July 25) we <a href="https://www.nscale.com/press-releases/stargate-norway-nscale-aker-openai">read</a> that Norway has struck a deal with the US to build one of its Stargate AI projects (the US &#8216;Manhattan Project for AI&#8217; announced in January) in country, a gigafactory, an AI data centre entirely powered by renewable energy, to supply Europe. Norway and US business are investing $1bn in the first phase of this project. &#8216;Europe needs more compute&#8217;, Sam Altman notes. More than just &#8216;more compute&#8217;, we - UK and Europe alike - need a credible plan proportionate to the threat and opportunity posed by AI. </p><p>Here&#8217;s why. In the past few weeks:</p><ul><li><p>Nature Chemical Engineering reports that self-driving (AI/automated) labs are producing new materials, in the field of materials science, ten times faster, processing ten times the data, accelerating discovery exponentially.</p></li><li><p>Nature Communications reports that self-driving labs have discovered in the venom of snakes and spiders, 2000 new antibacterial proteins that could serve as alternatives to current antibiotic treatments, 58 of these have been tested so far, 53 had medical applications such as killing the currently drug-resistant bacteria e-coli and staph - vital in a world of growing anti-microbial resistance.</p></li><li><p>A survey published back in March showed Agent-based AI models&#8217; increasing abilities in completing software development tasks. In 2022 they could complete at the touch of a button tasks that took humans 30 seconds. In 2025 they are completing tasks that take 1-hour. Their performance was doubling every 12-months, but is now doubling every 7 months. On this current trend, even without further acceleration in the rate, by 2027 they will be able to complete tasks that take humans a month. By 2029 tasks that take humans a working year.</p></li><li><p>I re-report that because now, <a href="https://metr.org/blog/2025-07-14-how-does-time-horizon-vary-across-domains/">further research</a> shows these improvements in Agentic AI &#8216;task scaling&#8217; are also being seen with Agentic AI in scientific reasoning, maths, robotics, the use of computers for tasks, and in self-driving systems.</p></li></ul><p>Most significantly, in my view, since last I wrote, mathematicians have had their Gary Kasparov/Lee Sedol moment, when AI surpasses them and they <a href="https://x.com/_Dave__White_/status/1947461492783386827">wonder what they are for</a>. This was triggered by &#8216;general&#8217; models like OpenAI&#8217;s <a href="https://x.com/alexwei_/status/1946477742855532918">latest reasoning model</a> and an advanced form of Google&#8217;s <a href="https://deepmind.google/discover/blog/advanced-version-of-gemini-with-deep-think-officially-achieves-gold-medal-standard-at-the-international-mathematical-olympiad/">Gemini</a> winning &#8216;gold medals&#8217; at the International Maths Olympiad (IMO). The achievement prompted both amusement and amazement. One Data Science Professor <a href="https://x.com/polynoamial/status/1946509154752811136">posted</a> about the poor performance and limitations of general AI models on Olympiad tasks just hours before their success was announced and was much mocked. In his defence, this was faster progress than some of the most bullish experts had <a href="https://x.com/__nmca__/status/1946507122369335734">predicted</a>, and is another example where progress has been consistently under-estimated. The question &#8216;When will an AI win a Gold Medal in the International Math Olympiad?&#8217; was forecast as follows:</p><p>July 2021: 2043 (22 years away)</p><p>July 2022: 2029 (7 years away)</p><p>July 2023: 2028 (5 years away)</p><p>July 2024: 2026 (2 years away)</p><p>July 2025: AI wins a gold medal in the International Math Olympiad.</p><p>This is not the first time I have written out these kinds of estimates to illustrate how difficult humans find it to understand exponentials, to forecast AI progress.</p><p>On Friday, seven days before I clicked &#8216;publish&#8217; on this post, AI performance on the ARC AGI 2 benchmark [&#8216;Abstraction &amp; Reasoning Corpus for Artificial General Intelligence 2&#8217; &#8211; an AGI test redesigned after ARC AGI 1 was surpassed in December 2024] jumped from 16.5% to 19%. &#8216;Only&#8217; a 2.5-point gain, it represents a &gt;15% relative improvement in capability. If we assume exponential growth here too, given this benchmark was launched end March 2025, and models were scoring ~12% by end April, it could be fully solved by ~August 2026.</p><p>We&#8217;ve seen <a href="https://deepmind.google/discover/blog/aeneas-transforms-how-historians-connect-the-past/">Deepmind&#8217;s Aeneas</a> AI historian decode and complete inscriptions and text, adding nuance and contextualised understanding. We saw Deepmind launch <a href="https://deepmind.google/discover/blog/alphaearth-foundations-helps-map-our-planet-in-unprecedented-detail/">AlphaEarth</a> - where the impact on espionage and military operations should be obvious - take this summary from one <a href="https://x.com/bilawalsidhu/status/1950580970907648234">commentator</a>: </p><blockquote><p>'AlphaEarth Foundations does something clever -- instead of drowning in petabytes of Earth observation data, it creates compact summaries of every 10x10m square on Earth by fusing optical, radar, LiDAR, and climate data. ...it can see through clouds in Ecuador and reveal hidden agricultural patterns in Canada.&#8217; </p></blockquote><p>As I&#8217;ve written before, only in the world of AI could developments like these two new Deepmind models and milestones be essentially addendum to the main points. More likely than not, all the developments reported herein will soon be eclipsed by and quickly forgotten after the release of GPT-5. </p><p>A reminder - as I wrote in <em><a href="https://cassiai.substack.com/p/defence-and-artificial-general-intelligence">In Athena&#8217;s Arms</a> - &#8216;&#8230;</em>this is not a millenarian declaration of deterministic certainty.&#8217; Maybe progress will stop. Maybe someone will declare &#8216;AI will never&#8217; - like chess, Go, the IMO - and this time be right. Maybe someone will point to a fundamental barrier to AI progress that we can&#8217;t innovate our away around, through or over. But right now, the &#8216;straight lines on graphs&#8217; that have driven and reflect AI progress for almost two decades, show no signs of stopping. Very few people seem to be taking this as seriously as you might expect given the evidence - trying to plan, build, invest, govern, write, report, or converse as if AGI could be with us imminently, or in the next few years. Hope is for too many, the entirety of their AI strategy.</p><p>We are still under-estimating the speed of AI progress, and the profundity of its implications.</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>There is a great tyranny of definitions here, explained at length in previous posts&#8217; footnotes. You&#8217;re on your own this time, unless you want to go back and check the previous caveats and explanations.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>e.g. https://x.com/kimmonismus/status/1949037129163547083, https://www.reddit.com/r/singularity/comments/1m9v5s5/what_model_is_summit_on_lmarena/ &amp; https://www.linkedin.com/posts/emollick_kinda-amazing-the-mystery-model-summit-activity-7355073850738413568-H_pc </p></div></div>]]></content:encoded></item><item><title><![CDATA[UK Strategic Defence Review (SDR): Reviewing the Review & the Reactionariat.]]></title><description><![CDATA[The good, bad and ugly.]]></description><link>https://blog.cassi-ai.com/p/uk-strategic-defence-review-sdr-reviewing</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/uk-strategic-defence-review-sdr-reviewing</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Wed, 04 Jun 2025 06:02:09 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Bl5W!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Bl5W!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Bl5W!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png 424w, https://substackcdn.com/image/fetch/$s_!Bl5W!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png 848w, https://substackcdn.com/image/fetch/$s_!Bl5W!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!Bl5W!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Bl5W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png" width="1024" height="1536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1536,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3722542,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/165139847?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Bl5W!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png 424w, https://substackcdn.com/image/fetch/$s_!Bl5W!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png 848w, https://substackcdn.com/image/fetch/$s_!Bl5W!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!Bl5W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47dd07b9-92b0-42c1-92b9-3189fa7dd854_1024x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>The SDR published yesterday (Monday 2 June) is the best of my lifetime. It is sharper, more honest, and more structurally ambitious than any I have read&#8212;including those that predate today&#8217;s formalised processes and those written by nations beyond our borders.</p><p>But it is already at risk. Not because of what&#8217;s in it. But because of what&#8217;s being said around it.</p><p>I&#8217;m writing this not just to praise the Review, nor simply to critique its flaws. There are several, some are serious. But more urgently: to warn that the opportunity it presents could be lost amid noise, spin and superficial commentary. This would be a failure of national consequence. I hope the blog might reach those who can help us avoid it.</p><p><strong>Distinguishing Review Signal from Political Noise</strong></p><p>The misunderstanding and mischaracterisation create three principle risks:</p><ol><li><p><strong>We could miss the moment to prepare for war</strong>&#8212;and thereby make it more likely, and more disastrous.</p></li><li><p><strong>We could miss the chance to raise the level of our defence debate</strong>, after years of kayfabe, fake budgets, and misdirection.</p></li><li><p><strong>We could lose the Review&#8217;s careful and detailed analysis amid the thunderous political &#8216;announceables&#8217; that risk drowning it out</strong> &#8212;many of which are either unrelated, outdated, or contradictory.</p></li></ol><p>Let us take the third first. </p><p>In the run-up to the Review&#8217;s publication, there was a <a href="https://www.thetimes.com/uk/defence/article/review-fighter-jets-nuclear-weapons-x9vldt0sv">story</a> that the UK would now fund and field a second fighter-jet launched nuclear capability, in addition to the continuous-at-sea deterrent, our nuclear submarines with nuclear missiles. This is not in the Review, and regardless of its potential merits and demerits (I think on balance a sovereign capability we need) &#8211; or indeed whether we do commit to this following the talks that the Times tells are us are being held with the US, the Review does not recommend this. It was certain though that announcing it would drown out almost everything else in the debate. Similarly, the announcement of a &#163;15bn new &#8216;sovereign warhead&#8217; was always going to grab headlines, but isn&#8217;t new. Likewise, the announcement of twelve new submarines is not in the Review [0.1], which instructs the MOD to confirm numbers against requirements. Announcing all this as the Review was published obscured what the Review actually says.</p><p><strong>The Flaws are Clear</strong></p><p>Many criticisms of the review are justified. Decoupling it from a Spending Review, as happened in 2021 during the Integrated Review, neutered it. It is a standard Treasury tactic and it enables the Government to do what institutionally, it prefers &#8211; make policy and strategy word-smithing and empty rhetoric, a kayfabe, while the gritty, largely boring reality of implementation and hard choices are hidden: made by default, rather than decision, choices made when the headlines are dominated by wider cuts, spending and taxes on non-defence issues. Relatedly, the MOD reserving for itself the decisions on what equipment it needs, to come <a href="https://www.rusi.org/explore-our-research/publications/commentary/strategic-defence-review-needs-retain-its-ambitions-technological-transformation">after the Review</a>, makes it unlikely they would sign-off any recommendations the Reviewers might want to include on what and where to cut. It cannot be an ends-ways-means review, a strategy, if denied or constrained in the opportunity to comment on what means - financial, and in kit and equipment - are needed.</p><p>In consequence, this Review has been emasculated by being separated from the spending review. As many commentators have noted either:</p><blockquote><p>a. Defence needed a lot more money. Or</p><p>b. the Reviewers needed to show where they would cut, take risk and transform.</p></blockquote><p>They have done neither. But they were presumably prevented from doing so. One wonders what the Review looked liked before the five-month period of &#8216;<em>Decision-making and finalisation</em>&#8217; as the Review team &#8216;<em>finessed&#8230;</em>[their]<em>&#8230;recommendations</em>&#8217; during &#8216;<em>Engagement across Whitehall</em>&#8230;&#8217;.</p><p>The language seems dumbed down in many places, enabling consent and evade (or &#8216;toxic compliance&#8217; as I heard it called recently). e.g. &#8220;high-low mix&#8221; can be whatever the MoD decides it wants it to be. Digital Targeting Web is clearly a &#8220;Kill Web&#8221; the failure to call it what it is shows I think the lowest common denominator compromise that has infected parts of the Review. On procurement, we are still allowing some equipment to be built on 15-20 year timelines, with individual capabilities taking 18 months+ to manufacture. I don&#8217;t think you need to see the classified info (and to be clear, I have not) to see that this is a recipe for cost overruns and failure. The tech will improve a thousand times or more during the life of programmes, requirements will change in geopolitical cycles that can be hours and days, or just a few years. Most importantly - we must be able build stuff faster than our enemies can destroy it. What&#8217;s the ratio between expected attrition rates and the production speed of new platforms like FCAS or SSNs?</p><p>Things are often under-specified, for example, the MOD will likely just rebadge its multidomain integration stuff as a &#8220;digital targeting web&#8221; and keep on failing.</p><p>There are the usual linguistic atrocities. But these fall in to the category of niche objections that might matter, but not to most people. Fellow pedants, find more in the footnotes.[1]</p><p>But there is a real danger here that &#8216;the Reactionariat&#8217; of which I am now a part, miss a historic, urgent and vital opportunity to use the Review to raise the quality of our Defence debate, to hold the Government and the MOD to account. Strategic Reviews rarely give us a strategy - they should, that they don&#8217;t is a failing of our system [1.2]. But they can still be influential - harnessing the zeitgeist, empowering reformers - through a kind of &#8216;narrative dominance&#8217; they can change the conversation, and with it the priorities and activities in Government. This is where the risk of missing the opportunity given by this excellent review lies.</p><p><strong>The Review has Given the Nation an Opportunity we Must Not Miss</strong></p><p>In diagnosis the Review is clear and comprehensive in saying how bad things are, overturning rocks, and calling out how urgently things need fixing. You can see where it has been dumbed down I think, not quite being able to say that if we fight tomorrow it would be a catastrophe. But it is very close, and you don&#8217;t have to work hard to figure out that is what it says. I think you could sum this up as saying it would no longer be possible for HMG to say it did not know how bad things are. I think to have got this through the Whitehall system &#8211; maybe even more in need of reform than that within the MOD - is a significant achievement. It is detailed, exacting and far-reaching. Genuinely impressive in form and content. It could, if understood for what it is, drive a much better debate on defence.</p><p>There are important structural changes - Cyber-EM Command unifying electronic warfare (we&#8217;re actually pretty good at this, in narrow pockets that are permanently at risk of being killed by the system). Similarly, MSHQ and killing Levene are a big deal. Something we tried to get done when I was serving in 2018, again in 2020 when in No10.</p><p>Under the current SofS&#8217; John Healey&#8217;s leadership, and now public in the Review, CDS now has - or should have according to his revised role described in the Review - some meaningful authorities and not just sit as an advisor to SofS and the PM. We can stop talking about a civilianised &#8216;Head Office&#8217; with its appalling culture but a proper HQ, that can be held accountable (without this, no one was in charge and everyone blamed everyone else for continuous failure). These problems were and are systemic, not individual, but they incentivised and rewarded some pretty toxic behaviours. They were part of the reason that lowest common denominator compromise consistently triumphed over confronting issues. We should now have not three and half (STRATCOM) forces designed separately who figure out how they will fight later, but one force, centrally designed and focused on how we would fight in NATO vs Russia (I personally don&#8217;t think this is the right way to think about the threats we face in an era of systemic competition, but it isn&#8217;t an unreasonable approach [2]).</p><p>The Review has a &#8216;demand&#8217; for specific metrics, on lethality, productivity, and national economic impact. There is a section designed to ensure accountability (p.50). There&#8217;s a call to cut &#8220;by at least 50%&#8221; MOD regulation that kills industry. The kill web has a deadline - MVP by 2026, fully operational by 2027 &#8211; and where described as a &#8216;clear unifying mission&#8217; seems to be an attempt to give us a proper theory of winning that we haven&#8217;t had since the end of &#8216;Air-Land Battle&#8217; which passed with the Cold War. The commentariat could do much to push this to become a full operating concept, a theory of winning that drives force design based on DARPA&#8217;s ideas of the <a href="https://www.darpa.mil/news/2020/cross-domain-kill-webs">Kill Web</a>, and <a href="https://www.darpa.mil/news/mosaic-warfare">Mosaic Warfare</a> (or decision-centric warfare). </p><p>The focus on the way war has been transformed by technology runs throughout, describing how this is not an event, but rather that technology is continuously redefining warfare. This is, I think, correct and not really disputed (the degree to which it is true is argued, not the fact that it is). </p><p>Criticisms that there is not enough on emerging technology and procurement are unsustainable &#8211; some of the Reviews recommendations were clearly taken by the Treasury and announced early, but they are well covered and pushed further: &#8220;<em>Major modular platforms (contracting within two years). Pace-setting spiral and modular upgrades (contracting within a year). Rapid commercial exploitation (contracting within three months), with at least 10% of the MOD&#8217;s equipment procurement budget spent on novel technologies each year</em>.&#8221; The MOD is pushed to engage with VC and Private Capital in ways it currently does not. </p><p>The commitment, publicly, to making the UK&#8217;s Armed Forces &#8216;the most innovative in NATO&#8217; echoes something I argued for in 2020 for the 2021 Integrated Review. To be meaningful, it will need measures and metrics &#8211; how will we know we have achieved this? But these are not impossible to imagine. I offered suggestions <a href="https://wavellroom.com/2021/05/14/innovation-and-the-integrated-review/">here</a> in 2021. You could adapt the CBI&#8217;s measure of innovation pull-through &#8216;new to market new to firm inventions&#8217; &#8211; as &#8216;new to frontline, new to service inventions&#8217;. This Review gives Parliament and media the chance to propose how this might be metricised, measured or judged, and hold the MOD to account either for delivering against them, or to propose its own. There are a lot of deadlines &#8211; any reader of Government documents will recognise how rare this is in HMG policy. It is a huge opportunity for Parliament, media and public alike to hold Government accountable for detailed delivery. In shouting down the Review, the reactionariat risk missing a generational and urgent opportunity to make us safer.</p><p>Personally, I would have liked to have seen a commitment to a 6-monthly independent review of progress vs the recommendations, submitted to the PM, Parliament, and available to public. That was what embedded Levene&#8217;s reforms, mostly for the worse in the long run, but the process teaches lessons: <em>people respect what you inspect and value what you reward</em>. I would have had named owners for each recommendation, and reported on their progress in those regular independent external inspection reports. Of course Parliament, media, commentariat could now start making the argument for this, to help those reformers on the inside doing the same, to push the system to reform. To improve our Defence debate. This was probably always beyond what the Reviewers could achieve. Neither this Govt, nor the last, or any in my lifetime would have agreed to that - it needs one under greater political pressure (where scrutiny now could help), or one with greater confidence, and/or one that viscerally believes in what this review tells them: war could be imminent, and we would likely lose. If you&#8217;re not steeped in this stuff, that is hard to believe.</p><p>Examples of how far-reaching and thorough-going this Review is run throughout, but are particularly notable in the chapters on Defence medical, education and training &#8211; again calling out shortfalls and failings clearly, with recommendations for how to get to a solution. A crucially important example &#8211; especially since everyone, reviewers and critics alike, seem to agree we aren&#8217;t spending enough, won&#8217;t spend enough, and are not ready to deter or fight a war - is the call for a proper plan for mass mobilisation as Sky&#8217;s Deb Haynes and I called for this time last year. How we prepare to rapidly increase our resilience, preparedness and ability to fight across the whole of society and government if war comes while our conventional armed forces are too weak to succeed or sustain in the fight is not just prudent planning, it is a part of deterrence too. Similarly, making defence infrastructure an operational consideration, and not one civilianised, lobotomised, and focused only on cost, efficiency and short-termism is a huge shift. The form of this Review is better - the citizens panels a neat innovation. The substance on a national conversation - expanding cadets - is good. None of these are things past Reviews have covered anywhere near as well.</p><p><strong>Armed Forces</strong></p><p>The recommendations on the RN &amp; Army are radical and right. Worth reading these sections alone (short, digestible). The recommendations on the RAF are in my opinion dangerously inadequate. There is nothing on what the high-low mix should be. Little on where the RAF carries risk in current operations. Far too much scope for consent and evade. The diagnosis has either pulled punches or has been too superficial. We should remember that the notional independence of the Review makes this the Review Team&#8217;s failing, not the RAF&#8217;s. Since this Review offers only recommendations, it does not mean the RAF can&#8217;t still be clear-eyed in diagnosing and addressing its own challenges with the clarity, rigour and radicalism that is present in the Review&#8217;s section on the RAF&#8217;s sister Services.</p><p>The section on Space disappoints. The U.K. continues to radically underestimate its importance and centrality to our future prosperity and security. I don&#8217;t think the Reviewers, and almost anyone else, agrees with me on this. But I stand by it (it requires AI leadership to turn its potential for our relative power into reality). The review is good, within the parameters of the existing debate, where space is just the bit above the air domain, and the issues are access and control - not trade, industry, colonisation. This should be an area for the National Security Review. I am pretty certain they will fumble this too. It is too far outside the Overton Window, and would blow the mind of a Treasury trying to scrape money together for social spending. We&#8217;ll all be much poorer and less secure for this in 10-years time, but that kind of thinking and planning is hard in Whitehall. For Defence to contribute to the debate and the capability we needed an independent Space Force as both I and others have argued.[3] A Space Force could argue for funding and develop expertise in the domain on equal footing with those representing the Air, Land, Maritime and Cyber domains. We still don&#8217;t have one. Not a surprise, but disappointing.</p><p>On Cyber-EM. This is complex. There is a continuity across cyber and EM (I prefer &#8216;electronic warfare&#8217; - narrower, clearer). But they are also distinct. And they run through everything else. I think we need an independent Cyber Command to argue for resources as an equal to space, air, land, sea domains. But this doesn&#8217;t solve the EW problem. And there is no chance the Treasury, or the existing services would allow two new Commands. It was never a likely outcome. We probably have the least bad option here, and a big improvement on the status quo.</p><p>Info Ops for me is central (war is about coercing decisions) even a nuclear strike is an attempt to coerce a decision, ultimately for psychological effect, so too every bullet fired. It is a side note in the Cyber EM section. Again though, this is the orthodoxy. I am not a lone voice, and I think logic on our side. But it is an argument we have not yet won.</p><p>On intelligence &#8211; the Review is very good on calling out our weaknesses in Counter-Intelligence &#8211; which we must prioritise, indeed in which we need to be best in class, or everything else is undermined. I am less sure on the radical merging of the intelligence branches - I think this will kill off necessary specialisation. I also note that the Reviewers don&#8217;t call for the re-establishment of <a href="https://www.amazon.co.uk/Authorised-History-British-Economic-Intelligence/dp/1138658286">Defence Economic Intelligence</a> (another thing I think is important, but not many talk about as an issue), and should have done. This is an era of geoeconomic competition. We have no economic intel function in HMG (killed in Defence Intelligence in the 1990s). Also necessary to support Defence Exports. Perhaps the National Security Review that Government has said will follow, will cover this. But I am not optimistic and I think the Defence Reviewers missed an opportunity here.</p><p><strong>Reviewing the Review</strong></p><p>A short way of seeing how much better this Review is than anything that has come before in my lifetime would be to print off the MOD&#8217;s 2020/21 Command Paper, and read them side-by-side. The former was bad at the time. It is shamed by this.</p><p>I think a fairer judgement than the many who are saying this is just &#8216;more of the same&#8217; would be that the Review gives the MOD enough that with the structural reforms and a genuine SofS commitment to real implementation, it could be transformative. </p><p>Without SofS commitment, the dumbing down of language and stopping just short - i.e. stating the need for metrics mostly not specifying what they should be, not saying what should be cut, not offering a view on balance of crewed/uncrewed, on AGI timelines, specifying a need for kill web without the fuller &#8220;theory of winning&#8221; (op concept) leaves it largely impotent to address the threats we face. It can only describe and recommend.</p><p>The Reviewers were hamstrung by this being, as Matthew Savill has said &#8216;<a href="https://www.rusi.org/explore-our-research/publications/commentary/strategic-defence-review-needs-retain-its-ambitions-technological-transformation">Schr&#246;dinger's Review</a>&#8217; neither independent and external, nor internal and owned by the MOD. Thus they have been prevented, I would guess, in going as far as they wanted. But it is clear that they have used what political power their position gave them to preserve much that is valuable.</p><p>The problem with the fake budgets, rhetoric, and vapourware,  the noise around the Review, is that they sound exactly like &#8216;business as usual&#8217; and everyone is acting accordingly, taking pot shots at a Review that seem to me to be frequently completely detached from what the Review actually says. Sometimes &#8216;control K&#8217; in the SDR for whatever key word of criticism is being levelled can expose this. It isn&#8217;t hard to check. </p><p>The noise is at worst a political attempt to obfuscate what the Review actually says, e.g. explicitly and directly &#8220;business as usual is not an option&#8221; and which otherwise - even in its likely watered down form &#8211; exposes the gaps between years of rhetoric and reality too clearly, and in ways that are not advantageous for a Government that knows there are still not many votes in Defence. Even at best, if not deliberate, the noise results from a misreading of the Review&#8217;s contents. Whatever the reason, there is a real risk that the &#8216;annouceables&#8217; obscure what the Review actually says, harming the debate the Review calls for before it has even started. Drowning out signal in noise. Leading the commentariat to discredit the Review, making association with it low status, ensuring that narrative dominance - the only way in which this Review can succeed, is never achieved. </p><p>My call to action is this: read the Review in detail, and celebrate it. Make association high status. Use it to push the recommendations further, to empower the reformers whether they be Minister&#8217;s, civil servants, or military. Theirs is a difficult fight, the review gives them material support. Allies - you - could help them win.</p><p>An effective Defence is, as the FT&#8217;s Janan Ganesh <a href="https://www.ft.com/content/8178b984-cf92-4313-8381-d8e2f6fc7fa0">said</a> of growth, something everyone in Britain wants, but, whenever a trade off is required &#8216;just not that much&#8217;. The Government, Secretary of State John Healey and his team in particular, are to be commended for creating an opportunity for a debate that might change that. Commissioning even a semi-independent Review was brave. In publishing and accepting the recommendations of what could be a radical review, the Prime Minister and his Government have created the space for and seeded a much better national conversation on just how much we want to spend on defence, vs the risk of our sons, daughters, mothers, fathers, friends and colleagues, coming back in body bags, being overrun in battles we fail to deter and are not fit to fight. What happens next matters hugely. I write this in the hope that reformers in and across Government, those in Parliament, media, and public alike can make the most of this most remarkable of Reviews. It must not be a return of what Cabinet Secretary Simon Case once called &#8216;the curse of the missed opportunity&#8217;. As the Review quotes one of its &#8216;Citizen Panellists&#8217; on defence: [otherwise] &#8216;&#8230;it won&#8217;t be long before something more significant happens and we will think it should&#8217;ve been more of a priority.&#8217;<br><br>[published 0700; updated 09:33, 21:58, 4 June 2025]</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support our work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>[0.1] Added 2100, 4 June 2025. I am grateful for having this pointed out so I can correct the record: technically this is not quite right, in the SofS foreword he does talk about the ambition to get to 12 SSNs. But I offer two defences (1) my defence is of the &#8216;independent&#8217; Review, the SofS foreword cannot, by definition be in anyway independent of the MOD; (2) in the foreword it is contextual and soft pedalled, not trumpeted to the point where it drowns out discussion of anything else, as happened when this information was provided ahead of the Review as &#8216;a response to it&#8217; (https://www.gov.uk/government/news/uk-to-expand-submarine-programme-in-response-to-strategic-defence-review).</p><p>[1] &#8216;Warfighter&#8217; should be excised from the lexicon. It is a hair&#8217;s breadth from &#8216;Warrior&#8217; which is even worse &#8211; and I think misunderstands how military dominance is achieved. The Spartans were defeated by the better organised forces of Thebes. The Romans dominated the &#8216;warriors&#8217; of Europe through superior organisation. The Vikings were routed not by lone berserkers but by the more centrally organised Anglo-Saxon levy at Stamford Bridge in 1066, where Harold Godwinson&#8217;s disciplined fyrd outmatched their hit-and-run tactics. In each case, it was organisation&#8212;training, supply, command and control&#8212;not individual heroics, that determined the outcome.<br><br>Picking out and trying to kill-off a personal bugbear that finds its way into the review: &#8220;war is fought across all five domains but people and the land they live on are at the heart of it&#8221;. This is a half-baked zombie idea, and should be murdered, cremated, buried and sealed in the earth&#8217;s core so I never encounter it again. It is the Army&#8217;s attempt to argue for the Land domain as the decisive and most important. Which is nonsense. You win wars by imposing costs, and potential future costs, that the enemy state or nation are unwilling to tolerate. You a coerce a decision. That&#8217;s it. There is no special premium on &#8216;the land people live on&#8217;. Similarly, the ongoing fetishisation of SF as &#8216;the tip of the spear&#8217; needs to end. I think it infantilises the rest of the military and damages our overall effectiveness.</p><p>[1.1] e.g. see last year&#8217;s  Liaison Committee <a href="https://publications.parliament.uk/pa/cm5804/cmselect/cmliaisn/31/report.html">report</a> on Strategic Thinking in Whitehall, and multiple Committee reports before:  2011, <a href="https://publications.parliament.uk/pa/cm201011/cmselect/cmpubadm/713/713.pdf">Public Administration Select Committee</a>, on the <a href="https://publications.parliament.uk/pa/jt201617/jtselect/jtnatsec/153/153.pdf">2015 SDSR</a>, <a href="https://hansard.parliament.uk/Lords/2015-09-17/debates/15091736000773/Syria?highlight=%22grand%20strategy%22#contribution-15091736000054">on Syria</a>, on the <a href="https://publications.parliament.uk/pa/jt201719/jtselect/jtnatsec/2072/207205.htm#footnote-104">2018 NSCR and Modernising Defence Programme</a></p><p>[2] I think abandoning the Indo-Pac tilt is an error. We are focusing on NATO, by which we really mean a land war in Europe most likely an attack on the Baltic states, just as Russia is bleeding itself white in Ukraine, it&#8217;s war economy hollowing the country out unsustainably, when we have two new powerful continental allies in the alliance, Sweden &amp; Finland anchoring the Northern flank and now Article 5 committed to aid any member that is attacked. We are doing this when continental Europe&#8217;s Continental Powers - Poland in particular, but the Czech Republic, Romania (transforming into a serious military power on NATO&#8217;s eastern front), the Netherlands and Denmark are all rebuilding credible land forces.  France, long prepared for expeditionary war, now preparing for high-intensity conflict on the continent - and most notably Germany too all rearming for this land war - continental powers building continental armies to fight continental wars [see my article: https://wavellroom.com/2022/04/06/ukraine-the-integrated-review-land-power-in-europe/]. In such a situation, a UK maritime, air, cyber and space commitment to NATO, with the flexibility to secure our maritime nations security and economic interests - and make a massive contribution to the defence of Europe. Labour&#8217;s politically driven &#8216;NATO first&#8217; mantra, is interpreted by Whitehall as meaning they can forget &#8216;East of Suez&#8217; again - an error when the deeper and more enduring threat and more likely war comes from and with China, and geoeconomic systemic competition (see IR 2021) from China threatens UK security at home and interests globally - esp in the Indo-Pacific, where the world&#8217;s largest middle class will soon be and where the world&#8217;s fastest growing economies are. I would have us plan not vs Russia with NATO but vs a range of very specific scenarios. Interested readers might see Murray &amp; Knox&#8217;s &#8216;<a href="https://amzn.eu/d/4mBK84R">Dynamic of Military Revolutions</a>&#8217; (especially p. 192) for more on this last point: &#8216;&#8230;revolutions in military affairs have emerged from evolutionary problem-solving directed at <strong>specific operational and tactical issues </strong>in a <strong>specific theatre of war</strong> against a <strong>specific enemy. </strong>Successful innovators have <strong>always </strong>thought in terms of fighting wars against <em><strong>actual</strong> </em>rather than <em>hypothetical </em><strong>opponents, with </strong><em><strong>actual </strong></em><strong>capabilities, in pursuit of </strong><em><strong>actual </strong></em><strong>strategic and political objectives.</strong>&#8217; [bold added, italics in original].</p><p>[3] Dolman, E.C., 2009. <em>Victory through Air Space Power</em>. <em>Strategic Studies Quarterly</em>, 14(2). Available at: <a href="https://www.airuniversity.af.edu/Portals/10/SSQ/documents/Volume-14_Issue-2/dolman.pdf">https://www.airuniversity.af.edu/Portals/10/SSQ/documents/Volume-14_Issue-2/dolman.pdf</a> [Accessed 10 March 2025]. <br><br>Fletcher-Jones, C. (2024). In Favour of an Independent Royal Space Fleet: The Smuts Report and the Precedent of the Royal Air Force. <em>The RUSI Journal</em>, <em>169</em>(1&#8211;2), 22&#8211;32. <a href="https://doi.org/10.1080/03071847.2024.2359405">https://doi.org/10.1080/03071847.2024.2359405</a></p>]]></content:encoded></item><item><title><![CDATA[Doctor Weirdlove or: “How I Stopped Worrying and Learned to Love Skynet”]]></title><description><![CDATA[Why betting on a crewed RAF Tempest is a dangerous gamble&#8212;and how ignoring the AGI debate leaves us strategically exposed]]></description><link>https://blog.cassi-ai.com/p/doctor-weirdlove-or-how-i-stopped</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/doctor-weirdlove-or-how-i-stopped</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Fri, 09 May 2025 18:59:48 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!1T-A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1T-A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1T-A!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png 424w, https://substackcdn.com/image/fetch/$s_!1T-A!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png 848w, https://substackcdn.com/image/fetch/$s_!1T-A!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png 1272w, https://substackcdn.com/image/fetch/$s_!1T-A!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1T-A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png" width="580" height="393" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:393,&quot;width&quot;:580,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Almost Everything in &#8220;Dr. Strangelove&#8221; Was True | The New Yorker&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Almost Everything in &#8220;Dr. Strangelove&#8221; Was True | The New Yorker" title="Almost Everything in &#8220;Dr. Strangelove&#8221; Was True | The New Yorker" srcset="https://substackcdn.com/image/fetch/$s_!1T-A!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png 424w, https://substackcdn.com/image/fetch/$s_!1T-A!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png 848w, https://substackcdn.com/image/fetch/$s_!1T-A!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png 1272w, https://substackcdn.com/image/fetch/$s_!1T-A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2f6bad9-7f72-4ea0-a11a-66b5232a3cee_580x393.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Figure 1. Stanley Kubrick's Dr Strangelove was a political satire, asking us to laugh at the logical extension of the doctrine of mutually assured destruction, to force consideration of what this meant in practice. In calling this article Dr Weirdlove, we acknowledge how insane some of this sounds. And yet it is the logical extension of the path we are on.</em></figcaption></figure></div><p>We&#8217;re worried. On the public evidence, current UK military understanding of artificial intelligence, both present capabilities and the trajectory of its development, is dangerously inadequate. The consequences of this misunderstanding are existential: for our service personnel on the future battlefield, for our armed forces in future combat, and for humanity itself.</p><p>The danger is twofold:</p><p>First, UK Defence radically underestimates current AI capabilities and the speed of their advance. If an adversary adopts AI-enabled decision-making and autonomous capabilities, and we don&#8217;t, we lose.</p><p>Second, UK Defence is not considering the implications for humanity of Artificial General Intelligence and war. Without solving AI alignment&#8212;the challenge of ensuring advanced AI reliably follows human intent&#8212;we risk handing power over humanity&#8217;s most lethal weapons to misaligned, super-intelligent machines.</p><p>We make three arguments, each logically building on the other.</p><blockquote><p>1. AI-driven decision-making and autonomy are essential for our national security.</p><p>2. Embracing AI-driven decision-making necessarily involves Defence leaders in the broader AI alignment debate.</p><p>3. Therefore, Defence leaders must actively engage in this debate, clearly articulating the "Strangelovian" dilemma we face: the paradox of urgently needing AI-driven military capabilities despite the existential risks posed by AGI.</p></blockquote><p><strong>Caveat</strong></p><p>Before we get into that, we want to be clear on what we are and aren&#8217;t saying in this article.</p><p>Firstly, we recognise the need for a major programme to ensure the UK can, to the use the USAF phrase &#8216;fly, fight and win&#8217; in future warfare. We are not opposed to the investment in this essential capability.</p><p>Second, the RAF&#8217;s decision to run the podcast series and talk openly about its next generation fighter aircraft &#8216;Tempest&#8217;, the wider Future Combat Air System (FCAS) of which it is a part, and the international collaboration, the &#8216;Global Combat Air Programme (GCAP)&#8217; that is to <em><a href="https://commonslibrary.parliament.uk/research-briefings/cbp-10143/">design, manufacture, and deliver</a> a next-generation crewed combat aircraft</em>, is an admirable commitment to transparency.</p><p>Third, those speaking on the podcast showed courage in speaking publicly, spoke well, in the difficult circumstances of being on the record in an informal environment, and are to be commended. Without such transparency, debate is almost impossible.</p><p>Fourth, the announcement of the RAF&#8217;s procurement of <a href="https://www.raf.mod.uk/news/articles/stormshroud-arrival-marks-the-future-of-uk-air-combat-power/">StormShroud</a>, an autonomous support to crewed fighters (or &#8216;loyal wingman&#8217;) is a step in the right direction, to be celebrated, it does not change what is argued herein.</p><p>We hope to show why debate is necessary, but recognise we risk reducing the willingness to be open in the future. We hope that is not the case. In writing this post we wish to be clear that our concerns relate to the wider issues signalled in the Armed Forces&#8217; understanding of Artificial Intelligence and autonomy, and are not aimed at individuals. We offer constructive criticism, genuine concern. We are seeking to play the ball, not the man.</p><p>Why pick on Tempest? Because we judge that the air environment is the domain which will be automated first, and because the long timeline of its development &#8211; 15 years before it enters service, ~45 years before it is expected to stop serving as the RAF&#8217;s frontline fighter &#8211; are well in excess of most estimates for when we reach AGI.</p><p><strong>AI Capabilities</strong></p><p>UK Defence currently misunderstands, and dangerously underestimates, the capability and trajectory of AI systems. A recent illustrative example: The Economist&#8217;s Shashank Joshi noted two concerning claims<a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftn1"><sup>[1]</sup></a> on the RAF&#8217;s otherwise excellent podcast<a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftn2"><sup>[2]</sup></a>:</p><blockquote><p>1. AI today is just a &#8216;just a stochastic parrot, it is just advanced auto complete&#8217;<a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftn3"><sup>[3]</sup></a> &#8230; &amp;</p><p>2. &#8216;We&#8217;re prepared for the time when AGI does catch-up.&#8217;</p></blockquote><p>The first claim is heard widely in UK Defence, sounds smart, but isn&#8217;t. As Shashank noted on Bluesky, it is indicative of a &#8216;slightly impoverished view of frontier models that you see in defence &amp; even parts of the intelligence world&#8217;. Commenting, Professor Ken Payne suggested this view reflects Defence&#8217;s &#8216;overconfidence, and signalling&#8230;expertise via knowing cynicism&#8230;&#8217; rather than genuine understanding.<a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftn4"><sup>[4]</sup></a> Both Payne and Joshi are well connected and trusted commentators and experts within UK defence. They highlight that this is a bigger issue than a throwaway comment on a podcast.</p><p>The &#8220;stochastic parrot&#8221; concept <a href="https://dl.acm.org/doi/pdf/10.1145/3442188.3445922">originates</a> from Emily Bender&#8217;s 2021 paper published <em>over 12-months before the release of ChatGPT. </em>Today the idea is a misleading caricature of AI models. LLM&#8217;s do not just parrot what they were trained on. Here&#8217;s one of the earliest academic <a href="https://arxiv.org/pdf/2208.02957">deconstructions</a> of what is wrong with that claim, from August 2022. <a href="https://www.youtube.com/watch?v=Gg-w_n9NJIE&amp;t=4678s">Here&#8217;s</a> Geoff Hinton, Demis Hassabis, Tomaso Poggio, Ilya Sutskever &#8211; leading experts in AI - discussing how AI can reason in analogies, and be creative (2024). Here a paper from 2024 showing <a href="https://arxiv.org/abs/2406.14546">evidence</a> LLMs can conduct inductive out-of-context reasoning, make inferences beyond their training data. Similarly, mathematician <a href="https://en.wikipedia.org/wiki/Stephen_Wolfram">Stephen Wolfram</a> explains how <a href="https://renewableplus.blogspot.com/2024/06/llms-as-world-builders-implications-of.html">language is a world-model</a>, from which LLMs are likely deriving their output. The &#8216;stochastic parrot&#8217; claim is a zombie idea, long dead, it walks on.</p><p>The problem isn't just a misunderstood technical detail. The problem is that this error compounds into the future. It is a misunderstanding of what is likely to happen and at what speed. It misunderstands the stakes at play and the odds.</p><p>Three recent examples of how quickly AI is advancing, and why it matters:</p><blockquote><p>1. <strong>&#8216;Humanity&#8217;s Last Exam&#8217; (January 2025):</strong> There is a new AI benchmark test called &#8216;Humanity&#8217;s Last Exam&#8217; a selection of extremely difficult questions which AI was initially very poor at.<a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftn5"><sup>[5]</sup></a>But AI is improving (six times better within two months) and is expected surpass the test within 12-months.<a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftn6"><sup>[6]</sup></a> Imagine if you sat the hardest exam ever written, flunked it, and then were improving at a similar rate &#8211; how smart would you be in 2-years, in 3, in 5?</p><p>2. <strong>AI&#8217;s Persuasive Power (April 2025): </strong>This week, we learn via an unauthorised experiment on Reddit that AI is now performing better than humans in persuading people to change their minds. <a href="https://x.com/reddit_lies/status/1916916134630117814">Six times better</a>.</p><p>3. <strong>Rapid Task Scaling. </strong>AI&#8217;s capability to perform increasingly lengthy human-equivalent tasks has surged dramatically.<strong> </strong>The length of tasks, judged by how long it would take a human, that AI <a href="https://x.com/METR_Evals/status/1906787469129105614">can now complete</a> is increasing. In 2022 AI could complete software coding tasks that would have taken a human 30 seconds. In 2025 they can complete tasks that would have taken a human <a href="https://x.com/AISafetyMemes/status/1915102279515983933">1 hour</a>. The pace of advance (2024-2025) has gone from doubling every 7 months, to doubling every 4 months. If you understand exponentials, you&#8217;ll get the claim that this means agents will be doing tasks it normally takes a human 1-month to do by mid-2027. As one of us noted in a talk this week, at this rate of improvement, <em>by 2029</em>, the end of this Government&#8217;s Parliamentary Term and the Trump Presidency, AI will be doing tasks that take humans 2000 hours labour - roughly equivalent to a human working year - doing this in days, hours, maybe even in minutes. <strong>By 2029.</strong> &#8230;FCAS is planned to come into service <em>in 2040</em>, and likely goes out of service ~2070.<a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftn7"><sup>[7]</sup></a></p></blockquote><p><strong>Can those &#8216;straight lines on graphs&#8217; that have given us exponential progress in AI just keep going? </strong>The <a href="https://www.lesswrong.com/posts/SoWperLCkunB9ijGq/can-ai-scaling-continue-through-2030-epoch-ai-yes">answer</a> seems likely to be yes, until at least 2030. A <a href="https://epoch.ai/blog/can-ai-scaling-continue-through-2030">recent</a> Epoch AI study suggests &#8220;<em>2e29 FLOP training runs will likely be feasible by 2030</em>.&#8221; That&#8217;s <strong>200 quintillion FLOPS. </strong>If that doesn&#8217;t help you, over on <em>Less Wrong </em>they <a href="https://www.lesswrong.com/posts/SoWperLCkunB9ijGq/can-ai-scaling-continue-through-2030-epoch-ai-yes">explain</a>: &#8220;<em><strong>In other words, by 2030 it will be very likely</strong> <strong>possible</strong> <strong>to train models that exceed GPT-4 in scale to the same degree that GPT-4 exceeds GPT-2 in scale.<a href="https://epochai.org/blog/can-ai-scaling-continue-through-2030#fn:2"><sup>2</sup></a></strong> If pursued, we might see by the end of the decade advances in AI as drastic as the difference between the rudimentary text generation of GPT-2 in 2019 and the sophisticated problem-solving abilities of GPT-4 in 2023.</em>&#8221; You&#8217;ve probably not really heard of GPT-2, it predates ChatGPT (GPT-3) by 3-years. It&#8217;s like the jump between Model T Ford, when you could still argue horses were better for lots of work tasks, and the cars of the 1980s, when no one sensible was making that argument. We are looking at a similar or greater advance again by 2030.</p><p>These are some of the latest indicators. This blog has covered where we are going and how quickly we are heading there, particularly in &#8216;Don&#8217;t Blink&#8217; and &#8216;In Athena&#8217;s Arms&#8217;. On the Wavell Room, we have all argued independently that &#8216;the future is uncrewed&#8217;. We&#8217;re on a track to AGI, perhaps as soon as 2026, but with a median Metaculus estimate for &#8216;oracle&#8217; AGI by 2028 (a year earlier than was forecast throughout 2024). Taken together, these examples starkly illustrate that AI is advancing at a pace Defence is failing to appreciate or prepare for adequately. To underestimate AI is to risk strategic defeat.</p><p><strong>The AI Quarterback: Why Tempest Does Not Need Humans</strong></p><p>Let&#8217;s explore the claim in the podcast that Tempest, the fighter aircraft at the centre of FCAS <em>&#8216;definitely won&#8217;t be remote&#8217;, </em>for which we read &#8216;won&#8217;t be fully autonomous&#8217;, and that there is therefore a necessity for a human &#8216;<em>quarterback</em>&#8217; to coordinate autonomous and crewed platforms in attack and defence. We think this claim rests on multiple flawed assumptions, and misunderstands both technological progress and operational realties.</p><p>Why would it be impossible for Tempest to be remote? One answer is that any kind of electronic emission can be detected, and so you can&#8217;t have a datalink to the aircraft, as you do for current remotely piloted drones. There are unchanging physics aspects to this argument. The range of the radio link to control an aircraft is defined by the power of the emission. A long-range aircraft, essential to operate over enemy terrain, would need very high-power emissions, which makes it easier to detect.</p><p>But this smuggles in an assumption, that there has to be a datalink if there isn&#8217;t a human in the cockpit. That ignores the speed of technology development. If an AI is able to take inputs of sensor data, process them faster and respond more effectively than humans, it can be fully autonomous. If it is fully autonomous, it doesn&#8217;t need a data link.</p><p>The argument that a human &#8216;<em>quarterback</em>&#8217; - the &#8216;player&#8217; coordinating the attack - is a necessity also assumes the crewed &#8216;quarterback&#8217; platform must remain in (relative) close proximity to its autonomous wingmen. It must be close to reduce emissions and avoid detection when coordinating the attack. If not close, then the human quarterback has the same datalink problem that the long range remote piloted aircraft had. Proximity enables low-powered or visual communications, reducing (though not eliminating) detectability. However, this constraint applies equally whether coordination is human or AI-led. Although fully "zero-emission" platforms may be unrealistic, AI-managed coordination would likely significantly minimise emissions, surpassing even low-emission human-led approaches (AI communication likely more efficient). Thus, human presence offers no inherent advantage in emission management or stealth&#8212;AI coordination can achieve equal or greater effectiveness with reduced proximity constraints.</p><p>A related argument is that you need a pilot because maybe electronic warfare &#8211; jamming, spoofing, frying of the aircraft electronic sensors and systems &#8211; would cause a fully autonomous fighter aircraft to drop out the sky. This is indefensible. Current fighter aircraft are &#8216;fly by wire&#8217;. To optimise for agility they are inherently unstable, and the only thing that keeps them in the air is the computer constantly adjusting the configuration of the aircraft&#8217;s control surfaces. If an adversary is capable of jamming, spoofing or frying the electronic systems of an autonomous fighter, then they are equally capable of incapacitating a crewed aircraft, rendering the presence of a human pilot irrelevant to survivability. The distinction between crewed and uncrewed aircraft in terms of vulnerability to electronic attack is therefore illusory, and the solution lies not in insisting on human pilots but in hardening aircraft electronics against such threats.</p><p>Then there is the argument that there are minimal to no trade-offs involved in including a pilot in the cockpit, so long as the aircraft, as is the case with Tempest, can be optionally crewed. Again this seems indefensible.</p><p>In all major wars, pilot loss has been more of a limiting factor on a nation&#8217;s ability to fight than aircraft loss. We can manufacture aircraft faster than we can train pilots. This was true of both <a href="https://battle-of-britain-diary.org.uk/1940/09/07/dowding-has-a-conference-to-discuss-how-to-manage-fighter-commands-pilot-losses/">RAF</a> pilots in the Battle of Britain and Luftwaffe pilots in the Defence of the Reich. In the Ukraine war, shortages of pilots has limited both <a href="https://www.eurasiantimes.com/amid-lack-of-pilots-russian-air-force-announces-joining-of-female/">Russian</a> and <a href="https://english.nv.ua/nation/u-s-limited-by-pilot-availability-in-training-ukraine-s-f-16-squadrons-50472557.html">Ukrainian</a> air operations &#8211; and been specifically <a href="https://english.nv.ua/nation/u-s-limited-by-pilot-availability-in-training-ukraine-s-f-16-squadrons-50472557.html">cited</a> as a reason for delaying providing Ukraine with F-16s.</p><p>Another argument is that pilot skill is irreplaceable. Yet today AI-driven drones have begun to outperform human pilots in complex, high-speed drone racing scenarios, dodging drones through woods and around complex obstacle courses at impossible speed with super-human agility, signalling a significant shift in the capabilities of autonomous flight. The USAF&#8217;s Venom programme sees autonomous &#8216;VISTA&#8217; F-16s dogfighting (Basic Fighter Manoeuvres), something we were <a href="https://www.forbes.com/sites/erictegler/2020/08/20/ai-just-won-a-series-of-simulated-dogfights-against-a-human-f-16-pilot-5-to-nothing-what-does-that-mean/">told</a> in 2020 wouldn&#8217;t happen for &#8216;in excess of a decade, maybe two&#8217;. That estimate came from a USAF pilot who was a relative AI optimist, schooled in DARPA&#8217;s simulator-based AlphaDogFight trials, where the humans lost. As another of those defeated pilot&#8217;s admitted &#8216;&#8220;If I were to walk away from today saying I don&#8217;t trust the AI&#8217;s ability to perform fine motor movement and achieve kills, I&#8217;d have a lack of integrity&#8230;&#8221;.</p><p>The quarterback analogy overlooks another crucial &#8216;skill' limitation: humans have extremely limited capacity for simultaneous attention. This is why texting while driving is illegal&#8212;i<a href="https://news.vt.edu/articles/2009/07/2009-571.html">t multiplies crash risk by a factor of twenty-three</a>. Coordinating multiple combat aircraft is vastly more complex than texting, demanding constant real-time awareness, quick decisions, and continuous communication across multiple platforms. Human pilots simply don&#8217;t have the cognitive bandwidth to fly their own aircraft effectively while simultaneously managing a swarm of others. Swarming technologies, multi-agent systems, can do this better, faster, more responsively. And surely after AlphaGoZero in Go (2017), <a href="https://alphastar/">AlphaStar</a> in the game Starcraft II (2019) &#8211; which required breakthroughs in AI that could account for game theory, imperfect information, long-term planning, real-time, large action spaces, we don&#8217;t need to make the case that AI is superior in coordinating complex tactical plays? It&#8217;s a big bet against the odds to suggest you&#8217;ll need a human quarterback for their tactical skill, when Tempest is deployed in 15 years (2040), let alone in 45 (2070) when the aircraft is expected to still be operating.</p><p>Then there is the argument that aircraft capability isn&#8217;t majorly impacted by having a human in the cockpit. But the limitations this places on how many &#8216;g&#8217; an aircraft can pull is a problem. Even marginal differences can determine whether an aircraft is taken out or not. The presence of a human pilot imposes physiological limits on an aircraft's manoeuvrability, particularly regarding g-force tolerance. While modern fighter jets are structurally capable of executing manoeuvres exceeding 9g, human pilots typically can withstand only up to 9g for short durations, even with the aid of g-suits designed to mitigate the effects of high acceleration forces. Exceeding this threshold risks G-LOC (g-force induced Loss Of Consciousness), compromising mission effectiveness and pilot safety. Unmanned aircraft, free from these human limitations, can exploit the full aerodynamic potential of their designs, allowing for more aggressive manoeuvres. </p><p>Maximum load factor (&#8216;g&#8217;) is not the only important factor in air combat manoeuvrability that full autonomy aids. An AI would like hold an aircraft at the edge of its performance envelope much more effectively. A further transformation will come from the AI quarterback &amp; autonomous systems&#8217; ability to execute precisely coordinated, multi-aircraft tactics&#8212;complex three-dimensional manoeuvres performed with superhuman timing, precision, and strategic coordination. Uncrewed aircraft collaborating dynamically can preserve formation energy, manipulate adversaries into vulnerable positions, and optimise defensive manoeuvres to evade threats with accuracy no human team could manage.</p><p>Enhanced agility - individual and formation - can be crucial in combat scenarios, where even marginal improvements in manoeuvrability can significantly reduce the probability of being hit by enemy missiles: high-g manoeuvres in the end game can be the difference between the aircraft being destroyed or the missile being evaded. There&#8217;s a reason pilots and intelligence officers scrutinise &#8216;doghouse plots&#8217; that illustrate an aircraft's performance envelope vs those of adversary aircraft and adversary missiles. It&#8217;s the difference between life and death. Mission success and mission failure.If having a pilot in the cockpit imposes even small limits on manoeuvrability, then the cost of having a human quarterback is reduced probability of mission success.</p><p>Then there is the question of the wisdom of a programme that takes 10-20 years to deliver a capability in an age of exponential technical progress. It was only 14 years ago that US Venture Capitalist Marc Andreesen argued &#8216;<a href="https://a16z.com/why-software-is-eating-the-world/">software was eating the world</a>&#8217; and it was edgy and futuristic &#8211; now everyone accepts this, and companies such as Anduril are challenging Lockheed and Boeing &#8211; while increasingly software driven drones are transforming warfare and rendering many capabilities obsolete. Twenty year-old assumptions are likely to be wrong, and costly &#8211; and they are likely to be more wrong 20-years in the future, than they were 20-years ago to today. Today progress is accelerating faster than ever, and yet still poised to leap to even greater speed. We are on the brink of Dario Amodei&#8217;s &#8220;&#8230;&#8216;<a href="https://www.darioamodei.com/essay/machines-of-loving-grace">compressed 21<sup>st</sup> century</a>&#8217; the idea that after powerful AI is developed&#8230;[Amodei suggests in 1-3 years]&#8230;we will in a few years make all the progress in biology and medicine that we would have made in the whole of the 21<sup>st</sup> century.&#8221; We think Amodei chose biology and medicine as fields that are media palatable &#8211; this would be just as true across the sciences, and thus weapons and military equipment development.</p><p>Is it safe to bet on a human &#8216;quarterback&#8217; in an aircraft in 2040, and change later if that turns out to be wrong? No, there are consequences. Getting this bet wrong, means spending money on developing a platform that will be rendered obsolete at some point prior to its deployment in 15-years' time, or within the next 45 years between now and the aircraft&#8217;s projected end of service life. </p><p>The risk is that we are wasting money investing in the wrong things, and will waste more in the future, since changing designs is a major cause of MOD cost over-runs.</p><p>Then there is the issue of opportunity costs. The MOD has <a href="https://commonslibrary.parliament.uk/research-briefings/cbp-10143/">budgeted</a> &#163;12bn over ten years for FCAS. Deepmind costs Google between ~&#163;164M-&#163;477M a year, or ~&#163;10M per study.<a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftn8"><sup>[8]</sup></a> Imagine what we could do if we invested similarly well.</p><p><strong>AGI, Defence &amp; The Future of Warfare</strong></p><p>The claim made on the podcast &#8212;&#8220;We&#8217;re prepared for the time when AGI catches up&#8221;&#8212; implies Defence possesses concrete plans and capabilities for effectively managing advanced, human- or beyond human- level artificial intelligence &#8211; either our own, or that deployed by an ally or adversary. Yet the evidence presented thus far suggests the opposite.</p><p>The current Defence position&#8212;that by 2040, a human operator in an aircraft like Tempest will outperform AI in processing data, coordinating multiple platforms, situational awareness, and rapid decision-making&#8212;demonstrates precisely how unprepared we are. As shown earlier, AI systems already equal or surpass human performance in complex strategic tasks, and the trajectory of improvement strongly favours AI dominance. Even if human pilots retain marginal advantages today in certain areas, these advantages are diminishing rapidly, soon to disappear entirely.</p><p>The claim thus resolves to, in 2040, UK defence believes <em>it is very likely that a human systems operator will outperform an AI in processing data, and controlling subordinate platforms, a human operator will have better situational awareness, and be able to react faster</em>.</p><p>On any reasonable appraisal, as we have shown, this seems overwhelmingly unlikely: indeed, given AI&#8217;s current performance in air-to-air combat and strategy relative to humans, it may not be true <em>today:</em><sup> </sup>processing and analysis of data, control of networks and speed of reactions are measures where humans do worse.</p><p>Let&#8217;s push this logic further. If we hold the platform and support constant, which dominates:</p><blockquote><p>1. Human-only control, or</p><p>2. Human-plus-AI control, or</p><p>3. AI-only control</p></blockquote><p>Whichever proposition of the above is true will win. Tempest is a bet on the second. All the evidence to date suggests that not long after human-machine teaming wins, the AI beats the human machine team. Our planning is stuck short of the logical conclusion of its own assumptions.</p><p>Defence might understandably prefer retaining human pilots from a moral cultural or aesthetic preference for a pilot in the cockpit. But what matters is whether your adversaries share your preferences. If they don&#8217;t, as soon as one adversary picks truth over beauty, you must change, or you will lose. It is that simple. Some might object that adoption will be uneven. This is true and also irrelevant. The moment any credible power demonstrates a battlefield advantage via AI, the pressure to replicate it becomes existential. The delay is strategic suicide, not insulation. Unlike many previous candidates, this is a true horse-to-tank moment, except an order-of-magnitude more so.</p><p>Which one will be dominant in the 2040s cannot of course be known. But humans aren&#8217;t getting significantly better and artificial intelligences are. Why do you believe humans will win this race?</p><p>The implications extend well beyond fighter aircraft. The AI frontier will be jagged &#8211; we think air-to-air combat will be fully autonomous first, then air-to-ground, with other domains slower. Air warfare offers straightforward reward functions (seeing what&#8217;s there, shooting things down, blowing things up, moving things around), in *relatively* uncluttered environments (compared to ground and sea &#8211; surface and subsurface &#8211; warfare) &#8211; different capabilities will be automated at different time horizons. But unless the trajectory changes, what is true of fighter aircraft is true of every other military system. AI is likely to be better at operating <em>all of them</em>: there isn&#8217;t one domain or system where it is obvious that humans will retain decisive advantage, or any advantage at all.</p><p>AGI makes people advantage redundant everywhere, from pilots to operators, commanders, to intel staff. In doing so, it removes demographic inferiority &#8211; a nation with less well trained, well educated, or well-motivated troops, a nation with fewer people, is no longer less able to defend itself.</p><p>Our argument then is that betting against these clearly visible trends&#8212;hoping humans will somehow maintain a meaningful edge&#8212;is dangerously naive. If Defence truly believes human superiority can persist, it must explicitly justify why human capabilities will indefinitely outperform rapidly improving AI systems. Moreover, it must convincingly refute or challenge the well-supported forecasts placing AGI&#8217;s arrival around 2028.</p><p>On the other hand, if Defence accepts the logic and evidence we have laid out, it should be racing towards our AI and autonomous future to fulfil its duty to protect the nation.</p><p><strong>Humanity&#8217;s Last Exam: Existential Risk</strong></p><p>We now make a larger claim. If you accept the logic of the argument so far, then our generation is sitting humanity&#8217;s final exam as a civilisation. To pass this exam, we must resolve at least one of the following critical issues:</p><ol><li><p>Globally prevent further AI development.</p></li><li><p>Remove the possibility that advanced AI can access advanced weaponry or highly dangerous materials.</p></li><li><p>Eliminate war entirely as a political instrument.</p></li><li><p>Solve advanced AI alignment.</p></li></ol><p>If you can&#8217;t do that, the logic takes you to a civilizational test for humanity: <em><strong>what happens when the optimal move is to hand over our most destructive artefacts to non-human intelligences?</strong></em></p><p>Dystopian? Yes, we agree. So what are we going to do about it? Our options are clear:</p><p>1. <strong>Stop Developing Minds Smarter Than Humans. </strong>Humanity can stop developing powerful artificial intelligences. But that has to be all of it, any defection at all prevents this strategy from working. This seems very unlikely, and perhaps simply not possible.</p><p>2. <strong>Remove Tools That Smarter Minds Could Use Decisively. </strong>Accept the development of smarter-than human artificial intelligences but <strong>eliminate</strong>, globally, all advanced military technologies and crucially, the underpinning civilian technologies. In practice, anything that an AI could take control of and use against us. Again, it can&#8217;t just be done with pinky promises, it has to be done by everyone, everywhere with no defections. Misaligned AGI could theoretically weaponise almost anything - from disease to climate to the economy - against us, but clearly, handing it our most advanced military capabilities is an order of magnitude more immediate threat.</p><p>3.<strong> End War. </strong>Never go to war again, solve all problems by politics. Clearly, not happening.</p><p>4. <strong>Solve AI Alignment. </strong>Solve AI alignment so you <em>don&#8217;t care</em> if AI controls humanity&#8217;s most destructive weapons, because it will always do your/your elected representative/your elected AI leader&#8217;s, bidding. This might be done by luck &#8211; maybe AI always remains a tool of humans, aligned to us, despite being able to out-think us. Or, assuming this is possible, it might be done through deliberate and careful research and development to put in place regulation and guardrails etc. In any case, even if you think it unlikely (we have split views between us at Cassi), if it is not a zero chance, we still need to do it. After all, we insure against war by building a military, against all kinds of low-probability high impact risk in our day-to-day lives. This is what &#8216;prepared for the time when AGI does catch-up&#8217; means in practice. Hope is not a strategy.</p><p>5. <strong>Cede Control of Destructive Systems to </strong><em><strong>misaligned</strong></em><strong> Non-Human Intelligences. </strong>Get killed by Skynet.</p><p>There aren&#8217;t actually any other options. That is your lot.</p><p>Can we please start taking this seriously?</p><p><strong>So What?</strong></p><p>We are not remotely &#8216;<em>prepared for the time when AGI does catch-up</em>&#8217;. Defence&#8217;s persistent underestimation and misunderstanding of AI is a national security risk, right now.</p><p>We need to address the issue of under-estimating, and insufficiently understanding, AI for the sake of the security of our nation.</p><p>Military leaders must recognise that their duty is twofold: </p><p>First, to decisively invest in and prepare for an AI-driven future of warfare; </p><p>Second, to candidly explain the Strangelovian escalation trap humanity faces. Achieving AI alignment is not primarily a problem to be solved by UK Defence &#8211; the excellent UK AI Security Institute should remain the lead for this. But nor is it an abstract concern that Defence can ignore&#8212;it is a strategic imperative that Defence&#8217;s leaders have a moral and ethical obligation to explain given their part in it. Failure to act and speak out now risks more than defeat; it risks existential disaster.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.cassi-ai.com/subscribe?"><span>Subscribe now</span></a></p><p></p><div><hr></div><p><a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftnref1"><sup>[1]</sup></a> </p><div class="bluesky-wrap outer" style="height: auto; display: flex; margin-bottom: 24px;" data-attrs="{&quot;postId&quot;:&quot;3lnmx3bms5c2n&quot;,&quot;authorDid&quot;:&quot;did:plc:mj5h6johygkjaoo3dg5q23lm&quot;,&quot;authorName&quot;:&quot;Shashank Joshi&quot;,&quot;authorHandle&quot;:&quot;shashj.bsky.social&quot;,&quot;authorAvatarUrl&quot;:&quot;https://cdn.bsky.app/img/avatar/plain/did:plc:mj5h6johygkjaoo3dg5q23lm/bafkreifqsw4576zytpf2psa4yyjve7xv6tt5ceiaqrogfm7vu5eypav6ce@jpeg&quot;,&quot;text&quot;:&quot;On human pilots: \&quot;you can't guarantee always that connection [comes] back, which is why right now...we need a human in that quarterback position. We're prepared for the time when [AGI] does catch up. At the moment [LLMs] they're just a stochastic parrot...advanced auto complete\&quot;&quot;,&quot;createdAt&quot;:&quot;2025-04-25T09:50:08.594Z&quot;,&quot;uri&quot;:&quot;at://did:plc:mj5h6johygkjaoo3dg5q23lm/app.bsky.feed.post/3lnmx3bms5c2n&quot;,&quot;imageUrls&quot;:[]}" data-component-name="BlueskyCreateBlueskyEmbed"><iframe id="bluesky-3lnmx3bms5c2n" data-bluesky-id="5704920692663678" src="https://embed.bsky.app/embed/did:plc:mj5h6johygkjaoo3dg5q23lm/app.bsky.feed.post/3lnmx3bms5c2n?id=5704920692663678" width="100%" style="display: block; flex-grow: 1;" frameborder="0" scrolling="no"></iframe></div><p> &amp; <a href="https://x.com/shashj/status/1915701982792208823">Shashank Joshi on X: "RAF officer leading concepts/req for Tempest: "at the top of our list is the payload." Irrelevant how it gets to target, he says, but analysis shows fast jet better than cargo plane or sub. "Payload is what we're all about ... we're talking roughly double an F-35 A's worth..."" / X</a></p><p><a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftnref2"><sup>[2]</sup></a> <a href="https://teamtempest.podbean.com/e/future-horizons-the-tempest-podcast-ep-14-%e2%80%93-xxxxxxx/">Episode 14 &#8211; Meeting UK operational needs | Future Horizons: The Tempest Podcast</a> (it is all worth a listening, but the most relevant sections are at the start and then from c.40 minutes in).</p><p><a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftnref3"><sup>[3]</sup></a> Ignoring that for a sufficient level of intelligence or skill, everything is auto-complete&#8230;</p><p><a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftnref4"><sup>[4]</sup></a> </p><div class="bluesky-wrap outer" style="height: auto; display: flex; margin-bottom: 24px;" data-attrs="{&quot;postId&quot;:&quot;3lnmy6zym4s2n&quot;,&quot;authorDid&quot;:&quot;did:plc:mj5h6johygkjaoo3dg5q23lm&quot;,&quot;authorName&quot;:&quot;Shashank Joshi&quot;,&quot;authorHandle&quot;:&quot;shashj.bsky.social&quot;,&quot;authorAvatarUrl&quot;:&quot;https://cdn.bsky.app/img/avatar/plain/did:plc:mj5h6johygkjaoo3dg5q23lm/bafkreifqsw4576zytpf2psa4yyjve7xv6tt5ceiaqrogfm7vu5eypav6ce@jpeg&quot;,&quot;text&quot;:&quot;Struck by the slightly impoverished view of frontier models that you see in defence &amp; even parts of the intelligence world. These models have obvious limitations &amp; might be incapable of many complex tasks in cluttered environments, but the auto-complete analogy is outdated.&quot;,&quot;createdAt&quot;:&quot;2025-04-25T10:10:08.552Z&quot;,&quot;uri&quot;:&quot;at://did:plc:mj5h6johygkjaoo3dg5q23lm/app.bsky.feed.post/3lnmy6zym4s2n&quot;,&quot;imageUrls&quot;:[]}" data-component-name="BlueskyCreateBlueskyEmbed"><iframe id="bluesky-3lnmy6zym4s2n" data-bluesky-id="03420007524934099" src="https://embed.bsky.app/embed/did:plc:mj5h6johygkjaoo3dg5q23lm/app.bsky.feed.post/3lnmy6zym4s2n?id=03420007524934099" width="100%" style="display: block; flex-grow: 1;" frameborder="0" scrolling="no"></iframe></div><p><a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftnref5"><sup>[5]</sup></a> And I am not sure any actual human could individually do well on it: 2500 challenging questions on a 100 different subjects.</p><p><a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftnref6"><sup>[6]</sup></a> <a href="https://lastexam.ai/">Humanity's Last Exam</a></p><p><a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftnref7"><sup>[7]</sup></a> This footnote is for the nerdiest of technical pedantry. The measuring how &#8220;general&#8221; AI is at solving tasks is a very difficult challenge. You cannot test on anything a human might be asked to do, that&#8217;s an infinite test. What represents a good cross section of all the tasks you would expect a human to be able to do? Which human? In how long? How do you measure success? There are a million confounding variables. METR took on the approach by using length of time it normally takes a human as a proxy for task complexity. Does that mean that &#8216;being a pilot&#8217; and &#8216;running a tactical battle&#8217; was in that list? - No. Does that mean you can ignore the work? Not sensibly if you want to understand how generality in AI is improving.</p><p><a href="applewebdata://540BC26B-CA9F-4D4D-83CF-022B6A1D72EF#_ftnref8"><sup>[8]</sup></a> This (2017) article <a href="https://qz.com/1095833/how-much-googles-deepmind-ai-research-costs-goog">https://qz.com/1095833/how-much-googles-deepmind-ai-research-costs-goog</a> suggests costs of &#163;164M, this 2020 article https://www.cnbc.com/2020/12/17/deepmind-lost-649-million-and-alphabet-waived-a-1point5-billion-debt-.html buts the figure at &#163;477M, individual papers/studies are said (2024) to cost ~$13M, calculated here <a href="https://www.reddit.com/r/MachineLearning/comments/1ej5h4b/d_calculating_the_cost_of_a_google_deepmind_paper/">https://www.reddit.com/r/MachineLearning/comments/1ej5h4b/d_calculating_the_cost_of_a_google_deepmind_paper/</a></p>]]></content:encoded></item><item><title><![CDATA[Four And a Half Bad Days In Cyberspace]]></title><description><![CDATA[Defence Cyber Marvel 4 (DCM4): Inspiration & Warning]]></description><link>https://blog.cassi-ai.com/p/four-and-a-half-bad-days-in-cyberspace</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/four-and-a-half-bad-days-in-cyberspace</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Fri, 14 Mar 2025 07:01:51 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!bN65!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><p>[Updated, 16 March 2025. 2209hrs: new Endnote]</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bN65!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bN65!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg 424w, https://substackcdn.com/image/fetch/$s_!bN65!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg 848w, https://substackcdn.com/image/fetch/$s_!bN65!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!bN65!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bN65!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg" width="1280" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:95124,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/159026939?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bN65!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg 424w, https://substackcdn.com/image/fetch/$s_!bN65!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg 848w, https://substackcdn.com/image/fetch/$s_!bN65!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!bN65!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf95203d-e3f1-44f0-baf3-00099d3b3d31_1280x720.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image Credit: LIG Nex1. Available here: https://biz.chosun.com/en/en-industry/2025/03/05/4H7MSTWCOJHYZHJAEKENPJWB7I/</figcaption></figure></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.cassi-ai.com/subscribe?"><span>Subscribe now</span></a></p><p>This was the most innovative military exercise I have participated in or witnessed in 22 years of service. <a href="https://www.army.mod.uk/learn-and-explore/army-of-the-future/modernisation/army-innovation/defence-cyber-marvel/">Defence Cyber Marvel&#8217;s</a> fourth iteration, <a href="https://www.facebook.com/share/v/14tFXvuRMH/">DCM4</a>, took place in Seoul <a href="https://www.ajupress.com/view/20250305144007957">recently</a> and I had the privilege to attend. If you just want to read about the exercise, skip the intro, and jump to the section headed DCM 4. But first, here&#8217;s why I think it matters&#8230;</p><p>&#8216;<em>The rainbow of colors in the window paints how everything went so wrong, so fast. The water in the Potomac still has that red tint from when the treatment plants upstream were hacked, their automated systems tricked into flushing out the wrong mix of chemicals.</em></p><p><em>By comparison, the water in the Lincoln Memorial Reflecting Pool has a purple glint to it.</em></p><p><em>They&#8217;ve pumped out the floodwaters that covered Washington&#8217;s low-lying areas after the region&#8217;s reservoirs were hit in a cascade of sensor hacks. But the surge left behind an oily sludge that will linger for who knows how long. That&#8217;s what you get from deciding in the 18th century to put your capital city in low-lying swampland and then in the 21st century wiring up all its infrastructure to an insecure network. </em></p><p><em>All around the Mall you can see the black smudges of the delivery drones and air taxis that were remotely hijacked to crash into crowds of innocents like fiery meteors. And in the open spaces and parks beyond, tiny dots of bright colors smear together like some kind of tragic pointillist painting. These are the camping tents and makeshift shelters of the refugees who fled the toxic railroad accident caused by the control system failure in Baltimore. </em></p><p><em>FEMA says it&#8217;s safe to go back, now that the chemical cloud has dissipated. But with all the churn and disinfo on social media, no one knows who or what to trust. Last night, the orange of their campfires was like a vigil of the obstinate, waiting for everything to just return to the way it was. But it won&#8217;t.</em>&#8217;</p><p>I remember reading this, August Cole &amp; Peter Singer&#8217;s evocative &#8220;FICINT&#8221; (Fictional Intelligence) opening to the US&#8217; Cyber Solarium <a href="https://cybersolarium.org/wp-content/uploads/2022/05/CSC-Final-Report.pdf">report</a>, in Downing Street in 2020. A few years before, I&#8217;d read August Cole&#8217;s brilliant short story <em><a href="https://mwi.westpoint.edu/underbelly-2/">Underbelly</a></em>, that helps us explore the direct military effects of a cyberattack on the UK. It is no less alarming, resulting in a distributed HQ led by a Special Forces Colonel &#8216;Fessenden&#8217;, having to operate clandestinely, out of a rural pub pulling together the talents needed, with all major bases and MOD cyber networks shut down due to cyber infrastructural attacks. If you can, do read it.</p><p>In 2020 I was with our No10 Integrated Review team at a meeting with the Defence Secretary and senior Defence and National Security Leaders. One Treasury official asked what it would mean if we cut the defence budget for aircraft carriers and reassigned it to offensive and defensive cyber. Might we be a more dangerous opponent? Better able to deter, and safer at home? Might spill-over benefits to the wider economy be greater? More nationally resilient?</p><p>I don&#8217;t think this was a serious proposition, but a challenge &#8211; a provocation designed to force people to think. And it is one I have often thought about since. I am willing to bet it has never been seriously and fully explored in the UK.</p><p>For me, personally, on issues like the aircraft carriers, FCAS/Tempest, Challenger 3 tanks and all the other totemic and high-end kit that comes up for regular scrutiny, my primary interest is in how we make the decisions on force structures and balance, not what specifically we should have.<a href="applewebdata://B381C907-64CD-40A0-A086-2D5BF11086D2#_ftn1"><sup>[1]</sup></a></p><p>Cyber Power debates in Whitehall, like those on Space Power<a href="applewebdata://B381C907-64CD-40A0-A086-2D5BF11086D2#_ftn2"><s><sup>[2]</sup></s></a>, are driven by fundamentally misaligned incentives. Just as air power needed an independent champion to compete equally for resources &#8211; the purpose the creation of the RAF served &#8211; so cyber and space will never get to make their arguments with full force for as long as they have to make their case through the RAF, Army, RN, or Strategic Command, (at least while Strategic Command is run by officers whose careers depend, in part, on keeping their parent services content).</p><p>In fact, for cyber the situation is worse in many ways. Not only does it have to compete with the other military domains of war but it also has to compete for resources with GCHQ. <a href="https://www.gchq.gov.uk/information/welcome-to-gchq">GCHQ&#8217;s mission</a> is primarily espionage. As a civilian led and run organisation, it cannot use lethal force, and so cyber-attacks that might cause destruction, death or injury must be carried out by military personnel. But GCHQ is funded by the Foreign Office, not the MOD. The FCDO does not want to hand the MOD an argument taking funding away from espionage to redirect it to military cyber defence and offence.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Thus cyber, in the sense that August Cole was talking about, never gets to air its arguments for why it should be funded ahead of aircraft carriers, tanks, FCAS, and espionage etc. Our Treasury official&#8217;s question remains largely unanswered and unexplored. I don&#8217;t think we know how vulnerable we really are. Nor do I think we truly know how much we could do with offensive cyber capabilities if they were fully funded and had an independent champion. August Cole&#8217;s cyber FICINT helps us see how we might first learn about our vulnerabilities.</p><p>DCM4 is helping ensure our &#8216;Fessenden&#8217; will be able to find the people he needs to fight back.</p><p><strong>Defence Cyber Marvel 4 (DCM 4)</strong>.</p><p>A large hotel in Seoul. Down lifts and up escalators. Into a labyrinthine complex of a large open plan convention centre and surrounding smaller convention rooms and offices. Innovation everywhere. I am shown around this multinational exercise with other visitors, shown what can be done at the unclassified level - the briefings I am given are given to local media, visiting multinational dignitaries, academics, business and industry representatives alike. The capabilities we see are more remarkable for the fact that they are in the public domain.</p><p>In one corner a group of soldiers from 2 PARA discuss how they have put together a number of electronic warfare sensors from commercially available hardware. They bought parts on Amazon, on a shoestring budget, with a focus on Counter UAS. They, and Royal Signals soldiers present say these devices are perhaps more effective than those they would currently have access to were they to deploy with the officially provided EW support equipment of the Light Electronic Warfare Teams (LEWT).</p><p>Whether the kit itself is better or not isn&#8217;t really the point though. As they note, the creation of these devices would enable a proliferation of EW and ISR across the battlefield that is not possible with the current force structures where these capabilities are in high demand and insufficient quantity. The Ukraine war has at times been referred to as the &#8217;24 hour&#8217; war, where technological advancements are only effective for a short period before being countered. Creating a culture in which innovative soldiers can continually develop and refine technological advantage themselves will enable us to maintain the initiative.</p><p>It makes me think of the culture inculcated in the Israeli Defence Forces, one of <em>Ein Beira, </em>or &#8216;no choice&#8217;; of tinkering and trying and taking risk, modifying kit, solving problems. It sits in damning contrast to the process of building under maximal regulation, assurance, validation and verification. This has long suggested to me that we have unintentionally killed the tinkering innovation spirit in our armed forces outside these bands of dynamic volunteers and novel thinkers. We have built a culture where the soldier, sailor, airmen and women are expected to passively wait until the kit he or she needs is handed to them with a training manual. As Bernard Schriever wrote of the US DoD in the 1960: &#8220;the system now runs the people&#8221;. DCM 4, it seems to me, is an antidote. And the PARAs seem to be thriving with having been given permission to innovate, to test and try, in hardware and software &#8211; not things we traditionally think of as Parachute Regiment specialities, and a reminder of how capable so many in our Armed Forces are.</p><p>In another corner, a young RAF Officer speaks passionately about how much she is learning, how she, and over 1000 participants from 60 different organisations, including 27 nations working in 20 worldwide locations, are mostly volunteers. All are self-taught. They have given up evenings and weekends to learn how to become hackers &#8211; they did this outside of military time. Now she is leading a team of Brits from the RAF and Americans, competing against teams from other services and other countries; and also, fascinatingly, other UK Government Departments. The Ministry of Justice has a team here. The NHS too. The Met Police and National Crime Agency. There are teams from across the UK military, though Army personnel and units dominate, since it is primarily the Army, through the Army Cyber Association, that organise the Exercise. Here too are representatives of most of the largest tech firms both globally and in the UK. All compete equally. The strongest performers are not always the teams you&#8217;d expect. But the Exercise organisers protect everyone, keeping results - bar those of the teams that win -between them and the teams. The focus is on winning, but also on learning from failure, so any embarrassment must be minimised.</p><p><strong>Four and Half Bad Days in Cyberspace</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!krIr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!krIr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg 424w, https://substackcdn.com/image/fetch/$s_!krIr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg 848w, https://substackcdn.com/image/fetch/$s_!krIr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!krIr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!krIr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:8430363,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/159026939?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!krIr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg 424w, https://substackcdn.com/image/fetch/$s_!krIr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg 848w, https://substackcdn.com/image/fetch/$s_!krIr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!krIr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5dc7883-d534-4932-92e3-3cf38fc0d685_7097x4734.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image: Crown Copyright, 2025.</figcaption></figure></div><p>The teams spend 10-months organising and preparing for what they call <em>4.5 bad days in Cyber Space</em>. They are organised into small rapid reaction teams on the &#8216;Blue Team&#8217; side. They are tasked with defending unfamiliar networks and systems &#8211; from Enterprise IT systems to sophisticated industrial control systems and transport networks on which nations are dependent. There is a large &#8216;Red Team&#8217;, ramping up attacks on them that Blue Teams must defend against, growing in intensity and sophistication as the exercise goes on. A &#8216;White Team&#8217; provides quantified and objective feedback on performance. There&#8217;s a leaderboard. A &#8216;Green Team&#8217; builds the safe training area competing teams operate in before then defending the whole exercise from real-world cyber-attacks and espionage from outsiders. There are numerous &#8216;side quests&#8217; &#8211; innovation experiments in which the Paras were engaged.</p><p>The Green Team is led by a young Army officer. She describes how her all-volunteer team has spent the past 10 months designing and building on the <a href="https://www.cr14.ee/">NATO &#8220;Cyber range&#8221;</a>, essentially a server rack in Estonia within which they have &#8216;Deployed Infrastructure as Code&#8217; &#8211; which is to say they have systems that exist electronically exactly, or as close as possible, to their form in the real-world. Now it is built her team spends their time keeping the network operational while watching for suspicious traffic on the network. If anything looks like it shouldn&#8217;t be there, they train to both defend and attack.</p><p>The Cyber Range shows how satellites can be hacked, and how not to make this obvious. For example, turning a satellite&#8217;s solar power arrays to face away from the sun, leading to a gradual loss of power and functionality that isn&#8217;t obviously a cyber-attack. There are lots of ways this might happen &#8211; so harder to detect and very effective. They can get a weather camera to misreport, leading to the satellite operators using it at times when the weather makes it less effective, or to choose not to use it when the weather is optimal, because they believe its view occluded by weather [reminder, this is all public domain: see also the <a href="https://github.com/deptofdefense/hack-a-sat-library">DoD&#8217;s GitHub &#8216;How To&#8217; Guide</a>. on satellite hacking, or this online <a href="https://hackasat.com">hack-a-sat</a> competition to hack satellites]. </p><p>The Cyber Range has vulnerabilities in industrial control systems and simulated military command and control networks. They have &#8216;real systems&#8217; hardware that you might find in a hospital, or alongside train tracks, and players can try to hack it. Sometimes these are linked to wider simulations of, for example, a whole hospital, or a wider power grid. Sometimes digital targets are linked to models &#8211; for example, an aircraft carrier or a Hornby-like train set &#8211; so you can see the effects of their being hacked on the scale model or the wider simulated system. Sometimes this can find real-world vulnerabilities, something that the exercise organisers are careful to ensure are reported correctly, and knowledge of which is kept contained. The whole team tests and learns while teaching others on the exercise. It is hard not be impressed.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nY1y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nY1y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nY1y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nY1y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nY1y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nY1y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg" width="4080" height="3072" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:3072,&quot;width&quot;:4080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3440454,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://cassiai.substack.com/i/159026939?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68abae2c-9279-4e69-81bc-8e3b6db4181b_4080x3072.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nY1y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nY1y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nY1y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nY1y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64bf9dfc-81ea-42f6-8e56-800d5d3d2e1b_4080x3072.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Goliath, an example of the cyber infrastructure used, albeit it stayed in the UK due to size, and was remotely accessed from South Korea. Image Credit: Dan Humphries 2025, also the builder of Goliath!</figcaption></figure></div><p>In another corner, a team is hacking a commercial satellite &#8211; a real one &#8211; hired for this purpose by the exercise organisers. The satellite is, in the exercise, an adversary&#8217;s surveillance satellite. The team on this side quest insert malware so they can watch the enemy through their own surveillance satellite.</p><p>Another side quest: a team set to sinking an aircraft carrier &#8211; fortunately not a real one this time. The team first gained access to the ships <a href="https://mashable.com/article/hacking-boats-is-fun-and-easy">GPS, making the crew</a> uncertain as to its location. They make the ship &#8216;think&#8217; and report that it was many miles from its real position. Next, the team hack/access the control systems, sending its engines into overdrive and send the ship circling on full power. Turning at full speed causes the ship to list hard. Next, the team, deep inside the ship&#8217;s systems, order its ballast system to shift weight rapidly within the hull. Combined with the listing, the carrier passes the point of no return, rolls, and sinks.</p><p>There&#8217;s an AI Defence Challenge &#8211; prompt injection attacks lead another AI to churn out wrong answers, subtly or obviously, or to generate misinformation, or propaganda. There&#8217;s drone hacking and spoofing. Teams from all over the world have trained LLMs to <a href="https://www.wired.com/2017/02/libratus/">play poker</a>. That might not sound like the fare of war, but it has very relevant attributes: incomplete information; uncertainty; a mix of objective and subjective probabilistic assessments &#8211; what cards might my opponents have and what might they do?; recursion &#8211; what does my opponent think I might have and do, what do they think that I think that they might do, and so on; and of course the staple of the nature of war &#8211; chance<a href="applewebdata://B381C907-64CD-40A0-A086-2D5BF11086D2#_ftn3"><sup>[3]</sup></a>. The poker competition was won by the Japanese Maritime Self-Defence Forces (the Japanese Navy). The point here is to begin experimenting with AI decision systems. The first time I have seen this get serious attention on a military exercise, even if it is as a side-quest and game.</p><p>I'm told that at last year&#8217;s DCM3 the exercise incorporated quantum computing, running an air defence scenario, and using quantum (presumably annealing/combinatorics) for ISR planning &amp; execution, and logistics planning optimisation. This year some early experimentation with quantum radio frequency detection has been seen, something to be expanded on in future exercises.</p><p>I head to another stand where a rifleman describes his Rifles&#8217; Cyber Teams and the experimentation they are doing supporting tactical actions with cyber support. Inspired by DCM &#8211; they are now using Notebook LLM for briefings and analysis, and Google Threat Detection as part of their workflows, with aspirations to widen the use of such tools from tactical experimentation into their headquarters processes.</p><p>Across the exercise there are competing priorities for the Blue teams. Keep the hospital system up and running to save lives or shut it down to protect patient data integrity, knowing that if someone gets into your patient data, you might never be able to trust it again? Treatment itself becomes a threat, if you don&#8217;t know your patient&#8217;s medical history. Teams are constantly called on to prioritise. The attacks they face include new, novel, cutting edge techniques &#8211; perhaps something seen just once or twice globally before &#8211; exposing the defenders to these emerging threats to ensure they are ready to face them in the real-world should they come calling.</p><p>One of the insights I leave with is that a truly intense cyberattack on the UK, or the UK military, is not something we can defend in the way we might try to hold a frontline, or how we might achieve complete control of the air. However long a border in the physical battlespace, the attack surface in cyberspace is simply vast in comparison. As a result, in a cyberwar, we will be hurt &#8211; a successful defence can only reduce harm. Maybe the waters swirling around Washington, the crashed delivery drones, the displaced military headquarters operating from a pub &#8211; all these &#8216;defeats&#8217; in August Cole&#8217;s dystopian stories were in fact the losses we accepted, as we kept incubators in neonatal wards running, and defended our nuclear power stations and submarines from cyber-attack.</p><p><strong>Destroy &amp; Degrade. Reconstitute and Rebuild</strong></p><p>The exercise&#8217;s organisers are proud of what they&#8217;ve built, and rightly so. As they say, you can&#8217;t build an effective Navy by staying in the harbour. DCM 4 gets UK military talent preparing for the wars of the future. The breadth of the teams entering, from across <a href="https://biz.chosun.com/en/en-industry/2025/03/05/4H7MSTWCOJHYZHJAEKENPJWB7I/">industry</a>, defence, public sector and across countries, makes the training unique and the connections for all parties hugely valuable. These are the people they would fight alongside in a real &#8216;cyber war&#8217;.</p><p>DCM aims to increase the skills of everyone involved, and succeeds brilliantly. The teams defend systems and attack them, seeking to destroy, degrade, reconstitute and rebuild them. Everyone involved says there is nothing else like it. It succeeds in part because it is not a formal military exercise. Run by the Army Cyber Association, - set up by soldiers who wanted to enjoy cyber similarly to way the British Army funds <a href="https://www.army.mod.uk/news/army-officers-battle-fierce-storms-to-complete-world-s-toughest-row-across-the-pacific/">others to row the Atlantic</a> - and all by volunteers, it has freedoms that would never be granted to a formal exercise. It can be less deferential to organisational fiefdoms &#8211; wrapping cyber offence, defence, civ, mil, industry, espionage and international participation without having to do quite so much politicking across those misaligned incentives.</p><p>In DCM4 there are echoes of the early innovative days of the RAF, which I wrote about for the Wavell Room in an article they titled: <em><a href="https://wavellroom.com/2024/07/25/saving-the-royal-air-force/https:/wavellroom.com/2024/07/25/saving-the-royal-air-force/">Saving the Royal Air Force</a>. </em>That article described how private contributions, media campaigns, contests and clubs were instrumental to building the capabilities the RAF would later need, creating many of them before it existed, allowing the nascent RAF to harness them rather than having to build them from scratch. Furthermore, contributions, campaigns, contests and clubs created groups who could pressure Whitehall and the Ministry of Defence to act on Air Power, giving them the evidence, showing how powerful it could be, and building public pressure for proper investment. In an age when budgets are even more tightly constrained than they were in the 1930s &#8211; with welfare bills and health spending more tightly tying the hands of the state - a similar approach &#8211; using clubs and media campaigns, contests and private contributions on cyber, space, electronic warfare and uncrewed systems might be the only way to prepare. The incentives in the system to go slow and under-allocate on all these vital areas are just too strong.</p><p>The intrinsic motivation of all involved to improve their skills and put on a world-class exercise is why it works. Formalise it and you&#8217;d kill that intrinsic motivation, get rid of the freedoms and the innovation would go.</p><p>On the other hand, the exercise as-is needs some formal approvals from within the MOD if is to be allowed to go ahead. As it grows, these are becoming more challenging to win. It needs approvals for cyber reserves that wish to participate to mobilise and join, it needs approvals to win budget whether as a &#8216;sport&#8217; like rowing the Atlantic, or some other capacity. Likewise if it is to be able to get the soldiers, sailors and RAF personnel released from their day jobs to join as volunteers, it needs some official standing. Partly because of its success &#8211; it&#8217;s hard to compare it now to a small team rowing the Atlantic to justify funding and approval &#8211; and in part because it is so innovative - these can be hard to win. It doesn&#8217;t fit neatly into boxes &#8211; it isn&#8217;t a formal exercise and yet it is vast. It hasn&#8217;t been through the normal processes of training needs analysis and multilevel authorisations, sign-offs and so on. These things would likely kill it, but they are in place for a reason. It seems unlikely the exercise can continue as it is. The organisers seem genuinely worried as to whether DCM5 will be approved in time to be put it on next year, or at all.</p><p>Yet this year&#8217;s Exercise was put on with the full support of the <a href="https://www.gov.uk/government/news/the-4th-republic-of-korea-uk-cyber-dialogue-held-in-london">South Korean Government.</a> It attracted Foreign Office funding. Far more people applied to participate than the Exercise could accommodate.</p><p>Perhaps the lesson is to fund DCM through its fifth and sixth incarnations, but not seek to control it. Meanwhile to take inspiration from it in the formal exercise programme, and seek to match or exceed the levels of innovation it achieves. If this can be done, the organisers would likely welcome the chance to handover to the big green machine of military bureaucracy, knowing the fire of innovation they have lit is spreading, and being enabled to burn brighter and longer, wider. But it should be for them to decide the time to do this, when they judge the flame has been passed on, and they need no longer keep their torch burning.</p><p>Until then, Defence must find a way to ensure DCM4 continues to thrive as DCM5, DCM6, largely as-is. I don&#8217;t want to make this seem simple. All those rules and regulations and controls are in place for a reason. But find a way we must. It will need top-down leadership to succeed &#8211; it is in the middle-ranks of the military that the flame of innovation dies. Not because the people are bad, but because usually everyone with responsibility over some aspect or another of an exercise has to say yes, and only one person has to say no. Because if DCM goes well, those middle-level approvers will get no credit. But if it goes wrong, they will get the blame. And because absent a Cyber Service looking to drive the capability to excel, to win resources in the next SDR, there is no champion at the top without some significantly conflicted incentives.</p><p>For me, personally, I was extremely proud of to see UK Defence being as innovative as everyone knows it can be, but rarely is. To hear the passion of those involved, and watch them pushing boundaries and better preparing us for the future, was genuinely inspiring, and hopeful.</p><p>The exercise felt to me like it was an incarnation of &#8216;the Science Super Power&#8217; agenda and the Indo-Pacific tilt we wrote about in the 2021 Integrated Review. But it was also explicitly inspired, it&#8217;s organisers tell me, by General Richard Barrons&#8217; <em><a href="https://www.tandfonline.com/doi/epdf/10.1080/03071847.2016.1253371">Warfare in the Information Age</a></em> (WITIA) paper, published in 2016, but originally written and circulated as his internal paper and idea, in the years prior. That paper called out the growing threats and opportunities cyber warfare was creating, as a capability in its own right, but also, crucially, showed the need to integrate cyber warfare in everything we do.</p><p>Watching and listening to the personnel on this exercise, I think we have found the generation who can usher in the vision provided in General Barrons&#8217; paper, make it a reality. But, if they are to succeed, Defence Cyber Marvel and its volunteers will need more support to overcome the innovation-stifling institutional pressures, the misaligned incentives, that have prevented us getting there in the ~15 years since the vision for Warfare in the Information Age, was first sketched out.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>nb. with apologies to those that were kind enough to speak to me during my visit: I have chosen not to name anyone for security. I am, however, hugely grateful to you all for your time, insights, and commitment to excellence. You really were genuinely inspiring. </p><p></p><div><hr></div><p><a href="applewebdata://B381C907-64CD-40A0-A086-2D5BF11086D2#_ftnref1"><sup>[1]</sup></a> This is not to pretend I don&#8217;t have views, but I feel much more strongly about the need to improve how we plan and build for the future than any individual kit decision &#8211; debates around which are nearly always narrowly single-service dominated, dominated by discussion of what was, and to lesser extent what is, rather than what-will be, and selective in both the evidence and logic used to defend or attack them. I find these debates mostly unproductive and distracting.</p><p><a href="applewebdata://B381C907-64CD-40A0-A086-2D5BF11086D2#_ftnref2"><sup>[2]</sup></a> Dolman, E.C., 2009. <em>Victory through <s>Air</s> Space Power</em>. <em>Strategic Studies Quarterly</em>, 14(2). Available at: <a href="https://www.airuniversity.af.edu/Portals/10/SSQ/documents/Volume-14_Issue-2/dolman.pdf">https://www.airuniversity.af.edu/Portals/10/SSQ/documents/Volume-14_Issue-2/dolman.pdf</a> [Accessed 10 March 2025]. <br><br>Fletcher-Jones, C. (2024). In Favour of an Independent Royal Space Fleet: The Smuts Report and the Precedent of the Royal Air Force. <em>The RUSI Journal</em>, <em>169</em>(1&#8211;2), 22&#8211;32. <a href="https://doi.org/10.1080/03071847.2024.2359405">https://doi.org/10.1080/03071847.2024.2359405</a></p><p><a href="applewebdata://B381C907-64CD-40A0-A086-2D5BF11086D2#_ftnref3"><sup>[3]</sup></a> AI is already super-human at poker see <a href="https://en.wikipedia.org/wiki/Libratus">Libratus</a> &amp; <a href="https://en.wikipedia.org/wiki/Pluribus_(poker_bot)">Pluribus</a>, and like in chess, AI is <a href="https://www.forbes.com/sites/neilsahota/2024/01/19/bluffing-beyond-human-capability-ais-role-in-revolutionizing-poker/">changing</a> the way humans play the game, with a growing dependency on machines for training and in competition.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><strong>New Endnote</strong>. 12-03-25. </p><p>To be more correct here, in response to helpful constructive criticism: </p><p>GCHQ is funded via the Single Intelligence Account [see latest - Cabinet Office Corporate Report 2023-24, <strong><a href="https://www.gov.uk/government/publications/security-and-intelligence-agencies-financial-statement-2023-24/security-and-intelligence-agencies-financial-statement-2023-24-html">Security and Intelligence Agencies Financial Statement 2023-24</a>]. </strong>It states:</p><blockquote><p>&#8220;<em>The Secretary of State for Foreign, Commonwealth and Development Affairs is the responsible Secretary of State for SIS and GCHQ<a href="https://www.gov.uk/government/publications/security-and-intelligence-agencies-financial-statement-2023-24/security-and-intelligence-agencies-financial-statement-2023-24-html#fn:2"><sup>[footnote 2]</sup></a> and the Secretary of State for the Home Office for MI5<a href="https://www.gov.uk/government/publications/security-and-intelligence-agencies-financial-statement-2023-24/security-and-intelligence-agencies-financial-statement-2023-24-html#fn:3"><sup>[footnote 3]</sup></a> The Agencies ensure that the appropriate Secretary of State is briefed on matters that could become the subject of Parliamentary or public interest and on issues, which they need to be aware of in discharging their wider Ministerial responsibilities. There are well-established arrangements for seeking Ministerial clearance for operations when required.</em></p><p><em>In line with the responsibility assigned to AOs in Managing Public Money the Principal Accounting Officer (PAO) acts to ensure that the SIA operates effectively and efficiently in support of national security policies, aims and objectives. The Deputy National Security Advisor (DNSA) Matthew Collins was delegated as Temporary Acting Principal Accounting Officer for the SIA given the Cabinet Secretary&#8217;s medical leave between October and December 2023. The Heads of the Agencies are each AOs in their own right, with delegated authority from the PAO</em>.&#8221;</p></blockquote><p>The point remains unchanged, &#8216;cyber&#8217; as a capability and domain* must compete not only with the the Army, Navy, and Air Force for funding within the MOD, but also with GCHQ. GCHQ has other budgetary champions in Whitehall - where the Foreign Secretary&#8217;s responsibilities, as described above, usually make him/her the main lobbyist for GCHQ in Spending Reviews, Defence and National Security Reviews etc.</p><p>There are other senior champions for GCHQ, as can be seen in the latest corporate report. Altogether we can see that these are: </p><ul><li><p>The Secretary of State for Foreign, Commonwealth and Development Affairs;</p></li><li><p>the Cabinet Secretary; </p></li><li><p>the National Security Advisor (or his Deputy at present, since the NSA is a Spad not a Civil Servant); </p></li><li><p>the Home Secretary [since the <a href="https://www.ncsc.gov.uk">National Cyber Security Centre</a>, which solely does Cyber Defence but does is funded through GCHQ. As the <a href="https://www.gchq.gov.uk/section/mission/cyber-security">GCHQ&#8217;s website</a> states  &#8220;<em>The UK's cyber security mission is led by the National Cyber Security Centre (NCSC), which is a part of GCHQ</em>.&#8221; Which always amuses me because you could delete &#8216;<em>which is a part of</em>&#8217; and insert &#8216;<em>which is subordinate to</em>&#8217; and then you&#8217;d have a clearer statement of why this needs to be written at the top of the page. In any case, the Home Secretary, for whom making sure nothing awful happens in the UK is the main job, is therefore also a champion for GCHQ, if she/he wants funds to flow down to NCSC rather than to some competing priority elsewhere in Whitehall].</p></li></ul><p>In addition to which, as the SIA financial statements put it the &#8230;<em>Agency Heads have a statutory duty to provide annual reports on the work of the Agencies directly to the Prime Minister&#8230;</em> giving GCHQ direct access to the Prime Minister to argue for more funds.**</p><p>Given all this strengthens the point I am making, rather than weakens it, I haven&#8217;t reworded the main article.</p><p>*whether it is a domain is a whole other important debate, but I am drawing the line here because (a) for practical purposes in the context of my argument I think it does not matter and (b) I will end up with endnotes on endnotes.</p><p>**in adding this new Endnote, I feel like I also add a tone overly critical of GCHQ - I do not intend this. The point is just to be more technically correct in support of my point that &#8216;cyber&#8217; as an offensive and defensive capability, lacks a truly independent champion of the standing of the Chiefs of Royal Navy, Army and Royal Air Force, and also has to compete with an organisation focused primarily on intelligence gathering.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Education, Education, Education?]]></title><description><![CDATA[Using AI to Mitigate the AI Revolution: easing economic transition, protecting citizens, advancing development, extending the UK's influence.]]></description><link>https://blog.cassi-ai.com/p/education-education-education</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/education-education-education</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Wed, 12 Feb 2025 05:01:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!fhoo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fhoo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fhoo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!fhoo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!fhoo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!fhoo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fhoo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:145390,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fhoo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!fhoo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!fhoo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!fhoo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d9be28a-c777-41b2-a901-b47bddf70c16_1024x1024.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p><strong>AI Disruption</strong></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>It&#8217;s Happening. The AI revolution is well underway. Most likely it will accelerate both in rate and scope. We must consider what to do in the worst-case scenario that AI is a true substitute for human cognition, a subject we have discussed several times in this blog.</p><p>But also, we need a plan to mitigate the radically disruptive effects of AI even if no further advancements were made. The adoption of existing AI technologies alone would have profound economic and social implications. We propose using AI to radically improve how humans learn. We make the national case here - but it is as relevant to all businesses, organisations and us all, as individuals.</p><p><strong>The Case for Optimism</strong></p><p>While we know a lot of facts about the brain and human psychology, we don&#8217;t really know how either &#8216;work&#8217;. We are not sure what consciousness is. Intelligence is notoriously hard to define. Even reasoning, what constitutes knowledge or understanding, all are uncertain, contested concepts. Under such circumstances, we keep coming up with new benchmarks for testing AI in the way we have tested human intelligence, and increasingly with tests no normal human, and no deeply expert human, would pass. And AI keeps not just passing these tests but surpassing them. Hence the bearish case must be planned for.</p><p>But if we don&#8217;t know what human intelligence is or how it works, we cannot say that human intelligence won&#8217;t continue to play an important role in the economy, in making decisions in the future. It might, even if it is hard to say with any confidence how.</p><p>If we assume this optimistic case, AI will not replace all humans. AI becomes a complement to human cognitive labour. We work with it, and it creates new jobs.</p><p>Still, many will be displaced from their current roles. We can already see that lots of things we do now will be done better by AI, <em>and could be done better by AI now</em>, even if not all things. How do we plan for the scenario of massive displacement?</p><p>There are two key challenges: mitigating large-scale social disruption and ensuring that displaced workers transition into new, productive roles as quickly as possible.</p><p>Re-training will be essential. The likelihood that the skills needed in an AI-driven world match those of the pre-AI era is effectively zero. The UK, or any nation that enables large-scale, rapid retraining, stands to gain significant advantages. However, this process is costly and difficult, requiring substantial human and material resources at a time when those resources are increasingly valuable.</p><p><strong>Education and AI</strong></p><p>Education is a solved problem in principle&#8212;humanity knows how knowledge transfer works. The issue lies in implementation: mass-learning is both pedagogically suboptimal and logistically inefficient. The core challenges are:</p><blockquote><p>&#183; A teacher cannot dynamically assess every student's exact knowledge state in real-time.</p><p>&#183; Students themselves struggle to identify their precise learning needs.</p><p>&#183; Traditional education incurs high logistical costs and opportunity costs.</p><p>&#183; The sheer order of magnitude of change will be disconcerting, as we are culturally accustomed to large-class learning rather than individualized tuition.</p></blockquote><p>AI-assisted online learning can address these problems. AI tutors approximate individualized instruction, while online learning significantly reduces costs. The key lies in breaking down knowledge into minimal, structured chunks, providing explanations, practice, instant feedback, and spaced repetition&#8212;each of which is a well-understood technique.</p><p>AI is uniquely suited to serve as a 1:1 tutor, providing not just instant feedback but also dynamically generating new materials, conducting low-stakes testing, and evaluating progress in near real-time. This allows students to learn at their own pace, maximizing retention and efficiency. Students would have access to tutors with a far higher IQ and expertise than humans can provide. The human average on an IQ test is ~100. Current AI models have surpassed this. They will only improve from here.</p><p>Despite this, this approach does not replace teachers but redefines their role. Educators would focus on facilitating hands-on practice, coaching, and higher-order skills like critical thinking, evaluating AI-outputs, and communication. For many adult and adolescent learners, direct contact time with teachers may decrease, but learning efficiency would increase.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VsdC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VsdC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg 424w, https://substackcdn.com/image/fetch/$s_!VsdC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg 848w, https://substackcdn.com/image/fetch/$s_!VsdC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!VsdC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VsdC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg" width="1456" height="995" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:995,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A screenshot of a graph\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A screenshot of a graph

AI-generated content may be incorrect." title="A screenshot of a graph

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!VsdC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg 424w, https://substackcdn.com/image/fetch/$s_!VsdC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg 848w, https://substackcdn.com/image/fetch/$s_!VsdC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!VsdC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96c3bc23-b1f0-45e8-ace6-75fc991b9c4f_2048x1399.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Image <a href="https://x.com/HighlyRetired/status/1886756234415505730">from</a> @highlyretired on X, 4 Feb 2025.</p><p><strong>The Present</strong></p><p>There are a growing number of examples where AI is already radically improving education. Recall the mantra that the AI we are using today is the worst AI we will ever use. We can also assume that as the AI itself improves so to will we improve our knowledge of how best to employ it over time. Therefore it is almost certain that the gains we describe here will be improved upon, perhaps radically improved upon, as AI gets better and we get better at using it in education.</p><p>Take the <a href="https://blogs.worldbank.org/en/education/From-chalkboards-to-chatbots-Transforming-learning-in-Nigeria">World Bank AI pilot in Edo State, Nigeria</a>, which focused on English Language, AI (ChatGPT) Knowledge, and Digital Skills. After six-weeks of access to the AI tutor, the results showed students:</p><blockquote><p>&#8226; Gained two-years of learning, a 1200% increase</p><p>&#8226; Improved their ability to learn</p><p>&#8226; Benefited more as exposure to ChatGPT training increased, with the ceiling unknown</p><p>&#8226; Benefited more than students in 80% of all the World Banks other (often larger scale, more expensive) educational interventions</p></blockquote><p>This was likely achieved with OpenAI&#8217;s GPT 4-turbo model the most modern at the time of the intervention. GPT-4-turbo is now fifteen months and ~ten models behind the most recent OpenAI releases.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!prqW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!prqW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png 424w, https://substackcdn.com/image/fetch/$s_!prqW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png 848w, https://substackcdn.com/image/fetch/$s_!prqW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png 1272w, https://substackcdn.com/image/fetch/$s_!prqW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!prqW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png" width="1292" height="861" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:861,&quot;width&quot;:1292,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Figure on Dose-Response Relation&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Figure on Dose-Response Relation" title="Figure on Dose-Response Relation" srcset="https://substackcdn.com/image/fetch/$s_!prqW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png 424w, https://substackcdn.com/image/fetch/$s_!prqW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png 848w, https://substackcdn.com/image/fetch/$s_!prqW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png 1272w, https://substackcdn.com/image/fetch/$s_!prqW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c2a28bf-bdd7-406b-9a73-e78ca892737d_1292x861.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>Figure 1. Chart shows how increased exposure to ChatGPT increased Nigerian students performance on the test - we do not where the ceiling is for this 'dose-response' &#8211; how much better can students be with greater use of ChatGPT?</em></p><p>More examples:</p><blockquote><p>&#183; In Ghana, the <a href="https://harrisonpllc.substack.com/p/experimental-evidence-on-the-impact">ChatBot &#8216;Rori&#8217;</a> made available via WhatsApp for maths tuition saw students gain the equivalent of a one year&#8217;s worth of classroom learning in six-months. This from one sixty-minute session a week, at a cost of just $5 per student.</p><p>&#183; At Harvard, an AI chatbot tutor <a href="https://www.researchsquare.com/article/rs-4243877/v1">enabled</a> undergraduate students in physics classes to learn twice as much in less time, reporting significantly higher engagement compared to those in taught with &#8216;active learning&#8217; techniques in a traditional classroom<strong>.</strong></p><p>&#183; In disadvantaged areas of the Southern United States an AI &#8216;<a href="https://studentsupportaccelerator.org/studies/tutor-copilot-human-ai-approach-scaling-real-time-expertise">Tutor Co-Pilot&#8217;</a> significantly improved tutor performance, as measured in student outcomes (4% improvements on average, 9% improvements for students taught by the weaker tutors).</p></blockquote><p><strong>The Future</strong></p><p>Classroom education has changed little in the past century. Despite efforts to optimize, fundamental constraints limit the full application of pedagogical science. Overlaying new solutions on top of existing structures demands extra effort from students and exacerbates educational inequality. Instead, the best learning should be available to all.</p><p>Humans will never match AI in raw fact retrieval. While factual knowledge remains essential, superior internal models and conceptual understanding matter more&#8212;just as the ability to tackle mathematical problems outweighs rote memorization of formulas.</p><p>To remain competitive, humans must optimize their strengths. AI-enhanced learning offers a scalable solution, and current online educational content, though plentiful, lacks the necessary individuation, optimization, and real-time feedback to maximize learning efficiency.</p><p>A networked AI can learn what students need to know dynamically and empirically, refining its teaching strategies as it gathers data from learners in real-time. It can personalise based on skill, but also optimise to individual psychological make-ups and preferences. This adaptive approach ensures that education remains responsive to individual needs, eliminating traditional delays in curriculum improvement. The tools to implement this revolution exist today&#8212;it is time to use them.</p><p><strong>National Strategy</strong></p><p>There is more to this than just mitigating consequences domestically, nationally, for our children and all of us as workers, business owners, as a society and as citizens &#8211; urgent though this is.</p><p>We can and should be using the global reach and credibility of organisations like the BBC and the Open University to provide similar tuition globally. Such a service might offer British credentials &#8211; perhaps GCSE&#8217;s or &#8216;A&#8217; Levels, marked by AI with far higher consistency than human teachers can manage (see Olex.AI&#8217;s <a href="https://olex.ai/">impressive results</a> in marking English school work), or degrees from our top Universities, MOOCs but enabled by cutting edge tech that could hugely increase the effectiveness and efficiency of knowledge transfer.</p><p>We could use it:</p><blockquote><p>&#183; to help other nations reduce the disruption the AI revolution will bring</p><p>&#183; to reduce migratory pressures on ourselves by helping other nations harness the AI revolution</p><p>&#183; to identify global talent, either for recruitment as part of a global talent spotting campaign, or for the award of accelerated immigration admission for the top performers</p><p>&#183; to identify or qualify for the award of e-citizenship, e-residency or the right to found an e-business covered by UK law, and able to access UK banking and other services.</p></blockquote><p>For those in poor or corrupt countries trying to build global businesses, this could be an invaluable opportunity. </p><p>It may be that poorer countries, and the most disadvantaged in society, benefit most from AI tutoring, overcoming barriers like large class sizes, <a href="https://unesdoc.unesco.org/ark:/48223/pf0000231310">teacher migration</a>, <a href="https://www.unesco.org/en/articles/persistent-teacher-gap-sub-saharan-africa-jeopardizing-education-recovery/">teacher shortages</a>, and teaching by <a href="https://www.unesco.org/en/articles/persistent-teacher-gap-sub-saharan-africa-jeopardizing-education-recovery/">unqualified staff</a>. In some studies of AI, both in educational and professional setting - such as with Tutor Co-Pilot in the Southern US- it has been the lower performers that improve the most when AI assists. </p><p>For the UK, allowing AI talent to earn in the UK while living wherever they want through e-citizenship, e-residency or e-businesses, might raise tax revenues &#8211; in 2023 Estonia <a href="https://news.err.ee/1609257696/e-residents-contributed-67-4-million-to-estonian-state-budget-in-2023">received</a> &#8364;67.4 million in tax from e-citizens and e-businesses, a 33% increase on the previous year. What might the UK raise if not only offered such an option, but proactively awarded it to the highest performing students globally? </p><p>An objection here might be that allowing talent in poorer countries to found businesses in the UK will take tax revenue from the areas where they live. But the alternative is often for them to migrate. By staying where they are they spend locally, benefitting the economy. And once their businesses prosper, shielded by the UK&#8217;s legal system and with access to global markets from the UK, expansion into their country of origin seems likely - from a much more secure base.</p><p>A final caveat: we must remain wary of this early evidence for AI&#8217;s benefits in improving learning. From 2008, Professor <a href="https://www.ted.com/speakers/sugata_mitra">Sugata Mitra</a> gave inspiring talks on the effectiveness of his &#8216;Hole in the Wall&#8217; experiments &#8211; where students in impoverished areas were given access to a computer connected to the internet and self-learned complex subjects with no tuition. In 2013, Mitra won the Ted Prize for his &#8216;<a href="https://www.ted.com/talks/sugata_mitra_kids_can_teach_themselves">Kids Can Teach Themselves</a>&#8217; talk. In 2015 the Times Education Supplement ran a headline <a href="https://www.tes.com/magazine/archive/internet-learning-boosts-performance-seven-years-sugata-mitra-study-finds">claiming</a> &#8216;<em>Internet learning boosts performance by seven years</em>&#8217;. These claims are now <a href="https://20821764.fs1.hubspotusercontent-na1.net/hubfs/20821764/Nomanis_June2023/Pdf/NOMANIS11-JUN21_BENNETT.pdf">disputed</a>, albeit <a href="https://link.springer.com/article/10.1007/s11125-021-09552-y">not debunked</a>. What we should note is the need for experimentation, testing of claims and careful implementation. </p><p>Care and caution must not mean prevarication and delays. It&#8217;s Happening. The AI development and AI impact will likely now scale logarithmically rather than linearly. Like the pandemic, we will feel its effects gradually, and then suddenly. We must respond with the urgency needed, and pre-emptively, not retrospectively. Education, Education, Education, as one Labour leader once put it.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[It's Happening]]></title><description><![CDATA[Personal reflections on the AI acceleration]]></description><link>https://blog.cassi-ai.com/p/its-happening</link><guid isPermaLink="false">https://blog.cassi-ai.com/p/its-happening</guid><dc:creator><![CDATA[Keith Dear]]></dc:creator><pubDate>Fri, 07 Feb 2025 13:01:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Wa6Q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Wa6Q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:274883,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic 424w, https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic 848w, https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic 1272w, https://substackcdn.com/image/fetch/$s_!Wa6Q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8115bc4e-010a-4067-8246-f2fc1c84eb1a_1024x1024.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="pullquote"><p>Artificial Intelligence crossing the Rubicon</p></div><div class="pullquote"><p><em>&#8216;I will show you fear in a handful of dust.&#8217;</em></p><p><em>T.S.Eliot, The Waste Land</em></p></div><p>There is a profound sadness about watching all this unfold. At Christmas, a good friend and deep ML expert at one of the world&#8217;s leading labs wrote in his email circular &#8216;<em>It&#8217;s happening</em>&#8217;. And I felt the same thrill and fear flare in me that comes multiple times daily now. He&#8217;s right. And we can&#8217;t stop it.</p><p>I re-read Sarah&#8217;s <em><a href="https://longerramblings.substack.com/p/a-defence-of-slowness-at-the-end">In Defence of Slowness at the End of the World</a></em> again and again. I find myself back in church on Sunday. I wonder if I should read less for work and go back to theology and philosophy &#8211; passions in my teenage years and twenties. In coping with &#8216;the Coming Wave&#8217; they feel now like they might be more important than anything I have learned since. The thrill is there. More on that another time. But moreover: I am worried.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>I have written and spoken, with my usual mix of passion, evangelism and frustration, about where AI is going and how important it is going to be, for some time. We have named our new company &#8216;Cassi&#8217; after Cassandra the prophetess condemned always to be right and never believed, in part in reference to a nickname given me in previous job. I am very aware that the warnings so many, I included, have advanced on the profundity of this revolution have not had the effect intended. We have failed.</p><p>So much has been written and said describing how disruptive a world of superintelligent AIs will be economically, socially, for national security &#8211; for humanity - that there is little need to requote and reference. Most of us commenting have striven to remain grounded, to avoid sounding &#8216;shrill&#8217;, to be sufficiently sober to retain a place with &#8216;the adults in the room&#8217;. But even now, as <em>it is happening</em> it remains a fringe issue. Acknowledged by Governments in words but not really in deeds. Treated with amusement and disdain by many in our governing and media classes. As if it is something we can worry about later. Part in consequence, it remains largely outside of electoral consciousness, with few doing the work to wake people up to what is coming. </p><p>Most thought &#8216;Don&#8217;t look up&#8217; was about climate change. But not for me.</p><p>We are not ready. It is too late to be ready. It is too late to stop it.</p><p>I won&#8217;t pretend to know precisely how all this will play out. As ever, we do our best to grasp what Peter Hennessey called &#8216;the thin whisps of tomorrow&#8217;. That&#8217;s been central to my job for 22 years now. Forecasting breeds humility.</p><p>As Sarah writes</p><blockquote><p>&#8216;<em>Those among us that are <a href="https://situational-awareness.ai">Situationally Aware</a> must be on guard against arrogance. To anticipate some transformation that most live in ignorance of can easily breed self-importance. &#8230;worse, it can degrade the way that we perceive the efforts of others. &#8230;I don&#8217;t want to presume they&#8217;d act any differently if they knew (what I think) I know</em>.&#8217;</p></blockquote><p>Somewhere else, she wrote acknowledging we could be wrong &#8211; maybe some of us, and I one of them, have drunk the kool aid. Maybe, despite my protestations in &#8216;<a href="https://cassiai.substack.com/p/defence-and-artificial-general-intelligence">In Athena&#8217;s Arms</a>&#8217; (on AGI) we have succumbed to a cargo cult. In many ways, I hope so.</p><p>But until that becomes apparent, what I think I know is:</p><p>1. It is happening</p><p>2. We can&#8217;t stop it</p><p>3. We are not ready.</p><p>My view is that so much is unknown that our estimate of the probability of extinction risk from AI cannot be zero. But I do think it is highly unlikely (&lt;5%). More likely is radical economic disruption, and social unrest. My biggest worry, because it feels more real and imminent, is that the things AI will take from us are the things I enjoy the most in my work &#8211; the search for new and novel insights, the act of creation, of communication, of anticipation and discovery - and yes, of struggle, and effort, the reward that comes from success at the end of it, or the consolations and lessons that exist in failure. I worry that even if we create a new economy, where intelligence is cheap and the AIs play the dominant role but we still work, the things I find most meaningful will be the things of least value.</p><p>I will keep writing, and play my part in what is unfolding as best I can. Because you can&#8217;t stop this, I would rather be on the upside of the bet than the downside. I think you can probably do more good to mitigate the disruption by being a part of the revolution than by sitting on the sidelines and commenting. I will try to enjoy the thrill of progress and do my bit, and my best to help others, to ameliorate the risks and harness the opportunities of what is coming. There are, of course potential radical upsides too, much greater prosperity, better health - some would say &#8216;curing death&#8217;, fully-automated luxury communism, Bostrom&#8217;s &#8216;Deep Utopia&#8217;. But I don&#8217;t think we   get there without significant social and political unrest. &#8216;Significant&#8217; being something of an understatement. </p><p>I worry that my writing breathlessly of progress and development is too easily misconstrued as delighting in the disruption. I write this because I want to be understood.  There is a thrill to being a small part of - or even just being around to witness - the most far-reaching revolution in human history. But there is also a profound sadness about watching all this unfold. It&#8217;s happening.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.cassi-ai.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Cassi AI is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>