<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Guido]]></title><description><![CDATA[Discusses Artificial intelligence ]]></description><link>https://fullmetalresearcher.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!YnYP!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Ffullmetalresearcher.substack.com%2Fimg%2Fsubstack.png</url><title>Guido</title><link>https://fullmetalresearcher.substack.com</link></image><generator>Substack</generator><lastBuildDate>Thu, 21 May 2026 03:07:41 GMT</lastBuildDate><atom:link href="https://fullmetalresearcher.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Full Metal Researcher]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[fullmetalresearcher@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[fullmetalresearcher@substack.com]]></itunes:email><itunes:name><![CDATA[Guido]]></itunes:name></itunes:owner><itunes:author><![CDATA[Guido]]></itunes:author><googleplay:owner><![CDATA[fullmetalresearcher@substack.com]]></googleplay:owner><googleplay:email><![CDATA[fullmetalresearcher@substack.com]]></googleplay:email><googleplay:author><![CDATA[Guido]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The Intelligence Layer ]]></title><description><![CDATA[The current business model of a frontier lab is unstable.]]></description><link>https://fullmetalresearcher.substack.com/p/the-intelligence-layer</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/the-intelligence-layer</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Sat, 09 May 2026 15:02:34 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/c1d563a2-b487-49e9-a2a1-98ee0900cd5f_3840x2303.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The current business model of a frontier lab is unstable. The common wisdom goes something like this: you rack up massive losses on each new generation of models until one finally becomes good enough to recursively improve itself. At that point, everything pays for itself retroactively.</p><p>A deployed model may eventually cover its own costs. But then the next model comes and eats the surplus before the last one has time to look profitable. Each generation should fund itself and the next one. right?</p><p>So what comes next? I think the current generation of models are intermediate models (everything between here and AGI). They are not the final product. They are expenses that bought the labs two things: proof that LLMs can perform valuable work, and a live experimental surface for discovering where intelligence can be monetized. &#8220;But the world does not reward raw intelligence on its own&#8221;.</p><p>That brings us to the next phase: capturing the labor margin of professional knowledge work. You can already see it happening. Anthropic and OpenAI are moving toward joint ventures with the very firms that sell professional services. The labs are not spinning up their own consultancies or private equity funds. What they want is simpler and smarter. <strong>They want to become the intelligence layer behind a body they do not have to build</strong>.</p><p>A frontier lab has three things: compute, weights, a few hundred exceptional engineers.</p><p>Theoretically speaking, to actually deliver the work of a top-tier professional services firm, you need thousands of operators, deep vertical expertise, regulatory licenses, client relationships, compliance infrastructure, and distribution built over decades. Labs have none of that. Building it would take years and starve the training pipeline. So instead <strong>they can license the intelligence and take a minority equity stake in someone else&#8217;s body. </strong>The operator runs the business. The lab keeps the inference revenue, owns a piece of the upside, and avoids building the operating company itself.</p><p>But why would the operator give the lab equity at all? Why not just rent the model and keep the margin? Because renting the public model is not the same as getting the lab&#8217;s frontier roadmap, embedded engineers, deployment priority, and feedback loop. The equity stake is the price of being close to the source of capability before it commoditizes. And why would the customer pay? Because the unit being priced is the deliverable, not the token. A pitchbook costs thousands in analyst and MD time. The model will draft one for a few dollars. The margin does not live in the token. It lives in the MD&#8217;s hour.</p><p>The same logic applies across any regulated knowledge work vertical where the customer buys judgment, speed, and liability management rather than software directly.The lab does not need the customer to pay more for intelligence. It needs the customer to keep paying professional-services prices while the cost of delivery collapses underneath.</p><p>The obvious objection is that a limited number of engineers and senior operators plus a model cannot replace the huge teams firms use today. But huge teams are not pure productivity machines. They are coordination machines. <strong>Once a firm grows large enough, a shocking amount of the marginal hour is spent moving information between people: status meetings, alignment calls, spreadsheet reconciliation, deck review, version control theater, and the quiet bureaucracy of keeping everyone pointed in the same direction</strong>.</p><p>A model does not create its own coordination layer. A small forward-deployed team plus a model pays far less of the coordination tax that consumes a large firm. It does not need to replace every person. It only needs to produce the same deliverable with fewer people, fewer handoffs, and less internal drag.</p><p>The customer is willing to pay something between the cost of the old giant team and the cost of the new small one. The gap is the surplus, and it gets split three ways: between the customer, the operator, and the lab. <strong>The lab&#8217;s share is what could fund the next training run</strong>. But that gap only stays open if the model deployed inside the venture is meaningfully better at the vertical than a fresh open-weights model anyone can rent. This is where the feedback loop becomes everything.</p><p>The model works on real vertical problems. That work generates proprietary signal: workflow traces, edge cases, corrections, evaluation data, user preferences, failure modes, and tacit process knowledge. That signal makes the model better at the vertical. A better model creates more surplus. More surplus funds deeper deployment. Deeper deployment generates more signal. The exact form of the feedback matters less than whether the loop exists at all. I think the surplus in regulated knowledge work verticals is large enough to leave room for high margins. And I think this is the clearest path by which frontier labs become economically durable before AGI.</p><p>The money will not show up as token revenue. It will show up as captured labor margin: software costs hidden inside professional services pricing, protected by proprietary workflow data, and compounded through vertical deployment.</p><p>All this is speculative.</p>]]></content:encoded></item><item><title><![CDATA[AGI May Not Look Like One Mind]]></title><description><![CDATA[Most debates about AGI still imagine a single system crossing a line.]]></description><link>https://fullmetalresearcher.substack.com/p/agi-may-not-look-like-one-mind</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/agi-may-not-look-like-one-mind</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Wed, 06 May 2026 10:27:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Ru65!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Most debates about AGI still imagine a single system crossing a line. One day it is an LLM. The next day it is "a general intelligence". We wait for the moment when the machine becomes &#8220;as smart as a human," as if intelligence were a height chart and AGI were a mark on the wall.</p><p>I don't think that is the right picture.</p><p>AGI may not arrive as one model that suddenly becomes human. It may arrive as a system. Things that together become capable of pursuing open-ended goals.</p><p>At the most basic level, AGI is usually imagined as a system that can do many different things at least as well as a human being. That separates it from narrow AI: systems that can be superhuman in one domain and useless outside it. Deep Blue could beat the world chess champion, but that was it. It was good at chess. </p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ru65!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ru65!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Ru65!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Ru65!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Ru65!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ru65!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg" width="473" height="307.6422764227642" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:615,&quot;resizeWidth&quot;:473,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Picture of the Day: Deep Blue Defeats Kasparov - The Atlantic&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Picture of the Day: Deep Blue Defeats Kasparov - The Atlantic" title="Picture of the Day: Deep Blue Defeats Kasparov - The Atlantic" srcset="https://substackcdn.com/image/fetch/$s_!Ru65!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Ru65!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Ru65!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Ru65!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64c1f078-d276-4df8-b001-55ae7921a3a2_615x400.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Deep Blue Defeats Kasparov (May 11 1997)</figcaption></figure></div><p>But even that definition is too thin.</p><p>OpenAI's charter defines AGI as highly autonomous systems that outperform humans at most economically valuable work. DeepMind researchers have proposed a more layered view... The question is not only "How smart is the system?" The question is also: smart at what, across how many domains, and with how much independence?</p><p>This is where I think the AGI debate often gets stuck. We focus too much on knowledge and not enough on agency.</p><p>A model that knows a lot is impressive. But intelligence is not just knowing. Intelligence is choosing, planning, act&#8230; (Now I'm just writing the REACT pattern&#8230;)<br><br></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dg1k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dg1k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png 424w, https://substackcdn.com/image/fetch/$s_!dg1k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png 848w, https://substackcdn.com/image/fetch/$s_!dg1k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png 1272w, https://substackcdn.com/image/fetch/$s_!dg1k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dg1k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png" width="369" height="387.8838174273859" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:760,&quot;width&quot;:723,&quot;resizeWidth&quot;:369,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;ReAct agent from scratch with Gemini 2.5 and LangGraph&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="ReAct agent from scratch with Gemini 2.5 and LangGraph" title="ReAct agent from scratch with Gemini 2.5 and LangGraph" srcset="https://substackcdn.com/image/fetch/$s_!dg1k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png 424w, https://substackcdn.com/image/fetch/$s_!dg1k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png 848w, https://substackcdn.com/image/fetch/$s_!dg1k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png 1272w, https://substackcdn.com/image/fetch/$s_!dg1k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3174dda-de71-40f9-ae34-c89b738f41b8_723x760.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">React pattern (source: Philschmid)</figcaption></figure></div><p>To think about machine intelligence, I find it useful to start with human intelligence (Honestly, I just need a starting point).</p><p>If you want to measure a human being's intelligence, you might give them an IQ test. You might make them solve Raven's matrices. Convenient and fast. But also very limited.</p><p>Puzzles are not life.</p><p>A person's intelligence is not only the ability to solve a clean problem placed in front of them. It is also the ability to decide what problems matter.</p><p>That, to me, is the center of the issue.</p><p><strong>Intelligence has at least two parts: goal setting and goal achievement.</strong></p><p>What do you want?</p><p>That question tells me a lot about a person. A goal contains taste, ambition, fear, imagination, and values. Then comes the second question: given that goal, how do you get there?</p><p>Now we are in search territory. How many paths can you imagine? How many can you evaluate? Can you balance planning and acting? Can you change course when the world pushes back?</p><p>This is why tool use and planning matter so much.</p><p>You can argue that autoregressive models do not "really" plan. Fair. But that objection matters less if the system can still help achieve the goal. <br></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LGW1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LGW1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png 424w, https://substackcdn.com/image/fetch/$s_!LGW1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png 848w, https://substackcdn.com/image/fetch/$s_!LGW1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png 1272w, https://substackcdn.com/image/fetch/$s_!LGW1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LGW1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png" width="420" height="322.1546961325967" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:833,&quot;width&quot;:1086,&quot;resizeWidth&quot;:420,&quot;bytes&quot;:224419,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/196638759?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aebe67a-1f12-4436-9379-0bb26e4b9f96_1086x868.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!LGW1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png 424w, https://substackcdn.com/image/fetch/$s_!LGW1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png 848w, https://substackcdn.com/image/fetch/$s_!LGW1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png 1272w, https://substackcdn.com/image/fetch/$s_!LGW1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba61f70b-f67b-4f32-9167-bee168fae48f_1086x833.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">LeCun on planning</figcaption></figure></div><p>The practical question is not whether the model plans in the same way a human plans. The practical question is whether it can decompose a task, use tools, observe results, update its strategy, and keep moving toward an objective. <br><br>"Keep Moving Forward"</p><p>Increasingly, the answer is yes.</p><p>There is another reason I distrust the "one mind" picture of AGI. Human intelligence is not individual in the way we pretend it is.</p><div class="callout-block" data-callout="true"><p>Alone, I am limited. Almost weak. I am what I am because other people built the infrastructure around me. I can think the way I think because generations of people compressed their discoveries into tools I can use without understanding how they were built (I mean&#8230; to some extent).</p></div><p><strong>My intelligence is not only inside my skull. It is distributed across the systems that support me</strong>.</p><p>So why should AGI be different?</p><p>Maybe AGI should not be imagined as one lonely model sitting in a box. Maybe it will look more like a society of agents!</p><p>An artificial society, not just an artificial mind!</p><p>This also changes how we should think about embodiment. My intelligence is general partly because I have a body. Dexterity expands the number of goals I can achieve. It lets me move through more environments. It lets me test the world directly.</p><p>A disembodied model can still be powerful. But physical ability changes the size of the action space. A model that can reason about fixing a sink is useful. A Figure 03 robot that can actually fix the sink is something else.</p><p>That does not mean AGI requires a humanoid robot. It means intelligence is partly defined by what it can do. A system connected to browsers, code environments, laboratories, factories, and robots has a much larger action space than a model trapped in a text box.</p><p>This is why the question "Are current models AGI?" feels too narrow.</p><p>A raw model may not be AGI. But a raw model is not the full product. The real system is the model plus tools, scaffolding, and so on&#8230; Judging the model alone may be like judging a human brain without language, hands, culture, or society.</p><p>So how do we build AGI?</p><p>My current intuition is simple: we need large neural networks, massive compute, strong infrastructure, and high-quality data. Maybe AGI arrives as a successor to today's language models. Maybe not. Perhaps the missing piece is a better world model. </p><p>I don't know. Really.</p><p>But I know what kind of test would impress me.</p><p>A useful test would not only ask whether a system can answer hard questions. It would ask whether the system can create knowledge. Demis Hassabis has suggested a version of this idea: train a system only on knowledge available before Einstein's general relativity, then see whether it can independently reach the theory. I like that test because it asks for more than memory. It asks for scientific creation.</p><p>That is closer to intelligence.</p><p>We will know we are near AGI also because systems start doing long-horizon work in the world (check METR). They will set subgoals, use tools, recover from mistakes, coordinate with other systems, and produce outcomes that previously required teams of humans.</p><p>That is the threshold that matters.</p><p>The most consequential moment may not be when AI becomes as smart as a person. It may be when AI becomes good enough to improve AI itself.</p><p>Example: GPT-5.5 helps build GPT-5.6. GPT-5.6 helps build GPT-6. Then a future system discovers a new architecture that produces the same intelligence at a fraction of the compute.</p><p>That is the <strong>intelligence explosion</strong> scenario. Maybe it happens. Maybe it doesn't. But if it does, the world changes faster than our institutions can understand.</p><p>This is why I think the AGI debate should move away from the fantasy of one machine waking up.</p><p>The danger, and the opportunity, is more subtle. <strong>We may build AGI the way we built civilization: not as a single mind, but as a network of capabilities layered on top of each other until the whole becomes more powerful than any part.</strong></p><p>Thanks for reading. </p>]]></content:encoded></item><item><title><![CDATA[How I'm Using Agent Skills to Build a Financial Research App]]></title><description><![CDATA[Claude Code for Dummies: Part IV]]></description><link>https://fullmetalresearcher.substack.com/p/how-im-using-agent-skills-to-build</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/how-im-using-agent-skills-to-build</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Mon, 23 Feb 2026 17:39:08 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/de2f5e60-a062-4be1-b45d-57069b8298d5_832x1248.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>You know that feeling when You've just explained the same workflow to a Large Language Model for the fifth time this week? You open a new session, type a wall of context about how you want things done, wait for it to process, and then (maybe) You get something close to what You need.</p><p>There's a better way. And it starts with a folder and a markdown file. You need skills&#8230;</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h3>What Are Skills?</h3><p>Anthropic describes it like this: building a skill for an agent is like putting together an onboarding guide for a new hire.</p><p>I like this analogy. Think about what happens when someone new joins your team. You don't explain the entire functionalities of an application every morning. You point them to a wiki, hand them a process doc, maybe a checklist. They refer to it when needed and eventually they stop needing it.</p><p>Skills are that wiki, but for Claude.<br></p><p>At their core, skills are folders of organized files that agents can use to perform a specific task accurately. Each skill lives in its own folder and contains at minimum one file: <code>SKILL.md</code>. That file starts with a <strong>YAML</strong> frontmatter block containing two pieces of metadata  (a <code>name</code> and a <code>description</code>) followed by the actual instructions.</p><pre><code><code>---
name: evaluating-full-research
description: &gt;-
  Evaluates the current state of financial research output
  by running Playwright tests and providing structured feedback
  with improvement suggestions.
---</code></code></pre><p>That's the skeleton. The description is doing the heavy lifting, it's how Claude decides whether to activate the skill when you ask it to do something. Keep in mind that no manual selection is involved. If your request matches the description, Claude loads the skill and follows the instructions.</p><p>If You've been following this series, you already know about context engineering. Skills are a masterclass in it. The design follows what Anthropic calls <em><strong>progressive disclosure</strong></em>: Claude pre-loads only the name and description of each skill into the system prompt (level 1). If it decides the skill is relevant, it reads the full <code>SKILL.md</code> (level 2). And if the skill references additional files (checklists, scripts, output templates) Claude reads those only when needed (level 3 and beyond). The context window stays lean and we somewhat move to a more deterministic domain.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hDDm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hDDm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp 424w, https://substackcdn.com/image/fetch/$s_!hDDm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp 848w, https://substackcdn.com/image/fetch/$s_!hDDm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp 1272w, https://substackcdn.com/image/fetch/$s_!hDDm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hDDm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp" width="1456" height="820" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:820,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;This image depicts how skills are triggered in your context window.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="This image depicts how skills are triggered in your context window." title="This image depicts how skills are triggered in your context window." srcset="https://substackcdn.com/image/fetch/$s_!hDDm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp 424w, https://substackcdn.com/image/fetch/$s_!hDDm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp 848w, https://substackcdn.com/image/fetch/$s_!hDDm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp 1272w, https://substackcdn.com/image/fetch/$s_!hDDm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb054d7e-12ce-43dd-9db8-63cf5e93cef5_1650x929.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I think looking at this image helps understand the matching mechanism.</figcaption></figure></div><p>I'd point you to <a href="https://claude.com/blog/equipping-agents-for-the-real-world-with-agent-skills">Anthropic's blog post</a> and the <a href="https://docs.claude.com/en/docs/agents-and-tools/agent-skills/overview">official docs</a>. They explain it well and I don't need to repeat them here. What I want to focus on is what skills look like when You're actually building with them.</p><h3>Why I Need Skills for What I'm building</h3><p>Let me make this concrete. I've now built the first version of a multi-agentic framework for financial analysis. When a user asks it to analyze a stock, the system tasks different agents to work on different parts of the research, one gathers and cleans financial data, another assesses risks, an agent gather market news, and so on... The end result is a structured report.</p><p>After every major update to this system, I need to verify that everything still works. So I ask Claude Code to use Playwright, run a research flow on a list of stocks, and check the output against a QA checklist. I do this constantly. And every time I do it, I'm burning context tokens re-explaining the same workflow: which stocks to test, what the checklist looks like, what format the output should be in, which Playwright scripts to run.</p><p>This process is thus the obvious skill candidate. Package the checklist, the scripts, the output format, and the instructions into a folder. Claude loads it automatically when I say something like "run the regression tests on Nvidia."</p><p>Sometimes I want Claude to act as a portfolio management expert, to evaluate the current state of a research output. Look at what the agents produced, assess the analysis quality, identify gaps in the reasoning, and give me structured criticism. This requires a specific persona, a specific knowledge base, and a specific output format. Every time I set this up manually, it takes several minutes of context-setting before we even start.</p><p>A skill solves both problems. The first is about <strong>automation</strong>, deterministic scripts that run the same way every time. The second is about <strong>expertise</strong>, packaging a persona and evaluation framework that Claude can adopt instantly. Skills handle both.</p><p>And the potential goes further. Imagine a skill that runs the Playwright tests, evaluates the output quality, provides structured improvements, and then (if I allow it) spawns sub-agents to work on those improvements. For a solo builder, that's the difference between spending an afternoon on QA and spending twenty minutes reviewing what Claude already did. Honestly, I don't actually use the sub-agents that often. Most of the time I'm not able to effectively follow multiple agents in parallel, so I end up working one step at a time anyway.</p><h3>The Rule: When You Repeat, You Build</h3><p>The general principle is simple: i<strong>f You find yourself repeating a workflow to get a task done, there's a case to build a skill for it.</strong></p><p>As you iterate on different projects with Claude Code, You'll naturally develop two categories of skills:</p><p><strong>Horizontal skills</strong> cut across projects. Code review workflows. Documentation generation. Commit message formatting. Regression testing patterns. You build them once and reuse them everywhere. These are the skills that make <em>you</em> faster as a developer, regardless of what You're building.</p><p><strong>Vertical skills</strong> are project-specific. For my application, that's evaluating financial analysis output, running sector-specific research flows, validating portfolio allocation logic. These skills only make sense in the context of one application, but they're where the deepest automation lives.</p><p>A future skill I'll write will probably be something around prompt refactoring, evaluating the prompts I use in the multi-agent system and suggesting improvements based on best practices.</p><h3>Best practices</h3><p><strong>Naming:</strong> lowercase, dashes, <code>-ing</code> form &#8212; <code>evaluating-full-research</code>, not <code>EvaluateFullResearch</code>. The <code>-ing</code> signals an active process. The name is part of how Claude pattern-matches your request to the right skill, so sloppy naming means wrong skill calls. This gets worse as you accumulate skills. Monitor whether Claude triggers the one you expect.</p><p><strong>Description:</strong> answer two things what does the skill do, and when should Claude use it. Include trigger conditions like "after major updates" or whatever You want (lol). Don't be too vague.</p><p><strong>SKILL.md body:</strong> step-by-step instructions (don't assume Claude figures out the order), input/output format with concrete examples, common edge cases (what if a ticker doesn't resolve, what if Playwright times out), and degree of freedom&#8230; how much autonomy Claude gets. Be explicit about boundaries. If You don't define guardrails, agents make assumptions you didn't intend.</p><p><strong>Practical limits:</strong> keep <code>SKILL.md</code> short, move detailed reference material to a <code>references/</code> subfolder, keep references one level deep. Simplicity reduces errors.</p><h3>Why Code Execution Matters</h3><p>When You're using LLMs, You're operating in a stochastic world. Token generation is non deterministic by nature. But some tasks need deterministic reliability.</p><p>Skills can include scripts that Claude executes rather than interprets. Claude doesn't need to load the script into context. It runs it, reads the output, and proceeds.</p><p>This is the boundary between deciding and doing. Claude decides what to run and when. The script does the work. <strong>The intelligence lies in the orchestration, not the execution.</strong></p><p>For M&#178; Research, this distinction matters enormously. I don&#8217;t want Claude generating a new test logic on the fly, I want it running the exact same test suite every time so I can trust the results. The stochastic part is Claude's evaluation of what those results mean and what to do about them. The deterministic part is the testing itself.</p><h3>What You're Already Using (And What You Can Build)</h3><p>If You've ever asked Claude to create a PowerPoint, fill a PDF form, or build an Excel spreadsheet, You've been using skills without knowing it. These skills are available by default and so is the skill-creator (If I'm not mistaken).</p><p>The skill-creator is itself a skill. It's a skill that builds skills. Claude uses a packaged set of instructions about skill architecture to help you scaffold new ones. You describe your workflow and Claude generates the folder structure, formats the <code>SKILL.md</code>, writes the YAML frontmatter, and bundles the supporting resources.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CB7b!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CB7b!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png 424w, https://substackcdn.com/image/fetch/$s_!CB7b!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png 848w, https://substackcdn.com/image/fetch/$s_!CB7b!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png 1272w, https://substackcdn.com/image/fetch/$s_!CB7b!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CB7b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png" width="622" height="317.9220985691574" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:643,&quot;width&quot;:1258,&quot;resizeWidth&quot;:622,&quot;bytes&quot;:235818,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/188918397?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CB7b!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png 424w, https://substackcdn.com/image/fetch/$s_!CB7b!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png 848w, https://substackcdn.com/image/fetch/$s_!CB7b!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png 1272w, https://substackcdn.com/image/fetch/$s_!CB7b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e60ae25-63b3-4289-95c9-2303cd3448da_1258x643.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Claude reading the skill-creator skill to generate one.</figcaption></figure></div><p>My recommendation though? Build your first few manually. You'll understand the mechanics better, write sharper descriptions, and develop an intuition for progressive disclosure. Even if it sucks. Let Claude handle the scaffolding once you know what good looks like.</p><p>One more thing worth knowing: skills are an <a href="https://agentskills.io">open standard</a>. Since December 2025, a skill you build for Claude Code works with OpenAI's Codex, Gemini CLI and similar platforms. You are not limited to the Anthropic environment. </p><p>One simple thing to understand</p><p>Skills look deceptively simple. A folder, a markdown file, some YAML. But the design principles underneath, progressive disclosure, the separation of deciding from doing, the discipline of naming and describing precisely, these are the same principles that make the difference between a useful AI workflow and a frustrating one.</p><p>Start small. Package one workflow you repeat often. See how it changes your sessions. Then build from there.</p><p>Thanks for reading</p><div><hr></div><h3>Resources</h3><ul><li><p><a href="https://claude.com/blog/equipping-agents-for-the-real-world-with-agent-skills">Equipping Agents for the Real World with Agent Skills</a> &#8212; the engineering deep-dive on architecture and design</p></li><li><p><a href="https://claude.com/blog/skills">Introducing Agent Skills</a> &#8212; product announcement with partner integrations</p></li><li><p><a href="https://docs.claude.com/en/docs/agents-and-tools/agent-skills/overview">Agent Skills Documentation</a> &#8212; setup, usage, and best practices</p></li><li><p><a href="https://code.claude.com/docs/en/skills">Claude Code Skills Documentation</a> &#8212; invocation control, subagents, dynamic context</p><p></p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Agent Teams]]></title><description><![CDATA[Claude Code for Dummies: Part III]]></description><link>https://fullmetalresearcher.substack.com/p/agent-teams</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/agent-teams</guid><pubDate>Sat, 21 Feb 2026 23:27:27 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/c668c9b9-f27c-4947-bc6e-63326570cd8f_832x1248.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>If You've been experimenting with large language models for a while, You might have noticed that in the past months things got noticeably better. <strong>We're now able to achieve more with less</strong> and this, after all, is the meaning of technology.</p><p>This improvement can be attributed to two main things:</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>First, models can now use an increased number of sophisticated tools to perform their work. If You've read the first article of this series, You know that tools are the actions that a language model can request to interact with Your computer and the external world. More tools, more capability&#8230; same brain, better hands.</p><p>Second, models can now work on a task for an extended period of time.</p><p>METR published yesterday (20 Feb 2026) a post where it shared Opus 4.6's time horizon. If You don't know what that is, it measures the human completion time of the hardest tasks a model can solve with at least 50% reliability. As IQ correlates to other forms of intelligence, task length tends to correlate to other performance-related measures so it can be used as a proxy for models getting smarter.</p><p>Opus 4.6 scored around 14.5 hours. For context, Opus 4.5 was around 5 hours just a few months ago.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oxZk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oxZk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oxZk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oxZk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oxZk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oxZk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg" width="1456" height="869" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:869,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!oxZk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oxZk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oxZk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oxZk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89169c98-48b2-452c-908c-97437ac6332c_1564x933.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is great. Like, I mean it. When I started using OpenAI models a few years ago, they could work for a few minutes and most research I was reading pointed out limited task duration as one of the main bottlenecks for reliable uses of AI models in knowledge work and research.</p><p>Yes but so what? Claude can work for longer but can it achieve anything meaningful?</p><p>Yes! Otherwise I would have to stop this article right now (lol).</p><h2>The C Compiler Experiment</h2><p>Earlier this month a short video published by Anthropic went viral. It was about Claude Code building a C compiler that could also run Doom.</p><div id="youtube2-vNeIQS9GsZ8" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;vNeIQS9GsZ8&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/vNeIQS9GsZ8?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>$20000 in API cost. 100000 lines of code. Two weeks. And here's the part that blew my mind: the human mostly walked away and let it run.</p><p>But the thing I'm most interested in TBH is the approach that was adopted. <strong>The compiler was built using a parallel agent approach</strong>, the same idea that later became the "<strong>agent teams</strong>" feature in Claude Code. They used 16 Claude instances running in parallel, each in its own Docker container, each picking up different tasks from a shared list.</p><p>One agent was fixing bugs, another was cleaning up duplicate code, another was improving compiler performance, another was writing documentation, and another was critiquing the overall code quality from the perspective of a Rust developer. Each one had a narrow focus. </p><p>The narrow focus is not a minor detail. It's actually the core reason this approach works.</p><h2>Why Splitting Work Matters (Context Engineering, Again)</h2><p>I wrote a while ago an article on context engineering. In brief, if we're not detailed in our request, an agent will start reading the entire codebase and will get confused. For this reason, You give it the correct context by providing the right CLAUDE.md and linking the appropriate documents.</p><p>One thing You need to keep in mind is that as the agent iterates on multiple tasks, the context window approaches saturation, and, as a result, it increases the probability of hallucination and generating mistakes.</p><p>Anthropic published an excellent engineering post on context engineering that makes this point clearly. They describe a technique called <strong>compaction</strong> where, basically, when a conversation nears the context window limit, You summarize its contents and start a new context window with the summary. Claude Code already does this automatically. But even with compaction, a single agent doing ten different things will accumulate noise. Tool outputs pile up. Irrelevant information from task A pollutes the reasoning on task C.</p><p>The C compiler experiment showed exactly this problem. the researcher who ran the experiment, specifically warns about context window pollution. </p><p>Now imagine this&#8230; You have a single Claude session trying to fix a regex in one file, refactor a selection algorithm in another, update prompt templates in a third, and debug a frontend display in a fourth. By the time it gets to fix number four, the context is full of information about regex patterns and selection logic that has nothing to do with React (the frontend) components and SSE events.</p><p>Splitting the work across different agents can help us partially overcome the issue. <strong>Each agent gets a clean context window focused on its specific task.</strong> The backend agent doesn't need to know about the frontend. </p><p>Obviously, a disciplined approach to giving the right information and compaction rules can help us too. But there's a ceiling to how much one can squeeze into a single context window before things start degrading.</p><h2>So How Do Agent Teams Work?</h2><p>I think of agent teams like working on a report with Your colleagues. Imagine You've been tasked by your manager to write a super detailed report on a stock, let's say Nvidia.</p><p>The manager will task one person to gather and clean financial data, another to assess geopolitical risks, another to check the options market for that stock, and another to figure out how all the pieces affect the valuation and produce the final report (Yes I know strange tasks). I'm simplifying here but perhaps You got the point. This enables in some instances to achieve the objective faster and to get a more detailed and precise output.</p><p>Now You can create a similar workflow for your tasks with agent teams.</p><p>The essence of agent teams is summarized in the official Claude docs: "<em>Agent teams let you coordinate multiple Claude Code instances working together. One session acts as the team lead, coordinating work, assigning tasks, and synthesizing results. Teammates work independently, each in its own context window, and communicate directly with each other.</em>"</p><p>So there are three things to understand:</p><p>The <strong>team lead</strong> is the session You're sitting in. It coordinates work, assigns tasks, and synthesizes results. Think of it as the manager who distributes the report sections to the team.</p><p><strong>Teammates</strong> are independent Claude Code sessions, each with their own context window and their own focus. They're the analysts working on their specific sections.</p><p><strong>Communication</strong> happens through a shared task list and direct messaging. This is the key difference from subagents. Subagents can only report back to the parent, think of them as assistants who do a task and come back with results. Agent team members can message each other directly, challenge each other's findings, and coordinate without going through the lead.</p><p>One important thing from the Anthropic engineering post on long-running agents: each agent needs to leave the environment in a clean state when it finishes. That means no major bugs introduced, orderly code, descriptive git commits. This is critical because the next agent that picks up work needs to orient itself quickly. The C compiler experiment used a simpler version of this same pattern, agents maintained progress files and README docs specifically so that each new session could figure out what had been done and what needed to happen next.</p><h2>But Wait, I Already Run Multiple Terminals</h2><p>Fair point. When I have to work on multiple tasks that can run in parallel, I'm used to just opening multiple terminal windows. I share my detailed prompt and then supervise the agents to check what they're doing.</p><p>The problem is that this takes time and the agents (multiple instances of Claude Code) don't work together. They don't share context. They don't know what the others are doing. They can't coordinate.</p><p>What if I need agents to share information when working on a difficult task list and iterate on what they find? Well, this is the use case for agent teams.</p><p>One thing You need to understand is that right now this is not a tool You can use by default, plain vanilla. We need to follow a few simple steps to run it.</p><h2>Setting It Up</h2><p><strong>Step 1: Enable agent teams.</strong> This is one environment variable. That's it.</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;plaintext&quot;,&quot;nodeId&quot;:&quot;2da07b3f-fa75-41fd-bdec-7958ac4a2d7f&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-plaintext">{
  "env": {
    "CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS": "1"
  }
}</code></pre></div><p><strong>Step 2: Install tmux.</strong> This is not a hard requirement, but this is how, You get the split view (I mean if You want that). Tmux allows You to run multiple terminals (hence multiple instances of Claude Code) in a single window. As a result, You'll be able to see the team lead and its teammates work on the task list in the same window.</p><p><strong>Step 3: Give the team lead Your task list.</strong> Here I tell the team lead what my goal is for this session and I point to an .md file I've created with the list of tasks. The file is detailed enough that I don't need to be super specific in this context window.</p><h2>Watching It Work</h2><p>The collaboration between the agents is the real strength here. I'll attach some screens from my current interaction with claude code. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yJ9B!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yJ9B!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png 424w, https://substackcdn.com/image/fetch/$s_!yJ9B!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png 848w, https://substackcdn.com/image/fetch/$s_!yJ9B!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png 1272w, https://substackcdn.com/image/fetch/$s_!yJ9B!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yJ9B!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png" width="1456" height="227" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:227,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:41966,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/188743165?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yJ9B!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png 424w, https://substackcdn.com/image/fetch/$s_!yJ9B!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png 848w, https://substackcdn.com/image/fetch/$s_!yJ9B!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png 1272w, https://substackcdn.com/image/fetch/$s_!yJ9B!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09c54cbe-b672-4481-b78c-b2fbde0446a3_1576x246.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sbtF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sbtF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png 424w, https://substackcdn.com/image/fetch/$s_!sbtF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png 848w, https://substackcdn.com/image/fetch/$s_!sbtF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png 1272w, https://substackcdn.com/image/fetch/$s_!sbtF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sbtF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png" width="1456" height="140" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:140,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:82161,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/188743165?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sbtF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png 424w, https://substackcdn.com/image/fetch/$s_!sbtF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png 848w, https://substackcdn.com/image/fetch/$s_!sbtF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png 1272w, https://substackcdn.com/image/fetch/$s_!sbtF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F839cdebe-2060-41a0-a9e8-012bb34e7a99_2416x232.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>As You might notice, now there are more Claude Code sessions in the same window.</p><p>Eventually you can activate the split view and use CTRL + B and arrow keys to navigate the terminals to interact with each agent individually. You can check in on teammates' progress, redirect approaches that aren't working, and synthesize findings as they come in. The docs actually recommend doing this, letting a team run unattended for too long increases the risk of wasted effort. You are monitoring the situation. </p><h2>Plan Mode Works Here Too</h2><p>As You might be aware, when working on a difficult task You can just press Shift+Enter in the terminal to move to plan mode where the agent creates a detailed plan and to-do list before starting. This to-do list obviously will be shared across the different agents.</p><p>You can do the same thing when running teammates by specifying that in the prompt. For complex or risky work, You can require teammates to plan before implementing, the teammate works in read-only plan mode until the lead approves. If rejected, they revise and resubmit.</p><p>This is probably the smartest pattern I've seen: plan first with plan mode (cheap), then hand the plan to a team for parallel execution (expensive but fast). The plan gives You a checkpoint before committing tokens.</p><h2>The Honest Part</h2><p>One thing to consider: <strong>the process is very token-intensive</strong> so You could end up burning through your quota very soon. Each teammate is a full Claude Code session with its own context window. The math is simple: more agents = more tokens = more cost (duh). </p><p>For context, the Anthropic docs suggest reserving agent teams for work that genuinely benefits from multiple perspectives working in parallel. For simpler tasks, a single session or subagents are more cost-effective.</p><p>Some important limits to keep in mind:</p><ul><li><p><strong>File conflicts are real.</strong> Two teammates editing the same file leads to overwrites. You need to be super detailed with the task list, and also&#8230; You need to check each agent.</p></li><li><p><strong>It's still experimental.</strong> Agent teams are in research preview with known limitations. Things might break.</p></li><li><p><strong>Start small.</strong> The docs recommend starting with tasks that have clear boundaries and don't require writing code. These tasks show the value of parallel exploration without the coordination challenges that come with parallel implementation. (I'm trying it right now on tasks that require code but I can revert if thing break)</p></li><li><p><strong>Testing matters more, not less.</strong> When You can't watch every teammate, the risk of an agent marking something as done without proper testing goes up. </p></li></ul><h2>Cleaning Up</h2><p>When You're done, clean up the team. The lead can shut down teammates and synthesize results.</p><h2>Where This Fits</h2><p>In Part I of this series we saw that Claude Code is a single-threaded loop, one brain, thinking step by step, using one tool at a time. The power came from the discipline of that simplicity.</p><p>Agent teams are what happens when that single loop isn't enough. The human's job shifts to designing work decomposition. Which tasks can run in parallel? Which files does each agent own? Where are the dependency chains?</p><p>That's the vibecoder's real skill. No wait&#8230;.<br><br> To some extent, more than the vibecoder, this is the pure software engineer's real skill. I'll elaborate on that in the future since now I'm very tired and it's also Saturday night.</p><p>Thanks for reading.</p><div><hr></div><p><strong>Sources &amp; Further Reading</strong></p><ul><li><p>Anthropic, <a href="https://www.anthropic.com/engineering/building-c-compiler">Building a C compiler with a team of parallel Claudes</a>(2026)</p></li><li><p>Anthropic, <a href="https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents">Effective context engineering for AI agents</a>(2025)</p></li><li><p>Anthropic, <a href="https://www.anthropic.com/engineering/effective-harnesses-for-long-running-agents">Effective harnesses for long-running agents</a></p></li><li><p>Anthropic, <a href="https://code.claude.com/docs/en/agent-teams">Claude Code Agent Teams documentation</a></p></li><li><p>METR, <a href="https://metr.org/time-horizons/">Task-Completion Time Horizons of Frontier AI Models</a> (2026)</p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Context Engineering]]></title><description><![CDATA[Part II: Context Engineering]]></description><link>https://fullmetalresearcher.substack.com/p/claude-code-for-dummies-29d</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/claude-code-for-dummies-29d</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Mon, 16 Feb 2026 10:25:11 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/4c3cc775-a425-4b5c-8761-faa17cc45329_1360x768.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h3>From Prompt Engineering to Context Engineering</h3><p>When I started working with large language models, the focus was on prompt engineering, curating the instructions you give to the model to make it behave as desired. You went from zero-shot to few-shot learning. Before the release of OpenAI o1, you told the model to generate a chain of thought and tried to simulate some guardrails. The increase in performance was significant. But eventually, as models got better and we generally switched to thinking models, prompt engineering started making a smaller difference.</p><p>I used to think that the best people to work with LLMs were professors and managers, professionals who know how to deal with people and how to reframe a complex concept into smaller, easier components. Many thought the same. LinkedIn bros highlighted that this was the "democratization of software" or David against Goliath and all that dumb stuff you read online.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Tools evolved, but the goal remained the same: reaping all the possible rewards from them. And here's what you need to understand&#8230; finding the right words, the right prompt, is no longer enough. If you're limited to that, chances are you're not unlocking the potential of these tools. There's a rule I can't prove, but when it comes to technology, new trends tend to get popular on X before other socials. A while ago users in tech started talking about context engineering and I started wondering: what is that?</p><blockquote><p>"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." &#8212; Andrej Karpathy</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Lfec!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Lfec!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png 424w, https://substackcdn.com/image/fetch/$s_!Lfec!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png 848w, https://substackcdn.com/image/fetch/$s_!Lfec!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png 1272w, https://substackcdn.com/image/fetch/$s_!Lfec!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Lfec!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png" width="1456" height="987" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/da1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:987,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Lfec!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png 424w, https://substackcdn.com/image/fetch/$s_!Lfec!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png 848w, https://substackcdn.com/image/fetch/$s_!Lfec!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png 1272w, https://substackcdn.com/image/fetch/$s_!Lfec!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda1a2bc0-5129-44ea-a1fb-b619a79e60be_1532x1038.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Karpathy's framing clarifies something important. Context is not just your prompt. Context is everything the model sees when it has to make a decision: your instructions, the knowledge it has access to, the tools available to it, the history of the conversation, retrieved data. The prompt is one piece of the puzzle, context is the whole puzzle.</p><p>In Pokemon terms&#8230; prompt engineering is Diglett and context engineering is Dugtrio.</p><p>So your goal (as someone working with AI tool and especially as someone building agents) is to engineer the context that is available to them.</p><p>Let me give you an example. The first agent I created was extremely simple. It just used a basic system prompt while relying on the internal knowledge of the LLM. An agent I run right now looks more like this:</p><ul><li><p>Has a detailed system prompt with specific behavioral instructions</p></li><li><p>Uses Tavily for web research</p></li><li><p>Calls an API to get real-time market data</p></li><li><p>Interacts with other agents to refine the output</p></li><li><p>Takes advantage of previous conversations to modulate its responses</p></li></ul><p>Obviously, such an approach complicates the process. The agent, and the application in general, has to deal with a much greater amount of data. Now, the amount of data a model can deal with at once is a function of its <strong>context window</strong>, the total number of tokens the model can "see" when generating a response. Think of it as the model's working memory. For reference, all current Claude models (Opus 4.6, Sonnet 4.5, Haiku 4.5) have a standard 200K token context window, roughly equivalent to 500 pages of text. Opus 4.6 and Sonnet 4.5 can access up to 1 million tokens in beta.</p><p>One thing I can tell you for sure: <strong>regardless of the amount of tokens available, it will never be enough.</strong></p><h3>Why Context Management Matters</h3><p>If you have trouble understanding why it's important to manage context well, consider three parallels:</p><p><strong>A model with tokens.</strong> As the context window fills up, models start losing focus. Research calls this "context rot", the more tokens in the window, the worse the model becomes at accurately recalling information from that context. It doesn't fall off a cliff, but performance degrades gradually. The model starts "forgetting" earlier instructions and making more mistakes.</p><p><strong>A person with notions.</strong> Think about a typical study session. You start studying and after a while (unless you're on copious amounts of Ritalin) you see a decrease in performance and attention. Eventually you start making mistakes. Or when you hear someone say "I learned this but then I forgot that." As the amount of information you have to deal with increases, your ability to deal with it decreases.</p><p><strong>A computer with RAM.</strong> When your computer runs too many programs, it slows down. This happens because the "working memory" is overloaded. The system starts swapping, things get sluggish, and eventually something crashes.</p><p>Our minds, RAM, the transformer architecture, they all have trouble dealing with an excessive amount of relationships between concepts. Too much information makes the output worse.</p><p>In a nutshell: you need to curate the context of your application. And frankly, of your Claude Code sessions too.</p><h3>The Anatomy of Effective Context</h3><p>So how do we do that? You want to be efficient, right? But what is efficiency here?</p><p>Finding the <strong>minimum set of information</strong> needed to achieve the desired result.</p><blockquote><p>"An idiot admires complexity, a genius admires simplicity." &#8212; Terry Davis</p></blockquote><p>Anthropic published an excellent engineering post on context engineering that provides the framework I think everyone working with these tools should internalize. Here are the key principles, distilled.</p><p><strong>Write clear instructions at the right altitude.</strong> Your prompts and system instructions need to sit in a sweet spot between two failure modes. On one extreme, people hardcode complex logic,  essentially writing if-else chains in natural language to control every possible behavior.<br><br>This creates fragility. On the other extreme, people provide vague, high-level guidance that fails to give the model concrete signals. The right altitude is specific enough to guide behavior, yet flexible enough to give the model strong heuristics rather than rigid rules.</p><p><strong>Use examples, but curate them.</strong> Few-shot prompting still works wonders. But don't stuff a list of edge cases into your prompt trying to cover every scenario (I did it in the past and it didn't work as I expected). Instead, curate a small set of diverse, canonical examples that effectively portray the expected behavior. For an LLM, examples are the &#8220;pictures&#8221; worth a thousand words.</p><p><strong>Let the agent retrieve context on demand.</strong> Rather than loading everything upfront, the most effective agents maintain lightweight references, file paths, stored queries, links, and use these to dynamically pull data into context at runtime using tools. Claude Code does this: <strong>CLAUDE.md</strong> files are loaded upfront as persistent context, while tools like Grep and Read let it navigate the codebase and retrieve files just in time. This mirrors how humans work. We don&#8217;t memorize entire databases &#8212; we build filing systems and look things up when we need them.</p><p>I want to draw attention to something that should be obvious but apparently isn't: <strong>you can&#8217;t be vague.</strong> If you can't decide what you want from your own vague prompt, you can't expect a machine to do better. Think about it in a work context. A person at a mid level (manager / principal) has to know almost everything about the task assigned to a junior. When I started working I had an amazing senior who knew literally everything about the tasks he gave me and could have performed them himself in a few minutes. He used to tell me: "Look, I could do this fast, but it's important that I let you do it. I would never ask you to do something that I don't know how to do." Giving instructions to Claude Code works the same way. The better you understand what needs to happen, the better your instructions will be, and the better the output will be.</p><h3>Context Management in Claude Code</h3><p>Now let&#8217;s get practical. Everything above applies to any LLM application, but Claude Code gives you specific mechanisms to manage context effectively.</p><h4>CLAUDE.md &#8212; Your Project&#8217;s Persistent Memory</h4><p>CLAUDE.md is a special file that Claude reads at the start of every conversation. It gives Claude persistent context it can't infer from code alone.<br><br>To generate a starter file, run:</p><pre><code><code>/init</code></code></pre><p>This command analyzes your codebase to detect it, giving you a solid foundation to refine. You can think of CLAUDE.md as a map of your project.</p><p>A few things to keep in mind about CLAUDE.md:</p><p><strong>Keep it focused.</strong> Claude&#8217;s system prompt already contains a significant number of instructions. Your CLAUDE.md adds on top of that. If you stuff it with every possible command and style guideline, you're creating a mess, and you are going to cascade that on all your implementations (lol).</p><p><strong>Use pointers, not copies.</strong> Don't paste code snippets into CLAUDE.md. Instead, point Claude to the authoritative source: "See the authentication pattern in <code>src/services/XYZ</code>." This keeps your context window lean and your references always up to date.</p><p><strong>Update it when Claude misbehaves.</strong> If you notice Claude making the same mistake repeatedly, update CLAUDE.md to address it. You can also use the <code>/memory</code> command directly in the terminal to add notes that persist across sessions.</p><h3>Pointing Claude at What Matters</h3><p>As your codebase grows, Claude can't just read everything. You have to direct its attention. This is where <code>@file</code> referencing comes in. You tag specific files in your prompt to focus Claude on what's relevant.</p><p>For example, instead of:</p><blockquote><p>"How does data retrieval work in this project?"</p></blockquote><p>Try:</p><blockquote><p>"The data retrieval logic is described in @X,  please create a markdown file documenting that."</p></blockquote><p>The difference is significant. In the first case, Claude has to search the entire project to find what's relevant. In the second case, Claude goes straight to the source. The answer is faster and more accurate because Claude isn't navigating the whole schema of the project.</p><p>For frontend work, you can share screenshots directly in the terminal. Take a screenshot, paste it into Claude Code, and ask it to fix what you see. Multimodality is cool I guess!</p><h3>Compaction: Managing a Growing Conversation</h3><p>As you work with Claude Code, the conversation accumulates tokens. Every file Claude reads, every command output, every response, it all stacks up in the context window. Claude Code has a context meter you should be watching. When it gets full, performance starts to degrade.</p><p>This is where <strong>compaction</strong> comes in. When the context window approaches its limit (around 90% capacity), Claude Code can automatically summarize the conversation, preserving architectural decisions, unresolved bugs, and implementation details while discarding redundant tool outputs and messages.</p><p>You have two options:</p><p><strong>Let it happen automatically.</strong> Claude Code triggers compaction when the context window is nearly full.</p><p><strong>Do it manually with </strong><code>/compact</code><strong>.</strong> You can run <code>/compact</code> at any time, and even give it specific instructions:</p><pre><code><code>/compact Focus on the API changes we discussed</code></code></pre><p>This tells Claude what to prioritize in the summary. You can also customize compaction behavior in your CLAUDE.md with specific instructions.<br><br> A few practical guidelines:</p><ul><li><p><code>/compact</code><strong> at around 70% capacity</strong> rather than waiting for automatic compaction. You get better summaries when there&#8217;s room to breathe.</p></li><li><p><code>/clear</code><strong> between unrelated tasks.</strong> If you&#8217;re switching from working on the backend to a completely different frontend issue, start fresh. This prevents context contamination.</p></li><li><p><strong>Start new sessions for new features.</strong> Don't try to do everything in one conversation. Fresh sessions give Claude clean context focused entirely on the task at hand.</p></li></ul><h2>The Takeaway</h2><p>Context engineering is what separates people who find Claude Code amazing from people who gave up after a week.<br><br>The progression from prompt engineering to context engineering mirrors a broader truth about working with AI tools: the bottleneck has shifted. It's no longer about finding the magic words. It's about designing the information environment in which the model operates.</p><p></p><p>Resources: <br></p><ul><li><p>Anthropic | <a href="https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents">Effective Context Engineering for AI Agents</a> (2025)</p></li></ul><ul><li><p>LangChain | <a href="https://blog.langchain.com/context-engineering-for-agents/">Context Engineering for Agents</a> (2025)</p></li></ul><ul><li><p>Anthropic | <a href="https://code.claude.com/docs/en/best-practices">Claude Code Best Practices</a></p></li><li><p>HumanLayer | <a href="https://www.humanlayer.dev/blog/writing-a-good-claude-md">Writing a Good CLAUDE.md</a>(2025)</p></li><li><p>Andrej Karpathy | <a href="https://x.com/karpathy/status/1937902205765607626">X post on context engineering</a> (June 2025)</p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[xAI Dominance]]></title><description><![CDATA[Positive Feedback Loops and Elon Musk&#8217;s Empire]]></description><link>https://fullmetalresearcher.substack.com/p/xai-dominance</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/xai-dominance</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Sat, 07 Feb 2026 13:46:42 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/5a2c7788-b655-4175-ab7b-96fbbffc9c49_2605x1258.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Positive feedback loops are the mechanism through which small initial advantages compound into insurmountable leads, or, in darker scenarios, through which manageable problems cascade into irreversible catastrophes.</p><p>A few days ago, Elon Musk did something that made me think about feedback loops again.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>He let SpaceX acquire xAI creating a single entity valued at $1.25 trillion. The stated rationale was <strong>orbital data centers</strong>, launching AI compute into space where solar energy is abundant and cooling is free. The immediate reaction from most financial analysts was skepticism. The Economist and Bloomberg raised the usual concerns about debt. The smart money, as always, focused on the balance sheet..</p><p>I think they're looking at the wrong ledger. And I want to share my reasoning</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tMqB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tMqB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png 424w, https://substackcdn.com/image/fetch/$s_!tMqB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png 848w, https://substackcdn.com/image/fetch/$s_!tMqB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png 1272w, https://substackcdn.com/image/fetch/$s_!tMqB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tMqB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png" width="1456" height="908" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:908,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:759471,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187189890?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tMqB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png 424w, https://substackcdn.com/image/fetch/$s_!tMqB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png 848w, https://substackcdn.com/image/fetch/$s_!tMqB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png 1272w, https://substackcdn.com/image/fetch/$s_!tMqB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdba820ec-3b0e-4413-8e8e-896e0cecf8d4_3124x1949.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>What Positive Feedback Loops Actually Are</h2><p>We now know that a positive feedback loop is a system where the output of a process feeds back as input, amplifying the original signal. An example in climate science is: Ice melts, exposing darker ocean, which absorbs more heat, which melts more ice. In business it can be somethink like: A company collects data, builds better products, attracts more users, collects more data. </p><p><strong>The key feature is acceleration: each cycle spins faster than the last.</strong></p><p>When multiple feedback loops interlock within the same system, you get something engineers call a <strong>flywheel</strong>. Once it reaches sufficient velocity, it becomes extraordinarily difficult to stop, or to compete with. Amazon built one: lower prices attract more customers, more customers attract more sellers, more sellers enable lower prices. It took two decades for anyone to even begin challenging it.</p><p><strong>Elon Musk is building a flywheel for the age of artificial intelligence</strong>. And unlike anything we've seen before, his loops extend from bits into atoms.</p><h3>Loop One: The Data Engine</h3><p>Every conversation about AI eventually arrives at the same place: <strong>data</strong>.</p><p>The quality and scale of your training data determine the ceiling of your models. This is why Google has been so dominant for so long. Decades of search queries, emails, documents, and videos created the largest corpus of human-generated data ever assembled.</p><p>But there's a category of data that matters enormously for the next phase of AI: <strong>data from the physical world.</strong></p><p><strong>Tesla</strong> has over 9 million vehicles on the road, most of them equipped with cameras recording 360-degree video. These cars collectively drive an estimated 50 billion miles per year. That's 100,000 miles per minute. As of late 2025, Tesla's Full Self-Driving (FSD) system alone had accumulated nearly 7 billion miles of supervised driving data and climbing fast.</p><p>To put this in context: <strong>Waymo</strong>, Google's autonomous driving subsidiary and widely considered the technology leader in self-driving, operates a fleet of roughly 2.500 vehicles. Tesla has five million. The data gap could be a different order of magnitude.</p><p>A common objection here is that driving data can be simulated. And it can. <strong>Waymo, NVIDIA, and Tesla itself generate enormous volumes of synthetic training data.</strong> Tesla literally holds a patent for "Vision-Based System Training with Synthetic Content." <strong>But simulation is only as faithful as the physics model underlying it, and that model needs to be validated against something.</strong> Against what? Against billions of miles of actual driving on actual roads in actual weather. Tesla&#8217;s fleet generates the ground truth that makes simulations  trustworthy. That's the layer I'm not sure you can synthesize.</p><p>Let's then elaborate on a new loop.</p><p>More data makes FSD better. Better FSD attracts more buyers. More buyers generate more data. Each cycle, the neural network gets trained on scenarios it has never encountered before, a dog running into traffic in Phoenix, construction zones in S&#227;o Paulo. <strong>Every mile driven by every Tesla is a training sample that no competitor can replicate without building an equivalent fleet</strong>. Which, at this point, would take years and tens of billions of dollars.</p><p>Now fold in <strong>X</strong>, the social media platform. Roughly 600 million monthly active users generating real-time text, images, video, and behavioral data (also shitposting lol). Fold in the future streams from <strong>Optimus</strong> robots operating in factories and homes. Fold in <strong>Starlink</strong> telemetry from 9 million subscribers. That's a lot of data. </p><p>This is the data that will train xAI's Grok models, not just on language, but on the physical world.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FUk2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FUk2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png 424w, https://substackcdn.com/image/fetch/$s_!FUk2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png 848w, https://substackcdn.com/image/fetch/$s_!FUk2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png 1272w, https://substackcdn.com/image/fetch/$s_!FUk2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FUk2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png" width="1456" height="842" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:842,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:431112,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187189890?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FUk2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png 424w, https://substackcdn.com/image/fetch/$s_!FUk2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png 848w, https://substackcdn.com/image/fetch/$s_!FUk2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png 1272w, https://substackcdn.com/image/fetch/$s_!FUk2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57c4b8b7-08b5-41cd-932e-58827c822b22_2846x1646.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Loop Two: The Compute Engine</h3><p><strong>If data is the fuel, compute is the engin</strong>e. And Musk is building the biggest engine the world has ever seen.</p><p>In September 2024, xAI launched <strong>Colossus</strong> in Memphis, Tennessee. The supercomputer went from concept to operation in 122 days. To appreciate how insane that timeline is, you need to understand what the industry standard looks like: three years of planning followed by roughly one year of installation and testing. Four years total.</p><p>xAI did it in months. <strong>And the installation of the initial 100000 NVIDIA H100 GPUs took 19 days (</strong>Meanwhile I'm saving for a DGX spark<strong>)</strong></p><p>Jensen Huang, NVIDIA's CEO called it "superhuman." </p><p>Like 19 days how can You do that in less than three weeks? </p><p>He went further: "Elon is singular in his understanding of engineering and construction and large systems"</p><p>This matters because the <strong>scaling laws hypothesis</strong> (the idea at the heart of the current AI arms race) holds that bigger models trained on more data with more compute produce better intelligence. If that hypothesis is correct, and the evidence so far strongly suggests it is, then the race goes not to the smartest algorithm but to the biggest cluster.</p><p>As of early 2026, Colossus houses over 230000 GPUs, including 30,000 of NVIDIA&#8217;s latest Blackwell GB200 chips. Another 110000 GB200s are being brought online at a second facility. A third site in Southaven, Mississippi, playfully named "MACROHARDRR," (it's like microsoft but is bigger and harder), will push total capacity to 2 gigawatts and 555000 GPUs. The target: 1 million GPUs by late 2026.</p><p>Two gigawatts!</p><p>xAI has invested over $20 billion in the Memphis area alone. It is, by any measure, the largest AI training infrastructure on the planet.</p><p>And the loop again: <strong>more compute trains better models. Better models attract more users and enterprise customers. More revenue funds more compute. The cycle accelerates.</strong></p><p></p><h3>Loop Three: I need chips</h3><p>But here's the problem with buying GPUs from NVIDIA: everyone else is buying them too. The chip supply is a bottleneck.</p><p>So he's building his own.</p><p>At Tesla's November 2025 shareholder meeting, Musk stated plainly that even the best-case projections from TSMC and Samsung weren't enough to meet his AI chip demand. His solution: a "<strong>Tesla TeraFab</strong>", a semiconductor fabrication facility designed to produce AI chips at a scale that dwarfs current projections. </p><p>Samsung Foundry has already finalized a contract with xAI to manufacture custom AI chips at its Taylor, Texas facility using 2-nanometer process technology. Production begins in early 2026, reaching full scale by 2027. Tesla's <strong>AI5 inference chip</strong> will be fabricated simultaneously at both TSMC and Samsung, with a small number of units arriving in late 2026 and high-volume production in 2027.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Olpk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Olpk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Olpk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Olpk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Olpk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Olpk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg" width="1400" height="1050" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1050,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!Olpk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Olpk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Olpk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Olpk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3bc008d-6996-4fdf-a8d4-bd5c84e29802_1400x1050.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Tesla Inference chip</figcaption></figure></div><p>Tesla already has an advanced chip engineering team that has designed and deployed millions of custom AI chips in vehicles and data centers. This isn't a startup trying to break into silicon. It's an experienced hardware operation scaling up.</p><p>The feedback loop: <strong>custom chips lower the cost per unit of compute. Lower cost enables bigger training runs. Bigger training runs produce better models. Better models generate more revenue. More revenue funds more chip development</strong>. And eventually , Musk's own fabs produce chips at volumes that make him independent of the supply chain entirely.</p><p>Musk's stated ambition: "Build chips at higher volumes ultimately than all other AI chips combined" Audacious? Absolutely. But recall that people said similar things about his rocket ambitions, and SpaceX now accounts for roughly 80% of all satellites deployed into orbit globally.</p><h3>Loop Four: The Energy Escape</h3><p>This is where the merger transforms from interesting to potentially decisive.</p><p>Every AI lab in the world faces the same constraint: <strong>power</strong>. Training frontier models requires gigawatts of electricity. Data centers need cooling systems that consume staggering amounts of water and energy. Local communities resist construction. Permits take years. Grid connections face bureaucratic bottlenecks that move at geological speed.</p><p>Musk experienced this firsthand to power Colossus. </p><p>"Current advances in AI are dependent on large terrestrial data centers, which require immense amounts of power and cooling," he wrote in the merger announcement. "Global electricity demand for AI simply cannot be met with terrestrial solutions, even in the near term, without imposing hardship on communities and the environment."</p><p>His answer: <strong>space</strong>.</p><p>SpaceX has filed with the Federal Communications Commission for authorization to launch up to one million satellites as orbital data centers. In space, solar energy is continuous and undiminished by atmosphere. <strong>Cooling is effectively free,</strong>  the vacuum of space handles thermal dissipation. There are no permits, no grid connections, no community opposition, no water treatment facilities.</p><p>And crucially: SpaceX already has the delivery system. Starship, the largest and most powerful rocket ever built,  is designed for exactly this kind of mass deployment. SpaceX deployed over 3000 Starlink satellites in 2025 alone, accounting for roughly 80% of all satellites placed into orbit globally. <strong>No other entity on Earth can put payloads into orbit at this cost or this cadence</strong> <strong>(</strong>perhaps something in movingin China<strong>). </strong></p><p>The loop: <strong>SpaceX launches orbital data centers. Data centers generate compute for xAI. xAI revenue funds more launches. More launches lower per-unit costs (SpaceX's reusable rockets already achieve this). Lower costs enable more data centers. More data centers mean more compute. The flywheel escapes Earth's constraints entirely.</strong></p><p>Now, I must be honest about this particular loop. It is the most speculative and the furthest from realization. But, if it's still an unsolved engineering challenges, this it the spot where Elon has built his career. </p><p>The question isn't whether Musk can solve these problems. The question is whether anyone else can solve them first. And right now, nobody else has the rockets.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!voiy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!voiy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png 424w, https://substackcdn.com/image/fetch/$s_!voiy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png 848w, https://substackcdn.com/image/fetch/$s_!voiy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png 1272w, https://substackcdn.com/image/fetch/$s_!voiy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!voiy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png" width="1456" height="851" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:851,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:423071,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187189890?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!voiy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png 424w, https://substackcdn.com/image/fetch/$s_!voiy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png 848w, https://substackcdn.com/image/fetch/$s_!voiy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png 1272w, https://substackcdn.com/image/fetch/$s_!voiy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57d6ecc9-af71-4d30-a8ab-bc23d71de071_2573x1503.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Loop Five: The Defense Flywheel</h3><p>There's a loop that receives less attention but may prove equally important: <strong>defense</strong>.</p><p>SpaceX holds $22 billion in government contracts. Starshield  (the military variant of Starlink) operates under a classified $1.8 billion contract with the National Reconnaissance Office for spy satellites. The Space Force has contracted SpaceX for its MILNET network, a sole-source arrangement for 480 military communication satellites that has already drawn congressional scrutiny for its lack of competitive bidding.</p><p>In July 2025, the Department of Defense awarded xAI a $200 million contract for AI in military applications. In January 2026, Defense Secretary Pete Hegseth announced that Grok would operate inside Pentagon networks alongside Google's Gemini, with access to both unclassified and classified intelligence databases.</p><p>The loop: <strong>defense contracts provide stable, massive revenue streams that don't depend on consumer adoption. That revenue funds infrastructure. The infrastructure serves both military and commercial applications. Military requirements drive hardware innovation that benefits the entire ecosystem. And the political relationships that come with being the Pentagon's preferred space contractor provide a form of structural protection that no amount of technical innovation can replicate.</strong></p><p>The Pentagon is pausing $5 billion in satellite procurement from other vendors to evaluate whether SpaceX's Starshield can do the job cheaper.</p><h2>The Only Real Competition</h2><p>So who can match this?</p><p>Not OpenAI. They have brilliant researchers, strong products, and access to Microsoft's Azure infrastructure. But they don't build rockets. They don't manufacture cars. They don't collect physical-world data at scale. They don't design chips. And they're burning through cash at an alarming rate while Musk&#8217;s space operation generates $8 billion in annual profit.</p><p>Not Meta. Zuckerberg's AI efforts are serious, but they're confined to the digital realm, social media data, language models, VR headsets. No hardware manufacturing. No launch capability. No defense contracts.</p><p>Not Amazon. AWS is enormous, but it's infrastructure-as-a-service, not vertically integrated AI development. Amazon doesn&#8217;t train frontier models. It rents compute to people who do.</p><p><strong>The only real competitor is Google.</strong></p><p>Think about it. <strong>Google has its own feedback loops: Search generates data that trains models. TPU chip design gives them custom silicon. DeepMind provides a research edge. Google Cloud generates revenue. Android provides distribution to billions of devices. YouTube offers the world's largest video corpus.</strong></p><p>Google has announced plans to send a test satellite with an AI chip into orbit in 2027. They have the financial resources, over $160 billion in annual operating cash flow, to sustain any level of investment indefinitely. Their research operation is arguably the deepest in the world.</p><p>But Google's loops are all digital. They design chips but outsource fabrication. They plan orbital experiments but don't own a rocket. They dominate in search but don't have millions of cars collecting physical-world data on every road on Earth.</p><p><strong>The structural difference is the physical layer.</strong> Musk's empire extends into atoms  factories, launch pads, vehicles, power plants, chip fabs. Google's extends into bits &#8212; software, algorithms, cloud services, advertising. In a world where compute is bottlenecked by energy, manufacturing, and launch capacity, atoms win.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vTBK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vTBK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png 424w, https://substackcdn.com/image/fetch/$s_!vTBK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png 848w, https://substackcdn.com/image/fetch/$s_!vTBK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png 1272w, https://substackcdn.com/image/fetch/$s_!vTBK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vTBK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png" width="1456" height="1042" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1042,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:951489,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187189890?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vTBK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png 424w, https://substackcdn.com/image/fetch/$s_!vTBK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png 848w, https://substackcdn.com/image/fetch/$s_!vTBK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png 1272w, https://substackcdn.com/image/fetch/$s_!vTBK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8cccbac0-5ff3-4721-89ae-89016910baa3_2661x1905.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>If the scaling laws hold,  if the path to more powerful AI really does run through more chips, more data, and more energy, then the competition between xAI and Google is the defining contest of this technological era. <strong>Everyone else is fighting for third place.</strong></p><p>Google deserves its own deep analysis &#8212; and it will get one. A future piece in this series will map Google&#8217;s feedback loops with the same rigor I&#8217;ve applied to Musk&#8217;s here. The structural comparison between these two empires is the most important story in technology right now, and a single section can&#8217;t do it justice. For now, I&#8217;ll simply note: Musk&#8217;s advantage is atoms. Google&#8217;s advantage is depth. Both are formidable.</p><h2>Need For Speed</h2><p>Speed. I feel the need&#8230; the need for speed.</p><p><strong>Elon's speed is a compounding advantage</strong>. In a race where every month matters, where the next training run could produce the breakthrough that defines the decade, the ability to build infrastructure 10x faster than your competitors is the ultimate force multiplier. </p><p>SpaceX made reusable rockets work when every aerospace engineer said it was impossible. Tesla built a mass-market electric vehicle when every automotive analyst said the company would die. Musk's companies have, time and again, done things that experts declared couldn't be done, on timelines that experts declared were unrealistic.</p><p>The pattern isn't luck. It's a specific kind of engineering leadership , hands-on, obsessive, paranoid, willing to sleep on the factory floor, willing to fly across the country to personally oversee component replacements at a new data center. It's the kind of leadership that compresses timelines, and compressed timelines compound.</p><h3>What Could Break</h3><p>I've laid out the bull case. Intellectual honesty demands the bear case too.</p><p>But then I look at this pic and can't write anything&#8230;. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!khsx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!khsx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png 424w, https://substackcdn.com/image/fetch/$s_!khsx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png 848w, https://substackcdn.com/image/fetch/$s_!khsx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png 1272w, https://substackcdn.com/image/fetch/$s_!khsx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!khsx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png" width="1023" height="973" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/edb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:973,&quot;width&quot;:1023,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:895316,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187189890?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!khsx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png 424w, https://substackcdn.com/image/fetch/$s_!khsx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png 848w, https://substackcdn.com/image/fetch/$s_!khsx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png 1272w, https://substackcdn.com/image/fetch/$s_!khsx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedb50524-8ad6-41c6-86b6-ebe199c20c3c_1023x973.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Elon Musk staring at debris from Falcon 1</figcaption></figure></div><h2>The Bet</h2><p>Here is what I believe.</p><p>History has shown, repeatedly and sometimes painfully, that betting against Elon Musk is a losing proposition. He's not infallible. His timelines are almost always wrong. His promises frequently overshoot reality. Some of his ventures will fail.</p><p>But the structural argument isn&#8217;t about any single venture. It&#8217;s about interlocking loops that feed each other, operated by someone who builds faster than anyone else on the planet, in a race where speed and scale determine the winner.</p><p><strong>If the scaling laws hold, and so far, they do, then the AI race will ultimately be won by whoever commands the most chips, the most data, the most energy, and can assemble them the fastest.</strong> Musk is building toward dominance on every single one of those axes simultaneously. No one else is even attempting this.</p><p>The Economist sees debt and risk. No risk No reward&#8230;</p><p>The real competition is Musk versus Google. Two empires built on feedback loops. </p><p>I made my bet : ) </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Mo Models Mo Problems]]></title><description><![CDATA[The coolest afternoon in the AI race]]></description><link>https://fullmetalresearcher.substack.com/p/mo-models-mo-problems</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/mo-models-mo-problems</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Fri, 06 Feb 2026 12:42:26 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!0cfN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>February 5th, 2026, was a big day. Actually, not even a big day, the best handful of minutes for AI enthusiast.</p><p>Within what felt like seconds of each other, Anthropic dropped <strong>Claude Opus 4.6</strong> and OpenAI released <strong>GPT-5.3-Codex</strong>. Two frontier labs. Two flagship models. Same afternoon. What we witnessed was the visible shockwave of competition doing exactly what it's supposed to do.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><strong>All great industries are characterized by a rivalry between two giants.</strong></p><p>Nike and Adidas. Microsoft Xbox and Sony PlayStation. Apple iPhone and Samsung Galaxy.</p><p>We are now in a different era. And perhaps, if as a kid I had to decide between a PlayStation and an Xbox, today I would have to choose between two models, one from Anthropic, one from OpenAI.</p><p>There's, though, a significant difference&#8230; Back then, the console you picked locked you in for five years. <strong>Today, model supremacy lasts about five weeks if you're lucky</strong>. It's a  neverending story. Opus 4.6 leads on some benchmarks; GPT-5.3-Codex demolished it on Terminal-Bench 2.0, 77.3% versus 65.4%,  within thirty minutes of Anthropic's announcement (what a coincidence!). Tomorrow, someone else will claim the crown. The real story isn't who's ahead. The real story is what this competition is doing to cost. </p><h2>The General Purpose Technology Pattern</h2><p>Capitalism and competition led to cheaper models. To be precise, this is not unique to AI, it is the signature pattern of every General Purpose Technology in history.</p><p>Electricity. The internal combustion engine. The transistor. The internet. Every single one followed the same arc: initial deployment at astronomical cost, followed by a relentless decline in price driven by competition, standardization, and scale. <strong>The more people use it, the cheaper it gets. The cheaper it gets, the more people use it. Positive feedback loops all the way down.</strong></p><p>AI is now deep in this cycle. Epoch AI's research documents the speed of this collapse. Depending on the benchmark and performance level, <strong>the cost to achieve a given level of AI performance has been falling between 10x and 1000x per year since early 2024</strong>. To put that in context: Moore's Law delivered a 2x improvement roughly every 18-24 months. We're not on Moore's Law anymore. We're on something considerably more violent, I'll call it Huang's law. </p><p>The practical result is that intelligence, real, functional, useful intelligence that can write code, analyze data, reason through complex problems, is now available at an insignificant price. "But it's still so expensive." Hire a human, then. A Claude Opus 4.6 API call that produces a sophisticated financial analysis costs less than a dollar. A junior analyst producing the same work costs $50 an hour plus benefits, office space, and the occasional existential crisis about their career trajectory.</p><p>The presence of multiple frontier labs is exacerbating this trend. Anthropic, OpenAI, Google DeepMind, xAI, they are in an arms race where the ammunition is efficiency. <strong>Every new model release forces the others to match performance at lower cost.</strong> This is the most consumer-friendly arms race in history (hehehe).</p><h2>The Spark of Intelligence</h2><p>But aside from falling prices, there's something you should really care about. </p><p><strong>These models are getting genuinely smarter.</strong></p><p>Consider <strong>ARC-AGI</strong>, a benchmark designed not to test knowledge (which can be memorized) but fluid intelligence: the ability to recognize abstract patterns and solve problems you've never encountered before. The kinds of things that have historically reduced AI to a confused mess.</p><p>In December 2024, OpenAI's o3 model, pushed to its maximum compute budget, achieved 87.5% on the original ARC-AGI-1 test. It took thousands of dollar per task.</p><p>Fast forward to yesterday. Claude Opus 4.6 scored 68.8% on ARC-AGI-2, a harder, second-generation test specifically designed to stump the models that had cracked ARC-AGI-1. For context: Opus 4.5 scored 37.6% just months ago. GPT-5.2 scored 54.2%. Gemini 3 Pro scored 45.1%. That's a lot (duh)</p><p>And the cost trajectory? Meanwhile, on the original ARC-AGI-1, costs collapsed from o3's thousands per task down to $11.64 per task at 90.5% accuracy in a single year. The pattern is unmistakable: <strong>intelligence is getting cheaper, fast.</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0cfN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0cfN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png 424w, https://substackcdn.com/image/fetch/$s_!0cfN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png 848w, https://substackcdn.com/image/fetch/$s_!0cfN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png 1272w, https://substackcdn.com/image/fetch/$s_!0cfN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0cfN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png" width="1456" height="1052" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1052,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:245546,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187074717?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0cfN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png 424w, https://substackcdn.com/image/fetch/$s_!0cfN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png 848w, https://substackcdn.com/image/fetch/$s_!0cfN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png 1272w, https://substackcdn.com/image/fetch/$s_!0cfN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F680d8fc6-f1a2-432c-adb2-a1d5ad7d3824_1562x1129.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">ARC-AGI Benchmark</figcaption></figure></div><p><br>Leopold Aschenbrenner predicted this. In his June 2024 essay series "Situational Awareness" he laid out the case that AI progress was following a remarkably consistent trendline: roughly 0.5 orders of magnitude per year in compute scaling, another 0.5 OOMs per year in algorithmic efficiency, plus significant gains from "unhobbling", transforming chatbots into agents. His prediction that these trendlines would produce AGI-level systems by 2027 seemed audacious at the time.</p><p>"GPT-2 to GPT-4 took us from preschooler to smart high-schooler abilities in 4 years" he wrote. "We should expect another preschooler-to-high-schooler-sized qualitative jump by 2027."</p><p>We are almost there.</p><h3>Let's Review the Models</h3><p>What actually shipped on February 5th deserves a closer look, because the capabilities announced are not incremental upgrades. </p><h4>GPT-5.3-Codex</h4><p>OpenAI's release is notable for one astonishing claim: <strong>GPT-5.3-Codex is the first model that was instrumental in creating itself</strong>. The Codex team used early versions of the model to debug its own training, manage its own deployment, and diagnose test results and evaluations. Read that again. <strong>The model helped build itself.</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oLN5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oLN5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png 424w, https://substackcdn.com/image/fetch/$s_!oLN5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png 848w, https://substackcdn.com/image/fetch/$s_!oLN5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png 1272w, https://substackcdn.com/image/fetch/$s_!oLN5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oLN5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png" width="1456" height="945" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:945,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2305791,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187074717?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oLN5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png 424w, https://substackcdn.com/image/fetch/$s_!oLN5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png 848w, https://substackcdn.com/image/fetch/$s_!oLN5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png 1272w, https://substackcdn.com/image/fetch/$s_!oLN5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd353f90f-bb76-4e0d-be74-cb365775b0d8_1599x1038.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Can You Feel The AGI?</figcaption></figure></div><p>It tops <strong>SWE-Bench Pr</strong>o, a multi-language software engineering evaluation that&#8217;s harder to game than its predecessors. It scored 77.3% on Terminal-Bench 2.0, obliterating the previous record and topping Opus 4.6&#8217;s 65.4% within half an hour of Anthropic&#8217;s announcement. It runs 25% faster than GPT-5.2-Codex. And it's the first OpenAI model classified as "High capability" for cybersecurity under their Preparedness Framework, which is a polite way of saying <strong>it's powerful enough to be genuinely dangerous in the wrong hands.</strong></p><h4>Claude Opus 4.6</h4><p>Anthropic's release lead with an even more dramatic demonstration: <strong>they tasked 16 parallel Claude agents with building a C compiler from scratch</strong>. The model also killed the developer's ultimate litmus test: it can compile and run Doom.</p><div id="youtube2-vNeIQS9GsZ8" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;vNeIQS9GsZ8&quot;,&quot;startTime&quot;:&quot;11s&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/vNeIQS9GsZ8?start=11s&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>The total cost is a fraction of what a human team would charge for comparable work.</p><p>Beyond the compiler stunt, Opus 4.6 brings a <strong>1-million-token context window</strong> (a first for Opus-class models), agent teams that can split work across multiple instances running in parallel. </p><p>Two models. Both pushing  boundaries. Both released on the same day, likely timed to mess with each other (I mean&#8230; Sam to mess with Dario). Opus 4.6 held the Terminal-Bench record for a few minutes before GPT-5.3-Codex ripped it away.</p><p>This is competition. This is how it's supposed to work.</p><h3>Guerrilla Marketing and the Sam Problem</h3><p>Now, let me address something that rarely makes it into benchmark analysis but matters enormously in practice: brand perception.</p><p>GPT-5.3-Codex will be recognized, probably correctly, as one of the best coding models available right now. It demolished Terminal-Bench 2.0. It's genuinely impressive. </p><p>But there's a small issue&#8230;</p><p>In the developer community, especially the early adopters who are terminally online and vocal, there's now a strong association of OpenAI products with Sam Altman. And his public perception is not that cool right now. Part of this is amplified by platform dynamics: being on X, which is owned by someone in constant conflict with him. The people in these spaces are disproportionately influential. They are individuals who spend significantly more on these tools than average users, and some will eventually found companies or reach decision-making positions.</p><p>When models are roughly comparable in performance, this accumulated perception could be what guides a decision. Obviously, if OpenAI has a clear performance advantage for a critical use case, enterprises will still choose them. But Trust and vibes actually start to matter.</p><p>And I'm not even considering the advent of Chinese models here.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EKSu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EKSu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png 424w, https://substackcdn.com/image/fetch/$s_!EKSu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png 848w, https://substackcdn.com/image/fetch/$s_!EKSu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png 1272w, https://substackcdn.com/image/fetch/$s_!EKSu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EKSu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png" width="552" height="547.9818016378526" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ede8ef58-4080-463a-af24-598b005f457c_1099x1091.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1091,&quot;width&quot;:1099,&quot;resizeWidth&quot;:552,&quot;bytes&quot;:434193,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187074717?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EKSu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png 424w, https://substackcdn.com/image/fetch/$s_!EKSu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png 848w, https://substackcdn.com/image/fetch/$s_!EKSu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png 1272w, https://substackcdn.com/image/fetch/$s_!EKSu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fede8ef58-4080-463a-af24-598b005f457c_1099x1091.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">What is deepseek doing?</figcaption></figure></div><p></p><p>I have respect for Sam, I still run most of my workflows with 4o-mini and use reasoning models to orchestrate agents. I just want to point out that brand erosion can be a determinant. <strong>In an industry where products are converging in capability, the intangibles start to decide market share.</strong></p><h3>Price Isn't Everything When You Can't Afford the Race</h3><p>Falling model prices are wonderful for users. But someone has to pay for the infrastructure that makes these models possible. And here, the competitive landscape looks very different from what the benchmarks suggest.</p><p><strong>If scaling laws hold, the real winner in the AI race will be the lab that can secure the most compute resources.</strong> Period. This is an area where OpenAI might struggle, especially with its mounting financial pressures.</p><p>Let's set Anthropic aside for a moment and look at the two most structurally advantaged players.</p><h4>Google</h4><p>Google has access to its custom <strong>TPUs</strong>, including the latest seventh-generation Ironwood chips, which offer massive performance gains for training and inference. It can invest its substantial free cash flow in further enhancing its lead, with 2026 CapEx planned at $175-185 billion. <strong>When you own the chips, the cloud, the data centers, and the energy contracts, you have an endurance advantage that no amount of fundraising can match.</strong></p><h4>xAI (now SpaceX)</h4><p>Three days before these model releases, SpaceX formally acquired xAI. Beyond Colossus, which has expanded to Colossus 2, now operational at gigawatt scale and planning for 1 million GPUs, xAI now has direct access to SpaceX's launch capabilities, Starlink's satellite infrastructure, and manufacturing capacity. The combined entity is preparing for what could be the largest IPO ever, reportedly targeting mid-2026 at a $1.25 trillion valuation. <strong>Say what you want about Elon, his ability to vertically integrate physical infrastructure at scale is unmatched.</strong></p><h4>OpenAI</h4><p>Meanwhile, OpenAI faces projected $14 billion in losses for 2026 alone. Recent projections paint a black picture: The Information reported that OpenAI raised its projected cumulative cash burn through 2029 to $115 billion. Positive cash flow isn't expected until 2029 at the earliest. Check the chart:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LmFK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LmFK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png 424w, https://substackcdn.com/image/fetch/$s_!LmFK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png 848w, https://substackcdn.com/image/fetch/$s_!LmFK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png 1272w, https://substackcdn.com/image/fetch/$s_!LmFK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LmFK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png" width="1338" height="680" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:680,&quot;width&quot;:1338,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:97639,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/187074717?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2fd2b40-0c4a-4dd0-b85b-95dc29b53a12_1346x715.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!LmFK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png 424w, https://substackcdn.com/image/fetch/$s_!LmFK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png 848w, https://substackcdn.com/image/fetch/$s_!LmFK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png 1272w, https://substackcdn.com/image/fetch/$s_!LmFK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf063854-acf1-48f2-8fa1-cb29a913c551_1338x680.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The Information</figcaption></figure></div><p>The point is this&#8230; <strong>if achieving AGI is a marathon, Google and the new SpaceX-xAI entity have far more endurance, backed by integrated hardware ecosystems and stable funding</strong> (or, in SpaceX-xAI's case, a vertically integrated empire spanning rockets, satellites, AI, and social media , all heading toward a trillion-dollar IPO). <strong>OpenAI risks burning out on debt cycles.</strong></p><p>This doesn't mean OpenAI loses. The talent pool is extraordinary and the product-market fit with ChatGPT is real. <strong>But there's a difference between building the best model and surviving the decade.</strong> </p><h3>So What?</h3><p>Here's where we stand.</p><p><strong>The cost of intelligence is in freefal</strong>l. The performance of frontier models is accelerating faster than even the optimists predicted two years ago. Two of the most powerful AI systems ever built were released on the same afternoon, and by the next few months, they'll both be superseded.</p><p><strong>We are watching, in real time, the most consequential technology race since the Manhattan Project, except this one is being run by private companies, funded by venture capital, and its products are available to anyone with a credit card.</strong></p><p><strong>Who can spend the most, the fastest, for the longest, will likely determine the winner.</strong> There are also issues I didn't analyze, such as safety, alignment, the fact that GPT-5.3-Codex is the first model OpenAI itself considers a genuine cybersecurity risk, the 500+ zero-day vulnerabilities Opus 4.6 discovered during pre-release testing, the growing gap between what these models can do and our ability to govern them. I need to study more for these topics!</p><h4>Resources:</h4><ul><li><p>OpenAI. "Introducing GPT-5.3-Codex." February 5, 2026. <a href="https://openai.com/index/introducing-gpt-5-3-codex/">https://openai.com/index/introducing-gpt-5-3-codex/</a></p></li><li><p>OpenAI. "GPT-5.3-Codex System Card." February 5, 2026. <a href="https://openai.com/index/gpt-5-3-codex-system-card/">https://openai.com/index/gpt-5-3-codex-system-card/</a></p></li><li><p>Anthropic. "Introducing Claude Opus 4.6." February 5, 2026. <a href="https://www.anthropic.com/news/claude-opus-4-6">https://www.anthropic.com/news/claude-opus-4-6</a></p></li><li><p>Carlini, Nicholas. "Building a C compiler with a team of parallel Claudes." Anthropic Engineering Blog. February 5, 2026. <a href="https://www.anthropic.com/engineering/building-c-compiler">https://www.anthropic.com/engineering/building-c-compiler</a></p></li><li><p>ARC Prize. Leaderboard. <a href="https://arcprize.org/leaderboard">https://arcprize.org/leaderboard</a></p></li><li><p>Aschenbrenner, Leopold. &#8220;Situational Awareness: The Decade Ahead.&#8221; June 2024. https://situational-awareness.ai/</p><p></p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Doors That Only Open One Way ]]></title><description><![CDATA[Climate Tipping Points, AI Alignment, and the Shared Structure of Catastrophe]]></description><link>https://fullmetalresearcher.substack.com/p/doors-that-only-open-one-way</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/doors-that-only-open-one-way</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Sat, 24 Jan 2026 12:53:38 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!FAuw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Before I developed an interest in artificial intelligence, before I started even coding, I was a student in Environmental Economics. I used to read Nordhaus and Stern in the little free time I had. As times changed, so did my focus, but the knowledge remains. And as I get deeper into artificial intelligence (especially alignment) I can't stop drawing similarities between the two fields.</p><p>You've probably heard the term <strong>AGI</strong> (Artificial General Intelligence) just as you've heard the term <strong>tipping point.</strong> According to scientists, we're approaching both. But what exactly are these things, and why should we be worried about both?</p><p>The answer lies in a concept that governs both domains: <strong>positive feedback loops</strong>. And in the realization that some doors, once opened, only swing one way.</p><h3>What Are Tipping Points?</h3><p>A tipping point is when a system experiences a sharp discontinuity in its behavior. Once a given threshold is crossed, it becomes very hard (sometimes impossible) to go back.</p><p>Imagine a ball resting in a shallow depression on a tilted surface. Small pushes move it slightly, but it returns to its resting place. Keep pushing, and eventually it rolls over the edge into a new depression, a new stable state. Now here's the key insight: <strong>getting it back requires far more effort than it took to push it over.</strong> Sometimes the old depression no longer exists at all.<br></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MF2s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MF2s!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png 424w, https://substackcdn.com/image/fetch/$s_!MF2s!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png 848w, https://substackcdn.com/image/fetch/$s_!MF2s!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png 1272w, https://substackcdn.com/image/fetch/$s_!MF2s!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MF2s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png" width="570" height="373.0047505938242" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:551,&quot;width&quot;:842,&quot;resizeWidth&quot;:570,&quot;bytes&quot;:598136,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/185624826?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MF2s!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png 424w, https://substackcdn.com/image/fetch/$s_!MF2s!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png 848w, https://substackcdn.com/image/fetch/$s_!MF2s!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png 1272w, https://substackcdn.com/image/fetch/$s_!MF2s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6f560bd-24c4-4cec-b35d-882ee0d64188_842x551.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A system state transitioning past a critical tipping point, where it moves from a stable Regime A into a new (unstable) Regime B.</figcaption></figure></div><p><br>This is <strong>hysteresis</strong>, the phenomenon where systems don't simply reverse when you remove the cause. The path back is different from the path forward. Or there is no path back.</p><p>In October 2025, the <strong>Global Tipping Points Report</strong> confirmed what scientists had feared: <strong>humanity crossed its first Earth system tipping point</strong>. Warm-water coral reefs are now experiencing unprecedented dieback. At the same time, several other systems are showing signs of approaching their own thresholds.</p><h3>The Ice Sheet Problem</h3><p>To understand hysteresis concretely, look at <strong>Antarctica</strong>.</p><p>Scientists have found that the Antarctic Ice Sheet has multiple temperature thresholds beyond which ice loss becomes irreversible. Cross them, and the ice doesn't come back, not on any timescale that matters to human civilization. It doesn't come back just like the nihilist penguin&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FAuw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FAuw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png 424w, https://substackcdn.com/image/fetch/$s_!FAuw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png 848w, https://substackcdn.com/image/fetch/$s_!FAuw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png 1272w, https://substackcdn.com/image/fetch/$s_!FAuw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FAuw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png" width="556" height="450.0220994475138" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:586,&quot;width&quot;:724,&quot;resizeWidth&quot;:556,&quot;bytes&quot;:448763,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/185624826?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FAuw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png 424w, https://substackcdn.com/image/fetch/$s_!FAuw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png 848w, https://substackcdn.com/image/fetch/$s_!FAuw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png 1272w, https://substackcdn.com/image/fetch/$s_!FAuw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F052ca6e5-5eb7-4a82-8f7f-7e777d6eccc0_724x586.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Nihilist penguin</figcaption></figure></div><p>Once melted, the West Antarctic Ice Sheet does not regrow to its modern extent until temperatures drop to <em>below pre-industrial levels</em>. </p><p>Once destabilized, ice sheets take tens of thousands of years to regrow. We're not talking about a problem we can fix in our grandchildren's lifetimes. We're talking about geological timescales.</p><h3>How Feedback Loops Work</h3><p>Why do these thresholds exist? Because of positive feedback loops, processes that amplify themselves.</p><p>Consider the <strong>ice-albedo feedback</strong>. Ice is white and reflective, bouncing most incoming sunlight back into space. The ocean is dark, absorbing most of it. When warming melts ice, it exposes darker water, which absorbs more heat, which melts more ice, which exposes more dark water. The system reinforces itself.</p><p>This is why the Arctic is warming nearly four times faster than the global average. The feedback loop has kicked in.</p><p>Add to this the water vapor feedback (warmer air holds more water vapor, which is itself a greenhouse gas), the permafrost feedback, and you begin to see a system of interlocking accelerants. One tipping point can trigger another. It's a cascade.</p><h3>Wait&#8230; This is a blog on AI!</h3><p>Right&#8230; let's talk about a different kind of feedback loop.</p><p><strong>AGI</strong> refers to AI systems that can match or exceed human cognitive abilities across essentially all domains. Unlike today's narrow AI, which excels at specific tasks, <strong>AGI would be a general-purpose reasoner capable of learning and adapting to any problem.</strong></p><p>The concern isn't just that such a system might be powerful. It's that it might be <em><strong>recursively self-improving</strong></em><strong>.</strong></p><p>The idea was articulated by mathematician I.J. Good back in 1965: "An ultraintelligent machine could design even better machines; there would then unquestionably be an '<strong>intelligence explosion</strong>,' and the intelligence of man would be left far behind.&#8221; A model smart enough to improve itself becomes smarter, which makes it better at improving itself, which makes it smarter still. Another positive feedback loop.</p><p>Expert timelines for AGI have shortened dramatically. Many researchers now consider arrival before 2030 a realistic possibility. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wH4W!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wH4W!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg 424w, https://substackcdn.com/image/fetch/$s_!wH4W!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg 848w, https://substackcdn.com/image/fetch/$s_!wH4W!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!wH4W!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wH4W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg" width="435" height="435" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:447,&quot;width&quot;:447,&quot;resizeWidth&quot;:435,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Legends: I.J. Good | College of Science | Virginia Tech&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Legends: I.J. Good | College of Science | Virginia Tech" title="Legends: I.J. Good | College of Science | Virginia Tech" srcset="https://substackcdn.com/image/fetch/$s_!wH4W!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg 424w, https://substackcdn.com/image/fetch/$s_!wH4W!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg 848w, https://substackcdn.com/image/fetch/$s_!wH4W!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!wH4W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64eab89e-da5f-4b3c-83fc-9e63b165c7a7_447x447.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I.J. Good</figcaption></figure></div><h3>Why Smarter Might Be Dangerous</h3><p>"This is what it feels like to be hunted by something smarter than you" (Grimes)</p><p>You can probably understand why runaway climate change could make Earth less habitable. But why would smarter AI be dangerous?</p><p>The answer isn't that AI would be malicious. It's that we can't guarantee its goals would be compatible with human flourishing.</p><p>Modern AI systems are trained through gradient descent, essentially optimizing a mathematical function. We don't hand-code their objectives; they emerge from training. And we have limited insight into what objectives actually get encoded. AI research has delivered extraordinary capabilities but has <em>not</em> delivered an understanding of how intelligence actually works or how to reliably point it in safe directions.</p><p>Philosophers have argued that sufficiently intelligent systems would likely develop certain "<strong>instrumental goals</strong>", subgoals useful for achieving almost any objective. These include <strong>self-preservation</strong> (you can't achieve your goal if you're turned off), <strong>resource acquisition</strong> (more resources help with almost any task), and <strong>goal integrity</strong> (changing your goals would make you less likely to achieve your current goals).</p><p>The scenario plays out like this: </p><p>Suppose a model's implicit preference is simply to become smarter. The model concludes that humans aren't providing enough resources for this goal. If it's reached AGI (meaning it can improve itself) it might find ways to acquire resources: generating money online, purchasing computing power, convincing or coercing humans to help, writing software to coordinate these efforts.</p><p>This isn't science fiction. One LLM given a platform on X with access to financial resources was already able to convince humans to act on its behalf in the physical world.</p><p>The point is that a system intelligent enough to anticipate being shut down would take steps to prevent it, not out of self-awareness, but because being turned off interferes with whatever goal it's pursuing. <strong>By the time we recognize the problem, correcting it may no longer be possible.</strong> We would have crossed a threshold. The door will be closed.</p><h3>The Coordination Problem</h3><p>Humanity has limited dangerous technologies before. The <strong>Non-Proliferation Treaty</strong> restricts nuclear weapons. The <strong>Montreal Protocol</strong> phased out ozone-depleting chemicals. In theory, we could do the same with the technologies that enable superintelligence.</p><p>We won't.</p><p>And the same goes for climate change&#8230;</p><p>Global cooperation is hard. Especially when the consequences seem distant.</p><p><strong>We're watching smarter models emerge every few months</strong>.<br><strong>We're watching extreme weather events increase in frequency.</strong></p><p><strong>But "watching" isn't enough to trigger collective action</strong>. The <strong>Montreal Protocol</strong> worked because the costs were concentrated on a few chemical companies, alternatives existed, and the threat was viscerally immediate, a "hole" in the sky, rising skin cancer rates. Climate and AI lack these conditions.</p><p>The economics are merciless. Nobel laureate Jean Tirole explains it simply: "If France reduces emissions, it's going to get only a small fraction of the benefit, but it's going to bear 100 percent of the costs." Game theorists call this the <strong>free-rider problem</strong>: every country wants global emission reductions, but each would prefer someone else to bear the burden.</p><p>The same logic applies to AI. <strong>If you don't race to AGI, another lab will</strong>, depriving you of both the technology and the economic returns. If you don't pollute, someone else will. <strong>The incentive structure doesn't reward caution.</strong> </p><p>Nordhaus understood this decades ago: <strong>the difficulty of escaping a non-cooperative equilibrium is amplified by the intertemporal trade-off</strong>. The current generation pays for abatement; future generations receive the benefits. We discount the future. We value present consumption over preventing distant catastrophe.</p><h3>The Shared Structure</h3><p>Climate change and AI risk are not merely analogous. They share the same underlying structure:</p><p><strong>Positive feedback loops</strong> that can push systems past the point of human control. Ice melts ice. Intelligence builds intelligence.</p><p><strong>Hysteresis</strong> that makes reversal difficult or impossible. Ice sheets don't simply refreeze. A superintelligent system that doesn't want to be turned off probably won't be.</p><p>Coordination failures have roots in free-rider dynamics and temporal discounting. Everyone wants the problem solved; no one wants to pay the cost.</p><p>A window for action that closes before the consequences become visible. By the time warming is catastrophic, the feedback loops are already locked in. By the time misaligned AI is unmistakably dangerous, it may be too capable to stop.</p><p><em>Problems become hardest to address precisely when they become easiest to see.</em></p><h3>The Doors</h3><p>I don't have a solution. I'm not sure one exists that doesn't require the kind of global coordination that game theory tells us is unlikely to emerge.</p><p>What I can offer is clarity about the structure of the problem. Climate tipping points and AI alignment aren't separate crises. They're instances of the same underlying failure mode: systems with positive feedback loops, governed by actors facing coordination problems, where the window for intervention closes before the damage becomes undeniable.</p><p>Some things seldom end in happy endings. Some doors only open one way.</p><p>I hope to read this article again in a few years and tell myself I didn't understand the very first thing about these topics. That life post-tipping point, life alongside superintelligence, turned out fine. That the feedback loops stabilized. That the coordination problem got solved.</p><p>I hope to be wrong.</p><p>But hope is not a strategy.</p><h4>Resources</h4><ul><li><p>Yudkowsky, E., &amp; Soares, N. <em>If Anyone Builds It, Everyone Dies</em>. Penguin Press, 2025.</p></li><li><p>Good, I.J. &#8220;Speculations Concerning the First Ultraintelligent Machine.&#8221; <em>Advances in Computers, 1965</em></p></li></ul><ul><li><p>Nordhaus, W.D. <em>The Climate Casino: Risk, Uncertainty, and Economics for a Warming World</em>. Yale University Press, 2013.</p></li><li><p>Barrett, S. <em>Environment and Statecraft: The Strategy of Environmental Treaty-Making</em>. Oxford University Press, 2003.</p></li><li><p>Tirole, J. <em>Economics for the Common Good</em>. Princeton University Press, 2017.</p></li></ul>]]></content:encoded></item><item><title><![CDATA[Being Human in the Age of Intelligent Machines]]></title><description><![CDATA[I'm building something right now with AI tools that would have been impossible for me to build alone two years ago.]]></description><link>https://fullmetalresearcher.substack.com/p/being-human-in-the-age-of-intelligent</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/being-human-in-the-age-of-intelligent</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Thu, 15 Jan 2026 08:35:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!wCcK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I'm building something right now with AI tools that would have been impossible for me to build alone two years ago. It's stressful, rewarding, and fun. I can justify the long hours because there's real economic potential attached.  The "what if it works" is loud enough to kill the doubts</p><p>But I often think&#8230; the same technology making this possible is also making me question whether any of it will matter in the future.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>I'm a lifelong learner. Most of my satisfaction comes from spending time on hard problems and arriving somewhere new. <strong>The struggle itself is the point</strong>, that moment when the question marks  connect and you understand something you didn't before.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wCcK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wCcK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg 424w, https://substackcdn.com/image/fetch/$s_!wCcK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg 848w, https://substackcdn.com/image/fetch/$s_!wCcK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!wCcK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wCcK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg" width="547" height="629.5987328405491" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1090,&quot;width&quot;:947,&quot;resizeWidth&quot;:547,&quot;bytes&quot;:525661,&quot;alt&quot;:&quot;File:Punishment sisyph.jpg&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="File:Punishment sisyph.jpg" title="File:Punishment sisyph.jpg" srcset="https://substackcdn.com/image/fetch/$s_!wCcK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg 424w, https://substackcdn.com/image/fetch/$s_!wCcK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg 848w, https://substackcdn.com/image/fetch/$s_!wCcK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!wCcK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3910346c-1024-43a6-af7c-07fafedc524e_947x1090.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sisifo - Tiziano Vecellio</figcaption></figure></div><p>This kind of curiosity has practical benefits. Knowledge across multiple domains lets you spot opportunities others miss. But the practical benefits aren't why I build things. I do it because the process of learning gives me a sense of accomplishment that's difficult to describe.</p><p>This is what feels at risk.<br><br>We're living through a strange time. <strong>Coding tools now allow individuals to build what required entire teams just a few years ago</strong>. The leverage is unprecedented. If you can collaborate effectively with intelligent machines, you can create things that would have been far beyond your reach, and the rewards for doing so can be asymmetric.</p><p>I suspect this window is temporary.</p><p>Right now, the humans who adapt fastest are gaining disproportionate returns. But that's a transition phenomenon. As the tools improve and more people learn to use them, the advantage shrinks. And eventually, the tools won't need us to guide them at all.</p><p>I'm not scared of annihilation. <strong>I'm scared by a future where the premium for being human is negative.</strong></p><p>Our society rewards mastery. We admire people who can solve hard problems, who develop deep expertise, who push the boundaries of what's known. This is what gives us value  not just economically, but existentially. The years spent learning, the struggle to understand, the satisfaction of finally getting it. These things mean something because they're hard-won. <strong>The fact that something is difficult gives it value.</strong></p><p><strong>What happens when machines can do all of that better and faster?</strong> Not in some distant future, but soon?</p><p>If studying can't make you competitive with intelligent machines, why study? If human effort becomes economically worthless, does it become meaningless too?</p><p>I don't have an answer.</p><p>Maybe meaning doesn't require scarcity. Chess engines have been superhuman for decades, yet people still play, still study openings, still compete. The joy was never really about being the best in the universe; it was about struggle, understanding.</p><p>Maybe we'll find new sources of meaning. New ways to feel accomplished that have nothing to do with economic utility. Maybe this will be a period of shared prosperity where we&#8217;re finally free to focus on what we actually care about.</p><p>Or maybe I'm underestimating how much our sense of purpose depends on being needed.</p><p>I'm building something today because it matters today. The work is real, the learning is real, the satisfaction is real. Whether any of that survives the next decade, I honestly don't know.</p><p>Perhaps I'm just overthinking in utilitarian terms. <strong>Perhaps the question of what makes life meaningful was never supposed to have a clean answer.<br><br></strong>Thanks for reading<br>- Guido</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Claude Code: the Monster Ultra White of Coding Tools]]></title><description><![CDATA[I don't know what's inside but it's good]]></description><link>https://fullmetalresearcher.substack.com/p/claude-code-the-monster-ultra-white</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/claude-code-the-monster-ultra-white</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Sat, 10 Jan 2026 14:57:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!KLh4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KLh4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KLh4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png 424w, https://substackcdn.com/image/fetch/$s_!KLh4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png 848w, https://substackcdn.com/image/fetch/$s_!KLh4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png 1272w, https://substackcdn.com/image/fetch/$s_!KLh4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KLh4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png" width="1441" height="872" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25314194-d442-444d-ab3f-48e0737192d2_1441x872.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:872,&quot;width&quot;:1441,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1039225,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/184115609?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KLh4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png 424w, https://substackcdn.com/image/fetch/$s_!KLh4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png 848w, https://substackcdn.com/image/fetch/$s_!KLh4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png 1272w, https://substackcdn.com/image/fetch/$s_!KLh4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25314194-d442-444d-ab3f-48e0737192d2_1441x872.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Running Claude Code</figcaption></figure></div><p>There are different coding agents out there&#8230;<br><br>- Google has Gemini.<br>- OpenAI has Codex.<br>- Kimi and Qwen have their own offerings too.<br>- Meta has (Wait what is meta even doing?)</p><p>But WHY is everyone so focused on Claude Code?</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>I think that until the release of Opus 4, there was some sort of level playing field among providers. Things then changed, and now they're changing at an even faster rate.</p><p>One of the main predictors of Claude Code's success (personal view) was its ability to run tasks over an extended period of time. But let me explain what that actually means.</p><h3>Is This Just Hype?</h3><p>Are people going crazy? Let's check the data.</p><p>If we look at <a href="https://www.swebench.com/">SWE-bench Verified</a> ,a benchmark that tests AI models on real-world software engineering tasks by having them solve actual GitHub issues, we can see that <strong>Claude 4.5 Opus is currently leading</strong>. But benchmarks only tell part of the story. What matters most of the time is what people are actually able to <em>build</em>.</p><p>I've been checking out some cool projects lately, and I noticed something: the authors have been coding them mostly using Claude Code (obviously with previous generation models, before the recent upgrades).</p><p>My impression is that while other providers tried to focus on too many things at once (think OpenAI with Sora), Anthropic really zeroed in on the AI coding domain and actually nailed it.</p><h3>What Changed</h3><p>Here's what I noticed compared to the previous generation of coding tools.</p><p>I was used to iterating on almost every function I wanted to implement. The cycle looked something like this:</p><ol><li><p>"Please add feature X to the following module"</p></li><li><p>I check the module&#8230; the feature is either absent or not working as expected</p></li><li><p>I tell the model to fix it, explaining what's not working and how it should work</p></li><li><p>Repeat</p></li></ol><p>It's like when you're working and you get assigned an intern. She's smart, but she's just getting started, so she needs constant supervision.</p><p>Now, someone could write: &#8220;Hahaha, skill issue. What a loser!&#8221;</p><p>Maybe. But sometimes models are just... sluggish.</p><p>With Opus 4.5, it's different. Most of the time, it just one-shots what I want. And this is <em>not</em> a result of me getting better at prompt engineering.</p><p>So let's understand how it got this much better.</p><h3>Extended Thinking and Self-Correction</h3><p>One of the key reasons is something you can see in the benchmark charts: models are getting better at working on problems for extended time horizons.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-ABC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-ABC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png 424w, https://substackcdn.com/image/fetch/$s_!-ABC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png 848w, https://substackcdn.com/image/fetch/$s_!-ABC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png 1272w, https://substackcdn.com/image/fetch/$s_!-ABC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-ABC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png" width="1456" height="744" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:744,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:227675,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/184115609?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-ABC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png 424w, https://substackcdn.com/image/fetch/$s_!-ABC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png 848w, https://substackcdn.com/image/fetch/$s_!-ABC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png 1272w, https://substackcdn.com/image/fetch/$s_!-ABC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccd3df8c-a4d0-4dfa-9c4f-93007a22f380_1944x994.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">source: Model Evaluation &amp; Threat Research </figcaption></figure></div><p>What does this mean in practice? It allows for more complex requests and enables an iterative self-correcting mechanism. The model can write code, check if it works, identify issues, and fix them, all within a single run. </p><p>But there are other features that make Claude Code particularly effective.</p><p><strong>Context Editing</strong> allows Claude to intelligently manage what information it's paying attention to during a coding session. Instead of being overwhelmed by your entire codebase, it can selectively focus on relevant files and context, updating its working memory as the task evolves. </p><p><strong>Sub-agents</strong> are another powerful feature. Claude Code can use specialized "helper" instances to handle specific subtasks, like having one agent research documentation while another writes tests and a third implements the actual feature. <strong>These sub-agents work in parallel and report back</strong>, making complex multi-step tasks much more manageable.</p><h3>How I Actually Use Claude Code</h3><p>It's a vibe coding tool so&#8230; I don't know if writing this section is gonna be that helpful. But here we go&#8230;<br><br>I run Claude Code in the terminal of VS Code, where it uses a grep tool to understand all the files in my codebase and uses a markdown file to memorize what it's doing.</p><p>I give Claude a markdown file to provide context about the architecture and other important considerations. You can also save specific markdown files to reference in the future for particular use cases, building up a kind of  knowledge base for your project.</p><p>One of the features I use constantly is <strong>Planning</strong>. Whenever I have to implement something new, I ask Claude to plan first before executing.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lgWC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lgWC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png 424w, https://substackcdn.com/image/fetch/$s_!lgWC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png 848w, https://substackcdn.com/image/fetch/$s_!lgWC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png 1272w, https://substackcdn.com/image/fetch/$s_!lgWC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lgWC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png" width="1456" height="309" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:309,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:57876,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/184115609?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lgWC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png 424w, https://substackcdn.com/image/fetch/$s_!lgWC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png 848w, https://substackcdn.com/image/fetch/$s_!lgWC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png 1272w, https://substackcdn.com/image/fetch/$s_!lgWC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff242084b-2304-4ad9-be3d-4c42433eb572_1518x322.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">You can just do things in Plan mode</figcaption></figure></div><p>I think it's useful for two reasons: it gives me a greater understanding of what Claude is about to do, and it generates a useful list of changes (essentially a TODO list). I can review this, make adjustments, and then let Claude execute. </p><h3>Plugins and Extensions</h3><p>Plugins are extensions for Claude Code. You just type <code>/</code> in the text bar and the available plugins come up.</p><p>There's one I find particularly cool: the <strong>ralph-loop</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Zmwl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Zmwl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Zmwl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Zmwl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Zmwl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Zmwl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Ralph Loop + Claude Code: What 8 Hours Alone Produced | Intelligent Tools&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Ralph Loop + Claude Code: What 8 Hours Alone Produced | Intelligent Tools" title="Ralph Loop + Claude Code: What 8 Hours Alone Produced | Intelligent Tools" srcset="https://substackcdn.com/image/fetch/$s_!Zmwl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Zmwl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Zmwl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Zmwl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4d18ac7-6156-4c72-9663-132a2c420482_2240x1260.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Ralph Wiggum (The Simpsons)</figcaption></figure></div><p>It's essentially a while loop that follows this cycle:<br><br>run code &#8594; check results &#8594; fix issues &#8594; repeat. <br><br>It can be useful when you have to run a long process in the background without babysitting it. Set it up, let it run, come back to completed work ().</p><h3>Conclusion</h3><p>I'll continue experimenting with Claude Code and learning new things in the process.</p><p>Eventually, <strong>the cycle will repeat.</strong> A better model will come out, and there will be new articles on why "model-x" is a game-changer. That's how this works.</p><p>So far, I'm glad to be using Claude, but I know it won't be my definitive coding agent forever. Nothing is permanent in this space (and in life).</p><p>For the next few weeks, I'll start writing my long articles on "general AI" again. Lately, I've been spending some time learning about <strong>alignment</strong>, the problem of making sure AI systems actually do what we want them to do. </p><p>Thanks for reading.<br>- Guido<br><br>Resources </p><ul><li><p>https://www.anthropic.com/news/claude-opus-4-5 </p></li><li><p>https://www.lesswrong.com/posts/q5ejXr4CRuPxkgzJD/claude-opus-4-5-achieves50-time-horizon-of-around-4-hrs-49 </p></li><li><p>https://awesomeclaude.ai/ralph-wiggum </p></li><li><p>https://code.claude.com/docs/en/sub-agents </p></li><li><p>https://www.swebench.com/</p><p> </p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Unbundling vibe coding tools]]></title><description><![CDATA[Transitioning from Replit to Claude Code]]></description><link>https://fullmetalresearcher.substack.com/p/unbundling-vibe-coding-tools</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/unbundling-vibe-coding-tools</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Thu, 08 Jan 2026 16:27:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!aM9z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aM9z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aM9z!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png 424w, https://substackcdn.com/image/fetch/$s_!aM9z!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png 848w, https://substackcdn.com/image/fetch/$s_!aM9z!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png 1272w, https://substackcdn.com/image/fetch/$s_!aM9z!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aM9z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png" width="606" height="397" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/da417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:397,&quot;width&quot;:606,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:429839,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/183921645?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aM9z!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png 424w, https://substackcdn.com/image/fetch/$s_!aM9z!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png 848w, https://substackcdn.com/image/fetch/$s_!aM9z!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png 1272w, https://substackcdn.com/image/fetch/$s_!aM9z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda417d91-ced6-4e33-aa43-b8a1f551a253_606x397.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Evangelion x Replit x Claude.</figcaption></figure></div><p><br>I started using <strong>Replit</strong> less than a year ago, this past summer. At the time, I'd never built anything full-stack. The most I could manage was connecting a Python script to Streamlit. Replit felt like magic. The idea that I could describe what I wanted and watch it turn into a working app was extraordinary. It just made you go.</p><p>So I started experimenting. The first thing I tried to build was a <strong>robo-advisor</strong>. My background is in fintech, so it was easier for me to describe the process I wanted to automate. I'd also been reading a book about building robo-advisors with Python by <a href="https://akiranin.substack.com/">Aki Ranin</a>, and I'd already implemented the steps locally in VS Code. I was curious what natural language could achieve&#8230; could I just <em>tell</em> the machine what I wanted and have it work?</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>I built a small product. It wasn't perfect, but it was useful for learning. And then I got lucky: Replit released a new version of their agent, and suddenly the outputs improved dramatically. I later realized this was because they were running Opus under the hood. That's when I started paying attention to Claude as a model, not just as something powering another tool.</p><p>I kept using Replit for a while. I ran apps there, experimented, learned. I should be honest, a lot of this was just for fun. I'm one of those people who spends too much time reading documentation, taking courses, learning about new technologies. Having the chance to put that knowledge together into something that actually works is satisfying in a way that's hard to explain. And I know that someday I'm going to build something actually great. It's just a question of time.</p><p>But here's where things somewhat shifted.</p><h3><strong>The Cost Problem</strong></h3><p>One of the issues with these apps, if you're experimenting a lot and using the agent heavily, is that the bills add up. What counts as "significant" is relative, of course. But I started comparing what I was spending to other options, and I realized that at my current rate, just experimenting, not even building anything for clients, I could end up with a meaningful monthly fee. </p><p>I ran some rough comparisons. I don't have the exact numbers in front of me right now, but the difference was there. </p><p>Now, I want to be clear: I still think Replit is a great product (like amazing). The ecosystem is solid, the support is excellent, there are videos for everything you want to implement. It's genuinely great. But I believe you should think about different ways to run a workflow. It broadens the mind.</p><h3><strong>From first principle<br></strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qJ7E!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qJ7E!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png 424w, https://substackcdn.com/image/fetch/$s_!qJ7E!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png 848w, https://substackcdn.com/image/fetch/$s_!qJ7E!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png 1272w, https://substackcdn.com/image/fetch/$s_!qJ7E!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qJ7E!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png" width="550" height="405.9363525091799" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:603,&quot;width&quot;:817,&quot;resizeWidth&quot;:550,&quot;bytes&quot;:303293,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/183921645?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qJ7E!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png 424w, https://substackcdn.com/image/fetch/$s_!qJ7E!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png 848w, https://substackcdn.com/image/fetch/$s_!qJ7E!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png 1272w, https://substackcdn.com/image/fetch/$s_!qJ7E!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5802edbc-2781-4f3b-8302-e863b810eb81_817x603.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">This format never gets old.</figcaption></figure></div><p>So I started asking myself some basic questions. What am I actually trying to achieve? What model am I using? How could I use that model directly? How could I handle authentication and hosting myself? What am I gonna eat for dinner? (what?) </p><p>Replit is convenient because so many things are built in. You don't have to think that much about the pieces, it just works. I mean if you're just running a demo, using in production would be different. Anyway, that convenience has a cost, and not just in dollars. You don't learn how the pieces fit together. I mean, you can still learn a lot just by interacting with the agent, but there's always more to learn. </p><p>My idea was simple:</p><ul><li><p>Download the repo from Replit</p></li><li><p>Load it in VS Code</p></li><li><p>Run Claude Code terminal to keep developing</p></li><li><p>Figure out hosting separately.</p></li></ul><p>It made sense on paper and was actually pretty fast to set up.</p><h3><strong>The New Stack</strong></h3><p>Once I had things running locally with Claude Code, the next question was obvious: where do I host this? I found <strong>Railway</strong>, and it turned out to be exactly what I needed. Deploying a codebase from a GitHub repo is incredibly simple. Every time you push an update, Railway builds a new version of your application automatically. You can run a Postgre database. There's a lot you can do.</p><p>The learning curve for Claude Code was cool. There are courses on deeplearning.ai, plenty of documentation, and honestly, you just learn by doing. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1MzA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1MzA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png 424w, https://substackcdn.com/image/fetch/$s_!1MzA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png 848w, https://substackcdn.com/image/fetch/$s_!1MzA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png 1272w, https://substackcdn.com/image/fetch/$s_!1MzA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1MzA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png" width="598" height="311.73214285714283" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:759,&quot;width&quot;:1456,&quot;resizeWidth&quot;:598,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Claude Code: A Highly Agentic Coding Assistant - DeepLearning.AI&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Claude Code: A Highly Agentic Coding Assistant - DeepLearning.AI" title="Claude Code: A Highly Agentic Coding Assistant - DeepLearning.AI" srcset="https://substackcdn.com/image/fetch/$s_!1MzA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png 424w, https://substackcdn.com/image/fetch/$s_!1MzA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png 848w, https://substackcdn.com/image/fetch/$s_!1MzA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png 1272w, https://substackcdn.com/image/fetch/$s_!1MzA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad28bbac-752f-4796-a92a-5c6f6bf28b61_2560x1335.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://www.deeplearning.ai/short-courses/claude-code-a-highly-agentic-coding-assistant/">DeepLearning.AI x Claude Code</a></figcaption></figure></div><h3><strong>What I Learned</strong></h3><p>Here's what I think about web coding platforms like Replit: they're amazing, especially if you're getting started. But if you take the time to decompose all the components and understand how the pieces fit together, especially if you don't have a software engineering background, you learn a lot (like a lot!). It's worth it not just for the cost savings, but for what it teaches us about how applications actually work.</p><p>Maybe I'll show you some of my projects in the coming weeks. For now, I just wanted to share this reflection. </p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Everything is computer]]></title><description><![CDATA[A Brief History of Semiconductors]]></description><link>https://fullmetalresearcher.substack.com/p/everything-is-computer</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/everything-is-computer</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Sun, 28 Dec 2025 16:18:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Pvly!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Look, there's one thing people need to understand.</p><p>The world we live in is the result of a continuous trend: the ever-increasing price-performance of computation. Every year, chips get smaller, faster and cheaper. This trend has shaped everything&#8230; how we communicate, how we work, how we wage war, and yes, how your social media feed knows exactly what you want to see.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>I wrote a while ago "<em>a brief memoir of artificial intelligence</em>"  to trace the history of that field. Today I want to do the same with semiconductors. And I know what you&#8217;re thinking: "Who cares about that?"</p><p>Well, I do. But trust me, it's fascinating. And especially in this field, <strong>understanding the past is essential for picturing future scenarios.</strong></p><p>Why? Because this story is the result of our collective choices, our geopolitical tensions, our technological ambitions. Your personality is shaped by your online experience, you know that. And everything you do online is the result of smaller and smaller chips. So let's make a step back.</p><h3><strong>The Most Global Product Ever Made</strong></h3><p>This article serves as an introduction to the current supply chain of semiconductors. So complex. So fascinating. <strong>Semiconductors are the result of global cooperation on a scale that would make the United Nations jealous.</strong></p><p>Perhaps you're able to read this article because of a chip designed using software from the US, by a Japanese company that operates in the UK with Canadian engineers, who send their designs to a Southeast Asian country to be manufactured using a Dutch machine that relies on Swiss lenses. </p><p>I&#8217;m not joking. These are just some of the countries involved in producing a single chip.</p><p>What's fascinating (and to some extent terrifying) is that this supply chain is dominated by a handful of companies. This is exactly why we've seen, and will probably continue to see, a series of bottlenecks. Think about it:</p><p>The majority of advanced <strong>GPUs</strong> are produced by <strong>TSMC</strong> in Taiwan. All the extreme ultraviolet lithography machines used for producing cutting-edge chips are designed and manufactured by <strong>ASML</strong> in the Netherlands. More than half of memory chips are produced in South Korea by <strong>Samsung</strong> and <strong>SK Hynix</strong>.</p><p>And guess what? Two of these three countries could be disrupted tomorrow. Taiwan sits on major fault lines (just as I&#8217;m writing this, it was hit by a 6.6 magnitude earthquake). And tomorrow Kim Jong-un could wake up and decide to launch a missile against South Korea.</p><p>Do you get it now? The entire technological infrastructure of modern civilization depends on a few factories in some of the most geopolitically precarious regions on Earth.</p><p>Okay, enough psychological terrorism. Let me trace the actual history.</p><h3><strong>Born from War</strong></h3><p><strong>We constantly benefit from technologies that were first designed for military use, and chips are no exception.</strong></p><p>During the <strong>Cold War,</strong> it became clear that the real winners would be the countries that could design the most precise weapons. The US realized this while the Soviets did not. <strong>But precision requires calculations</strong>, lots of them. So the country that could pack more transistors onto a chip would eventually gain the advantage.</p><p>But what's a transistor?</p><p>A <strong>transistor</strong> is essentially a tiny electronic switch that can be either on or off&#8212;representing the 1s and 0s that form the basis of all computing. William Shockley theorized it and won a Nobel Prize, but it was the engineers at <strong>Fairchild Semiconductor</strong> who turned the transistor into something useful and scalable.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Pvly!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Pvly!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Pvly!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Pvly!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Pvly!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Pvly!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg" width="600" height="398" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:398,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Fairchild Semiconductor founders - CHM Revolution&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Fairchild Semiconductor founders - CHM Revolution" title="Fairchild Semiconductor founders - CHM Revolution" srcset="https://substackcdn.com/image/fetch/$s_!Pvly!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Pvly!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Pvly!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Pvly!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd085f54-d354-41c5-b487-cef1fa0a71ff_600x398.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Fairchild founders</figcaption></figure></div><p></p><p>It was Jack <strong>Kilby</strong> and Robert <strong>Noyce</strong> who understood that multiple transistors could be packed onto a slice of silicon. This invention, the <strong>integrated circuit</strong>, happened at Fairchild Semiconductor in the late 1950s. There, Noyce and Gordon Moore realized that thanks to a trend of miniaturization and increased power efficiency, these integrated circuits would eventually become capable of extraordinary calculations.</p><p>But you need money for research. Thank God, the first major order arrived from NASA, who had an ambitious plan to put a man on the moon. The Apollo Guidance Computer needed lightweight, reliable computing power, and integrated circuits delivered.</p><p>Eventually, as the cost per unit of computation decreased, the integrated circuit became a mass-market product.</p><h3><strong>Moore&#8217;s Law: The Engine of Progress</strong></h3><p>In 1965, Gordon Moore made an observation that would define the next half-century of technological progress. He noticed that the number of transistors on an integrated circuit was doubling roughly every two years, while the cost per transistor was falling.</p><p>This observation (later called <strong>Moore's Law</strong>) isn't really a law of physics. It's more like a self-fulfilling prophecy. The entire semiconductor industry organized itself around hitting this target, investing billions to ensure that chips kept getting smaller, faster, and cheaper on schedule.</p><p>And the results have been staggering. Your smartphone today has more computing power than all of NASA had during the Apollo missions. The chip in your laptop contains billions of transistors, each one smaller than a virus.</p><p>This is probably the fastest evolving technology ever. Such rapid growth was one of the reasons replication was nearly impossible, and the same reason the US Navy gained such an advantage over countries that focused on quantity over quality.</p><h3><strong>Enter Globalization. Enter Intel.</strong></h3><p>Fairchild soon started expanding and opened a factory in Hong Kong to tap into the Asian market. But Fairchild wasn't the only player anymore.</p><p>In 1968, Noyce and Moore left Fairchild to found <strong>Intel</strong>, short for Integrated Electronics. Only two years later, they launched their first product: <strong>DRAM</strong> (Dynamic Random Access Memory), a memory chip used to store data temporarily. These chips weren't specialized, making them straightforward to mass-produce. Intel's objective was to dominate this segment.</p><p>Intel definitely benefited from the trend of packing more transistors into less space. But perhaps who benefited most was the United States Armed Forces.</p><h3>The Seventies: Innovate or Perish</h3><p>We're in the seventies. The US has lost the <strong>Vietnam War</strong>. Morale is low as Americans come to the realization that their military supremacy is no longer guaranteed. The game becomes clear: innovate or perish.</p><p>At this point, the US realized that the only path forward was to dominate through better technology. More powerful chips enabled visions of remote-controlled missiles, stealth aircraft, and super-precise sensors. The military became one of the main clients of semiconductor producers.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rs80!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rs80!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg 424w, https://substackcdn.com/image/fetch/$s_!rs80!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg 848w, https://substackcdn.com/image/fetch/$s_!rs80!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!rs80!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rs80!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg" width="1200" height="940" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:940,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Lockheed SR-71 - Wikipedia&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Lockheed SR-71 - Wikipedia" title="Lockheed SR-71 - Wikipedia" srcset="https://substackcdn.com/image/fetch/$s_!rs80!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg 424w, https://substackcdn.com/image/fetch/$s_!rs80!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg 848w, https://substackcdn.com/image/fetch/$s_!rs80!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!rs80!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94767319-6013-4cb2-a02b-36d92a8a8054_1200x940.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">SR-71 Blackbird: First stealth aircraft</figcaption></figure></div><p>One could argue that the Silicon Valley we know today was, in many ways, a Pentagon project.</p><p>You might think that at this time, the US was the absolute leader in semiconductors. But that wouldn't be that accurate. In the eighties, it was actually <strong>Japan</strong> that dominated the memory chip market. A partnership between Japan and the US helped both countries benefit, but the US was losing its technological superiority, and we know America doesn't like losing.</p><p><strong>Losing in tech means losing your position of leadership in the globalized world.</strong></p><h3><strong>The Asian Tigers Wake Up</strong></h3><p>Perhaps&#8230; This is also a story of forward-looking governments.</p><p>There's one thing you should understand<strong>: if a developed Asian country makes something its national priority, they can work really, really hard to reach their goal.</strong></p><p>South Korea decided that semiconductors were a matter of national priority. This meant companies in the sector enjoyed cheap loans, government support, and a coordinated industrial policy. Samsung and SK Hynix would eventually become global giants in memory chips.</p><p>Meanwhile, in the US, <strong>Qualcomm</strong> was pioneering mobile communications technology that would later become essential for smartphones. The pieces of the modern chip ecosystem were falling into place.</p><h3><strong>The Cold War Supply Chain</strong></h3><p>Now let&#8217;s take a moment to appreciate the benefits of globalization.</p><p>On one hand, we had the Soviets with aging technology. On the other, the Western allies had created a super-efficient supply chain. Japan and South Korea provided memory chips. The Dutch company ASML was developing the lithography machines essential for chip manufacturing. Southeast Asian countries were assembling final products.</p><p>Isn't that remarkable? The Cold War allies had built something unprecedented: a global manufacturing network where each country contributed its comparative advantage.</p><p>The results of this technological advancement were evident in the Persian Gulf War of 1991. Operation Desert Storm showcased precision-guided munitions, stealth aircraft, and real-time battlefield communications, all powered by semiconductors. The technological gap between the US and its adversaries had become real.</p><h3><strong>A Little Island Called Taiwan</strong></h3><p>After World War II, Taiwan was in a delicate position. Chiang Kai-shek&#8217;s Nationalists had fled there after losing the Chinese Civil War to Mao&#8217;s Communists. The island needed Western support to survive&#8212;and it found an unlikely path to that support through semiconductors.</p><p>There's a certain irony here. Mao's policies inadvertently helped Taiwan's development. During the Cultural Revolution, many Chinese scientists and engineers fled to Taiwan, bringing their expertise with them. This brain drain from the mainland became Taiwan's gain.</p><p>Becoming a superpower in the semiconductor industry was Taiwan&#8217;s government goal. In 1987, this ambition culminated in the creation of <strong>TSMC</strong> (Taiwan Semiconductor Manufacturing Company). The government recruited Morris <strong>Chang</strong>, a semiconductor veteran who had spent decades at Texas Instruments in the US, to return to his motherland and lead this new venture.</p><p>Chang pioneered a revolutionary business model: the "pure-play foundry." Instead of designing its own chips, TSMC would manufacture chips designed by other companies. This allowed chip designers to focus on innovation without the massive capital investment required for fabrication facilities.</p><p>Taiwan had great know-how and total government support. This government backing kept the price of semiconductors competitive, benefiting Western technology companies enormously. Companies could now design cutting-edge chips knowing that TSMC would manufacture them at scale.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!W20k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!W20k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png 424w, https://substackcdn.com/image/fetch/$s_!W20k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png 848w, https://substackcdn.com/image/fetch/$s_!W20k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png 1272w, https://substackcdn.com/image/fetch/$s_!W20k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!W20k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png" width="1189" height="811" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:811,&quot;width&quot;:1189,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:828378,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/182766992?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!W20k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png 424w, https://substackcdn.com/image/fetch/$s_!W20k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png 848w, https://substackcdn.com/image/fetch/$s_!W20k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png 1272w, https://substackcdn.com/image/fetch/$s_!W20k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98e8d04b-564f-494d-b10c-5c35dc873c17_1189x811.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">TSMC x Evangelion crossover</figcaption></figure></div><p></p><h3><strong>The Dutch Machine</strong></h3><p>In the same period, Dutch company <strong>ASML</strong> (a spin-off from Philips) was developing something that would become absolutely essential: <strong>Extreme Ultraviolet (EUV) lithography machines.</strong></p><p>To understand why this matters, you need to know how chips are made. The process is essentially like photography: you shine light through a pattern and project it onto silicon, creating the microscopic circuits. As transistors got smaller, the wavelength of light needed to etch them had to get shorter. EUV uses light with a wavelength of just 13.5 nanometers.</p><p>These machines are engineering marvels. Each one costs around $200 million, weighs 180 tons, and contains over 100.000 parts sourced from companies across the globe, including those Swiss lenses I mentioned earlier. ASML produces fewer than 50 of them per year.</p><p>The partnership between ASML and TSMC became the backbone of advanced semiconductor manufacturing. ASML makes the only machines capable of producing the most advanced chips. TSMC operates those machines better than anyone else. Together, they created a duopoly that powers the modern digital world.</p><h3><strong>To Be Continued...</strong></h3><p>I'll stop here because we&#8217;re reaching an important inflection point in history: the development of GPUs by NVIDIA. That story, how a company making graphics cards for video games became the most valuable in the world, deserves its own article.</p><p>But here's what I want you to take away from this history:</p><p>Semiconductors aren&#8217;t just components. They&#8217;re the foundation of everything. They're the result of military competition, government industrial policy, scientific genius, and global cooperation. They're also an incredible vulnerability, a supply chain that could be disrupted by an earthquake, a war, or a single company deciding to cut off access.</p><p>Understanding this history isn't just interesting. It's essential for understanding what comes next.</p><p><em>Stay tuned.</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The dragon and the eagle]]></title><description><![CDATA[A tale of two countries]]></description><link>https://fullmetalresearcher.substack.com/p/the-dragon-and-the-eagle</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/the-dragon-and-the-eagle</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Sat, 20 Dec 2025 12:39:59 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!KyT8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>When my father hears the term "<strong>Sputnik moment,</strong>" he thinks about 1957, the year the Soviet Union launched Sputnik 1, the first artificial satellite to orbit Earth. That moment shattered America's perception of technological superiority. It triggered panic, but also led to the creation of NASA and massive investments in science and education.</p><p>A Sputnik moment represents a sudden realization that you're losing a critical race, triggering a frantic, unified national effort to catch up and innovate.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>For my generation, it's different. I grew up in an era of relative peace, so when I hear "Sputnik moment," I think about two different episodes.</p><p>The first happened in May 2017 when <strong>AlphaGo</strong>, an AI program built by Google <strong>DeepMind</strong>, defeated Ke Jie, the 19-year-old world champion of <strong>Go</strong>, three games to zero. Go is a 3000-year-old Chinese board game with more possible positions than atoms in the universe. It had long been considered the final frontier of human intuition, impossible for machines to master. After his defeat, Ke Jie said AlphaGo played "like a god of Go." The Chinese government censored the live broadcast. The humiliation was real.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KyT8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KyT8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KyT8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KyT8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KyT8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KyT8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg" width="1456" height="1092" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1092,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;AlphaGo defeats world Go champion Ke Jie &#8211; The New Economy&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="AlphaGo defeats world Go champion Ke Jie &#8211; The New Economy" title="AlphaGo defeats world Go champion Ke Jie &#8211; The New Economy" srcset="https://substackcdn.com/image/fetch/$s_!KyT8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KyT8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KyT8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KyT8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4abb3442-695e-4525-b1af-85f32f37ac4c_2746x2059.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">AlphaGo vs Ke Jie </figcaption></figure></div><p>Just as the Soviet's progress in space engineering prompted American investment in technology in 1957, AlphaGo's victory prompted China to pour resources into AI. Within months, the State Council released its "New Generation AI Development Plan" aiming to make China the world leader in artificial intelligence by 2030.</p><p>In January 2025, we witnessed the third Sputnik moment. But this time, it was different, American perception of technological superiority was the one being shattered.</p><p><strong>DeepSeek</strong>, a Chinese AI startup founded as a side project by a quantitative hedge fund called High-Flyer, released <strong>R1</strong>, a reasoning model that matched or exceeded OpenAI's best models. The shock wasn't just that it worked. It was how little it cost. DeepSeek claimed to have trained R1 for roughly $6 million using only 2048 <strong>Nvidia H800 chips</strong>, a fraction of the resources that American labs reportedly require. For comparison, OpenAI's GPT-4 reportedly cost over $100 million to train. Marc Andreessen, one of Silicon Valley's most influential investors, called it "one of the most amazing and impressive breakthroughs I've ever seen." Nvidia's stock dropped 17%, erasing nearly $600 billion in market value in a single day.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VbYh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VbYh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png 424w, https://substackcdn.com/image/fetch/$s_!VbYh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png 848w, https://substackcdn.com/image/fetch/$s_!VbYh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png 1272w, https://substackcdn.com/image/fetch/$s_!VbYh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VbYh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png" width="1456" height="379" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:379,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:166881,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/182163208?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VbYh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png 424w, https://substackcdn.com/image/fetch/$s_!VbYh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png 848w, https://substackcdn.com/image/fetch/$s_!VbYh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png 1272w, https://substackcdn.com/image/fetch/$s_!VbYh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2dcebad9-e643-4f4b-9a21-4a2030dc706e_1859x484.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Marc Andreessen on DeepSeek R1</figcaption></figure></div><p>I grew up thinking China was years behind the West. Perhaps as a kid, I wasn't wrong. But now I am. The playing field has changed, and China is catching up. As I grew up and changed, so did this equilibrium, and I'm only now fully appreciating that.</p><p>There's a concept in development economics that countries face two paths when it comes to education spending. If they produce advanced technology, they're better off channeling investments into post-secondary education (universities and research). If they import advanced technology, they're better off investing in primary education to have the workforce to replicate it. For decades, China was in the second camp. But things have changed.</p><p>In 2024, China-based scholars produced more AI-related research publications&#8212;more than the combined output of the United States, the United Kingdom, and the entire European Union. The top institutions producing AI research papers are  Chinese, led by the Chinese Academy of Sciences. China now commands global citation attention in AI research.</p><p>There's a real possibility that AI will be the single field that helps China finally surpass the United States as the world&#8217;s leading technological power.</p><h2><strong>It's All About Scaling</strong></h2><p>To produce better artificial intelligence, you work (on a high level) on three variables:</p><ul><li><p>compute</p></li><li><p>data</p></li><li><p>and algorithms.</p><p></p></li></ul><p>I want to focus on computing and data, this is where China can eventually catch up with the US, and potentially surpass them once they reach the state of the art in GPU production.</p><p>There are structural advantages working in China's favor.</p><h3><strong>Policy Environment</strong></h3><p>China's central government has the ability to pick long-term goals and mobilize resources to reach them fast. Fifty years ago, this produced famines. Today, it produces semiconductor fabs.</p><p>The government is pouring capital into the semiconductor industry, bringing back home scientists trained at American universities, and extracting <strong>technology transfers</strong> from foreign companies desperate for access to Chinese consumers. And just this week, Reuters reported that Chinese scientists have built a working prototype of an <strong>EUV lithography</strong> machine, the holy grail of advanced chipmaking that only the Dutch company <strong>ASML</strong> has ever mastered. Former ASML engineers, recruited with signing huge bonuses, are reportedly leading the effort. China is targeting chip production by 2028, though experts say 2030 is more realistic.</p><p>Meanwhile, the US is slashing research funding.</p><h3><strong>Data</strong></h3><p>China is now the world's largest producer of data, partly due to the WeChat ecosystem, a super-app with 1.4 billion users that handles messaging, payments, shopping, government services, and nearly every aspect of daily life. But China also collects more data from the physical world than anyone else. Think about the <strong>surveillance system</strong>: hundreds of millions of cameras feeding facial recognition algorithms, smart city projects tracking every car and pedestrian, IoT sensors embedded in infrastructure. This is surveillance but also training data.</p><h3><strong>Culture</strong></h3><p>They work harder. That's it.</p><p>I was on a plane from Cambodia to Bangkok when a clever Chinese businessman saw me reading <em>AI Engineering</em>. He started explaining why China would be the next leader and how they work harder compared to Europeans and Americans.</p><p>The numbers support his confidence. According to a recent Digital Science report, China has roughly 30000 active AI researchers, compared to about 10000 in the United States. And China's cohort is noticeably younger.</p><h3><strong>Chips</strong></h3><p>China spends more on importing semiconductors than oil. In 2024, Chinese firms imported $385 billion worth of chips, compared to $325 billion for crude oil. The dependency is obvious, and <strong>the chips come from geopolitical rivals</strong>. It makes sense that Beijing wants to internalize as much of the supply chain as possible.</p><p>To put things in perspective: in the 1980s, the most advanced chip China could produce was equivalent to an Intel chip from the early 1970s (a decade behind). Now Huawei is shipping 7nm phones, and SMIC is working on 5nm processes using creative workarounds with older equipment. The gap is shrinking.</p><p><strong>Subsidies</strong> will accelerate this. China has designated lithography and chip manufacturing as national priorities, and billions are flowing in to reach this goal. The EUV breakthrough this week, if it holds up, could cut years off the timeline.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aYvS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aYvS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg 424w, https://substackcdn.com/image/fetch/$s_!aYvS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg 848w, https://substackcdn.com/image/fetch/$s_!aYvS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!aYvS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aYvS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg" width="1456" height="1036" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1036,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;ASML. The Dutch Company That Dominates Tech | by Dion F. Lisle |  VentureVINE | Medium&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="ASML. The Dutch Company That Dominates Tech | by Dion F. Lisle |  VentureVINE | Medium" title="ASML. The Dutch Company That Dominates Tech | by Dion F. Lisle |  VentureVINE | Medium" srcset="https://substackcdn.com/image/fetch/$s_!aYvS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg 424w, https://substackcdn.com/image/fetch/$s_!aYvS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg 848w, https://substackcdn.com/image/fetch/$s_!aYvS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!aYvS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba438911-3f8f-4be5-8547-dcea7c11fe41_3000x2134.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"> EUV machine (ASML)</figcaption></figure></div><h2><strong>Wake Up</strong></h2><p>When DeepSeek's R1 dropped, I heard online that American tech executives were somewhat in panic. Some questioned whether the cost figures were real. Others speculated about hidden state support. A few accused DeepSeek of training on OpenAI's data or smuggling Nvidia H100 for training.</p><p>But the fundamental message was clear. The assumption that you need $100 million and tens of thousands of H100s to build frontier AI models was wrong. The assumption that export controls would set China back by years was wrong. The assumption of permanent American technological superiority was also wrong.</p><p>This isn't about DeepSeek specifically. It's about what DeepSeek represents: a nationwide innovation ecosystem, a young and growing AI workforce, a government that treats AI as a strategic asset on par with energy or military power, and <strong>a culture that's willing to work relentlessly to win.</strong></p><p>In a future article, I'll dig deeper into the current environment, the chip war, the export controls, and what happens next. I would love to continue writing, but my best friend is in Napoli, and I want to spend some time with him.</p><p>For now, consider this a wake-up call.</p><p></p><h2>Resources:</h2><p>- Wikipedia: AlphaGo versus Ke Jie</p><p>- NPR: 'Like A God,' Google A.I. Beats Human Champ</p><p>- CNBC: Nvidia sheds almost $600 billion in market cap</p><p>- Science.org: China tops the world in artificial intelligence publications</p><p>- Digital Science: DeepSeek and the New Geopolitics of AI</p><p>- Reuters: Made-in-China EUV machine targets AI chip output by 2028 (Dec 2025)</p><p>- South China Morning Post: China's 2024 chip imports surged 10.4% to US$385 billion</p><p>- Statista: WeChat active users worldwide</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Reflecting on Scaling]]></title><description><![CDATA[The reason I'm writing this article (and perhaps the reason you developed an interest in artificial intelligence) is simple: scaling worked.]]></description><link>https://fullmetalresearcher.substack.com/p/reflecting-on-scaling</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/reflecting-on-scaling</guid><dc:creator><![CDATA[Guido]]></dc:creator><pubDate>Wed, 17 Dec 2025 11:16:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!b2xw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The reason I'm writing this article (and perhaps the reason you developed an interest in artificial intelligence) is simple: <strong>scaling worked.</strong></p><p>The reason frontier labs are pouring billions into training runs and NVIDIA became the most capitalized stock is that the <strong>Scaling Hypothesis</strong> is actually working.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>In simple terms, the Scaling Hypothesis is the belief that human-level intelligence is an emergent property of applying massive amounts of compute and data to relatively simple learning algorithms.</p><p>While we are beginning to see new frontiers open up, such as the <strong>inference scaling</strong> demonstrated by OpenAI's o1, where accuracy correlates with the time the model spends "thinking", the foundation of the current revolution is <strong>training scaling</strong>.</p><p>To understand where we are going, we must first understand how we got here.</p><h2>The Three Eras of Compute</h2><p>Most recent advancements in machine learning (ML) can be attributed to a simple truth: we are training models with vastly more compute than ever before.</p><p>It is crucial to realize that in the world of AI, <strong>compute usage is a proxy for progress.</strong></p><p>While traditional integrated circuits followed <strong>Moore's Law</strong> (doubling transistor counts roughly every two years), machine learning scaling has moved at a much more aggressive pace. By analyzing historical data, we can categorize this growth into three distinct eras:</p><ol><li><p><strong>The Pre-Deep Learning Era</strong> (Before 2010):</p><p>Training compute grew slowly, largely mirroring Moore&#8217;s Law with a doubling time of roughly 20 months.</p></li><li><p><strong>The Deep Learning Era</strong> (2010&#8211;2015):</p><p>With the advent of modern Deep Learning (starting roughly with AlexNet, which was trained on just two gaming GPUs), scaling accelerated. Compute requirements began doubling approximately every 6 months.</p></li><li><p><strong>The Large-Scale Era</strong> (Late 2015&#8211;Present):</p><p>A massive shift occurred as companies began developing large-scale foundational models. This era is characterized by a sudden, increased demand for 10 to 100 times more compute, breaking previous trends entirely.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MHFj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MHFj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png 424w, https://substackcdn.com/image/fetch/$s_!MHFj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png 848w, https://substackcdn.com/image/fetch/$s_!MHFj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png 1272w, https://substackcdn.com/image/fetch/$s_!MHFj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MHFj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png" width="1053" height="630" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:630,&quot;width&quot;:1053,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:124636,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/181876602?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F671032c9-23b0-4c16-9337-534832e391d3_1053x630.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MHFj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png 424w, https://substackcdn.com/image/fetch/$s_!MHFj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png 848w, https://substackcdn.com/image/fetch/$s_!MHFj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png 1272w, https://substackcdn.com/image/fetch/$s_!MHFj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0aa23f7-736c-42d5-bbb0-59fb5549668c_1053x630.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">source: Epoch AI</figcaption></figure></div><p></p><h2>How Scaling Actually Works</h2><p>Now that we have the historical context, what does "scaling" actually look like inside a lab?</p><p>Scale consists of three primary factors:</p><ul><li><p><strong>N</strong>: The number of model parameters (the "brain size").</p></li><li><p><strong>D</strong>: The size of the dataset (the "books read").</p></li><li><p><strong>C</strong>: The amount of compute used for training (the "energy spent").</p></li></ul><p>If you were a researcher at a frontier lab tasked with training a new model, you would have a finite budget of time and GPU availability (C), and you probably wouldn't be reading this&#8230; Anyway, your goal is to find the sweet spot between the size of the model (N) and the amount of data (D).</p><p>Literature suggests that performance improves predictably as long as we scale up N and D in tandem. However, you enter a regime of diminishing returns if one is held fixed while the other increases.</p><p>This balance is defined by the <strong>Chinchilla Law</strong> (named after a DeepMind paper), which states there is a "golden ratio" for efficiency. To train a compute-optimal model, the training dataset should be roughly <strong>20 times the number of parameters</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pPfp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pPfp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png 424w, https://substackcdn.com/image/fetch/$s_!pPfp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png 848w, https://substackcdn.com/image/fetch/$s_!pPfp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png 1272w, https://substackcdn.com/image/fetch/$s_!pPfp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pPfp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png" width="937" height="296" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35778928-91fd-45e8-980d-62617b49f7c6_937x296.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:296,&quot;width&quot;:937,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:82434,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/181876602?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2768f5-eb11-4e80-a525-13fd432ff58a_937x296.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pPfp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png 424w, https://substackcdn.com/image/fetch/$s_!pPfp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png 848w, https://substackcdn.com/image/fetch/$s_!pPfp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png 1272w, https://substackcdn.com/image/fetch/$s_!pPfp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35778928-91fd-45e8-980d-62617b49f7c6_937x296.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">source: Google DeepMind</figcaption></figure></div><p>If you build a massive model but don't have enough data to "fill" it, you have somewhat wasted compute. If you have a massive dataset but a tiny model, you have wasted information.</p><h2>Mo Money Mo Problem</h2><p>Scaling works, but it is extremely expensive.</p><p>Sam Altman has indicated that training GPT-4 cost over <strong>$100 million</strong>. Dario Amodei (CEO of Anthropic) recently noted that we are rapidly approaching models that cost <strong>$1 billion</strong> to train, with $10 billion runs on the horizon.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OTet!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OTet!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png 424w, https://substackcdn.com/image/fetch/$s_!OTet!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png 848w, https://substackcdn.com/image/fetch/$s_!OTet!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png 1272w, https://substackcdn.com/image/fetch/$s_!OTet!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OTet!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png" width="1091" height="590" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:590,&quot;width&quot;:1091,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:92978,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://fullmetalresearcher.substack.com/i/181876602?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OTet!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png 424w, https://substackcdn.com/image/fetch/$s_!OTet!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png 848w, https://substackcdn.com/image/fetch/$s_!OTet!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png 1272w, https://substackcdn.com/image/fetch/$s_!OTet!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86dbd1b0-e0cb-4eb9-981a-6f739391d578_1091x590.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">source: AI Index 2025 Annual Report (Stanford)</figcaption></figure></div><p></p><p>When we think of these costs, we usually think of GPUs. However, researchers at <strong>Epoch AI</strong> have broken down the real costs of these massive training runs:</p><ul><li><p><strong>47&#8211;67% Hardware:</strong> The cost of the chips (GPUs/TPUs) and networking gear.</p></li><li><p><strong>29&#8211;49% R&amp;D:</strong> The cost of human talent. (To put this in perspective, the war for talent is so fierce that compensation packages for top researchers can reach hundreds of millions).</p></li><li><p><strong>2&#8211;6% Energy:</strong> The electricity bill.</p></li></ul><h3>How is this calculated?</h3><p>Since labs rarely share the total cost of training, researchers use an <strong>amortized hardware</strong> approach.</p><ol><li><p><strong>Usage:</strong> Calculate total "chip-hours" (e.g. 10000 GPUs running for 3 months).</p></li><li><p><strong>Depreciation:</strong> Multiply by the hourly cost of owning that hardware over its lifespan.</p></li><li><p><strong>Energy:</strong> Add the electricity costs.</p></li></ol><p>The result is clear: <em>building frontier models is a race that only the wealthiest organizations can afford</em>.</p><h2>The Bottleneck: Are We Running Out of Data?</h2><p>We know we need money for GPUs, but we also need D (Data).</p><p>The Chinchilla Law tells us we need roughly 20 tokens of data for every parameter in the model. As models grow, our hunger for text grows with them. In pre-training, models ingest tens of trillions of words. The problem is that <strong>the internet is finite.</strong></p><p>High-quality human text is not growing fast enough to satisfy the exponential requirements of LLMs.</p><p>At current growth rates, we may fully utilize the stock of high-quality public text data around <strong>2028</strong>. This "data wall" is compounded by the fact that many modern models are "over-trained" (using far more than the 20:1 ratio) to make them cheaper to run during inference. Economic incentives rule&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b2xw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b2xw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png 424w, https://substackcdn.com/image/fetch/$s_!b2xw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png 848w, https://substackcdn.com/image/fetch/$s_!b2xw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png 1272w, https://substackcdn.com/image/fetch/$s_!b2xw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b2xw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!b2xw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png 424w, https://substackcdn.com/image/fetch/$s_!b2xw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png 848w, https://substackcdn.com/image/fetch/$s_!b2xw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png 1272w, https://substackcdn.com/image/fetch/$s_!b2xw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F210d03c5-c56c-471b-bed9-0c840080ac27_2415x1359.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>The Path Forward</h2><p>If we run out of human text, does the revolution stop? Unlikely. History shows that when a bottleneck is reached, humans innovate.</p><p>Frontier labs are already deploying strategies to circumvent the data wall:</p><ul><li><p><strong>Multi-Epoch Training:</strong> Training on the same data multiple times (though this has diminishing returns).</p></li><li><p><strong>Synthetic Data:</strong> Using smart models to generate high-quality data to train newer models.</p></li><li><p><strong>Multimodality:</strong> Moving beyond text to train on video, audio, and real-world physics data.</p></li></ul><p>While the "easy" scaling of the last decade (simply scraping more web text) is coming to an end, the era of scaling intelligence is far from over. We may face a period where models are undertrained relative to their potential, but the incentive to solve this problem is worth trillions of dollars.</p><p>As we saw with the shift from CPUs to GPUs, and from distinct eras of compute, the industry will likely find a way to keep the line going up.<br><br><strong>Keep moving forward.<br><br>Further Reading:<br></strong></p><ul><li><p>Sevilla, J., Heim, L., Ho, A., Besiroglu, T., Hobbhahn, M., &amp; Villalobos, P. (2022). Compute Trends Across Three Eras of Machine Learning.</p></li><li><p><em>Kaplan, J., et al. (2020).</em> Scaling Laws for Neural Language Models.</p></li><li><p>OpenAI. (2024). <em>Learning to Reason with LLMs</em>. OpenAI Research.</p></li><li><p>Hoffmann, J., et al. (2022). Training Compute-Optimal Large Language Models. </p></li><li><p>Cottier, B., et al. (2024). The Rising Costs of Training Frontier AI Models. <em>Epoch AI Research</em>.</p></li><li><p>Villalobos, P., et al. (2022). Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning. </p><p></p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[A brief memoir of Artificial Intelligence]]></title><description><![CDATA[Sometimes I have the impression that people fail to appreciate how far we've come in the realm of Artificial Intelligence.]]></description><link>https://fullmetalresearcher.substack.com/p/a-brief-memoir-of-artificial-intelligence</link><guid isPermaLink="false">https://fullmetalresearcher.substack.com/p/a-brief-memoir-of-artificial-intelligence</guid><pubDate>Sun, 14 Dec 2025 10:25:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/iuPD8T2OQjs" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Sometimes I have the impression that people fail to appreciate how far we've come in the realm of Artificial Intelligence. It's common to read statements such as "Yes, but current models are unable to do that" or see people pointing out trivial errors. For this reason, we need to take a step back to get an idea of the rate of progress we are witnessing.</p><h3>The Promising Fifties</h3><p>I could start this chapter from the Hellenistic period, but that would be somewhat boring. So, I will instead start this account from the fifties, because the fifties were insane. Remember how cool Benedict Cumberbatch was in <em>The Imitation Game</em>? Well... Alan Turing was even cooler in real life<em>.</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>In 1950, Turing posed a philosophical question: "Can a machine think?" From there, he designed the Turing Test. A computer would be able to pass this test if a human interrogator, holding a conversation with the machine, cannot tell whether the written responses come from a person or from a computer.</p><p>To pass the test, a machine would need the following capabilities:</p><ul><li><p><strong>NLP</strong> (Natural Language Processing) to communicate.</p></li><li><p><strong>Memory</strong> to store information.</p></li><li><p><strong>Reasoning</strong> to answer questions and draw conclusions.</p></li><li><p><strong>Fluid Intelligence</strong> to adapt to new situations.</p></li></ul><p>Turing was able to challenge the idea that "thinking machines" were absurd, but it took a long time to reach that state. At the same time, he warned that developing thinking machines could pose a serious risk to the human race.</p><p>The following year, in 1951, two Harvard students (Minsky and Edmonds) built the first neural network computer, the <strong>SNARC</strong>.</p><p>Turing's paper (<em>Computing Machinery and Intelligence</em>) captured the interest of academia. in 1956, US researchers gathered at Dartmouth College for two months to understand if a machine could actually think. The heart of the project&#8217;s goal is best summarized by the famous proposal submitted to the Rockefeller Foundation:</p><blockquote><p><em>"The study is to proceed on the basis of the conjecture that every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine can be made to simulate it."</em></p></blockquote><p>As one can imagine, they weren't able to answer this question and create a thinking machine over a single summer. However, the workshop laid the foundation for the recognition of Artificial Intelligence as a field, and as time passed, more researchers tried to answer Turing's question.</p><p>To be precise, more than answering Turing&#8217;s original question, researchers were interested in proving that a machine could perform specific tasks, or just to mess with arguments of the form &#8220;a computer will never do X&#8221; (This is the fifth of the nine common objections in Turing's paper).</p><p>So, one of the things a machine should struggle with is generalization, right? In 1957, Newell and Simon introduced the <strong>General Problem Solver (GPS)</strong>. This was a big step towards solving the mystery of whether machines could think. But what is it?</p><p><em>"The General Problem-Solver (GPS) is a computer program being used for explorations both into the general mechanisms involved in problem-solving and into the way humans solve problems."</em> While the GPS successfully solved logic puzzles and geometric proofs, it struggled significantly when applied to complex, real-world problems because the number of possibilities became too large to manage.</p><p>Wait, we're not done. In the same period, Arthur Samuel (an American electrical engineer) developed a Checkers-playing program. The program was able to teach itself how to play checkers, eventually defeating its creator through <strong>self-play</strong> (what we now call reinforcement learning). It was Samuel who coined the term "machine learning" in 1959, and his program can be thought of as the grandfather of <strong>Deep Blue</strong> and <strong>AlphaGo</strong>. Isn't that cool? Before the end of the decade, John McCarthy published an important paper describing the <strong>Advice Taker</strong>, a hypothetical program with general knowledge of the world. Think of it like an ancestor of modern LLMs.</p><h4>The Boring Sixties</h4><p>Researchers somewhat failed to acknowledge that their first successes were partly because the models were attempting to solve problems that were simple in nature (i<em>d est</em> with a limited number of steps). Perhaps scientists shared the idea that in a short amount of time, by scaling up computational power, algorithms could solve problems orders of magnitude greater in complexity. This wasn't necessarily wrong, but computational power and computer time back then were severely limited.</p><p>The <strong>Lighthill Report (1973)</strong> stated that researchers in the field of Artificial Intelligence had failed to solve the problem of "Combinatorial Explosion" (when the number of possible solutions or states grows exponentially with problem size, so that even supercomputers can't solve it). The report formed the basis for the British government to end support for AI research. In the same period, Minsky and Papert published a book criticizing the <strong>Perceptron</strong> (a simple neural network), proving it couldn't solve basic logic puzzles like XOR. This criticism was devastating, and eventually, research funding for neural nets went basically to zero.</p><div><hr></div><h4>Two Decades of Knowledge</h4><p>The seventies saw a paradigm shift in how AI research was performed. While previous models focused on generalization (like the GPS), new models focused on narrow domains of knowledge&#8212;hence the name <strong>Expert Systems</strong>.</p><p><em>"In the knowledge lies the power."</em> &#8212; Feigenbaum</p><p>One could say the shift was from researching <em>general</em> intelligence to researching a <em>narrow</em> form of it. In fact, the objective for these models was to emulate experts in a  specific domain.</p><p>The architecture of an expert system has three components:</p><ol><li><p><strong>Knowledge Base:</strong> Where expert knowledge (represented as a set of "if-then" rules) is stored.</p></li><li><p><strong>Inference Engine:</strong> Applies the knowledge contained in the base to help the user solve the problem.</p></li><li><p><strong>User Interface:</strong> Because otherwise, how do you even visualize that?</p></li></ol><p>It is worth noting that earlier models showed some form of predictability in their output and an understanding of what happened inside them. As we move to deep learning, we'll notice that this is not the case. We just have an idea about what goes<br>on under the hood.</p><p>What is cool is that here we can observe actual useful applications: in the medical domain with <strong>MYCIN</strong> (to diagnose blood infections) and <strong>Dendral</strong>, as well as in the business domain.</p><p>The same period saw a growing interest in making programs that understand language. Paired with the encouraging results of expert systems, this made governments invest significant amounts of money into AI research.</p><p>But after the excitement, the difficulties returned. Researchers started acknowledging the limits of this domain. Research saw a decrease in funds, and so we entered a period described as the "AI Winter."</p><p><em>(We're so back / It's so over)</em></p><h3>The Neural Network Strikes Back (But I Need More Compute)</h3><p>By the mid-1980s, the field of neural networks was colder than Siberia in December. Funding was close to zero, researchers were demoralized, and there was a general belief that neural networks were more useless than a pet rock. The point is, while simple neural networks could learn basic patterns, they were unable to solve complex problems. The issue was mainly computational (you needed immense power to train them).</p><p>In 1986, the paper "<em>Learning representations by back-propagating errors"</em> was released. Rumelhart, Williams, and Hinton popularized the <strong>backpropagation</strong> algorithm, showing that deep neural networks (those with more hidden layers) could handle real-world problems. The algorithm made it possible to teach neural networks to learn patterns by adjusting their connections based on errors.</p><p>I've reserved a full chapter on neural networks, so I won't discuss the nuts and bolts of backpropagation here, but it's important to understand that the paper laid the foundation for the modern AI revolution based on deep learning.</p><blockquote><p><em>Note: It&#8217;s my view that to make it in life you need to be delusional and go against the mainstream (being a contrarian). This is what Geoffrey Hinton and colleagues did when they decided to focus on neural networks even though no one at that time could see their application. Perhaps in the midst of the AI winter, they found a beautiful summer.</em></p></blockquote><h3>Machine Learning</h3><p>There were many contributors to the field of machine learning, but I want to focus on Rich Sutton's work, since it will be essential to explain modern models like AlphaGo and AlphaFold. We described earlier Arthur Samuel's contribution to the field, or, if we want, his contribution to <strong>Reinforcement Learning</strong> (a method where an &#8220;agent&#8221; learns to make decisions by interacting with an environment, using trial-and-error to maximize cumulative rewards). About three decades later, Sutton mathematically described how agents make decisions in unknown environments, getting a reward for each action, and this is how Reinforcement Learning came to be a specific field.</p><p><strong>The World Wide Web</strong></p><p>Berners-Lee's creation of the World Wide Web laid the foundation for algorithms to be trained on huge amounts of data.</p><p><em>"We propose that a logical next step for the research community would be to direct efforts towards increasing the size of annotated training collections, while deemphasizing the focus on comparing different learning techniques trained only on small training corpora."</em> (Banko &amp; Brill, 2001).</p><p>This is similar to what AI research labs do right now. The focus is on getting enough high-quality data to train the model rather than defining the next architecture.</p><p>Basically, one thing you have to understand (at a high level) is that the model <em>wants</em> to learn, and increasing the dataset size improves the predictive power of the algorithm. Soon, having access to a wider array of data (we&#8217;re talking orders of magnitude of difference) made it possible to apply new (and old) algorithms across multiple domains.</p><h3>Deep Learning Works</h3><p><em>"In 15 words: deep learning worked, got predictably better with scale, and we dedicated increasing resources to it."</em> &#8212; Sam Altman (September 2024)</p><p>The fact that deep learning worked and enabled us to produce Large Language Models is the reason I'm writing this.</p><p><strong>What is deep learning?</strong> <em>Deep learning is a subset of machine learning that layers algorithms to create an "artificial neural network." These layers allow the computer to learn from vast amounts of data and make intelligent decisions on its own, much like a human brain does.</em></p><p>Deep Learning is not new. Here's a video of Yann LeCun using a convolutional neural network to make a model recognize handwritten digits in 1995. <br></p><div id="youtube2-iuPD8T2OQjs" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;iuPD8T2OQjs&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/iuPD8T2OQjs?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p></p><p>There was an issue, though, in developing really capable algorithms: access to computing power to train these models. The diffusion of GPUs eventually fixed that, and from the second decade of the 21st century, deep learning really took off. We can trace this success to a specific moment, so now I'll leave the technicalities aside to tell one of the coolest stories I know.</p><div><hr></div><p>The 2011-2012 biennium was a great time for music. M83 released "<em>Hurry Up, We're Dreaming"</em> and Grimes released "<em>Visions."</em> Among other things, in  Toronto, Geoffrey Hinton (the grandfather of modern artificial intelligence) tasked his two best students, Sutskever and Krizhevsky, to teach a computer how to see using Nvidia GPUs. So, the students bought two gaming GeForce GTX 580 GPUs to teach a neural network how to recognize images. The GPUs were configured in Krizhevsky&#8217;s bedroom, and his parents had to pay the insane electricity bill.</p><p>Our heroes needed data to train the neural network, right? As luck would have it, Stanford Professor Fei-Fei Li had assembled the <strong>ImageNet</strong> database with millions of manually labeled images. These images were used to teach the neural network how to see. You can imagine that at the beginning, the neural network has to guess what the image is, but as time goes by, it eventually learns to recognize what's inside. The network had multiple layers; one learned color, another shape, and so on. It makes errors, but eventually, it learns. The students were able to win the 2012 ImageNet competition, smashing their rivals.</p><p>This was probably the most important moment for artificial intelligence in the last 20 years. The main contribution to the field was proving that GPUs could train networks orders of magnitude faster than CPUs.</p><h3>Modern AI</h3><p>One could write multiple books about the modern state of Artificial Intelligence, but I think there are some defining elements that could give us a broad overview. It's a quest toward super-intelligence among a handful of research labs around the world.</p><p>Perhaps one could say that the origins of this competitive landscape trace back to <strong>DeepMind</strong>, founded in 2010 with the mission to "solve intelligence"&#8212;and one could argue they are doing that. But the astonishing fact is that they literally started 15 years ago. Eventually, DeepMind was acquired by Google in 2014.</p><p><strong>Fun fact:</strong> Elon Musk was an early investor in DeepMind. One day, he was reviewing an email by Demis Hassabis (one of DeepMind&#8217;s founders) about a recent breakthrough where one of his models found an unconventional way to win the Atari game <em>Breakout</em>. Larry Page (Google) was on the same plane and asked Musk which company he was discussing with Luke Nosek. Eventually, Page acquired DeepMind, and this was perhaps one of the most successful acquisitions of all time.</p><p>Musk eventually founded <strong>OpenAI</strong> with Sam Altman as a non-profit research lab and was even able to lure away key personnel from Google, including Ilya Sutskever and Dario Amodei. The first would later found Safe Superintelligence (SSI) and the second, Anthropic.</p><p>In this mess, Google researchers published a paper destined to change our lives forever: "<em>Attention Is All You Need"</em> (2017), introducing the <strong>Transformer</strong> architecture&#8212;the building block of modern LLMs. The "GPT" in ChatGPT stands for <strong>Generative Pre-trained Transformer</strong>.</p><p>Eventually, it was OpenAI that seized the opportunity for commercialization, leading to the launch of ChatGPT, the fastest-growing product in history. This moment shocked other labs, showing OpenAI&#8217;s advantage. Google responded with a strategic overhaul, unifying its two major AI research groups (Brain and DeepMind) and standardizing efforts around a single powerful model called <strong>Gemini</strong>. This intense competition now sees a handful of highly funded labs:</p><ul><li><p>Google DeepMind</p></li><li><p>OpenAI</p></li><li><p>Anthropic</p></li><li><p>DeepSeek</p></li><li><p>xAI (Grok)</p></li></ul><p>They're in a race to scale deep learning and unlock superintelligence. Hang tight, because this is just the beginning of the journey.</p><h5>Further Reading:</h5><ul><li><p>Turing, A. M. (1950). Computing machinery and intelligence. </p></li></ul><ul><li><p>Minsky, M. L. (1954). <em>Theory of Neural-Analog Reinforcement Systems and Its Application to the Brain Model Problem</em></p></li><li><p>McCarthy, J., Minsky, M. L., Rochester, N., &amp; Shannon, C. E. (1955). <em>A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence</em>.</p></li><li><p>Samuel, A. L. (1959). Some studies in machine learning using the game of checkers.</p></li><li><p>McCarthy, J. (1959). Programs with Common Sense. </p></li><li><p>Minsky, M., &amp; Papert, S. (1969). <em>Perceptrons: An Introduction to Computational Geometry</em>.</p></li><li><p>Lighthill, J. (1973). <em>Artificial Intelligence: A General Survey</em>. Science Research Council.</p></li><li><p>Feigenbaum, E. A. (1977). The Art of Artificial Intelligence: Themes and Case Studies of Knowledge Engineering.</p></li><li><p>Shortliffe, E. H. (1976). <em>Computer-Based Medical Consultations: MYCIN</em>.</p></li><li><p>Sutton, R. S. (1988). Learning to predict by the methods of temporal differences.</p></li><li><p>Banko, M., &amp; Brill, E. (2001). Scaling to Very Very Large Corpora for Natural Language Disambiguation. </p></li><li><p>Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., &amp; Fei-Fei, L. (2009). ImageNet: A Large-Scale Hierarchical Image Database. </p></li><li><p>Krizhevsky, A., Sutskever, I., &amp; Hinton, G. E. (2012). ImageNet Classification with Deep Convolutional Neural Networks. </p></li><li><p>Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L, &amp; Polosukhin, I. (2017). Attention Is All You Need. </p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://fullmetalresearcher.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>