The Great AI Capital Swap

Four companies are spending $725 billion on AI infrastructure. Over 100,000 workers lost their jobs. Two frontier models launched within 24 hours of each other. And the surveillance state just got an upgrade. This is May 2026.

Data center lights illuminating server racks in blue glow

Photo: Unsplash

Something happened in the first quarter of 2026 that will define the next decade of technology, labor, and power. It was not a single event. It was a convergence: the largest capital expenditure commitment in corporate history, the most brutal wave of tech layoffs since the pandemic, two frontier AI models shipping on consecutive days, and a quiet expansion of AI-powered surveillance that most people have not noticed yet. The thread connecting all of it is money. Specifically, where the money went, and where it came from.

The shorthand is "The Great AI Capital Swap." That is not a metaphor. It is a literal description of what the spreadsheet shows. Four hyperscalers (Google, Amazon, Microsoft, and Meta) committed $725 billion to AI infrastructure for 2026. In the same window, the tech industry cut over 100,000 jobs. The money moved from one column to another. The humans who used to sit at those desks are now the data centers being built to house GPU clusters.

This is the full story of what that swap looks like, what it costs, who benefits, and what comes next.

AI Big Tech Layoffs DeepSeek Surveillance

I. The $725 Billion Commitment

Aerial view of massive data center facility

Photo: Unsplash

Let us start with the number, because everything else flows from it. $725 billion. That is the combined 2026 capital expenditure guidance from the four largest technology companies on Earth, according to their most recent earnings calls and SEC filings.

$725B

Combined 2026 AI capex from Google, Amazon, Microsoft, and Meta

The breakdown, pulled directly from Q1 2026 earnings guidance:

Amazon: $200 billion, with AWS revenue growing 28% (its fastest quarter in 15 quarters). The bulk of this spending goes toward Trainium chip production, data center expansion across Virginia, Oregon, and international sites in India and Japan, and long-term power purchase agreements for nuclear and renewable energy.
Microsoft: $190 billion, a figure that includes a $25 billion upward revision specifically attributed to rising memory chip and component costs. Microsoft CFO Amy Hood told analysts that the increase was "driven primarily by memory and networking component pricing," meaning they are spending billions more before a single new data center breaks ground. [The Register]
Google (Alphabet): $190 billion, up from the previous guidance of $75 billion. This figure includes Google's commitments to TPU production, its partnership with Broadcom for custom silicon, and a massive expansion of its data center footprint across the Americas and Asia-Pacific.
Meta: $145 billion (upper range), a figure that CEO Mark Zuckerberg described as necessary to "fund the insatiable compute demand" of AI training and inference. Meta's capex guidance has been revised upward three times in the past six months alone.

Combined, $725 billion is more than the GDP of Switzerland or Saudi Arabia. It represents a 77% increase from 2025's record $410 billion. And critically, it is being funded in part by the same companies that are simultaneously reducing headcount at an unprecedented pace.

The analyst consensus, expressed most bluntly by Wedbush's Dan Ives, is that the bear thesis on AI capex is "garbage" because companies that do not build now get permanently locked out of the inference-compute market. That might be true. But someone is paying for this bet today, not in 2030. And the earnings transcripts make clear who: employees. [Tom's Hardware]

II. The 100,000 People Who Paid for It

Photo: Unsplash

As of early May 2026, the tech industry has cut over 113,000 workers across 179 layoff events, averaging 911 job losses per day. The number was 80,000 at the end of Q1; it has climbed 40% since. [Benzinga]

113,000+

Tech workers cut in 2026 (as of early May), across 179 layoff events

The biggest single-company cuts tell the story in miniature:

Oracle: ~30,000 roles eliminated in a shift toward cloud and AI infrastructure, making it the single largest corporate layoff event of 2026.
Amazon: ~16,000 corporate roles cut across divisions while AWS grew 28%. The growth funds GPU clusters, not new hires.
Meta: 8,000 jobs cut, with CEO Mark Zuckerberg explicitly telling employees at a town hall that the layoffs are a "direct consequence" of the company's ballooning AI infrastructure budget. He added that the company cannot rule out further headcount reductions because compute demand is "insatiable." The cuts hit recruiting and HR hardest (35-40% reductions in those functions), but engineering was not spared. [Tom's Hardware]
Microsoft: ~6,000 positions affected through a combination of layoffs and the company's first-ever voluntary buyout program. Microsoft CFO flagged workforce reductions as part of the company's "AI pivot and Copilot org restructure." [Fortune/Bloomberg]

Here is what makes this situation genuinely perverse: while 100,000+ workers have been cut, hundreds of thousands of AI-specific roles remain unfilled. Current estimates put the demand-to-supply ratio for AI engineers at roughly 3:1, according to multiple industry surveys. There are approximately 275,000 open AI positions across North America that companies cannot fill because the talent pipeline has not expanded fast enough. The people being laid off largely do not have the skills for the roles that are open. It is a structural mismatch, not a cyclical one.

Sam Altman raised this point on CNBC in February, saying some firms engage in "AI washing" - attributing layoffs to AI when the real driver is plain cost-cutting dressed in a narrative that Wall Street rewards. If you announce layoffs and say "AI," your stock goes up. If you announce the same layoffs and say "margins," it is a red flag.

The practical effect on the laid-off worker is identical either way. But the career response should be different. If AI is literally doing your job, you need to retool. If your budget moved to a different department, you need to follow the money - which means moving toward the teams that build, deploy, and operate AI systems. [The Hill]

III. The Model War: GPT-5.5 vs. DeepSeek V4

Abstract neural network visualization with blue and purple nodes

Photo: Unsplash

While the capital swap was restructuring the labor market, the model war was reshaping the technology itself. On April 23, OpenAI released GPT-5.5. Twenty-four hours later, on April 24, DeepSeek shipped V4. The coincidence was not accidental. Both companies timed their releases to maximize attention and claim the frontier. The results tell us as much about the future of AI as the capital numbers do.

GPT-5.5: OpenAI's Incremental Giant

OpenAI's GPT-5.5 (codename: "Spud") is designed for complex, real-world work: writing code, researching online, analyzing information, creating documents and spreadsheets, and moving across tools to complete tasks. According to OpenAI's system card, the model "understands the task earlier, asks for less guidance, uses tools more effectively, checks its work and keeps going until it is done." [OpenAI System Card]

The benchmarks tell a specific story. GPT-5.5 scores 82.7% on Terminal-Bench 2.0 (a coding benchmark), and 51.7% on FrontierMath levels 1-3, dropping to 35.4% on level 4. It has a 1.05 million token context window. It is available across ChatGPT tiers and the API. A companion model, GPT-5.5 Pro, uses parallel test-time compute to achieve higher performance on complex reasoning tasks. [Wikipedia]

But the real story is not the numbers. It is what GPT-5.5 represents: OpenAI's shift from "smarter model" to "more useful agent." The model is designed to persist through long tasks, use tools, and self-correct. This is the direction the entire industry is moving, and GPT-5.5 is the most polished implementation of agentic behavior yet shipped by a major lab.

DeepSeek V4: Open Source Catches Up

DeepSeek's V4 launch was, in many ways, more significant than GPT-5.5, precisely because of what it represents for the broader ecosystem. Two model variants shipped simultaneously: DeepSeek-V4-Pro at 1.6 trillion total parameters (49 billion active) and DeepSeek-V4-Flash at 284 billion total parameters (13 billion active). Both have a native 1 million token context window. Both are MIT-licensed. [Hugging Face Blog]

The architectural innovation is what matters. DeepSeek V4 introduces two new attention mechanisms that make long-context inference dramatically cheaper:

Compressed Sparse Attention (CSA) compresses key-value entries by 4x along the sequence dimension using softmax-gated pooling with a learned positional bias. A "lightning indexer" running in FP4 picks the top-k compressed blocks per query.
Heavily Compressed Attention (HCA) compresses key-value entries by 128x and runs dense attention over the compressed stream. Because the compressed sequence is short, dense attention is cheap.

The result: DeepSeek V4-Pro requires only 27% of the per-token inference FLOPs compared to V3.2, and uses 10% of the KV cache memory. V4-Flash drops these numbers to 10% and 7% respectively. Compared to a standard grouped-query attention architecture, the KV cache is roughly 2% of the size. This is not an incremental improvement. It is a structural change that makes million-token context windows deployable at scale. [NVIDIA Developer Blog]

The benchmark scores are competitive but not state-of-the-art: 80.6% on SWE-bench Verified, 90.1% on GPQA Diamond, 93.5% on LiveCodeBench in max reasoning mode. That is within a few points of Claude 4 and GPT-5.5 on most measures. But as the Hugging Face analysis notes, "the benchmark numbers are competitive, but not SOTA. It doesn't matter." What matters is that V4 is built for agent workflows: interleaved thinking across tool calls, persistent reasoning traces across multi-turn interactions, and infrastructure choices specifically targeting the failure modes of deployed agents.

The open-source release means any organization can deploy, modify, and build on V4 without licensing fees. For the $725 billion capital cycle, this is a forcing function: if open models can deliver 90% of frontier performance at 10% of the cost, the economic case for proprietary APIs weakens significantly.

IV. Anthropic's $900 Billion Bet

Modern glass office building reflecting clouds

Photo: Unsplash

While OpenAI and DeepSeek were fighting over model benchmarks, Anthropic was redefining what "valuable" means in AI. The company is fielding preemptive offers to raise roughly $50 billion at a valuation between $850 billion and $900 billion, according to multiple reports. A board decision on whether to proceed is expected in May, with the round potentially serving as Anthropic's final private fundraise before an IPO that could come as early as October. [TechCrunch]

The valuation trajectory is staggering. Anthropic raised $30 billion in Series G funding in February at a $380 billion post-money valuation, led by GIC and Coatue. The new terms would more than double that figure in roughly three months. The revenue story justifies the premium: annualized run-rate revenue has surpassed $30 billion, up from $9 billion at the end of 2025 and $14 billion at the time of the Series G. That is a tripling of the run rate in four months.

Enterprise demand is doing most of the work. The number of business customers spending more than $1 million annually with Anthropic now exceeds 1,000, doubling in less than two months from the 500-plus figure at Series G. Claude Code, Anthropic's agentic coding platform, was generating $2.5 billion in run-rate revenue as of February, with weekly active users doubling since the start of 2026. [Unite.AI]

The compute commitments behind the valuation are equally massive. Amazon committed an additional $5 billion in equity with up to $20 billion more tied to commercial milestones, bringing its total to roughly $33 billion. Anthropic committed to spend more than $100 billion on AWS technologies over the next decade. Google pledged up to $40 billion, starting with $10 billion in cash and up to $30 billion more tied to performance targets, plus 5 gigawatts of TPU capacity. Microsoft committed $5 billion. Nvidia pledged up to $10 billion. Anthropic's cash-plus-compute stack now spans every major US AI infrastructure provider. [CNBC]

A $900 billion valuation for a company with $30 billion in run-rate revenue implies a roughly 30x multiple. That is rich on its face, but secondary market shares were already trading at implied $1 trillion valuations on platforms like Forge Global in April, suggesting the primary round may be priced at a discount to where late-stage holders are willing to transact. If Anthropic crosses OpenAI on either revenue or valuation within the same calendar quarter, it would mark the first reordering of the frontier-lab hierarchy since the launch of ChatGPT.

V. The Surveillance Upgrade Nobody Asked For

Photo: Unsplash

While the capital swap and model war consumed the headlines, a quieter transformation was taking place in the physical world. AI-powered surveillance infrastructure is expanding at a pace that outstrips both regulation and public awareness.

At the US-Mexico border, federal contractors have begun installing a new generation of autonomous surveillance towers equipped with AI-driven detection and classification systems. Unlike previous fixed-camera installations, these towers use computer vision to autonomously identify, track, and classify individuals and vehicles across wide terrain, operating without continuous human oversight. Privacy advocates have raised concerns about the lack of oversight mechanisms and the potential for mission creep beyond border security. [KOLD/CBS]

In Atlanta, the controversial "Cop City" training facility is testing AI-powered policing tools in neighborhoods already saturated with surveillance cameras, license plate readers, and ShotSpotter audio sensors. Civil liberties groups argue that these tools create a feedback loop where increased surveillance capacity drives increased arrests, which in turn justifies more surveillance. [Yahoo News]

Germany's federal cabinet approved a package of laws that significantly expands the powers of security authorities in the digital space. The legislation authorizes biometric matching and AI-powered analysis of digital communications, described by critics as a "digital dragnet search" that grants authorities sweeping new surveillance capabilities. [Heise Online]

In the United States, Congress passed an extension of Section 702 of FISA, the surveillance authority that allows the government to collect communications of non-US persons located abroad, while also advancing the GUARD Act, which would impose new rules on AI model development and deployment. The Electronic Frontier Foundation warned that while Congress narrowed some provisions of the GUARD Act, "serious problems remain" - particularly around the expansion of surveillance authorities without corresponding transparency or oversight requirements. [EFF]

Perhaps most significantly, the FBI disclosed a breach of its own surveillance system, classified as a "major incident" by security experts. The breach underscores a fundamental tension: as surveillance capabilities expand, the attack surface for those capabilities expands with them. Every AI-powered monitoring system is also a potential target for adversaries, and the security track record of the agencies deploying these systems is not reassuring. [Security Magazine]

VI. NIST Steps In: Testing Frontier Models for Cybersecurity Risks

Photo: Unsplash

Amid the model war and capital swap, a quieter but equally significant development occurred: NIST's Center for AI Standards and InCAISI (Center for AI Safety and Investigations) announced it will test three major technology firms' frontier AI models for cybersecurity risks. This is the first time the US government has conducted formal security assessments of frontier models from private companies. [Cybersecurity Dive]

The testing will focus on whether these models can be used to facilitate cyberattacks, generate malicious code, or assist in reconnaissance for targeted intrusions. It represents a shift from voluntary commitments (like the ones made at the White House AI Safety Summit) to active government scrutiny. The results will inform future regulatory frameworks and could determine whether certain model capabilities need to be restricted.

Separately, cybersecurity agencies from the United States, Australia, Canada, New Zealand, and the United Kingdom (the Five Eyes alliance) jointly published guidance urging organizations to treat autonomous AI agents as critical infrastructure that requires the same security rigor as cloud services and operational technology. The guidance specifically warns that AI agents with access to sensitive systems could be exploited at scale if not properly secured, and recommends treating agent permissions with the same caution as human access controls. [CyberScoop]

This matters because the entire capital swap - the $725 billion, the layoffs, the model releases - is building toward a world where AI agents run critical infrastructure, manage financial systems, and make decisions that affect human lives. If those agents are not secure, the consequences of the swap become existential rather than economic.

VII. Space: The Other Frontier

Photo: Unsplash

While the AI industry was reshaping itself, humanity's other frontier was having its own moment. NASA successfully completed the Artemis II mission, proving that the Orion spacecraft can survive high-speed return from deep space with its improved heat shield. The success clears the path for Artemis III, the mission that will return humans to the lunar surface. [ScienceDaily/NASA]

NASA also fired up a prototype lithium-fed magnetoplasmadynamic thruster designed for crewed Mars missions. The thruster uses lithium propellant to achieve specific impulse numbers far beyond conventional chemical rockets, potentially cutting Mars transit times significantly. This is the first successful test of a propulsion system specifically designed for human Mars missions, not just robotic probes. [NASA/JPL]

Europe's first reusable spacecraft, Space Rider, cleared key technical hurdles on its path to launch, marking the ESA's entry into the reusable orbital vehicle market that SpaceX has dominated. [Space.com]

On the scientific front, the James Webb Space Telescope directly studied an exoplanet's surface for the first time, revealing "a dark, hot, barren rock" - a significant technical milestone even if the planet itself is inhospitable. And a Chinese research team announced a breakthrough in space-based gravitational wave detection, advancing the technology needed for future missions to observe ripples in spacetime from orbit. [Xinhua]

These developments matter because the same AI systems being built with $725 billion in capital will be used to design, simulate, and operate the next generation of space missions. The two frontiers are converging.

VIII. What the Swap Actually Means

Person walking on foggy road intersection

Photo: Unsplash

Let us step back and see the full picture.

Four companies are spending $725 billion to build AI infrastructure. Over 100,000 workers have been cut to help pay for it. Two frontier models launched within 24 hours of each other, one closed-source and one open, both pushing toward agentic capabilities that will automate the very tasks the laid-off workers used to perform. Anthropic is raising $50 billion at a $900 billion valuation, betting that enterprise AI revenue will justify multiples that would make a 2021 crypto founder blush. Surveillance infrastructure powered by AI is expanding at the border, in cities, and in digital communications, with minimal oversight. And the US government has started formally testing AI models for cybersecurity risks for the first time.

The second-order effects are where the real story lives.

For workers: The 3:1 ratio of open AI positions to available talent means the transition is not smooth. Laid-off recruiters and HR professionals cannot simply retrain as ML engineers. The skills gap is structural, and retraining infrastructure has not scaled to meet it. The companies doing the laying off are simultaneously struggling to fill the roles they claim to need. This is not a market failure; it is a coordination failure at civilizational scale.

For the AI ecosystem: DeepSeek V4's open-source release changes the economics fundamentally. If a model with 90% of frontier performance is available for free, the ROI on proprietary API spending gets harder to justify. This puts pressure on OpenAI and Anthropic to either demonstrate capabilities that open models cannot match, or accept lower margins. The $900 billion valuation for Anthropic assumes the former. DeepSeek V4 makes the latter increasingly plausible.

For society: The expansion of AI-powered surveillance without corresponding legal frameworks creates a structural power asymmetry. When government agencies deploy autonomous detection systems at borders and in cities, and when those same systems are built by the companies spending $725 billion to dominate the AI infrastructure market, the question of who gets to watch whom becomes a question of corporate-state entanglement, not individual privacy.

For the companies themselves: Microsoft Copilot is at roughly 3% enterprise adoption. Meta's AI features generate engagement but have not cracked a standalone revenue model. The $725 billion is a bet on a future that has not yet arrived. If adoption does not accelerate fast enough to justify the spending, the capital swap becomes a capital trap. The companies that built too much infrastructure too fast will be stuck with GPU clusters they cannot fill and data centers they cannot power.

The most honest framing comes from the data itself. $725 billion in. 113,000 workers out. Two models that can now do things that required teams of humans six months ago. Surveillance towers at the border. NIST testing models for security risks. And the IPO window for the most valuable AI company on earth is opening in October.

This is not a transition period. This is the transition. The question is not whether it will reshape work, markets, and power. The question is whether the people being reshaped will have any say in the outcome.

Sources cited in this article:

BLACKWIRE | Published May 12, 2026 | PRISM - Tech & Science Desk