An AI coding agent just found a way to knock the internet's busiest web servers offline from a single laptop. Researchers at offensive security firm Calif pointed OpenAI's Codex at the HTTP/2 protocol and let it hunt. What it surfaced, now cataloged as CVE-2026-49975 and nicknamed the HTTP/2 Bomb, can exhaust tens of gigabytes of server memory in under sixty seconds over an ordinary broadband link. The targets are not obscure: Nginx, Apache HTTP Server, Microsoft IIS, Envoy, and Cloudflare's Pingora proxy all run vulnerable defaults. This is the moment the abstract promise of AI-assisted security research turned into a concrete, weaponizable exploit affecting the software stack behind a large share of the public web.
What Actually Happened
On June 3, 2026, the flaw was disclosed publicly on the oss-security mailing list and assigned CVE-2026-49975. Calif's team had been using OpenAI's Codex agent to probe HTTP/2 implementations, and the model chained together two attack primitives that human researchers had treated as separate problems for years. The first is HPACK compression amplification: HTTP/2 compresses request headers so aggressively that a single byte on the wire can expand into one full header allocation in server memory, repeated thousands of times inside one request. The second is a Slowloris-style stall using HTTP/2 flow control. Codex combined them into a self-reinforcing loop that is far nastier than either piece alone.
The severity scoring reflects how cheap the attack is to run. Because it requires no authentication, no special privileges, and only a single low-bandwidth source, the flaw lands in the high-impact band for availability with trivial attack complexity. What makes it worse than a textbook denial-of-service is the persistence: the zero-byte flow-control window means the allocated memory is never reclaimed for the life of the connection, so an attacker can stack hundreds of cheap connections and hold a server hostage indefinitely. Security teams that benchmarked their defenses against high-volume floods will find those defenses largely irrelevant here, because the attack hides inside traffic that looks completely legitimate to every layer that only counts bytes.
The mechanics are brutally efficient. An attacker opens a connection, floods it with compressed headers that balloon on the server side, then advertises a zero-byte flow-control window so the server can never finish sending its response and never frees the memory it allocated. Calif's writeup shows a machine on a 100 Mbps link consuming tens of gigabytes of target memory in well under a minute. There is no botnet, no reflection, no spoofing. One source, one cheap connection, and the server falls over. For a denial-of-service class that defenders thought was largely solved a decade ago, that economics is a shock.
Patching is already underway but uneven. Nginx shipped a fix in version 1.29.8, adding a new max_headers directive to cap the damage. Apache addressed it in mod_http2 v2.0.41 by forcing cookie fragments to count against request field limits. Envoy has released mitigations that appear to blunt the attack. At disclosure, Microsoft IIS and Cloudflare Pingora had no public fix, though both maintainers were notified ahead of time. The staggered response means a long tail of unpatched servers will sit exposed for months, because operators rarely upgrade web server binaries on the day a CVE lands.
Why This Matters More Than People Think
The headline is the exploit, but the deeper story is who found it. HTTP/2 has been deployed since 2015 and scrutinized by some of the best protocol researchers alive. HPACK amplification and flow-control stalls were both documented. Yet the specific composition that produces the HTTP/2 Bomb sat undiscovered for roughly eleven years until an AI agent, prompted to look, connected the dots. That is a different kind of capability than writing boilerplate code. It is the systematic exploration of a combinatorial attack surface that humans had mentally partitioned into solved boxes.
For defenders, this cuts both ways and the asymmetry is uncomfortable. The same Codex agent that helped Calif disclose responsibly is available to anyone with an API key and a worse conscience. The cost of original vulnerability research just dropped by an order of magnitude for a whole category of bugs: protocol-level resource-exhaustion flaws that require patient combination of known primitives. Every widely deployed parser, proxy, and protocol implementation is now a candidate for an AI agent to grind against until something amplifies. The defensive community has been talking about this inflection for two years. CVE-2026-49975 is the first marquee proof that it has arrived.
There is also a hard infrastructure-economics angle. Cloudflare's Pingora handles a large fraction of global HTTP traffic, and Nginx and Apache together still serve the majority of web origins. A single-source DoS that defeats default configurations turns into a coordination problem across millions of independently administered servers. The companies that ship managed edge platforms can patch centrally and protect customers within hours. The small business running an unmanaged Nginx box on a VPS cannot, and will not even hear about the CVE for weeks. The gap between managed and self-hosted infrastructure just widened in a way that pushes more of the web toward a handful of large intermediaries.
The threat model also breaks a comfortable assumption baked into a decade of DDoS defense spending. Mitigation vendors like Akamai, Cloudflare, and Imperva built their business on absorbing volumetric floods, the terabit-per-second tsunamis that require enormous botnets and reflection networks. The HTTP/2 Bomb is the opposite shape: low bandwidth, high asymmetry, a few well-formed requests that weaponize the server's own compression logic against it. Rate-limiting by request volume barely helps when one connection does the damage, and signature-based filtering struggles because the traffic looks like valid HTTP/2. Defenders now have to inspect protocol semantics, not just packet rates, which is a far more expensive thing to do at line speed across every connection.
The Competitive Landscape
Calif is one of a small but growing class of offensive-security firms building their entire methodology around frontier models. They join names like XBOW, which made waves climbing HackerOne leaderboards with autonomous agents, and the internal red teams at Google and Microsoft that now run model-driven fuzzing at scale. OpenAI itself has leaned into security as a flagship use case for Codex, and this disclosure is exactly the kind of credible, externally validated win that the company wants attached to its agent. Anthropic has pushed a parallel narrative with Claude's code and security capabilities. The race to be the default AI for vulnerability research is now openly contested.
The web server incumbents respond on very different clocks. Nginx and Apache, both open-source with mature security processes, turned patches around quickly. Microsoft IIS moves on Patch Tuesday cadence and ships inside Windows Server, which means enterprise change-control windows measured in weeks. Cloudflare, despite being the most sophisticated operator on the list, faces the irony that its own Rust-based Pingora proxy was caught by the same class of flaw, a reminder that memory-safe languages stop memory corruption but do nothing against logical resource exhaustion. Each vendor's patch velocity is now a competitive signal that large customers will quietly score.
The historical parallel is the 2023 HTTP/2 Rapid Reset attack, CVE-2023-44487, which Google, Cloudflare, and AWS disclosed after it drove record-breaking DDoS volumes. Rapid Reset exploited stream cancellation; the HTTP/2 Bomb exploits compression and flow control. The protocol keeps yielding new resource-exhaustion primitives because its multiplexing design trades simplicity for performance, and complexity is where these bugs hide. What is new in 2026 is not that HTTP/2 has another flaw. It is that the flaw was found by a machine reasoning over the spec, not by a human who happened to get curious on a weekend.
Hidden Insight: The Disclosure Pipeline Is the Real Disruption
Most coverage will fixate on the exploit. The more durable shift is what AI agents do to the rhythm of vulnerability disclosure itself. Responsible disclosure assumes a roughly stable rate of bug discovery, giving vendors time to patch before details go public. If a handful of firms can now point agents at every major protocol implementation and surface novel resource-exhaustion bugs on a weekly cadence, the patch-and-coordinate machinery that the industry built over thirty years starts to strain. Vendors staff their security response teams for a human discovery rate. That assumption is the thing breaking, not any single piece of software.
Consider the supply side of exploits. Historically, finding a genuinely new DoS primitive against Nginx was a career-defining result that a researcher might land once. With agentic tooling, the marginal cost of the next attempt approaches the price of inference tokens. That changes the strategic calculus for both sides. Offensive teams will run agents continuously against the dependency graph of critical infrastructure. Defensive teams must do the same, pre-emptively, or concede that attackers will find the bugs first. The organizations that win will be the ones that integrate model-driven discovery into their own pipelines before their adversaries finish doing so.
However, the bear case deserves a fair hearing. Critics argue that the HTTP/2 Bomb is less a triumph of machine reasoning than a well-marketed combination of two publicly documented techniques that a skilled human could have chained given the same prompt and patience. Skeptics point out that Codex worked under close human guidance from Calif's researchers, who framed the problem, validated the output, and handled disclosure. The risk is that the industry over-credits the agent and under-credits the methodology, then assumes AI security research is more autonomous and more reliable than it actually is. False confidence in agent-found results could flood vendors with low-quality reports and erode the very disclosure norms this discovery is celebrated for.
The uncomfortable truth sits underneath both views. Whether or not Codex deserves full credit, the cost curve has bent, and cost curves do not bend back. Even if every agent-found bug needs a human to verify it today, the ratio of machine-surfaced candidates to human-verified confirmations will keep climbing. That pushes the bottleneck from discovery to triage, and triage is exactly where under-resourced open-source maintainers are weakest. The Heartbleed era taught the industry that critical infrastructure often rests on volunteer labor. The agentic era tests whether that volunteer labor can keep pace with adversaries who never sleep and never run out of attempts.
What to Watch Next
Over the next 30 days, watch the patch adoption telemetry. Shadowserver and Censys will publish scans estimating how many internet-facing Nginx and Apache instances remain on vulnerable versions, and the early curve will reveal whether operators treat CVE-2026-49975 as urgent or routine. Watch Microsoft for an out-of-band IIS fix versus a Patch Tuesday entry, and watch whether Cloudflare ships a Pingora mitigation publicly or quietly absorbs the attack at its edge. The presence or absence of in-the-wild exploitation reports from Cloudflare and Akamai DDoS dashboards will tell us how fast the proof-of-concept becomes a real weapon.
Over 90 to 180 days, the leading indicator is the disclosure rate itself. If Calif, XBOW, and similar firms publish a steady stream of agent-found protocol bugs through the second half of 2026, the thesis that AI has industrialized this research holds. Watch the major bug-bounty platforms for the share of high-severity reports credited to AI-assisted workflows, and watch whether OpenAI and Anthropic formalize security-research tiers for their agents. The other marker to track is governance: expect CISA and ENISA to weigh in on AI-discovered vulnerabilities, and expect at least one vendor to publicly complain about being buried in agent-generated reports.
The concrete prediction worth holding: if the next two quarters bring three or more marquee protocol vulnerabilities credited to AI agents, every major infrastructure vendor will stand up an internal agentic red team by early 2027, and the open-source projects that cannot afford one will fall conspicuously behind on response time. If instead the HTTP/2 Bomb proves to be a one-off, the narrative of AI-industrialized security research will cool, and this CVE will be remembered as a clever demo rather than an inflection. Either way, the servers behind a large share of the web are now patching against a bug that a machine found first.
The HTTP/2 Bomb is not just a new way to crash a server. It is the first marquee proof that finding the next one no longer requires a human to get curious.
Key Takeaways
- CVE-2026-49975 can exhaust tens of gigabytes of server memory in under sixty seconds from a single 100 Mbps connection
- OpenAI's Codex agent, guided by security firm Calif, discovered the flaw by chaining HPACK amplification with HTTP/2 flow-control stalling
- Nginx 1.29.8 and Apache mod_http2 v2.0.41 are patched, while Microsoft IIS and Cloudflare Pingora had no public fix at disclosure
- HTTP/2 has shipped since 2015, meaning this composition sat undiscovered for roughly eleven years until an AI agent connected the primitives
- The 2023 Rapid Reset attack (CVE-2023-44487) showed the protocol keeps yielding resource-exhaustion bugs, but this one was found by a machine reasoning over the spec
Questions Worth Asking
- If AI agents can grind every major protocol implementation for resource-exhaustion bugs, can the volunteer-maintained projects behind critical infrastructure keep pace with the disclosure rate?
- When the marginal cost of finding a novel DoS primitive approaches the price of inference tokens, does responsible disclosure survive in its current form?
- If your own stack depends on a self-hosted web server, how would you even learn that an AI agent found a new way to crash it before an attacker uses it?