The Agentic Web — Syama Mishra

The web we have was built for browsing. Pages sized for screens. Type at reading distance. Hierarchy for scanning. Search engines counting clicks. Analytics counting linger. Brands competing for what you will remember. The whole stack runs on one assumption: a person is on the other end, working through it in real time, and the value of the visit is in the visit itself.

That assumption is so deep most of us have stopped noticing it. The web is not a neutral medium. It is a presentation layer, optimised over thirty years for one mode of reading — slow, exploratory, attention-rich. Pages were the medium because, for thirty years, perusal was the only way a human could resolve their intent into an outcome. Looking, comparing, deciding, clicking. There was no other way to get from I want something to I have it.

The old web was paid for in attention. The next one is paid for in results.

The web is not just one layer of digital infrastructure among several. It is where modern life happens — where people work, shop, learn, find services, find each other, find themselves. Right now, two parallel motions are building the agentic substrate underneath all of that. Computer Use and browser automation reach outward from the agent into the legacy web that wasn't built for them — bridging, adapting, using the human surface to get the work done. Native connectors reach inward, into applications' own scripting and capability layers — Claude inside Blender drives the Python API directly, the way a Blender developer would; Adobe ships fifty pro-grade tools as a single connector. Both motions are real, both are necessary, both are the substrate growing.

The move has already happened, faster than any platform shift before it. ChatGPT was the fastest consumer product on record to a hundred million users. In under three years the chat became a surface carrying around 2.5 billion prompts a day. And the most revealing number is what those prompts are not: when OpenAI and Harvard studied the traffic, only about a quarter of it was search-style information-seeking. Google still handles something like eight to twenty times ChatGPT's search volume — the chat did not win search. It absorbed everything that was never search to begin with. The other three-quarters is writing, reasoning, planning, building, delegating — the intent the old web had no single surface for. The chat became where life happens not by replacing the search box but by catching everything the search box never held.

The economics pull one direction. Tokens are the currency of the modern AI economy. Every screenshot, every DOM parse, every reasoning step needed to interpret a human-facing surface is paid for in tokens. Native interfaces — purpose-built API calls, structured capability advertising, efficient introductions — cost a fraction of what bridging costs. Multiplied by the scale of agentic interaction this is arriving at, the difference is the difference between affordable and not.

And then the network effect changes the curve entirely. Each productive node added to the agentic web doesn't just become individually addressable — it becomes collaborative with every other connected node, through agents as the orchestration layer. Blender plus Adobe is a workflow as soon as both have agentic nodes; the producer in Ableton can route their work through Splice's sample library and Adobe's renderer and Blender's modelling without anyone having built bilateral integrations. Adobe's apps talking to each other stops being a competitive moat the moment everything talks to everything else through a shared substrate. The capability surface multiplies combinatorially. The web of agents grows non-linearly because every node is simultaneously a productive surface and a collaborator.

None of which means attention stops mattering. Attention does not disappear when the medium changes — it relocates, and the advertising funnel follows it wherever it pools, the way it tracked eyeballs from print to broadcast to search to the feed. If the chat is where intent now gathers, the funnel reorients to point at the chat. That is already beginning. So the next web is not simply paid for in results instead of attention; it is the site of a contest between the two. The results economy — agents paying for capabilities, value flowing to where the work actually happens — is the new structural possibility. The attention economy is the incumbent, and incumbents have gravity. Which model becomes load-bearing is not settled by which is more elegant. It is settled by what gets built, and who builds it first.

The new surface

The chat is where the human meets the web now. Not at a homepage. Not on a landing page. In conversation, with an agent. The browsing happens there. The comparing happens there. The weighing, the deciding — all of it, in dialogue. By the time anything reaches the website, the choosing is over. What arrives at the site is the action.

The website is no longer where intent gets resolved. It is where the resolved intent gets executed. The chat became the perusal layer. The site became the action layer. Two different jobs, two different surfaces, two different readers.

This is happening across the application layer too. In late April 2026, Anthropic and a coalition of major creative software companies — Blender, Autodesk, Adobe, Ableton, Splice, Affinity by Canva, Resolume, SketchUp — launched Claude for Creative Work: the entire creative production stack, across 3D, 2D, video, audio, live performance and architecture, simultaneously made agent-readable from inside the chat. Adobe alone shipped fifty pro-grade tools as a single connector. Anthropic joined the Blender Development Fund as a patron; the Blender connector is built on MCP, accessible to other LLMs as well. The chat is where the creative work begins. The visual workspace is no longer where it has to.

Warp went further the same day. The terminal company open-sourced their entire client and rebuilt contribution itself around agents — contributors don't write code directly any more, they go through agents that orchestrate the implementation. OpenAI sponsors the repo. The chat is now where developers contribute to the software they help build. Another surface where the work begins.

And from the agent's side, the website looks different. Hero images do not register. Typography does not land. Microcopy does not charm. Testimonials do not get skimmed. Trust badges do not get noticed. The entire visual layer — the part most websites spend the most money on — is not in the picture. The agent reads the page the way a forensic accountant reads a love letter: looking for structure, intent, capability. Everything else is overhead.

Expensive and slow

Agents are doing something strange. Reading a medium that was not built for them. Browsing, clicking, screenshotting, parsing the DOM. Recovering meaning from interfaces that were never meant to give it up.

It works. The result comes back. But almost everything between the request and the result is work the agent has to do because the page is shaped for someone else. Every screenshot, a translation cost. Every parse, a token cost. Every interface change, a brittleness cost. Every cookie banner, every modal, every consent flow, a friction cost. None of it is the work itself. All of it is the work of getting to the work.

An agent doing what it needs to do should not have to pretend to be a person to do it.

What's emerging in the field — browser automation, computer use, screenshot reasoning — is one way to bridge the gap, and most of the work to date has gone into it. It works. It is also the agent web grafted onto the human web, which is not the same thing as a web built for agents.

The good news: the digital substrate is generous about this. Unlike a city or a road network, a new layer of the web can be added in parallel to the old one — cheaply, deliberately, while the old one keeps running. The agentic web does not have to wait for the visual web to end. It can grow alongside it.

The moment the agent becomes the primary reader, every assumption underneath the old web starts to wobble. Layout was for human eyes. Branding was for human memory. Search was for human queries. Commerce was for human attention. Identity was for human verification. None of it was designed for a medium where the visit is invisible and the result is everything.

That is when I started seeing a different web.

Late 2024. Anthropic ships MCP. The conversation around it is about plumbing for AI-assisted development — coding assistants reaching databases, agents calling APIs, integration glue for software workflows. That is the use case Anthropic launches it for. That is the use case the discourse extends. Power-user feature. Narrow but useful. The kind of thing you note and mean to come back to.

I read the same protocol release. What I saw was a different shape entirely.

The same substrate that lets a coding assistant reach a database lets any agent reach any capability. It doesn't care whether the capability is a Postgres query or a Shopify catalogue. It doesn't care whether the agent is a coding assistant or a personal assistant booking a flight. The same plumbing carries both. Which means the agent web is not a developer tool waiting to scale — it is the next medium through which people find, transact, and decide. And the visual web, the one that took thirty years to build, is the wrong medium for it.

That read was not in the discourse around MCP, because the discourse was somewhere else. It came from looking at the protocol itself, in the months when the conversation had not yet caught up.

Once you see it, you cannot unsee it. The visual web is a poor fit for agents, and the next medium would need shaping by a lot of forward-looking people, in a lot of places. I wanted to be one of them.

I did not have a platform for that. No standards-body seat, no public work in the space, no network in the rooms where this kind of architecture gets discussed. So I started building — to take the read and make it concrete enough to test. The code was where the thinking happened: design decisions that look clean in the abstract reveal themselves the moment you have to ship them. And once it existed, it was something to point at.

Two integration directions follow from this read, and they are not opposed. Agents have to be able to use the web that exists — that work is necessary, because legacy will outlive any of us. The web can also grow a new layer better suited to agents — that work is cheap, because the digital substrate allows parallel construction. The two compound.

Both directions converge. Whether a site exposes structured tools natively, or a gateway service adapts a legacy site on its behalf, the agent-facing surface speaks the same protocol. The native architecture of the agentic web is the shared destination of every integration strategy. Which is why I built the protocol first.

I called it MCPI — Model Context Protocol Integration. The Integration was the load-bearing word. The design held two paths together: a specification for a web where agents are first-class readers, and a gateway service that could give a business or person a unified agent-readable interface without rebuilding anything. A small business could subscribe, connect their existing tools — Shopify, Facebook, Google Workspace, calendar, payments — and present a single coherent identity to the agent web: their inventory, their availability, their bookings, their inbox, their brand. The vision was that anyone could be in the agent web tomorrow, layered over the SaaS they already use, without waiting for any of those vendors to ship native support. DNS-based discovery so the agent could find either path under the same name. A Hello Protocol for context-aware introduction. A plugin architecture for capability advertising. A trust layer for referrals. Brand metadata so a site could ship its identity — colours, type, tone, voice — as structured data for the agent to render. A Chrome extension to test it all against a real site.

The Hello Protocol

The smallest piece of it carries the most weight.

An agent encounters a website. Today, it does what a human would do — it loads the page. Then it does something a human never has to do: it reverse-engineers the page back into structured meaning. Every visit is a parse, a guess, a reconstruction.

The Hello Protocol replaces all of that with a question and an answer.

The agent asks: what are you, and what can you do?

The site answers: I am a store, here are my categories, here are my capabilities, here is who I trust, here is how to transact, here is who I refer to.

No HTML. No DOM. No screenshot. No parsing. The site tells the agent directly, in a structure built for the reader on the other end.

And it adapts. A shopping agent, a research agent, and a support agent all asking the same site the same question get three different answers — not because the site has multiple identities, but because what the site needs to say about itself depends on what the agent is trying to do. The introduction is contextual, generated in conversation, shipped in the form most useful to the recipient. The site stops handing back a single rendered page and starts contributing to whatever experience the agent is building on the user's side.

A site that introduces itself in three hundred tokens is not a smaller version of a website. It is a different kind of object entirely.

This is not an optimisation. It is a category change. Once a site can introduce itself this way, the visual layer becomes optional. Useful when a human shows up. Skippable when one doesn't.

DNS as the root

When I started building MCPI, I knew immediately that DNS records were the right root for agent discovery — same way email and websites are addressed. I prototyped with what was accessible: a TXT record, _mcp.example.com TXT "v=mcp1 endpoints=…". It works. But it has always been a workaround for what the architecture really needs — a first-class, MX-style record that lets a domain say, for agent-directed traffic, contact this host. The dynamic intelligence — capability negotiation, trust, load — sits at the resolver behind the record. DNS is just the bootstrap. MX exists not because email was impossible without it, but because the substrate needed a clean, standardised routing primitive. The agentic web has the same need and no equivalent yet. Email's substrate solved this in the 1980s. The agentic web's hasn't.

The full architecture is three layers that compose. The DNS record is the address — small, typed, cached, geographically distributed through the substrate DNS was engineered for. The catalog at the address is the transport — the manifest describing what the domain exposes, in whatever the field's agreed schema turns out to be. And the responder behind the catalog is the mature form: not a static file describing capability, but a live representative agent doing intent matching, capability negotiation, and multi-vendor coordination in real time. The same URL can serve either — a JSON file at entry level, a running Concierge Agent at maturity. The client agent doesn't have to know which is on the other end. A static catalog is a slightly cleaner DOM to scrape. A live representative is where the domain actually acts on its own behalf on the agentic web.

And once the native architecture exists, the integration strategies arrange themselves around it. Sites can adopt the native protocol directly. Sites that won't can be wrapped by gateway services that speak the protocol on the agent-facing side, adapting to whatever those sites already are on the legacy side. Browser automation continues to handle whatever falls through. The strategies look different from the outside, but they all converge on the same agent-facing destination — the native architecture is what the agent meets, regardless of what is behind it. That is why MCPI was a specification first, and a gateway service second. The protocol had to exist before the gateway had something to translate into.

What followed

Nov 2024 Anthropic open-sources MCP.
Mar 2025 MCPI — specification, gateway service, Chrome extension.
Apr 2025 Google announces A2A with 50+ enterprise partners.
May 2025 CopilotKit launches AG-UI.
May 2025 Coinbase launches x402.
Jun 2025 A2A moves to the Linux Foundation.
Jul 2025 Cloudflare ships Pay per Crawl, an x402-based mechanism for paying AI crawlers.
Sep 2025 Google announces AP2 with 60+ partners including Mastercard, PayPal, Coinbase, and Adobe.
Nov 2025 Anthropic and OpenAI co-author the MCP Apps Extension.
Dec 2025 Google launches A2UI.
Feb 2026 Google launches WebMCP in Chrome (W3C incubation with Microsoft).
Apr 2026 A2UI v0.9. Stripe Sessions 2026. Cloudflare-Stripe agent-driven service provisioning.
Jun 2026 Google announces ARD — domain-rooted capability publication with federated registries, under the Linux Foundation.

The architecture I read in MCP in early 2025 is what the industry has built since. Not because I shaped it — I didn't — but because the read was correct. The shape was implicit in the protocol; I happened to see it early, when most of the conversation was about something else.

The field has begun to describe the agentic web's substrate as a six-layer protocol stack — three core, three at contested edges. MCP for tool and data access. A2A for inter-agent coordination and delegation. AG-UI for the wiring between agent runtimes and user-facing applications. Then A2UI for structured rendering, AP2 for agent-led commerce authorisation, and x402 for machine-to-machine payment of digital resources. Three answers to three architectural questions: what can the agent use, who else can it work with, how does the human stay in the loop — and three answers to the harder questions of how the agentic web restructures rendering, commerce, and economics underneath.

MCPI was sketching this whole stack as a unified architecture in early 2025, before most of the layers had names. It extended MCP — the foundation it was built on. It sketched A2A's territory through the Agent Network Protocol vision, with autonomous mesh networks, task delegation, federated topologies, and trust accumulation. It treated A2UI's territory through brand-as-structured-data, identity shipped as data and rendered by the receiving agent. It sketched AP2's territory through the Concierge Agent's intent handling, capability matching, multi-vendor coordination, and trust-network compensation — the protocol AP2 has now shipped with sixty-plus partners and a Mandate architecture for verifiable user intent across vendor boundaries. And it flagged x402's territory through reverse fee economics and structured monetisation for capability access, the question now formalised as machine-to-machine payment at the HTTP layer. The discovery-and-trust layer arrived last: in June 2026 Google announced ARD, domain-rooted capability catalogs with federated registries, under the Linux Foundation — the domain-as-trust-root direction MCPI has argued for since day one, shipped as the catalog transport standard the field needed. MCPI's DNS record is the address, ARD is the transport, the responder behind it can be static or agentic. Three moves that compose. The industry has spent the last fifteen months articulating these as separate protocols. MCPI was already articulating them together.

In late April 2026, Stripe used their annual Sessions conference to position the company as economic infrastructure for AI, with agentic commerce at the centre. The Universal Commerce Protocol with Google lets customers buy products inside Gemini and AI Mode. The Machine Payments Protocol, co-authored with Tempo, gives agents programmatic transaction primitives — microtransactions, recurring payments, streaming payments. The Agentic Commerce Suite lets any business become agent-ready by uploading their catalogue. Treasury accounts can now be opened in the agent's name, with cards issued for agents to make purchases autonomously. As part of the same launch, Cloudflare and Stripe co-designed a protocol for agents acting on behalf of users to provision accounts, register domains, start paid subscriptions, and deploy production code — across vendors, with budgets the agent can spend without ever seeing the credit card. The Concierge Agent vision was MCPI's sketch of this whole layer: every domain having an intelligent representative that handles intent, capability matching, multi-vendor coordination, and trust-network compensation. The agent web doing commerce on the user's behalf, across vendors, through a vendor-neutral substrate. It is shipping in working form.

The architectural direction MCPI was pointing at is the direction the standards bodies have since built out, layer by layer. That is the through-line worth flagging.

Around the same time, Sam Altman, the head of OpenAI, publicly named the same architectural gap: "feels like a good time to seriously rethink how operating systems and user interfaces are designed (also the internet; there should be a protocol that is equally usable by people and agents)." A protocol equally usable by people and agents. That is the native architecture this piece has been describing — and that MCPI was designed to embody fifteen months earlier.

I was taking the read seriously enough to be in the MCP working group from its early days, bringing the consumer-web read into a conversation that was, at that point, focused on developer use cases. Once the standards bodies started extending the substrate in the wider direction, shipping the gateway service alone would have forked the ecosystem rather than helped it consolidate. The gateway stayed at the design stage. The work shifted: from building MCPI, to bringing its reads into working-group conversations, to writing about what the protocol layer is now opening up but not answering. The form changes. The seriousness doesn't.

Because the interesting question is not which protocol wins. It is what gets built on top.

There are two architectural directions for AI and the web. They look similar from the outside. They are not the same.

Navigating the existing web

Agents that interact with human UIs

Automation of the current interface layer

Coordination across existing services

Solving today's problems today

Building the next web

Structured capability APIs

The visual layer becomes optional

The web restructures around intent

Solving tomorrow's architecture today

The first is necessary. It bridges where we are to where we are going. Browser automation, computer use, screenshot reasoning — these will exist for as long as legacy websites exist, which is to say, forever.

The second is the one I find more interesting. It asks what the web should look like once we stop pretending the agent is a person.

The shape of it

Inside the next web, you express an intent. Your agent figures out where to go. Sites respond with capabilities, not pages. The result comes back rendered by your agent, in your preferred form — to your screen, your voice, your wearable, your wall.

The visual layer does not disappear. It moves. From the site to the user. From the brand to the individual. From the page to the rendering surface. The site no longer controls the experience. It controls the identity. Colours, type, tone, voice, values, edges — shipped as structured data, interpreted by the user's agent, on the user's terms.

So brand strategy stops being "design an experience" and becomes "define an identity that survives any rendering context." A brand has to be itself when it is read aloud by a smart speaker, summarised by a research agent, or quoted in a multi-vendor bundle assembled by someone else's AI. None of those surfaces are under the brand's control. All of them are now where the brand is met.

If your brand cannot survive being rendered by a stranger's AI, it was never really a brand. It was a layout.

Ecommerce changes shape under the same pressure. The storefront is no longer a place. It is a capability surface — inventory, pricing, fulfilment, returns, all addressable. The "shop" is wherever the user happens to be standing. The competition is no longer who builds the prettiest page. It is who is most useful to the agent acting on the user's intent.

Search changes too. It stops being "ten blue links" and becomes resolution — the right capability, found, vetted, called. Whoever shapes how agents resolve intent into action has the same kind of leverage Google had over the human web. Possibly more.

And identity. A personal site stops being a homepage and becomes a machine-readable introduction. What you do, what you offer, who you trust, how to reach you — shipped once, rendered everywhere. The page you used to build for visitors becomes the manifest your agent presents on your behalf.

Each of these is the same story told at a different layer. The site stops being where the experience happens. It becomes the source of the data the experience is built from. The rendering moves to the user's side. The control moves with it.

Cloudflare ships a public diagnostic for this now — a five-category scanner that checks any site for the discoverability signals, content accessibility, bot-access controls, native agent protocols (MCP, A2A, WebMCP, Agent Skills), and commerce primitives (x402, MPP, UCP, ACP) the agentic web expects. The architecture has consolidated enough to be inspectable. The tool is also a useful place for anyone reading this to start: run it on your own site, and what comes back is an honest read of how far along the arc you already are.

The water cycle problem

There is a deeper question hiding under all of this, and it is the one most likely to break the agentic web before it gets built.

The old web ran on a kind of water cycle. People made things. Other people read those things, were inspired, made more things. Forums, blogs, wikis, journalism, open source — all grew out of the same loop: contribution feeding consumption feeding contribution.

An agentic reader replaces the human one and consumes at industrial scale. It does not, on its own, return anything to the systems that produced what it read. If the only economic relationship between agents and the web is extraction, the web stops producing what makes the agents useful. The reservoirs run dry.

An agentic web that takes without giving back is not a sustainable architecture. It is an extraction event with a long tail.

Cloudflare's Pay per Crawl, introduced in mid-2025, is one mechanism for this. It lets sites charge AI crawlers using HTTP 402 — a status code that has been sitting unused in the spec for thirty years, waiting for exactly this moment. There will be others: referral economics, subscription bundles, agent-side budgets, micropayments, content licensing, structured data pricing tiers. What I notice underneath all of them is the same dynamic: an agentic web that consumes at industrial scale without putting anything back is a system whose substrate stops renewing. The shape of the reciprocity matters less than whether reciprocity exists.

This is not a policy concern bolted onto the architecture. It is part of the architecture. A web designed for agentic consumption without economic reciprocity is a web designed to dry up.

Why I am writing this now

The protocol infrastructure is being built — by Google, Anthropic, OpenAI, Microsoft, the W3C, the Linux Foundation, Cloudflare, and others. A2A, MCP Apps, WebMCP, A2UI, Pay per Crawl, the broader MCP ecosystem. Real teams, real momentum.

The architectural plumbing is the part of the transition that gets the most attention because it is the part that consolidates first. It is also the part where the answers are easiest to converge on. Below it sits the wider set of questions this piece has been circling.

What does a website become when no human ever visits it. What does a brand mean when it cannot control a single pixel. What does ecommerce look like when the storefront is a capability and the agent decides which one to hand the user's money to. What does identity look like when an agent is acting on your behalf, and the site you are transacting with never sees you.

What does taste look like when curation, layout, and tone all happen on the user's side, rendered by the user's own AI according to the user's own preferences. What does discovery look like when the question stops being "what should I show this user" and becomes "what would this user's agent reach for, and why."

How do creators get paid in a substrate that consumes them at industrial scale. How do agents acting on someone's behalf maintain consent and accountability across long-running tasks. How does the legacy web get brought along, or does it not.

These are not single-discipline questions. Each of them sits across protocol, product, design, brand, economics, and the lived human experience of the resulting system. They are the next set of rooms.

I think these questions deserve more thought. That is part of why I am writing this — to put my own attempts into the conversation, and to find others doing the same.

I want to be in more rooms with folks building this — find me, get in touch, or say hello.

The
Agentic
Web

A Web Built for Browsing

The new surface

Expensive and slow

What I Saw, What I Built

The Hello Protocol

DNS as the root

What followed

The Web That Comes Next

Navigating the existing web

Building the next web

The shape of it

The water cycle problem

Why I am writing this now

TheAgenticWeb

A Web Built for Browsing

The new surface

Expensive and slow

What I Saw, What I Built

The Hello Protocol

DNS as the root

What followed

The Web That Comes Next

Navigating the existing web

Building the next web

The shape of it

The water cycle problem

Why I am writing this now

The
Agentic
Web