AG-UI (Agent–User Interaction Protocol) Notes

Why AG-UI?

The first time someone comes across AG-UI, they usually ask:

We already have SSE / WebSocket. Why do we need AG-UI?

Here’s the distinction:

SSE/WebSocket solve how to transmit (Transport)
AG-UI solves what to transmit (Interaction Protocol)

Analogy:

Layer	Role
TCP	Transmits byte streams
HTTP	Defines request/response semantics
SSE	Defines server-push streaming
AG-UI	Defines Agent–UI interaction event semantics

Limitations of SSE

SSE only sends events:

event: message
data: hello

But it doesn’t know:

Is this plain text?
Is this a Tool Call?
Is this an approval request?
Is this a state sync?

To SSE, it’s all just strings.

That’s why different Agent Frameworks end up defining their own event formats.

For example:

LangGraph:

{
  "type": "tool_start"
}

Another framework:

{
  "event": "tool.begin"
}

The frontend has to adapt to each one.

The Core Value of AG-UI

AG-UI defines a unified event model:

{
  "type": "TOOL_CALL_START"
}

{
  "type": "TEXT_MESSAGE_CONTENT"
}

{
  "type": "STATE_DELTA"
}

No matter the backend:

LangGraph
OpenAI Agents SDK
CrewAI
PydanticAI
Custom Agent

The frontend sees the same events.

How AG-UI Relates to MCP and A2A

MCP

Solves:

Agent ↔ Tool

For example:

GitHub
Jira
Slack
MySQL

A2A

Solves:

Agent ↔ Agent

For example:

Research Agent
    ↓
Coding Agent
    ↓
Review Agent

AG-UI

Solves:

Agent ↔ UI

For example:

React
Vue
Mobile App
Desktop App

Who Are the Two Parties in AG-UI Communication?

Many sources describe it as:

User ↔ Agent

But a more accurate framing is:

Agent Runtime ↔ Agent Client

Or:

Agent Backend ↔ Agent Frontend

Common scenarios:

Web

React/Vue
      ↕
    AG-UI
      ↕
 Agent Runtime

Mobile

iOS/Android
      ↕
    AG-UI
      ↕
 Agent Runtime

Desktop

Electron/Tauri
       ↕
     AG-UI
       ↕
  Agent Runtime

Putting AG-UI Into Practice

Frontend

Without AG-UI

React listens to SSE directly:

const es = new EventSource("/agent");

Then parses whatever custom events the backend sends.

Downside:

Every Agent Framework needs its own adapter.

With AG-UI

The frontend only deals with standard events:

TEXT_MESSAGE_START
TEXT_MESSAGE_CONTENT
TEXT_MESSAGE_END

TOOL_CALL_START
TOOL_CALL_END

STATE_DELTA

For example:

client.subscribe(event => {
  switch(event.type) {
    case "TOOL_CALL_START":
      showTool(event.toolName)
      break
  }
})

Switching the Agent Runtime doesn’t require frontend changes.

Backend

LangGraph

Native events:

on_tool_start
on_tool_end
on_llm_new_token

Transformation:

LangGraph Event
      ↓
 AG-UI Event

For example:

on_tool_start
      ↓
TOOL_CALL_START

LangChain

Using a Callback Handler:

class AgUiHandler(BaseCallbackHandler):
    ...

Mapping:

on_tool_start
      ↓
TOOL_CALL_START

on_llm_new_token
      ↓
TEXT_MESSAGE_CONTENT

OpenCode

OpenCode has its own event system:

task_start
tool_start
tool_end
message_chunk

Integration:

OpenCode Event
      ↓
 AG-UI Adapter
      ↓
React/Vue/UI

What Actually Makes AG-UI Valuable

A lot of people think it’s just about token streaming.

Turns out streaming is the easy part.

What’s actually valuable:

Human In The Loop

Agent:

About to delete the production database

Sends:

{
  "type": "APPROVAL_REQUEST"  // actual AG-UI uses the interrupt mechanism; this is a simplified illustration
}

Frontend automatically shows:

Approve
Reject

After the user chooses, execution continues.

Shared State

Agent:

{
  "type": "STATE_DELTA"
}

Updates the shared state.

React/Vue sync the UI automatically.

Generative UI

Agent outputs:

{
  "type": "UI_COMPONENT", // actual AG-UI implements this through tool calls + state; this is a conceptual illustration
  "component": "table"
}

Frontend renders automatically:

Table
Chart
Form
Card

Architect’s Perspective

If the system is just:

React
   ↓
Agent

Then SSE is enough.

But if the goal is:

Support LangGraph
Support ADK
Support OpenAI Agents
Support OpenCode
Support custom Agents

And also:

Web
Mobile
Desktop

That’s where AG-UI’s value shows up.

Unified as:

Agent Adapter
      ↓
    AG-UI
      ↓
 Unified UI

TL;DR

MCP = The USB port for Agents

A2A = The RPC protocol for Agents

AG-UI = The frontend protocol for Agents

Or:

SSE = The delivery company

AG-UI = The standardized packaging inside the box

You can get by without AG-UI.

AG-UI’s goal is to standardize the connection between different Agent Frameworks and different frontend frameworks.

AG-UI (Agent–User Interaction Protocol) Notes

AG-UI (Agent–User Interaction Protocol) Notes

Why AG-UI?

Limitations of SSE

The Core Value of AG-UI

How AG-UI Relates to MCP and A2A

MCP

A2A

AG-UI

Who Are the Two Parties in AG-UI Communication?

Web

Mobile

Desktop

Putting AG-UI Into Practice

Frontend

Without AG-UI

With AG-UI

Backend

LangGraph

LangChain

OpenCode

What Actually Makes AG-UI Valuable

Human In The Loop

Shared State

Generative UI

Architect’s Perspective

TL;DR

FEATURED TAGS

CATEGORIES