Why not pure tool calling?

React v3 uses tools constantly. It reads artifacts, writes files, patches documents, executes code, checks out active workspace state, pulls historical workspace slices as reference material, and updates plans. So the obvious question is: if tools are already central, why not simply build the whole thing around provider-native tool calling and keep the runtime thinner?

The short answer is that the agent is not living inside a simple linear tool loop. It lives inside a larger event landscape: an append-only timeline, an uncached operational tail, a rolling sources pool, subsystem/widget delivery, distributed isolated execution, and explicit workspace construction. Once those requirements are real, pure tool calling stops being the whole shape of the problem.

Important nuance: this is not an anti-tool-calling position. React is still a tool-using agent. The design choice is specifically about not making provider-native tool calling the primary orchestration protocol for the whole system.

1. The context is not just assistant actions

Native tool-calling transcripts are good at representing a very specific causal chain: the assistant decides, the tool runs, the result comes back, the assistant continues. But React sees more than that.

The timeline can contain user prompts, attachments, plan updates, workspace notices, cache warnings, steer events, feedback, source updates, and service alerts. Some of these are caused by the agent. Some are caused by runtime or external systems. Some are caused by the operator in the middle of the turn. All of them can matter.

Provider-native tool calling cleanly captures only one event type. React has to reason over a shared event landscape where runtime notices, source updates, steer, and publish events are visible alongside tool-caused events.

The model is not only reacting to the consequences of its own last tool call. It is reacting to the current state of the solution environment.

That is why the main shared state is the rendered timeline rather than a provider-managed transcript of tool calls.

2. ANNOUNCE is not a normal message

React relies on an uncached operational tail called ANNOUNCE. This is where runtime places high-frequency state that should grab model attention immediately:

authoritative temporal context
open plan state
workspace status
compaction and pruning warnings
fresh operational notices and mitigation hints

That surface exists because the system wants a place for information that is cheap to refresh, cheap to place at the tail, and deliberately outside the normal cache story. You can emulate pieces of that with ordinary messages, but you do not get the same explicit operational board semantics.

3. One generation has multiple consumers

React does not generate one flat response and then hope downstream consumers can peel it apart. It uses channeled generation. A single round can produce:

<channel:thinking>brief user-visible progress</channel:thinking>
<channel:ReactDecisionOutV2>validated decision JSON</channel:ReactDecisionOutV2>
<channel:code>raw executable code</channel:code>

These outputs are not interchangeable. The UI wants the progress text immediately. The runtime wants the structured decision for validation and tool dispatch. The execution subsystem wants raw code without having to recover it from a JSON argument string.

That last part matters more than it sounds. In a classical tool-calling shape, complex code usually has to be framed as assistant text or as a JSON string inside tool arguments. That is fragile and unnatural for code-heavy work. React can ask for a real code channel, stream raw code as code, and let Python keep its native indentation instead of forcing it through a tool-call JSON envelope first.

The deeper point is that with provider-native tool calling the model is still generating inside that protocol shape: assistant content, tool call, tool arguments, tool result, next assistant content. Even when the client streams those events nicely, generation is still fundamentally framed as tool-calling output. React is not. We ask for our own channels directly, so raw code, validated decision JSON, thinking, or subsystem payloads do not need to masquerade as tool-call arguments or flattened assistant text.

The other important limitation is delivery. Classical tool calling gives you one main assistant/tool transcript. It can expose tool events, but it is not really a general-purpose channel bus where different consumers subscribe to different streams as generation unfolds. React is. The client can listen to thinking, a widget can listen to structured subsystem payloads, and the exec/runtime layer can listen to code at the same time, all during the same round.

Layer	Main consumer	Why it is separate
`thinking`	Operator-facing stream	Immediate short progress without exposing tool or code internals.
`ReactDecisionOutV2`	Runtime validator	Strict structured decision contract for tool execution.
`code`	Exec subsystem / code widget	Raw code must be streamed, validated, and packaged independently.

Pure provider-native tool calling gives one main transcript. React keeps tool use, but moves orchestration into a timeline-first input and multi-channel output contract owned by the SDK runtime.

Provider-native tool calling is good at one of these layers. React needs all three at once, and it needs them as real channels that downstream consumers can subscribe to independently while generation is still happening.

4. Widgets and subsystem delivery matter during the round

This is one of the practical reasons the argument should not stay abstract. When React starts an execution workflow, the platform can surface a widget-like panel on the client before the turn is finished. The client may receive:

thinking progress
structured subsystem payloads
code stream chunks
later status updates like preparing, executing, completed, or error

That lifecycle is implemented through the communicator and subsystem/canvas delivery patterns. The tool call alone is only one moment in a larger streamed interaction contract.

This is also why custom channels matter. React can define and stream whatever channels the runtime needs: thinking, validated decision JSON, raw code, subsystem-specific payloads, widget-specific streams, or future channels we have not named yet. In pure tool-calling models, that flexibility is much narrower because the protocol is centered on assistant messages and tool invocations, not on arbitrary typed output lanes with independent listeners.

5. Distributed execution changes the workspace story

React is not designed as a machine-bound personal agent. A turn can start on a node that has no warm workspace for that user and conversation. That pushes the system toward explicit logical namespaces and runtime-owned workspace hydration.

The workspace rule is simple: the agent should not hallucinate local continuity. If it needs historical project state, it should activate exactly the slice it needs.

That is why the system uses explicit logical paths like fi:, ar:, so:, and tc:, and why workspace control is explicit: react.pull(...) brings older refs locally as readonly material, while react.checkout(...) defines what is actually materialized into the current-turn workspace. The runtime does not pretend every turn begins with a full project tree already mounted.

6. The real choice is SDK-owned orchestration vs provider-owned orchestration

It is tempting to frame the question as “tool-calling models versus non-tool-calling models,” but that is not actually the interesting split here. The real split is:

provider-native orchestration
vs
SDK-owned orchestration

React chooses the second one. The reason is portability and control. The same timeline model, ANNOUNCE semantics, workspace rules, and channeled streaming contract should survive model-provider changes instead of being rebuilt around whichever tool-calling surface one backend happens to expose.

That includes generation shape itself. We want to ask the model for the outputs we actually need, in the lanes we actually need, instead of pretending every non-chat output must become a tool-call argument or a flattened assistant message.

7. This choice has real costs

This is not free elegance. We accept several forms of engineering cost:

we own protocol parsing and validation
we own channel parsing and delivery
we own more runtime complexity around execution and workspace state
we own more documentation burden
we own more prompt discipline

The trade is worthwhile only because the runtime requirements are already larger than a simple message-level tool loop.

Conclusion

Pure tool calling is often a good fit when the world is mostly linear and the important state transitions are dominated by assistant-invoked functions. React v3 is working in a different shape of system: timeline-first, announce-aware, multi-channel, subscribable, distributed, and workspace-governed.

Once those constraints are real, the question is no longer “why not just use tool calling?” The question becomes: what is the narrowest protocol that still preserves the semantics of the system we actually built? For React, the answer is a custom one.