Google ADK vs LangGraph for Production Agents

Compare Google ADK and LangGraph for production agents: state, human approval, deployment, observability, pricing, and the decision rule.

Thursday, June 11, 2026

Omid Saffari

Google ADK vs LangGraph for Production Agents

Use LangGraph when the agent is a product workflow that must pause, resume, fork, and get human approval before side effects. Use Google ADK when the agent is a Google Cloud workload and the fastest safe path is Agent Platform Runtime, Sessions, Memory Bank, evaluation, and managed observability.

Verdict: LangGraph for Control, ADK for Google Agent Platform

LangGraph is the safer default for a production agent that behaves like a workflow engine: it has state, branches, retries, manual review, and side effects that must be replayed or resumed without losing the run. Google ADK is the better choice when your company has already chosen Google Cloud as the agent platform and wants the runtime, session store, memory store, evaluation service, tracing, and identity layer to sit inside that boundary.

That is the real split. ADK is not just a Python helper around Gemini. Google positions ADK as an open-source agent development framework for building, debugging, and deploying reliable agents at enterprise scale, available in Python, TypeScript, Go, Java, and Kotlin. It gives teams a code-first way into Google Agent Platform services.

LangGraph is not just another agent framework. LangChain describes LangGraph as a low-level orchestration framework and runtime for long-running, stateful agents, focused on durable execution, streaming, human-in-the-loop, and persistence. Its value is that the graph, state transitions, and interruption points are explicit in your application.

The wrong choice is picking based on which demo looks shorter. A production agent fails on run ownership: who stores state, who resumes after a crash, who blocks irreversible tools, who traces the run, and who pays when session and memory traffic grows. If those answers need to live in your product code and span providers, use LangGraph. If those answers should live in Google Agent Platform because the agent is a Google Cloud workload, use ADK.

LangGraph's production orchestration model is already a strong fit for teams that need explicit runtime control. The ADK comparison becomes sharper because Google is offering a platform path, not just a local SDK.

Google ADK documentation homepage — Google ADK positions itself as an open-source framework for production agents across Python, TypeScript, Go, Java, and Kotlin.

The Axis Is Who Owns the Run

The framework decision is a run-ownership decision. Every production agent needs a durable answer for the same object: a user request enters, tools are called, state changes, approval may be needed, memory may be read or written, and the run either finishes, pauses, retries, or fails.

In LangGraph, that object is modeled as graph state. Persistence saves graph state as checkpoints at every step of execution and organizes the history into threads. Those checkpoints enable human-in-the-loop workflows, conversational memory, time travel debugging, and fault-tolerant execution. If a run pauses for approval, thread_id is the pointer that tells the checkpointer which state to load when the run resumes.

In ADK, the run model is framed around agent platform services. ADK's session docs separate Session, State, and Memory: Session is the single ongoing interaction, State is data inside the current conversation, and Memory is searchable cross-session information. Google Agent Platform then adds managed Agent Runtime, Sessions, Memory Bank, Example Store, Evaluation Service, tracing, built-in observability, Code Execution, and Computer Use.

That changes the engineering trade. LangGraph says: put the workflow truth in code, then choose the hosting and observability stack. ADK says: write the agent in a Google-supported framework, then push it toward Google-managed agent infrastructure.

For a technical founder shipping a workflow agent that can refund an invoice, create a ticket, or write back to a customer database, LangGraph usually gives the clearer safety boundary. The graph node that proposes the action can stop before the tool call, persist the state, expose the approval payload, and resume only when the reviewer responds. That approval step is part of the product.

For a platform team standardizing agent workloads on Google Cloud, ADK can reduce runtime assembly. The same team can use Google Cloud identity, deployment paths, managed runtime, memory services, and tracing rather than assembling every layer independently.

LangGraph documentation overview — LangGraph is built around low-level orchestration for long-running, stateful agents.

Production Comparison

The practical choice is not which framework can call a tool. Both can. The choice is which one gives your team the least ambiguous production control surface.

Decision axis	Choose Google ADK when...	Choose LangGraph when...	Production risk to inspect	Cost surface
Runtime ownership	Agent Runtime is the intended deployment target and Google Cloud is the platform boundary.	The agent must run across clouds, providers, queues, or an existing app runtime.	Runtime portability, incident ownership, and how quickly you can replay a failed run.	Agent Runtime has a free tier of 50 vCPU hours and 100 GiB hours per month per project, then $0.0864 per vCPU hour and $0.009 per GiB-hour.
State model	Session, State, Memory, Sessions, and Memory Bank should be managed platform services.	State transitions, checkpoints, and resume behavior need to be explicit in code.	Hidden state writes, missing state snapshots, and unclear ownership of memory retention.	Memory Bank storage is $0.25 per 1,000 memories stored per month, plus LLM generation costs; retrieval is $0.50 per 1,000 memories returned after the first 1000 memories returned per month.
Human approval	Approval can be built around platform services or a custom ADK orchestration pattern.	Approval is part of the workflow and must pause at exact points before side effects.	Tool calls executing before review, non-repeatable approvals, and untraceable reviewer input.	LangSmith Developer is $0 per seat per month with up to 5k base traces per month; Plus is $39 per seat per month with up to 10k base traces per month and one dev-sized agent deployment.
Tools and integrations	You want ADK integrations such as MCP tools, Google Cloud Agent Identity, Agent Registry, API Registry, Apigee API Hub, Application Integration, BigQuery tools, Google Cloud Trace, Code Execution Tool with Agent Runtime, or Dapr.	You need a model-agnostic graph that can call tools from LangChain, custom code, MCP, or other app services without adopting a single cloud platform.	Too many tools, weak authorization boundaries, and missing per-tool audit data.	LangSmith Enterprise is custom priced for alternative hosting, hybrid or self-hosted deployment, custom SSO and RBAC, support SLA, and custom packages.
Evaluation and observability	You want ADK's evaluation model plus Google Agent Platform's Example Store, Evaluation Service, tracing, monitoring, and built-in observability.	You want LangSmith tracing, evals, annotation queues, and deployment while keeping orchestration code portable.	Traces that omit state, tool arguments, approval decisions, or memory reads.	Sessions billing is based on stored session events that include content, including user requests, model responses, function calls, and function responses.

The table hides one important point: the open-source framework cost is rarely the bill that surprises you. The surprise is trace volume, session events, memory retrieval, code execution, and idle orchestration work. Google says Agent Runtime billing is rounded to the nearest second and idle time is not billed. That is useful, but it does not remove session and memory costs once the agent is actually used.

LangSmith pricing has a different shape. The Developer plan is free per seat and includes tracing, evals, annotation queues, monitoring, one Fleet agent, and up to 50 Fleet runs per month. The Plus plan is $39 per seat per month and includes one dev-sized agent deployment, up to 500 Fleet runs per month, access to LangSmith Engine, and access to LangSmith Sandboxes. The cost question is not "which SDK is free?" It is "where will traces, runs, deployments, and reviewer queues live once usage is real?"

Use LangGraph When Approval Is Part of the Product

LangGraph wins when a human decision is a normal branch in the agent, not an exception path. Its interrupts pause graph execution at specific points and wait for external input before continuing. When an interrupt is triggered, LangGraph saves the graph state with persistence and resumes later with the same thread.

That matters for any workflow with side effects: purchase approval, account change, customer email, code change, data export, access request, or support escalation. The important question is not whether a person can look at the output. The important question is whether the runtime can stop before the irreversible action, store exactly what the agent wanted to do, expose that payload to a reviewer, then continue with the same state after review.

A minimal approval shape looks like this:

Python

from langgraph.types import interrupt, Command

def approval_node(state):
    approved = interrupt({
        "action": state["proposed_action"],
        "reason": state["reason"],
        "tool": state["tool_name"],
    })
    return {"approved": approved}

config = {"configurable": {"thread_id": request_id}}
stream = graph.stream_events(input_state, config=config, version="v3")

if stream.interrupted:
    enqueue_review(stream.interrupts)

graph.invoke(Command(resume={"approved": True}), config=config)

The code is small, but the design rule is the product. The approval payload should include the proposed action, the tool name, the arguments that will be sent, the user-visible consequence, and the reason the agent believes the action is allowed. The reviewer response should be stored as a first-class event, not as a note in a chat transcript.

That is where LangGraph's persistence model earns its complexity. Checkpoints are not only for crash recovery. They let you inspect state, pause for review, resume after approval, fork a run for debugging, and avoid re-running successful nodes after a failure. If the agent is part of a regulated or high-trust product workflow, those mechanics are not optional.

LangGraph also leaves the platform choice open. You can deploy through LangSmith, run your own server, use a database-backed checkpointer, send traces to your observability stack, or wrap it in the app infrastructure your team already operates. That flexibility costs engineering time, but it avoids making Google Agent Platform the permanent owner of your agent runtime.

The OpenAI Agents SDK versus LangGraph decision follows the same pattern: the more the agent looks like a durable business process, the more explicit orchestration matters.

Use Google ADK When the Platform Is the Product Boundary

Google ADK wins when the agent is meant to live inside Google's agent platform and the managed services remove work your team would otherwise own. ADK's strongest production path is not only the local framework. It is the path from code to Agent Runtime, Sessions, Memory Bank, evaluation, tracing, identity, and Google Cloud integrations.

ADK can start locally with familiar package installs:

Bash

pip install google-adk
npm install @google/adk
go get google.golang.org/adk

The platform story matters after that. ADK's Agent Runtime deployment guide supports Python and Go v1.2.0. Standard deployment uses Cloud Console and the ADK command line interface. Agents CLI deployment sets up a Google Cloud environment with CI/CD, infrastructure-as-code, and deployment pipelines, and it requires a Google Cloud project with billing enabled.

That is useful when the agent is not a standalone product runtime. For example, a SaaS platform team already using Google Cloud may want agents to call BigQuery, use Google Cloud Agent Identity, connect APIs through Apigee API Hub, trace with Google Cloud Trace, and persist conversational context through Sessions and Memory Bank. In that case, choosing ADK can make the agent easier to govern because the security and runtime boundaries align with the rest of the platform.

ADK also gives a clearer path for platform-style evaluations. Its evaluation docs recommend automated evaluations beyond prototype and split evaluation into trajectory/tool-use evaluation plus final-response evaluation. That is the right split for production agents because the final answer can be acceptable while the path was unsafe, wasteful, or policy-violating.

The production caution is lock-in and hidden platform cost. Agent Runtime pricing is compute and memory based, but Agent Platform also prices session events and memory operations. Sessions billing is based on stored session events that include content. Memory Bank charges for stored memories and returned memories. Starting February 11, 2026, billing begins for Code Execution, Sessions, and Memory Bank. If your agent writes memory too eagerly, stores noisy session events, or retrieves memory on every request, the platform services become part of the unit economics.

So the ADK checklist is not only "can it deploy?" It is:

Bound the platform state
Decide which data belongs in Session, which data belongs in State, and which data deserves Memory. ADK's own docs warn that in-memory session, state, and long-term knowledge data is lost when the application restarts, so production storage needs an explicit service choice.
Price the run path
Model Agent Runtime compute, RAM, session events, Memory Bank storage, Memory Bank retrieval, and Code Execution separately. The free runtime tier helps early testing, but production cost follows the services the agent actually touches.
Trace trajectory, not just output
Store tool choice, tool arguments, retrieved memory, evaluator result, final response, and any reviewer action. A trace that only records the final message is not enough for incident review.

The Decision Rule That Flips the Choice

Default to LangGraph unless Google Agent Platform materially reduces the production work your team must own. Flip to ADK when Google Cloud is already the control plane and the agent benefits from managed runtime, identity, session, memory, evaluation, and tracing services.

The cleanest decision rule:

If this is true	Use
The agent has irreversible side effects and human review is a core workflow branch.	LangGraph
The agent must run across model providers or cloud boundaries.	LangGraph
The agent is deployed as part of a Google Cloud platform standard.	Google ADK
The agent's hardest problem is managed session state, long-term memory, and platform governance inside Google Cloud.	Google ADK
The team wants managed deployment through LangSmith but still wants framework choice.	LangGraph or wrapped ADK, but keep the runtime boundary explicit

There is also a hybrid path, but it should be temporary or deliberately bounded. LangSmith Deployment documents support for agents built with LangGraph, Google ADK, and other frameworks. It can deploy Google ADK agents as a LangGraph with a wrapper package. That is useful for migration and unified observability, but it also creates another boundary: ADK owns one part of the agent semantics, LangSmith owns deployment and tracing, and your product still owns the side effects.

Hybrid stacks fail when nobody can answer a simple incident question: where did this run pause, what state was persisted, which reviewer approved it, what memory was read, and what exact tool payload executed? If the stack cannot answer that quickly, the framework choice was not finished.

Implementation Checklist

Whichever framework you choose, build the control layer before traffic exposes the gaps. A production agent needs durable IDs, policy checks, evaluator outputs, approval records, and cost telemetry from the first shipped version.

Log these fields for every run:

run_id, thread_id, user or account scope, and environment.
Model provider, model name, prompt or instruction version, and tool registry version.
Tool call name, tool arguments, tool result, tool latency, and whether the tool was dry-run, approved, blocked, or executed.
Memory read keys, memory write candidates, memory writes accepted, and memory writes rejected.
Retrieval sources or context documents when the agent reads knowledge.
Human reviewer, approval payload, reviewer decision, and resume event.
Evaluator name, trajectory result, final-response result, and policy violations.
Token usage, runtime compute, session event count, memory retrieval count, and code execution usage where the platform exposes them.

The framework only makes some of those fields easier. It does not remove the need to design the operational surface. LangGraph gives you a direct place to bind state and approvals. ADK gives you a direct path into Google-managed runtime services. Both still need a product-specific answer for access control, evaluator thresholds, cost ceilings, and incident review.

Use LangGraph if your team wants the agent runtime to be part of the application architecture. Use Google ADK if your team wants Google Agent Platform to be the agent runtime. Do not choose either because the starter example is shorter.

Is Google ADK better than LangGraph?

Google ADK is better when Google Agent Platform is already the target runtime and managed sessions, memory, evaluation, tracing, and deployment are part of the requirement. LangGraph is better when explicit workflow state, human approval, replay, and provider flexibility matter more than adopting a single cloud platform.

Can Google ADK agents run on LangSmith?

Yes. LangSmith Deployment documents Google ADK as a supported framework and describes deploying an ADK agent through a wrapper package. Treat that as a runtime-boundary decision, because ADK, LangSmith, and your product each own different parts of state, deployment, and side effects.

Does LangGraph replace LangChain?

No. LangChain components can still handle models, tools, and common agent loops. LangGraph is the orchestration runtime for durable execution, streaming, human-in-the-loop, and persistence, and the LangGraph docs state that you do not need to use LangChain to use LangGraph.

Which one is cheaper?

The open-source SDK price is not the useful comparison. Google Agent Runtime has a free monthly runtime tier, then compute, RAM, session, memory, and sandbox costs; LangSmith has plan and usage pricing for traces, evals, deployment, and agent runs. Price the whole run path, not the import statement.

Scope Your Agent Build

Design the runtime, approval gates, evals, observability, and cost controls before your agent sees production traffic.

Last Updated

Jun 11, 2026

CategoryAgents

Google ADK vs LangGraph for Production Agents

Verdict: LangGraph for Control, ADK for Google Agent Platform

The Axis Is Who Owns the Run

Production Comparison

Use LangGraph When Approval Is Part of the Product

Use Google ADK When the Platform Is the Product Boundary

Bound the platform state

Price the run path

Trace trajectory, not just output

The Decision Rule That Flips the Choice

Implementation Checklist

Scope Your Agent Build

More from Agents

Context Engineering vs Prompt Engineering for Production Agents

Agent Memory for Production AI Systems

OpenAI Agents SDK vs Pydantic AI for Production Agents

OpenAI Agents SDK TypeScript vs Python for Production Agents

LangChain vs LangGraph for Production Agents

OpenAI Agents SDK vs LangGraph for Production Agents

One letter, every week. Working systems — not hot takes.