LangSmith for LangGraph

Build reliable agents with LangGraph and LangSmith

LangGraph gives you the control to build stateful, complex agent workflows. LangSmith gives you the visibility, testing, and production feedback loop to ship them with confidence.

Get started with LangSmith Read the LangGraph docs

Get started free. No credit card required.

LangSmith platform dashboard showing LangGraph agent traces

Building the agent is only the beginning

Production agents need more than a framework

LangGraph gives your team the primitives to build flexible, stateful agent workflows with full control over branching, memory, and multi-agent coordination. But once your agent is in production, you need tracing to understand failures, evaluations to test changes safely, monitoring to catch regressions, and a continuous improvement loop. That is where LangSmith comes in.

The production layer for LangGraph agents

Trace every LangGraph run

Get full visibility into every node, edge, and tool call in your LangGraph workflow. See exactly what happened at each step and why.

Debug complex agent behavior

Pinpoint failures in multi-step, branching workflows without guesswork. Replay traces, inspect state, and understand what went wrong.

Evaluate changes before shipping

Run offline evaluations against datasets before you deploy. Catch regressions, measure improvements, and ship with confidence.

Monitor production performance

Track quality metrics, latency, and cost across all your LangGraph agents in real time. Get alerts before issues become outages.

Turn production traces into improvements

Annotate production traces, build evaluation datasets from real traffic, and close the loop between what users experience and what you ship next.

Full agent development lifecycle

LangGraph and LangSmith work together at every stage

From your first prototype to a production system handling thousands of users, LangGraph and LangSmith support the full lifecycle.

Build

Design controllable, stateful agent workflows in LangGraph. Use nodes, edges, and state to model exactly how your agent should reason and act.

Test

Use LangSmith to run evaluations against curated datasets. Catch regressions and validate improvements before they reach users.

Deploy

Deploy your LangGraph agent to production with LangSmith Deployment. Built-in support for long-running agents, human-in-the-loop, and stateful sessions.

Monitor

Track quality, latency, and cost across every production run. Surface anomalies and regressions with online evaluations and alerting.

Improve

Use production traces as a feedback loop. Annotate failures, build new test cases, update prompts and graph logic, then re-evaluate and ship with confidence.

Built for LangGraph complexity

Full support for the most demanding agent architectures

LangSmith is designed to handle the complexity that comes with real-world LangGraph agents.

Branching paths and conditional logic

Trace agents that take different paths based on state, model output, or external signals. Understand which branch ran and why.

State changes across nodes

Inspect how agent state evolves step by step across your graph. Compare state before and after each node to diagnose unexpected behavior.

Tool calls and external integrations

See every tool call your agent makes, the inputs it passed, and the outputs it received. Debug third-party API failures without reading logs.

Human-in-the-loop workflows

Track approval steps, interruption points, and human feedback in your LangGraph workflows. Understand how human input shapes agent decisions.

Long-running and multi-agent systems

Monitor agents that run for minutes or hours. Trace interactions across multi-agent systems where one agent delegates to another.

Version comparisons

Compare performance across agent versions side by side. Know whether a change to your graph, prompt, or tool improved outcomes before you ship it.

How it works

Build your agent in LangGraph

Define your agent as a graph with nodes, edges, and state. LangGraph gives you low-level control to model any agent architecture, from simple ReAct loops to complex multi-agent pipelines.

Trace and debug in LangSmith

Add one line of instrumentation to start capturing full traces of every LangGraph run. Inspect node inputs, outputs, state transitions, and tool calls in a structured UI.

Evaluate changes before shipping

Create evaluation datasets from production traces or manually curated examples. Run automated evaluators to score agent outputs and catch regressions before they reach users.

Monitor production behavior

Track quality, latency, and cost metrics across all production runs. Set up online evaluations that score live traffic automatically and alert you when performance degrades.

Improve prompts, code, tools, and graph structure

Use production insights to guide improvements. Annotate failures, update your graph logic or prompts, re-evaluate against your test suite, and ship with confidence.

Get started with LangSmith

Frequently asked questions

Ready to build more reliable LangGraph agents?

Use LangGraph to design controllable agent workflows. Use LangSmith to debug, evaluate, monitor, and improve them in production.

Get started with LangSmith