No more pattern-hunting in traces
Individual trace review does not reveal failure patterns at scale. Engine clusters failures automatically so you see what is broken and how often, without digging through thousands of traces.
LangSmith Engine watches your production traces, clusters failures into named issues, diagnoses root causes, and proposes fixes and eval coverage automatically.
Available now in public beta. No credit card required.
Link a LangSmith tracing project. Engine immediately starts monitoring your production traces for failure signals across errors, evaluator results, latency anomalies, and user feedback.
Give Engine access to your codebase and it can read relevant code, diagnose root causes, and draft targeted PRs with fixes ready for your review.
Engine surfaces clustered issues in a prioritized list, each with a proposed fix, a custom online evaluator, and failing traces added to your offline eval dataset. You review and ship.



LangSmith Engine uses the traces, evals, and development context teams already capture in LangSmith to help them improve agents without standing up new infrastructure.
From clustered failure detection to drafted fixes and eval coverage, Engine handles manual triage work that takes most teams hours.
Engine monitors traces for explicit errors, evaluator failures, latency spikes, negative user feedback, and unusual behavior, finding both known and undetected problems. When it detects a repeated pattern, it clusters those traces into a named issue with severity, timeline, and evidence links.
Connect with our team to see how
Built on LangSmith Infrastructure
LangSmith Engine is powered by your existing LangSmith setup. It reads from your current tracing projects and evaluator results, and writes back into your existing offline eval workflow.
Link your repository and Engine can read relevant code to diagnose root causes and open targeted PRs directly against your codebase.
LangSmith is SOC 2 Type II certified with comprehensive security controls for enterprise teams.
Trust centerIndividual trace review does not reveal failure patterns at scale. Engine clusters failures automatically so you see what is broken and how often, without digging through thousands of traces.
Engine proposes a code or prompt fix and an evaluator for the same issue at the same time. When you merge the fix, the evaluator is already in place to catch regressions.
Engine works on top of your current LangSmith tracing projects and evaluator results. Connect a project, optionally connect your repo, and it starts surfacing issues immediately.

"Engine saves our team hours of digging by not only identifying emerging failure modes, but also proactively suggesting evals and code changes to resolve them quickly."

"It has dramatically reduced our time-to-detection and time-to-triage, meaning less time spent hunting through traces and a much shorter path from broken behavior to a deployed fix."
Talk to our team about how Engine can automate failure detection, root cause diagnosis, and eval coverage for your agents. 20-minute demo.