Langfuse vs LangSmith: pricing, quotas & features (2025)
Open-source LLM engineering platform — observability, prompts, evals
vs. LLM observability, testing & evaluation — by LangChain
Data sourced from vendor documentation · Last updated May 2026
Summary
Langfuse and LangSmith are both ai-observability platforms, addressing the same core use case with different implementation philosophies and trade-offs. Both offer a free tier, making it easy to prototype without a credit card. Langfuse's paid plans start at $29/month, lower than LangSmith's entry point at $39/month. Langfuse has a broader documented feature set (16 vs 14 features). Budget-sensitive teams will find Langfuse easier to justify early on. All pricing and quota data below is sourced from Langfuse and LangSmith's official documentation — not generated by AI or estimated.
Langfuse vs LangSmith: Comparativa de precios, cuotas y características (2025)
En esta comparativa analizamos Langfuse y LangSmith lado a lado — incluyendo precios mensuales, límites del tier gratuito, características técnicas, cuotas de uso (almacenamiento, transferencia, usuarios activos mensuales) y los interfaces de desarrollo disponibles. Todos los datos proceden de la documentación oficial de cada proveedor, no de respuestas generadas por IA.
Langfuse es una plataforma de la categoría ai-observability — Open-source LLM engineering platform — observability, prompts, evals. Ofrece 6 tiers de precio: Hobby (Cloud Free) gratuito, Self-Hosted (OSS) gratuito, Core desde $29/mes, Pro desde $199/mes. Su catálogo en Staxly documenta 16 características y 6 interfazes para desarrolladores.
LangSmith pertenece a la categoría ai-observability — LLM observability, testing & evaluation — by LangChain. Ofrece 3 tiers de precio: Developer (Free) gratuito, Plus desde $39/mes, Enterprise (personalizado). Su catálogo documenta 14 características y 6 interfazes para desarrolladores.
A continuación encontrarás los tiers de precio completos de ambas plataformas, una matriz de cuotas del tier gratuito (transferencia, almacenamiento, MAU, llamadas a la API y otros límites), el listado completo de características y los interfaces (CLI, SDKs, REST, GraphQL, MCP) disponibles para integrar cada servicio.
¿Necesitas estos datos en tu agente de IA (Claude Code, Cursor, Zed)? Instala gratis el servidor MCP de Staxly y tendrás acceso estructurado a Langfuse, LangSmith y más de 130 plataformas para desarrolladores.
Pricing tiers
Langfuse
LangSmith
Free-tier quotas head-to-head
Comparing hobby on Langfuse vs developer on LangSmith.
| Metric | Langfuse | LangSmith |
|---|---|---|
| No overlapping quota metrics for these tiers. | ||
Features
Langfuse · 16 features
- Annotation Queues — Human reviewers rate traces. Unlimited on Pro+.
- Dashboards — Aggregate metrics, cost, quality across projects.
- Datasets — Curate test sets from production traces. Run experiments.
- EU Cloud Region — GDPR-compliant hosting in EU.
- Evaluations — LLM-as-judge, manual scores, custom model-graded evaluators.
- LLM Cost Tracking — Automatic cost calculation per provider/model.
- OpenTelemetry Native — OTel SDK → Langfuse endpoint works out of box.
- Playground — Test prompts + models + variables live.
- Prompt Management — Version, tag, label prompts. Reference from code by label.
- Public API — Full REST API for ingest, query, prompt management.
- Python @observe decorator — One-line decorator to trace any function.
- Self-Hosting — Docker Compose + k8s Helm chart.
- Sessions — Group related traces (conversations, agent runs).
- Tracing — Capture every LLM call, tool call, nested span with inputs/outputs/cost.
- Users Tracking — Segment traces by user ID, track per-user cost.
- Webhooks — Subscribe to trace completion events.
LangSmith · 14 features
- Alerts — Threshold alerts on latency, cost, eval metrics.
- Annotation Queues — Human-review workflows for trace quality rating.
- Custom Dashboards — Aggregate metrics dashboards per project/tag.
- Datasets — Collect examples → use as eval sets or training data.
- Evaluations — LLM-as-judge, embedding similarity, custom Python evaluators, offline batch eval…
- LLM Tracing — Automatic trace every LLM call + tool call + chain step.
- LangChain Integration — Auto-trace any LangChain/LangGraph run with env var.
- LangGraph Integration — First-class trace + eval for LangGraph agents.
- OpenTelemetry Export — Export traces as OTLP to Datadog/Honeycomb/etc.
- Playground — Test prompts + models inline before deploying.
- Prompt Canvas — Visual prompt editor with live test + eval.
- Prompt Hub — Public + private prompt library with versioning.
- Self-Hosted (Enterprise) — Docker + k8s deployment in your infra.
- Threads + Sessions — Group traces into conversational sessions.
Developer interfaces
| Kind | Langfuse | LangSmith |
|---|---|---|
| CLI | — | LangSmith CLI |
| SDK | langfuse-js, langfuse-python | langsmith-js, langsmith-python |
| REST | Langfuse REST API | LangSmith REST API |
| MCP | Langfuse MCP Server | LangSmith MCP |
| OTHER | Langfuse Dashboard, OpenTelemetry endpoint | LangSmith Dashboard |
Key takeaways
- Both Langfuse and LangSmith offer a free tier — Langfuse ("Hobby (Cloud Free)") and LangSmith ("Developer (Free)") — with no credit card required to start.
- The entry-level paid plan is $29/month for Langfuse (Core) vs. $39/month for LangSmith (Plus).
- Langfuse has a broader documented feature set (16 features) vs. LangSmith (14 features) in Staxly's catalog.
- Developer integrations differ: only LangSmith offers CLI.
Want this comparison in your AI agent's context? Install the free Staxly MCP server.