Day 55 of 60 · AI-system specific

AI agent tracing & observability

Multi-step agents are opaque. When the answer is wrong, you don't know if it was the retrieval, the tool call, or the reasoning. Traces make the steps legible, every prompt, every response, every state.

ProblemMulti-step agentic systems are opaque, when output is wrong, you can't see which tool call, which retrieval, or which reasoning step caused it.

How it works

Capture every prompt, model response, tool call, and intermediate state in a structured trace. Tie traces to evals and production user feedback. Adopt OpenTelemetry GenAI semantic conventions (standardised 2025).

What it catches

Tool-call failures, retrieval misses, latency cliffs, cost regressions, prompt-version effects, judge-vs-user disagreement. Required for any production agent.

Tools

Langfuse · Hybrid Arize Phoenix · OSS LangSmith · SaaS Helicone · Hybrid OTel GenAI · OSS

Verdict by project size

Small

Opt

Medium

Must

Large

Must

Extra-large

Must

Cost

Project size	Setup	Maint / mo	Tool / mo	CI / run
Small <10k LOC	4h	1h	$0	,
Medium 10–100k LOC	2d	5h	$200	,
Large 100k–1M LOC	10d	25h	$2k	,
Extra-large >1M LOC	40d	120h	$15k	,

Setup = engineer-days to first useful run · Maint = engineer-hours / month at steady state · Tool = out-of-pocket $ / month · CI = minutes added (or saved) per pipeline run

Lifecycle & ownership

When in lifecycle

Test Operate Observe

Per merge · Runs after merge to main; nightly heavy jobs.

Who owns it

ML / AI Engineer

Models, evals, drift, guardrails

Collaborates with: Developer, Security / AppSec

Reference implementations

OpenTelemetry GenAI conventions
Reference semantic conventions for tracing LLM calls and agent operations.
Langfuse examples
LLM tracing and observability examples for prompts, tools, and scores.
Phoenix tracing tutorials
Agent and RAG tracing tutorials with evaluation feedback loops.

Quick check

AI agent tracing & observability adopted which 2025 standard?

One question. Pick the best answer. Your streak is saved locally on this device.

Save the lesson

Download SVG ↓

Screenshot for a 1:1, drop it in Slack, or download the SVG.

All 60 days →