add agent evaluation example with @observe and component-level tracing by Ajay6601 · Pull Request #2585 · confident-ai/deepeval

Ajay6601 · 2026-03-31T02:08:33Z

Adds examples/agent_evaluation/ with a complete, runnable example showing DeepEval v3.0 agent evaluation capabilities:

test_agent_eval.py
Three ways to evaluate an agent:

evaluate () function (quickest)
pytest integration (deepeval test run)
Component-level eval with @observe and update_current_span
README.md with Quick start guide with metric descriptions

Uses a mock agent with retriever to keep the example (no external API calls needed beyond the evaluation LLM).

… with TaskCompletion, AnswerRelevancy, custom GEval

vercel · 2026-03-31T02:08:37Z

@Ajay6601 is attempting to deploy a commit to the Confident AI Team on Vercel.

A member of the Team first needs to authorize it.

penguine-ip · 2026-04-14T06:24:28Z

Hey @Ajay6601 the observed callback is no longer supported - it is the evals iterator right now: https://deepeval.com/docs/evaluation-component-level-llm-evals#run-component-level-evals

@observe

- Use evals_iterator loop instead of observed_callback - Nest @observe components to form proper trace hierarchy - Move TaskCompletionMetric to evals_iterator (trace-level) - Keep AnswerRelevancyMetric on @observe (span-level) - Add update_current_span for runtime test case creation

Ajay6601

Thanks for the review! Updated to use evals_iterator with @observe + update_current_span per the current docs. The example now demonstrates trace-level metrics via evals_iterator(metrics=[ ]) and span-level metrics via @observe(metrics=[ ]), with proper nested spans forming the trace hierarchy.

Add a complete agent evaluation example demonstrating end-to-end eval…

95170cf

… with TaskCompletion, AnswerRelevancy, custom GEval

Ajay6601 force-pushed the feat/add-agent-eval-example branch from a67d2a6 to 0d9c390 Compare April 15, 2026 03:47

Ajay6601 force-pushed the feat/add-agent-eval-example branch from 0d9c390 to 6567fe3 Compare April 15, 2026 03:49

Ajay6601 commented Apr 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add agent evaluation example with @observe and component-level tracing#2585

add agent evaluation example with @observe and component-level tracing#2585
Ajay6601 wants to merge 2 commits intoconfident-ai:mainfrom
Ajay6601:feat/add-agent-eval-example

Ajay6601 commented Mar 31, 2026

Uh oh!

vercel Bot commented Mar 31, 2026

Uh oh!

penguine-ip commented Apr 14, 2026

Uh oh!

Ajay6601 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Ajay6601 commented Mar 31, 2026

Uh oh!

vercel Bot commented Mar 31, 2026

Uh oh!

penguine-ip commented Apr 14, 2026

Uh oh!

Ajay6601 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants