-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: confident-ai/deepeval
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: return 1.0 when no knowledge retention verdicts exist
#2636
opened Apr 28, 2026 by
NgDMau
Loading…
feat: Integrate OpenRouterModel into Metric utilities
#2632
opened Apr 26, 2026 by
Djalal-H
Loading…
implement TrustScoreMetric for evaluating RAG source trustworthiness
#2623
opened Apr 22, 2026 by
Danish2op
Loading…
fix: prevent duplicate test-case rows on pytest retry
#2619
opened Apr 17, 2026 by
amitkojha05
Loading…
3 tasks done
feat(test_case): make trace_dict public for post-hoc agentic evaluation
#2600
opened Apr 4, 2026 by
tiffanychum
Contributor
Loading…
3 tasks
fix: multi-root traces silently drop root spans from evaluation and export
#2599
opened Apr 4, 2026 by
aerosta
Contributor
Loading…
fix: batched upload permanently truncates in-memory test run
#2597
opened Apr 4, 2026 by
aerosta
Contributor
Loading…
Add AG2 integration for multi-agent tracing
#2596
opened Apr 3, 2026 by
faridun-ag2
Loading…
8 tasks done
Fix/predictable temp file and race condition in gpu utils vulnerability
#2593
opened Apr 2, 2026 by
AseemPrasad
Loading…
fixing secure_exec sandbox escape via getattr vulnerability
#2592
opened Apr 2, 2026 by
AseemPrasad
Loading…
examples: add RAIL Score responsible AI evaluation example
#2591
opened Apr 2, 2026 by
SumitVermakgp
Loading…
add agent evaluation example with @observe and component-level tracing
#2585
opened Mar 31, 2026 by
Ajay6601
Contributor
Loading…
feat: add penalize_ambiguous_claims to AnswerRelevancyMetric
#2573
opened Mar 25, 2026 by
Krishnachaitanyakc
Loading…
3 tasks
fix(ragas): update capture_metric_type call for new telemetry signature
#2568
opened Mar 22, 2026 by
sachinML
Loading…
Add GoodMem integration for memory-powered retrieval
#2566
opened Mar 19, 2026 by
bassammalik
Loading…
4 of 5 tasks
fix: include tool and trace state in evaluation cache keys
#2561
opened Mar 19, 2026 by
aerosta
Contributor
Loading…
fix: preserve metric snapshots when async metric tasks fail in indicator
#2560
opened Mar 18, 2026 by
aerosta
Contributor
Loading…
feat: add native Groq model integration for high-speed evaluations
#2556
opened Mar 17, 2026 by
Jayachander123
Loading…
4 tasks done
Previous Next
ProTip!
no:milestone will show everything without a milestone.