Skip to content

Pull requests: confident-ai/deepeval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add MseeP.ai badge
#2637 opened Apr 28, 2026 by mseep-ai Loading…
Feature/trust score metric 17823921494002512507
#2625 opened Apr 22, 2026 by Danish2op Loading…
fix: guard against None input in trimAndLoadJson
#2620 opened Apr 19, 2026 by bongho Loading…
fix: prevent duplicate test-case rows on pytest retry
#2619 opened Apr 17, 2026 by amitkojha05 Loading…
3 tasks done
fix trace crashes from concurrent access
#2616 opened Apr 16, 2026 by gauravyad86 Loading…
POLLUX LLM-Judge metric
#2610 opened Apr 10, 2026 by ulyanaisaeva Loading…
Allow default model to be set via env
#2602 opened Apr 5, 2026 by A-Vamshi Collaborator Loading…
feat(test_case): make trace_dict public for post-hoc agentic evaluation
#2600 opened Apr 4, 2026 by tiffanychum Contributor Loading…
3 tasks
fix: batched upload permanently truncates in-memory test run
#2597 opened Apr 4, 2026 by aerosta Contributor Loading…
Add AG2 integration for multi-agent tracing
#2596 opened Apr 3, 2026 by faridun-ag2 Loading…
8 tasks done
add agent evaluation example with @observe and component-level tracing
#2585 opened Mar 31, 2026 by Ajay6601 Contributor Loading…
[NOT MERGABLE] OpenAI embedder changes
#2582 opened Mar 30, 2026 by A-Vamshi Collaborator Draft
Add GoodMem integration for memory-powered retrieval
#2566 opened Mar 19, 2026 by bassammalik Loading…
4 of 5 tasks
fix: include tool and trace state in evaluation cache keys
#2561 opened Mar 19, 2026 by aerosta Contributor Loading…
ProTip! no:milestone will show everything without a milestone.