Skip to content

feat(rl): add off-policy IS correction hook (current policy vs rollout)#2084

Open
EazyReal wants to merge 1 commit into
THUDM:mainfrom
EazyReal:off-policy-is
Open

feat(rl): add off-policy IS correction hook (current policy vs rollout)#2084
EazyReal wants to merge 1 commit into
THUDM:mainfrom
EazyReal:off-policy-is

feat(rl): add off-policy IS correction hook (current policy vs rollout)

e40db8f
Select commit
Loading
Failed to load commit list.
Sign in for the full log view

Annotations

1 warning
agent-test (0, test_agent/test_adapters.py)
succeeded Jun 24, 2026 in 48s