Normalize rewards by standard deviation of discounted return in MuJoCo#149
Open
vzhuang wants to merge 1 commit into
Open
Normalize rewards by standard deviation of discounted return in MuJoCo#149vzhuang wants to merge 1 commit into
vzhuang wants to merge 1 commit into