From-scratch benchmark of SAC, MBPO & MACURA on Gymnasium MuJoCo — uncertainty-aware model-based RL (ICML 2024 reimplementation).
benchmark reinforcement-learning pytorch gymnasium mujoco model-based-rl soft-actor-critic mbpo macura
-
Updated
Jun 11, 2026 - Python