v1.1.2
Changelog
This version adds normalized advantage and epsilon-clip support, which pairs with CoMLRL 1.1.2.
Normalizing the advantage can make convergence stable (no significant) at the cost of slightly higher VRAM use. Clipping too strictly will hurt.
- Gray - Repeated Bandit
- Light Green - Plain
- Red - Level Feedback
- Blue - Expert Edits