sb3-soft Documentation

Reinforcement learning algorithms with soft Q-targets, built for Stable-Baselines3.

sb3-soft currently provides: