Stable Baselines Logo
master

User Guide

  • Installation
  • Getting Started
  • Reinforcement Learning Tips and Tricks
  • Reinforcement Learning Resources
  • RL Algorithms
  • Examples
  • Vectorized Environments
  • Using Custom Environments
  • Custom Policy Network
  • Callbacks
  • Tensorboard Integration
  • RL Baselines Zoo
  • Pre-Training (Behavior Cloning)
  • Dealing with NaNs and infs
  • On saving and loading
  • Exporting models

RL Algorithms

  • Base RL Class
  • Policy Networks
  • A2C
  • ACER
  • ACKTR
  • DDPG
  • DQN
  • GAIL
  • HER
  • PPO1
  • PPO2
  • SAC
  • TD3
  • TRPO

Common

  • Probability Distributions
  • Tensorflow Utils
  • Command Utils
  • Schedules
  • Evaluation Helper
  • Gym Environment Checker
  • Monitor Wrapper

Misc

  • Changelog
  • Projects
  • Plotting Results
Stable Baselines
  • Docs »
  • Overview: module code

All modules for which code is available

  • stable_baselines.a2c.a2c
  • stable_baselines.acer.acer_simple
  • stable_baselines.acktr.acktr
  • stable_baselines.bench.monitor
  • stable_baselines.common.base_class
  • stable_baselines.common.callbacks
  • stable_baselines.common.cmd_util
  • stable_baselines.common.distributions
  • stable_baselines.common.env_checker
  • stable_baselines.common.evaluation
  • stable_baselines.common.noise
  • stable_baselines.common.policies
  • stable_baselines.common.schedules
  • stable_baselines.common.tf_util
  • stable_baselines.common.vec_env.base_vec_env
  • stable_baselines.common.vec_env.dummy_vec_env
  • stable_baselines.common.vec_env.subproc_vec_env
  • stable_baselines.common.vec_env.vec_check_nan
  • stable_baselines.common.vec_env.vec_frame_stack
  • stable_baselines.common.vec_env.vec_normalize
  • stable_baselines.common.vec_env.vec_video_recorder
  • stable_baselines.ddpg.ddpg
  • stable_baselines.ddpg.policies
  • stable_baselines.deepq.dqn
  • stable_baselines.deepq.policies
  • stable_baselines.gail.dataset.dataset
  • stable_baselines.gail.dataset.record_expert
  • stable_baselines.gail.model
  • stable_baselines.her.her
  • stable_baselines.her.replay_buffer
  • stable_baselines.her.utils
  • stable_baselines.ppo1.pposgd_simple
  • stable_baselines.ppo2.ppo2
  • stable_baselines.results_plotter
  • stable_baselines.sac.policies
  • stable_baselines.sac.sac
  • stable_baselines.td3.policies
  • stable_baselines.td3.td3
  • stable_baselines.trpo_mpi.trpo_mpi

© Copyright 2018-2021, Stable Baselines Revision 550db0d6.

Built with Sphinx using a theme provided by Read the Docs.