sb3

202 models • 1 total models in database
Sort by:

ppo-CartPole-v1

2,079
0

sac-Humanoid-v3

229
2

dqn-BeamRiderNoFrameskip-v4

NaNK
192
0

dqn-MountainCar-v0

142
1

dqn-CartPole-v1

124
0

dqn-Acrobot-v1

78
0

a2c-BreakoutNoFrameskip-v4

NaNK
72
2

td3-Ant-v3

71
0

sac-HalfCheetah-v3

69
2

dqn-BreakoutNoFrameskip-v4

NaNK
64
2

td3-Hopper-v3

63
0

td3-Swimmer-v3

53
1

sac-Ant-v3

53
1

ppo-LunarLander-v2

53
0

td3-Walker2d-v3

53
0

td3-HalfCheetah-v3

52
0

a2c-BipedalWalkerHardcore-v3

NaNK
51
0

a2c-MsPacmanNoFrameskip-v4

51
0

ppo-Walker2DBulletEnv-v0

49
0

ppo-BreakoutNoFrameskip-v4

NaNK
47
0

ppo-BipedalWalker-v3

NaNK
47
0

dqn-SpaceInvadersNoFrameskip-v4

44
4

ppo-Acrobot-v1

38
1

ppo-Walker2d-v3

37
0

ppo-MountainCar-v0

35
1

ppo-PongNoFrameskip-v4

35
1

ppo-MsPacmanNoFrameskip-v4

35
1

a2c-MountainCar-v0

33
0

ppo-LunarLanderContinuous-v2

31
0

ppo-SpaceInvadersNoFrameskip-v4

30
0

a2c-PongNoFrameskip-v4

30
0

a2c-Acrobot-v1

29
0

Ppo Lstm CarRacing V0

28
2

ppo-BipedalWalkerHardcore-v3

NaNK
28
1

a2c-BipedalWalker-v3

NaNK
28
1

ppo_lstm-CartPoleNoVel-v1

28
0

ddpg-BipedalWalker-v3

NaNK
28
0

ppo-MiniGrid-Empty-Random-5x5-v0

28
0

ppo-AntBulletEnv-v0

27
0

td3-MountainCarContinuous-v0

27
0

a2c-QbertNoFrameskip-v4

27
0

dqn-PongNoFrameskip-v4

26
2

a2c-SpaceInvadersNoFrameskip-v4

26
0

ppo-HopperBulletEnv-v0

25
0

ppo-HalfCheetahBulletEnv-v0

25
0

a2c-AsteroidsNoFrameskip-v4

25
0

tqc-PandaPickAndPlace-v1

24
7

ppo-HalfCheetah-v3

24
1

a2c-RoadRunnerNoFrameskip-v4

24
0

a2c-SeaquestNoFrameskip-v4

24
0

a2c-BeamRiderNoFrameskip-v4

NaNK
24
0

sac-MountainCarContinuous-v0

24
0

dqn-RoadRunnerNoFrameskip-v4

23
1

a2c-CartPole-v1

23
0

a2c-EnduroNoFrameskip-v4

23
0

tqc-PandaStack-v1

23
0

ppo-Pendulum-v1

22
3

a2c-Pendulum-v1

22
1

a2c-LunarLander-v3

A2C Agent playing LunarLander-v3 This is a trained model of a A2C agent playing LunarLander-v3 using the stable-baselines3 library and the RL Zoo. The RL Zoo is a training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included. RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo SB3: https://github.com/DLR-RM/stable-baselines3 SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib SBX (SB3 + Jax): https://github.com/araffin/sbx If you installed the RL Zoo3 via pip (`pip install rlzoo3`), from anywhere you can do:

22
1

ppo-CarRacing-v0

22
0

ppo-SeaquestNoFrameskip-v4

21
0

ddpg-Walker2DBulletEnv-v0

21
0

qrdqn-SpaceInvadersNoFrameskip-v4

20
1

ppo-RoadRunnerNoFrameskip-v4

19
0

a2c-Walker2DBulletEnv-v0

19
0

tqc-PandaPush-v1

19
0

dqn-MsPacmanNoFrameskip-v4

19
0

ppo-EnduroNoFrameskip-v4

18
0

tqc-PandaSlide-v1

18
0

tqc-FetchPush-v1

18
0

ppo-ReacherBulletEnv-v0

17
0

ppo-BeamRiderNoFrameskip-v4

NaNK
17
0

sac-Hopper-v3

17
0

sac-LunarLanderContinuous-v2

17
0

sac-BipedalWalker-v3

NaNK
17
0

ppo-MountainCarContinuous-v0

16
1

ppo-QbertNoFrameskip-v4

16
0

a2c-Hopper-v3

16
0

a2c-LunarLanderContinuous-v2

16
0

sac-BipedalWalkerHardcore-v3

NaNK
16
0

a2c-AntBulletEnv-v0

15
1

sac-Walker2DBulletEnv-v0

15
1

tqc-Walker2d-v3

15
1

tqc-BipedalWalker-v3

NaNK
15
1

ppo-AsteroidsNoFrameskip-v4

15
0

sac-Walker2d-v3

15
0

tqc-Hopper-v3

15
0

tqc-Walker2DBulletEnv-v0

15
0

ddpg-Pendulum-v1

15
0

sac-Pendulum-v1

14
0

tqc-Pendulum-v1

14
0

trpo-AntBulletEnv-v0

14
0

trpo-Walker2DBulletEnv-v0

14
0

a2c-ReacherBulletEnv-v0

14
0

a2c-HopperBulletEnv-v0

14
0

a2c-HalfCheetahBulletEnv-v0

14
0

tqc-PandaReach-v1

13
1

tqc-Humanoid-v3

13
0

ddpg-MountainCarContinuous-v0

13
0

tqc-parking-v0

13
0

a2c-Humanoid-v3

13
0

a2c-MountainCarContinuous-v0

12
0

trpo-Walker2d-v3

12
0

trpo-HopperBulletEnv-v0

12
0

trpo-HalfCheetahBulletEnv-v0

12
0

trpo-Ant-v3

12
0

qrdqn-BreakoutNoFrameskip-v4

NaNK
12
0

sac-HopperBulletEnv-v0

12
0

dqn-LunarLander-v2

12
0

tqc-Ant-v3

12
0

a2c-Walker2d-v3

12
0

qrdqn-MsPacmanNoFrameskip-v4

12
0

ppo-Ant-v3

11
2

tqc-FetchPickAndPlace-v1

11
2

ppo-MiniGrid-DoorKey-5x5-v0

11
1

trpo-MountainCar-v0

11
0

trpo-LunarLander-v2

11
0

ppo-Swimmer-v3

11
0

ppo-Hopper-v3

11
0

td3-ReacherBulletEnv-v0

11
0

a2c-LunarLander-v2

11
0

dqn-EnduroNoFrameskip-v4

11
0

tqc-HopperBulletEnv-v0

11
0

ddpg-HopperBulletEnv-v0

11
0

tqc-FetchReach-v1

11
0

tqc-FetchSlide-v1

11
0

a2c-Ant-v3

11
0

demo-hf-CartPole-v1

10
1

ppo_lstm-PendulumNoVel-v1

10
0

trpo-CartPole-v1

10
0

td3-HopperBulletEnv-v0

10
0

td3-LunarLanderContinuous-v2

10
0

td3-Walker2DBulletEnv-v0

10
0

qrdqn-PongNoFrameskip-v4

10
0

qrdqn-Acrobot-v1

10
0

qrdqn-RoadRunnerNoFrameskip-v4

10
0

qrdqn-BeamRiderNoFrameskip-v4

NaNK
10
0

qrdqn-CartPole-v1

10
0

sac-AntBulletEnv-v0

10
0

dqn-AsteroidsNoFrameskip-v4

10
0

tqc-Swimmer-v3

10
0

tqc-MountainCarContinuous-v0

10
0

td3-Humanoid-v3

9
1

trpo-Acrobot-v1

9
0

trpo-HalfCheetah-v3

9
0

trpo-Swimmer-v3

9
0

trpo-Pendulum-v1

9
0

trpo-LunarLanderContinuous-v2

9
0

td3-BipedalWalker-v3

NaNK
9
0

td3-BipedalWalkerHardcore-v3

NaNK
9
0

qrdqn-MountainCar-v0

9
0

qrdqn-SeaquestNoFrameskip-v4

9
0

qrdqn-AsteroidsNoFrameskip-v4

9
0

qrdqn-QbertNoFrameskip-v4

9
0

sac-Swimmer-v3

9
0

sac-HalfCheetahBulletEnv-v0

9
0

dqn-SeaquestNoFrameskip-v4

9
0

dqn-QbertNoFrameskip-v4

9
0

ppo_lstm-MountainCarContinuousNoVel-v0

8
0

trpo-Hopper-v3

8
0

trpo-ReacherBulletEnv-v0

8
0

trpo-BipedalWalker-v3

NaNK
8
0

trpo-MountainCarContinuous-v0

8
0

td3-AntBulletEnv-v0

8
0

td3-Pendulum-v1

8
0

a2c-HalfCheetah-v3

8
0

a2c-Swimmer-v3

8
0

sac-ReacherBulletEnv-v0

8
0

tqc-HalfCheetah-v3

8
0

tqc-ReacherBulletEnv-v0

8
0

tqc-HalfCheetahBulletEnv-v0

8
0

tqc-LunarLanderContinuous-v2

8
0

ddpg-ReacherBulletEnv-v0

8
0

ddpg-LunarLanderContinuous-v2

8
0

td3-HalfCheetahBulletEnv-v0

7
0

qrdqn-LunarLander-v2

7
0

tqc-AntBulletEnv-v0

7
0

tqc-BipedalWalkerHardcore-v3

NaNK
7
0

qrdqn-EnduroNoFrameskip-v4

6
0

ddpg-AntBulletEnv-v0

6
0

ddpg-HalfCheetahBulletEnv-v0

6
0

ppo-MiniGrid-FourRooms-v0

6
0

ppo-MiniGrid-KeyCorridorS3R1-v0

6
0

ars-Pendulum-v1

5
0

ppo-MiniGrid-MultiRoom-N4-S5-v0

5
0

ppo-MiniGrid-ObstructedMaze-2Dlh-v0

5
0

ppo-MiniGrid-GoToDoor-5x5-v0

4
0

ars-Acrobot-v1

3
0

ars-HalfCheetah-v3

3
0

ars-Hopper-v3

3
0

ppo-MiniGrid-RedBlueDoors-6x6-v0

3
0

ars-Walker2d-v3

2
0

ars-MountainCar-v0

2
0

ars-Swimmer-v3

2
0

ars-LunarLanderContinuous-v2

2
0

ars-MountainCarContinuous-v0

2
0

ars-CartPole-v1

2
0

ars-Ant-v3

2
0

ppo-MiniGrid-Fetch-5x5-N2-v0

2
0

ppo-MiniGrid-PutNear-6x6-N2-v0

2
0

ppo-MiniGrid-Unlock-v0

1
2

ppo-MiniGrid-LockedRoom-v0

1
0