sb3
202 models • 1 total models in database
Sort by:
ppo-CartPole-v1
—
2,079
0
sac-Humanoid-v3
—
229
2
dqn-BeamRiderNoFrameskip-v4
NaNK
—
192
0
dqn-MountainCar-v0
—
142
1
dqn-CartPole-v1
—
124
0
dqn-Acrobot-v1
—
78
0
a2c-BreakoutNoFrameskip-v4
NaNK
—
72
2
td3-Ant-v3
—
71
0
sac-HalfCheetah-v3
—
69
2
dqn-BreakoutNoFrameskip-v4
NaNK
—
64
2
td3-Hopper-v3
—
63
0
td3-Swimmer-v3
—
53
1
sac-Ant-v3
—
53
1
ppo-LunarLander-v2
—
53
0
td3-Walker2d-v3
—
53
0
td3-HalfCheetah-v3
—
52
0
a2c-BipedalWalkerHardcore-v3
NaNK
—
51
0
a2c-MsPacmanNoFrameskip-v4
—
51
0
ppo-Walker2DBulletEnv-v0
—
49
0
ppo-BreakoutNoFrameskip-v4
NaNK
—
47
0
ppo-BipedalWalker-v3
NaNK
—
47
0
dqn-SpaceInvadersNoFrameskip-v4
—
44
4
ppo-Acrobot-v1
—
38
1
ppo-Walker2d-v3
—
37
0
ppo-MountainCar-v0
—
35
1
ppo-PongNoFrameskip-v4
—
35
1
ppo-MsPacmanNoFrameskip-v4
—
35
1
a2c-MountainCar-v0
—
33
0
ppo-LunarLanderContinuous-v2
—
31
0
ppo-SpaceInvadersNoFrameskip-v4
—
30
0
a2c-PongNoFrameskip-v4
—
30
0
a2c-Acrobot-v1
—
29
0
Ppo Lstm CarRacing V0
—
28
2
ppo-BipedalWalkerHardcore-v3
NaNK
—
28
1
a2c-BipedalWalker-v3
NaNK
—
28
1
ppo_lstm-CartPoleNoVel-v1
—
28
0
ddpg-BipedalWalker-v3
NaNK
—
28
0
ppo-MiniGrid-Empty-Random-5x5-v0
—
28
0
ppo-AntBulletEnv-v0
—
27
0
td3-MountainCarContinuous-v0
—
27
0
a2c-QbertNoFrameskip-v4
—
27
0
dqn-PongNoFrameskip-v4
—
26
2
a2c-SpaceInvadersNoFrameskip-v4
—
26
0
ppo-HopperBulletEnv-v0
—
25
0
ppo-HalfCheetahBulletEnv-v0
—
25
0
a2c-AsteroidsNoFrameskip-v4
—
25
0
tqc-PandaPickAndPlace-v1
—
24
7
ppo-HalfCheetah-v3
—
24
1
a2c-RoadRunnerNoFrameskip-v4
—
24
0
a2c-SeaquestNoFrameskip-v4
—
24
0
a2c-BeamRiderNoFrameskip-v4
NaNK
—
24
0
sac-MountainCarContinuous-v0
—
24
0
dqn-RoadRunnerNoFrameskip-v4
—
23
1
a2c-CartPole-v1
—
23
0
a2c-EnduroNoFrameskip-v4
—
23
0
tqc-PandaStack-v1
—
23
0
ppo-Pendulum-v1
—
22
3
a2c-Pendulum-v1
—
22
1
a2c-LunarLander-v3
A2C Agent playing LunarLander-v3 This is a trained model of a A2C agent playing LunarLander-v3 using the stable-baselines3 library and the RL Zoo. The RL Zoo is a training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included. RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo SB3: https://github.com/DLR-RM/stable-baselines3 SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib SBX (SB3 + Jax): https://github.com/araffin/sbx If you installed the RL Zoo3 via pip (`pip install rlzoo3`), from anywhere you can do:
—
22
1
ppo-CarRacing-v0
—
22
0
ppo-SeaquestNoFrameskip-v4
—
21
0
ddpg-Walker2DBulletEnv-v0
—
21
0
qrdqn-SpaceInvadersNoFrameskip-v4
—
20
1
ppo-RoadRunnerNoFrameskip-v4
—
19
0
a2c-Walker2DBulletEnv-v0
—
19
0
tqc-PandaPush-v1
—
19
0
dqn-MsPacmanNoFrameskip-v4
—
19
0
ppo-EnduroNoFrameskip-v4
—
18
0
tqc-PandaSlide-v1
—
18
0
tqc-FetchPush-v1
—
18
0
ppo-ReacherBulletEnv-v0
—
17
0
ppo-BeamRiderNoFrameskip-v4
NaNK
—
17
0
sac-Hopper-v3
—
17
0
sac-LunarLanderContinuous-v2
—
17
0
sac-BipedalWalker-v3
NaNK
—
17
0
ppo-MountainCarContinuous-v0
—
16
1
ppo-QbertNoFrameskip-v4
—
16
0
a2c-Hopper-v3
—
16
0
a2c-LunarLanderContinuous-v2
—
16
0
sac-BipedalWalkerHardcore-v3
NaNK
—
16
0
a2c-AntBulletEnv-v0
—
15
1
sac-Walker2DBulletEnv-v0
—
15
1
tqc-Walker2d-v3
—
15
1
tqc-BipedalWalker-v3
NaNK
—
15
1
ppo-AsteroidsNoFrameskip-v4
—
15
0
sac-Walker2d-v3
—
15
0
tqc-Hopper-v3
—
15
0
tqc-Walker2DBulletEnv-v0
—
15
0
ddpg-Pendulum-v1
—
15
0
sac-Pendulum-v1
—
14
0
tqc-Pendulum-v1
—
14
0
trpo-AntBulletEnv-v0
—
14
0
trpo-Walker2DBulletEnv-v0
—
14
0
a2c-ReacherBulletEnv-v0
—
14
0
a2c-HopperBulletEnv-v0
—
14
0
a2c-HalfCheetahBulletEnv-v0
—
14
0
tqc-PandaReach-v1
—
13
1
tqc-Humanoid-v3
—
13
0
ddpg-MountainCarContinuous-v0
—
13
0
tqc-parking-v0
—
13
0
a2c-Humanoid-v3
—
13
0
a2c-MountainCarContinuous-v0
—
12
0
trpo-Walker2d-v3
—
12
0
trpo-HopperBulletEnv-v0
—
12
0
trpo-HalfCheetahBulletEnv-v0
—
12
0
trpo-Ant-v3
—
12
0
qrdqn-BreakoutNoFrameskip-v4
NaNK
—
12
0
sac-HopperBulletEnv-v0
—
12
0
dqn-LunarLander-v2
—
12
0
tqc-Ant-v3
—
12
0
a2c-Walker2d-v3
—
12
0
qrdqn-MsPacmanNoFrameskip-v4
—
12
0
ppo-Ant-v3
—
11
2
tqc-FetchPickAndPlace-v1
—
11
2
ppo-MiniGrid-DoorKey-5x5-v0
—
11
1
trpo-MountainCar-v0
—
11
0
trpo-LunarLander-v2
—
11
0
ppo-Swimmer-v3
—
11
0
ppo-Hopper-v3
—
11
0
td3-ReacherBulletEnv-v0
—
11
0
a2c-LunarLander-v2
—
11
0
dqn-EnduroNoFrameskip-v4
—
11
0
tqc-HopperBulletEnv-v0
—
11
0
ddpg-HopperBulletEnv-v0
—
11
0
tqc-FetchReach-v1
—
11
0
tqc-FetchSlide-v1
—
11
0
a2c-Ant-v3
—
11
0
demo-hf-CartPole-v1
—
10
1
ppo_lstm-PendulumNoVel-v1
—
10
0
trpo-CartPole-v1
—
10
0
td3-HopperBulletEnv-v0
—
10
0
td3-LunarLanderContinuous-v2
—
10
0
td3-Walker2DBulletEnv-v0
—
10
0
qrdqn-PongNoFrameskip-v4
—
10
0
qrdqn-Acrobot-v1
—
10
0
qrdqn-RoadRunnerNoFrameskip-v4
—
10
0
qrdqn-BeamRiderNoFrameskip-v4
NaNK
—
10
0
qrdqn-CartPole-v1
—
10
0
sac-AntBulletEnv-v0
—
10
0
dqn-AsteroidsNoFrameskip-v4
—
10
0
tqc-Swimmer-v3
—
10
0
tqc-MountainCarContinuous-v0
—
10
0
td3-Humanoid-v3
—
9
1
trpo-Acrobot-v1
—
9
0
trpo-HalfCheetah-v3
—
9
0
trpo-Swimmer-v3
—
9
0
trpo-Pendulum-v1
—
9
0
trpo-LunarLanderContinuous-v2
—
9
0
td3-BipedalWalker-v3
NaNK
—
9
0
td3-BipedalWalkerHardcore-v3
NaNK
—
9
0
qrdqn-MountainCar-v0
—
9
0
qrdqn-SeaquestNoFrameskip-v4
—
9
0
qrdqn-AsteroidsNoFrameskip-v4
—
9
0
qrdqn-QbertNoFrameskip-v4
—
9
0
sac-Swimmer-v3
—
9
0
sac-HalfCheetahBulletEnv-v0
—
9
0
dqn-SeaquestNoFrameskip-v4
—
9
0
dqn-QbertNoFrameskip-v4
—
9
0
ppo_lstm-MountainCarContinuousNoVel-v0
—
8
0
trpo-Hopper-v3
—
8
0
trpo-ReacherBulletEnv-v0
—
8
0
trpo-BipedalWalker-v3
NaNK
—
8
0
trpo-MountainCarContinuous-v0
—
8
0
td3-AntBulletEnv-v0
—
8
0
td3-Pendulum-v1
—
8
0
a2c-HalfCheetah-v3
—
8
0
a2c-Swimmer-v3
—
8
0
sac-ReacherBulletEnv-v0
—
8
0
tqc-HalfCheetah-v3
—
8
0
tqc-ReacherBulletEnv-v0
—
8
0
tqc-HalfCheetahBulletEnv-v0
—
8
0
tqc-LunarLanderContinuous-v2
—
8
0
ddpg-ReacherBulletEnv-v0
—
8
0
ddpg-LunarLanderContinuous-v2
—
8
0
td3-HalfCheetahBulletEnv-v0
—
7
0
qrdqn-LunarLander-v2
—
7
0
tqc-AntBulletEnv-v0
—
7
0
tqc-BipedalWalkerHardcore-v3
NaNK
—
7
0
qrdqn-EnduroNoFrameskip-v4
—
6
0
ddpg-AntBulletEnv-v0
—
6
0
ddpg-HalfCheetahBulletEnv-v0
—
6
0
ppo-MiniGrid-FourRooms-v0
—
6
0
ppo-MiniGrid-KeyCorridorS3R1-v0
—
6
0
ars-Pendulum-v1
—
5
0
ppo-MiniGrid-MultiRoom-N4-S5-v0
—
5
0
ppo-MiniGrid-ObstructedMaze-2Dlh-v0
—
5
0
ppo-MiniGrid-GoToDoor-5x5-v0
—
4
0
ars-Acrobot-v1
—
3
0
ars-HalfCheetah-v3
—
3
0
ars-Hopper-v3
—
3
0
ppo-MiniGrid-RedBlueDoors-6x6-v0
—
3
0
ars-Walker2d-v3
—
2
0
ars-MountainCar-v0
—
2
0
ars-Swimmer-v3
—
2
0
ars-LunarLanderContinuous-v2
—
2
0
ars-MountainCarContinuous-v0
—
2
0
ars-CartPole-v1
—
2
0
ars-Ant-v3
—
2
0
ppo-MiniGrid-Fetch-5x5-N2-v0
—
2
0
ppo-MiniGrid-PutNear-6x6-N2-v0
—
2
0
ppo-MiniGrid-Unlock-v0
—
1
2
ppo-MiniGrid-LockedRoom-v0
—
1
0