davila7/claude-code-templates/stable-baselines3
Use this skill for reinforcement learning tasks including training RL agents (PPO, SAC, DQN, TD3, DDPG, A2C, etc.), creating custom Gym environments, implementing callbacks for monitoring and control, using vectorized environments for parallel training, and integrating with deep RL workflows. This skill should be used when users request RL algorithm implementation, agent training, environment design, or RL experimentation.
Risk Score
0
out of 100
Popularity
19,944
Stars
1,854
Forks
Feb 9, 2026
Updated
CodeThreat AppSec
Full SAST + SCA agentic security analysis for MCP servers and Skills.