<< Back to skillsPopularityWorks with
Claude Code
Cursor
GitHub Copilot
Windsurf
Codex
Gemini CLI
ClineAmp
Markdown
T2 SCANNED
Skill v1.0.0orchestra-research/ai-research-skills/grpo-rl-training
orchestra-research·AI/ML·orchestra-research/ai-research-skills·06-post-training/grpo-rl-training/SKILL.md ↗·Updated Mar 20, 2026
Source check pending
This skill was recently updated. Verifying source availability...
orchestra-research/ai-research-skills
community
Install# npm $ npx vskill@latest install orchestra-research/ai-research-skills/grpo-rl-training # bun $ bunx vskill@latest install orchestra-research/ai-research-skills/grpo-rl-training # pnpm $ pnpx vskill@latest install orchestra-research/ai-research-skills/grpo-rl-training # yarn $ yarn dlx vskill@latest install orchestra-research/ai-research-skills/grpo-rl-training # alternative $ npx vskill@latest install orchestra-research/ai-research-skills --skill grpo-rl-training
No evaluation data available yetView eval results →
4.9kstars391forks437d trend
72d agoWorks with all 39 vskill-compatible agents
Embed badge
[](https://verified-skill.com/skills/orchestra-research/ai-research-skills/grpo-rl-training)