<< Back to skillsPopularityWorks with
Claude Code
Cursor
GitHub Copilot
Windsurf
Codex
Gemini CLI
ClineAmp
Markdown
T1 LOW TRUST
Skill v1.0.0(1 version)orchestra-research/ai-research-skills/lm-evaluation-harness
orchestra-research·AI/ML·orchestra-research/ai-research-skills·11-evaluation/lm-evaluation-harness/SKILL.md ↗·Updated Mar 20, 2026
Source check pending
This skill was recently updated. Verifying source availability...
orchestra-research/ai-research-skills
community
Install# npm $ npx vskill@latest install orchestra-research/ai-research-skills/lm-evaluation-harness # bun $ bunx vskill@latest install orchestra-research/ai-research-skills/lm-evaluation-harness # pnpm $ pnpx vskill@latest install orchestra-research/ai-research-skills/lm-evaluation-harness # yarn $ yarn dlx vskill@latest install orchestra-research/ai-research-skills/lm-evaluation-harness # alternative $ npx vskill@latest install orchestra-research/ai-research-skills --skill lm-evaluation-harness
No evaluation data available yetView eval results →
5.3kstars423forks437d trend
72d agoWorks with all 39 vskill-compatible agents
Embed badge
[](https://verified-skill.com/skills/orchestra-research/ai-research-skills/lm-evaluation-harness)