<< Back to skillsPopularityWorks with
Claude Code
Cursor
GitHub Copilot
Windsurf
Codex
Gemini CLI
ClineAmp
Markdown
T1 LOW TRUST
Skill v1.0.0applied-artificial-intelligence/claude-code-toolkit/llm-evaluation
applied-artificial-intelligence·DevOps·applied-artificial-intelligence/claude-code-toolkit·skills/llm-evaluation/SKILL.md ↗·Updated Mar 19, 2026
Source check pending
This skill was recently updated. Verifying source availability...
applied-artificial-intelligence/claude-code-toolkit
community
Install# npm $ npx vskill@latest install applied-artificial-intelligence/claude-code-toolkit/llm-evaluation # bun $ bunx vskill@latest install applied-artificial-intelligence/claude-code-toolkit/llm-evaluation # pnpm $ pnpx vskill@latest install applied-artificial-intelligence/claude-code-toolkit/llm-evaluation # yarn $ yarn dlx vskill@latest install applied-artificial-intelligence/claude-code-toolkit/llm-evaluation # alternative $ npx vskill@latest install applied-artificial-intelligence/claude-code-toolkit --skill llm-evaluation
No evaluation data available yetView eval results →
44stars12forks197d trend
79d agoWorks with all 39 vskill-compatible agents
Embed badge
[](https://verified-skill.com/skills/applied-artificial-intelligence/claude-code-toolkit/llm-evaluation)