Start
liu-2023-agentbench
liu-2023-agentbench - Skill Dossier
liu-2023-agentbench

liu-2023-agentbench

Comprehensive benchmark suite for evaluating LLM agents across diverse interactive environments

Uncategorized

Share this skill

Skills use the open SKILL.md standard — the same file works across all platforms.

Install all 544 skills as a plugin
claude plugin marketplace add curiositech/windags-skills claude plugin install windags-skills

Claude activates liu-2023-agentbench automatically when your task matches its description.

View on GitHub
"Use liu-2023-agentbench to help me build a feature system"
"I need expert help with comprehensive benchmark suite for evaluating llm a..."