liu-2023-agentbench

Comprehensive benchmark suite for evaluating LLM agents across diverse interactive environments

Uncategorized

Share this skill

Skills use the open SKILL.md standard — the same file works across all platforms.

Install all 551 skills as a plugin

claude plugin marketplace add curiositech/windags-skills claude plugin install windags-skills

Claude activates liu-2023-agentbench automatically when your task matches its description.