Dynamically Auditing the Open Skill
Ecosystem for LLM Agents
Overall Results
Radar view of each model's score by task category
Open Skills
Specialized prompt + tool kits that agents can invoke to do real work — sourced from the open ecosystem and evaluated on the same benchmark.