THUDM/AgentBench

A comprehensive benchmark for evaluating large models as autonomous Agents.

SDKagent

Visit THUDM/AgentBench website →

Category
Infraestructura IA
Official URL
https://github.com/THUDM/AgentBench
Last updated
Wed Apr 08
Tags
SDK · agent