4 curated 모델 평가 AI tools, ranked by rating and freshness and updated daily.
UC Berkeley's open crowdsourced arena for evaluating AI models.
Leaderboar
HuggingFace's open-source large model benchmark leaderboard for comparing model performance.
Open source
A blind community voting arena for ranking large language models through head-to-head battles.
A comprehensive large model evaluation platform built by Shanghai AI Lab.