Search our LLM Benchmark Suite for Humanities Image Data and compare model performance.
Check out our leaderboard and find out which providers, models work best with which kind of data.
We currently have 11 datasets subjected to 811 tests in 1339 runs.
Search for test-runs and compare results
This is an open and ongoing project. You're welcome to join us!