Search our LLM Benchmark Suite for Humanities Image Data and compare model performance.
Check out our leaderboard and find out which providers, models work best with which kind of data.
We currently have 11 datasets subjected to 1135 tests in 1873 runs.
Search for test-runs and compare results
This is an open and ongoing project. You're welcome to join us!