RISE Humanities Data Benchmark, v0.5.0-pre1

Search our LLM Benchmark Suite for Humanities Image Data and compare model performance.

Card image cap
See aggregated Results

Check out our leaderboard and find out which providers, models work best with which kind of data.

Card image cap
Browse Datasets

We currently have 11 datasets subjected to 811 tests in 1339 runs.

Card image cap
Inspect Results

Search for test-runs and compare results

Card image cap
Contribute Your Own

This is an open and ongoing project. You're welcome to join us!