Extract names, locations, signatures from table-like meeting minutes of Mines de Costano S.A., 1930s - 1960s
Dataset Description Result Overview Test Runs
This benchmark has been run 20 times. It uses fuzzy metric.
Tested providers: openai
Tested models: gpt-5.4-2026-03-05, gpt-5.3-codex, gpt-5.2-2025-12-11
| Score | Date | Provider | Model |
|---|---|---|---|
| 83.63 | 1 week ago | openai | gpt-5.3-codex |
| 83.31 | 1 week ago | openai | gpt-5.3-codex |
| 84.79 | 1 week ago | openai | gpt-5.3-codex |
| 83.29 | 1 week ago | openai | gpt-5.3-codex |
| 81.99 | 1 week ago | openai | gpt-5.3-codex |
| Role | Contributors |
|---|---|
| Domain expert | alexandra_binnenkade, sven_lienhard |
| Data curator | sven_lienhard |
| Analyst | sven_lienhard, Sorin Marti |
| Engineer | Sorin Marti |