RISE Humanities Data Benchmark

A test run is a single execution of a benchmark test using a defined model configuration.
Each run represents how a particular large language model (LLM) — such as GPT-4, Claude-3, or Gemini — performed on a given task at a specific time, with specific settings.

A test run includes:

Prompt and role definition – what the model was asked to do and from what perspective (e.g. “as a historian”).
Model configuration – provider, model version, temperature, and other generation parameters.
Results – the model’s actual response and its evaluation (scores such as F1 or accuracy).
Usage and cost data – token counts and calculated API costs.
Metadata – information like the test date, benchmark name, and person who executed it.

Together, test runs make it possible to compare models, providers, and configurations across benchmarks in a transparent and reproducible way.

Result 31 of 71

Test T0603 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	mistral
Model	mistral-small-2506

Temperature	0.0
Dataclass	Table

Normalized Score	23.30 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.23	0.28	0.16	0.40	61	1001	5075	1517
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 110.4K IT + 115.1K OT = 225.5K TT

Cost: 0.011$ + 0.035$ = 0.046$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 32 of 71

Test T0609 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	openai
Model	gpt-4o-mini-2024-07-18

Temperature	0.0
Dataclass	Table

Normalized Score	74.61 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.75	0.75	0.73	0.77	61	1984	751	599
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 2.4M IT + 51.5K OT = 2.4M TT

Cost: 0.357$ + 0.031$ = 0.388$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 33 of 71

Test T0614 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	openai
Model	gpt-5.2-2025-12-11

Temperature	0.0
Dataclass	Table

Normalized Score	96.32 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.96	0.91	0.96	0.96	61	2488	95	95
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 243.3K IT + 39.6K OT = 282.9K TT

Cost: 0.426$ + 0.554$ = 0.980$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 34 of 71

Test T0613 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	openai
Model	gpt-5.1-2025-11-13

Temperature	0.0
Dataclass	Table

Normalized Score	89.02 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.89	0.86	0.87	0.91	61	2348	344	235
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 189.4K IT + 37.3K OT = 226.7K TT

Cost: 0.237$ + 0.373$ = 0.610$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 35 of 71

Test T0615 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	openai
Model	o3-2025-04-16

Temperature	0.0
Dataclass	Table

Normalized Score	90.79 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.91	0.86	0.90	0.92	61	2371	269	212
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 193.4K IT + 145.1K OT = 338.5K TT

Cost: 0.387$ + 1.161$ = 1.548$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 36 of 71

Test T0612 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	openai
Model	gpt-5-nano-2025-08-07

Temperature	0.0
Dataclass	Table

Normalized Score	80.64 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.81	0.78	0.83	0.78	61	2018	404	565
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 270.6K IT + 389.7K OT = 660.3K TT

Cost: 0.014$ + 0.156$ = 0.169$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 37 of 71

Test T0582 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	cohere
Model	command-a-vision-07-2025

Temperature	0.0
Dataclass	Table

Normalized Score	78.43 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.78	0.80	0.83	0.74	61	1863	370	655
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 141.5K IT + 55.3K OT = 196.8K TT

Cost: 0.354$ + 0.553$ = 0.907$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 38 of 71

Test T0601 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	mistral
Model	mistral-medium-2505

Temperature	0.0
Dataclass	Table

Normalized Score	4.22 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.04	0.02	0.02	0.18	61	451	18398	2068
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 39.1K IT + 347.9K OT = 387.0K TT

Cost: 0.016$ + 0.696$ = 0.711$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 39 of 71

Test T0608 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	openai
Model	gpt-4o-2024-08-06

Temperature	0.0
Dataclass	Table

Normalized Score	90.55 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.91	0.87	0.89	0.92	61	2382	296	201
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 201.4K IT + 36.6K OT = 238.0K TT

Cost: 0.503$ + 0.366$ = 0.870$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Result 40 of 71

Test T0604 at 2026-02-10

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration

Provider	mistral
Model	pixtral-large-2411

Temperature	0.0
Dataclass	Table

Normalized Score	22.75 %
Test time	unknown seconds

Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring

Fuzzy Score	F1 micro / macro		Micro precision/recall		Tue/False Positives
n/a	0.23	0.27	0.16	0.41	61	1021	5437	1497
			Micro Precision	Micro Recall	Instances	TP	FP	FN

Costs / Pricing

Pricing Date: n/a, n/a.

Tokens: 122.0K IT + 144.3K OT = 266.3K TT

Cost: 0.244$ + 0.866$ = 1.110$

Cite: Hindermann, Marti, Alberto, et al., (2025). RISE-UNIBAS/humanities_data_benchmark, 10.5281/zenodo.16941752

Search Test Runs

Search Results
Show compact results Refine Search New Search

Download JSON Download CSV

Test T0603 at 2026-02-10

Test T0609 at 2026-02-10

Test T0614 at 2026-02-10

Test T0613 at 2026-02-10

Test T0615 at 2026-02-10

Test T0612 at 2026-02-10

Test T0582 at 2026-02-10

Test T0601 at 2026-02-10

Test T0608 at 2026-02-10

Test T0604 at 2026-02-10

Search Test Runs

Search Results Show compact results Refine Search New Search Download Download JSON Download CSV

Test T0603 at 2026-02-10

Test T0609 at 2026-02-10

Test T0614 at 2026-02-10

Test T0613 at 2026-02-10

Test T0615 at 2026-02-10

Test T0612 at 2026-02-10

Test T0582 at 2026-02-10

Test T0601 at 2026-02-10

Test T0608 at 2026-02-10

Test T0604 at 2026-02-10

Search Results
Show compact results Refine Search New Search

Download JSON Download CSV