RISE Humanities Data Benchmark, 0.5.0-pre1

Search Test Runs

 

A test run is a single execution of a benchmark test using a defined model configuration.
Each run represents how a particular large language model (LLM) — such as GPT-4, Claude-3, or Gemini — performed on a given task at a specific time, with specific settings.

A test run includes:

  • Prompt and role definition – what the model was asked to do and from what perspective (e.g. “as a historian”).
  • Model configuration – provider, model version, temperature, and other generation parameters.
  • Results – the model’s actual response and its evaluation (scores such as F1 or accuracy).
  • Usage and cost data – token counts and calculated API costs.
  • Metadata – information like the test date, benchmark name, and person who executed it.

Together, test runs make it possible to compare models, providers, and configurations across benchmarks in a transparent and reproducible way.

Search Results

Your search for Benchmark 'personnel_cards__true' with Search Hidden 'False' returned 57 results, showing page 5 of 6.
Result 41 of 57

Test T0606 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Provideropenai
Modelgpt-4.1-mini-2025-04-14
  
Temperature0.0
DataclassTable
  
Normalized Score87.31 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.87 0.83 0.84 0.91 61 2359 462 224
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 281.7K IT + 37.4K OT = 319.1K TTCost: 0.113$0.060$0.172$
Result 42 of 57

Test T0589 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Providergenai
Modelgemini-2.5-flash-lite
  
Temperature0.0
DataclassTable
  
Normalized Score87.57 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.88 0.83 0.84 0.91 61 2297 431 221
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 90.5K IT + 62.3K OT = 152.7K TTCost: 0.009$0.025$0.034$
Result 43 of 57

Test T0578 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Provideranthropic
Modelclaude-opus-4-20250514
  
Temperature0.0
DataclassTable
  
Normalized Score75.99 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.76 0.76 0.75 0.77 61 1938 645 580
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 229.0K IT + 63.5K OT = 292.4K TTCost: 3.435$4.760$8.195$
Result 44 of 57

Test T0587 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Providergenai
Modelgemini-2.0-flash-lite
  
Temperature0.0
DataclassTable
  
Normalized Score89.35 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.89 0.84 0.90 0.89 61 2235 250 283
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 221.2K IT + 68.1K OT = 289.3K TTCost: 0.017$0.020$0.037$
Result 45 of 57

Test T0613 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Provideropenai
Modelgpt-5.1-2025-11-13
  
Temperature0.0
DataclassTable
  
Normalized Score90.76 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.91 0.87 0.89 0.92 61 2381 283 202
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 189.4K IT + 37.3K OT = 226.7K TTCost: 0.237$0.373$0.610$
Result 46 of 57

Test T0586 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Providergenai
Modelgemini-2.0-flash
  
Temperature0.0
DataclassTable
  
Normalized Score92.38 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.92 0.87 0.93 0.92 61 2310 173 208
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 221.2K IT + 60.7K OT = 281.9K TTCost: 0.022$0.024$0.046$
Result 47 of 57

Test T0610 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Provideropenai
Modelgpt-5-2025-08-07
  
Temperature0.0
DataclassTable
  
Normalized Score93.50 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.93 0.89 0.94 0.93 61 2402 153 181
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 189.4K IT + 212.7K OT = 402.1K TTCost: 0.237$2.127$2.364$
Result 48 of 57

Test T0598 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Providermistral
Modelministral-8b-2512
  
Temperature0.0
DataclassTable
  
Normalized Score12.60 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.13 0.11 0.08 0.34 61 867 10381 1651
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 110.4K IT + 223.5K OT = 333.9K TTCost: 0.017$0.034$0.050$
Result 49 of 57

Test T0580 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Provideranthropic
Modelclaude-sonnet-4-20250514
  
Temperature0.0
DataclassTable
  
Normalized Score92.23 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.92 0.87 0.92 0.92 61 2320 193 198
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 229.0K IT + 60.5K OT = 289.5K TTCost: 0.687$0.908$1.595$
Result 50 of 57

Test T0605 at 2026-02-09

{'document-type': ['index-card'], 'writing': ['handwritten', 'typed', 'printed'], 'century': [20], 'layout': ['table', 'form'], 'task': ['transcription', 'document-understanding', 'data-correction'], 'language': ['de', 'fr']}

Configuration
Provideropenai
Modelgpt-4.1-2025-04-14
  
Temperature0.0
DataclassTable
  
Normalized Score90.03 %
Test timeunknown seconds
Prompt

Extract all information from this personnel card table and return it as a structured JSON object. The card contains rows documenting employment history with positions, locations, salary information, dates, and remarks.

REQUIRED JSON STRUCTURE:
```json
{
  "rows": [
    {
      "row_number": 1,
      "dienstliche_stellung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "dienstort": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "gehaltsklasse": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      },
      "jahresgehalt_monatsgehalt_taglohn": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "standardized numeric form or null",
        "is_crossed_out": false
      },
      "datum_gehaltsänderung": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "YYYY-MM-DD format or null",
        "is_crossed_out": false
      },
      "bemerkungen": {
        "diplomatic_transcript": "exact text from card",
        "interpretation": "expanded/standardized form or null",
        "is_crossed_out": false
      }
    }
  ]
}
```

COLUMN DEFINITIONS:
1. **dienstliche_stellung**: Official position or job title (e.g., "Assistent", "Professor", "Sekretär")
2. **dienstort**: Place of service or work location (e.g., "Basel", "Zürich")
3. **gehaltsklasse**: Salary class or grade (e.g., "III", "IV", roman numerals or numbers)
4. **jahresgehalt_monatsgehalt_taglohn**: Salary amount (annual/monthly/daily wage)
5. **datum_gehaltsänderung**: Date of salary change or effective date
6. **bemerkungen**: Remarks, notes, or additional comments

FIELD EXTRACTION RULES:

**diplomatic_transcript** (REQUIRED for all fields):
- Transcribe EXACTLY as written on the card
- Include all abbreviations, punctuation, and formatting as they appear
- Preserve original capitalization and spacing
- Include currency symbols and separators (e.g., "Fr. 2'400.-", "3.700.-")
- For dates, copy the exact format (e.g., "1. Jan. 1946", "1.April 1945")
- Use empty string "" for empty cells
- Be sure to escape ditto marks (repetition marks such as `"` indicating "same as above")
- DO NOT expand abbreviations or standardize formats
- Do not transcribe checkmarks

**interpretation** (OPTIONAL - use null if not applicable):
- For ditto marks (repetition marks such as `"`): Replace with the actual repeated value from the previous row in the same column
  - For example, if previous row's dienstliche_stellung is "Hilfsleiterin" and current row has `"`, interpret as "Hilfsleiterin"
  - Do NOT add explanatory text like "wie oben" or similar - just provide the repeated value
- Never expand abbreviations
- For dates: Convert to ISO format YYYY-MM-DD
  - "1. Jan. 1946" → "1946-01-01"
  - "1.April 1945" → "1945-04-01"
- For salary amounts: Extract numeric value only (remove currency symbols, separators)
  - "Fr. 2'400.-" → "2400"
  - "3.700.-" → "3700"
  - "5.094.-" → "5094"
  "6,30.-" → "6.3"
- For salary class: Convert roman numerals to arabic if clear
  - "III" → "3"
  - "IV" → "4"
- Do not convert roman numerals to arabic if not salary, date, or salary class
- Use null if no interpretation/expansion is needed or if the field is empty
- "+ {word}" where {word} is a word does not need interpretation

**is_crossed_out** (REQUIRED for all fields):
- Set to true if text in this field is crossed out, struck through, or deleted
- Set to false if text is normal (not crossed out)
- Empty cells should have is_crossed_out: false

ROW HANDLING:
- Number rows sequentially starting from 1 (top to bottom)
- Include ALL rows that have ANY content in ANY column
- Empty rows (all cells empty) should be omitted
- A row with only one filled cell should still be included

IMPORTANT NOTES:
- Return ONLY the JSON object, no additional text or explanation
- Every field must have all three sub-fields: diplomatic_transcript, interpretation, is_crossed_out
- diplomatic_transcript must never be null (use empty string "" for empty cells)
- interpretation can be null when no expansion/standardization applies
- Process the entire table from top to bottom
- Maintain consistent row numbering throughout

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
n/a 0.90 0.87 0.88 0.93 61 2393 340 190
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 201.4K IT + 38.3K OT = 239.7K TTCost: 0.403$0.307$0.709$