RISE Humanities Data Benchmark, 0.5.0-pre1

Search Test Runs

 

A test run is a single execution of a benchmark test using a defined model configuration.
Each run represents how a particular large language model (LLM) — such as GPT-4, Claude-3, or Gemini — performed on a given task at a specific time, with specific settings.

A test run includes:

  • Prompt and role definition – what the model was asked to do and from what perspective (e.g. “as a historian”).
  • Model configuration – provider, model version, temperature, and other generation parameters.
  • Results – the model’s actual response and its evaluation (scores such as F1 or accuracy).
  • Usage and cost data – token counts and calculated API costs.
  • Metadata – information like the test date, benchmark name, and person who executed it.

Together, test runs make it possible to compare models, providers, and configurations across benchmarks in a transparent and reproducible way.

Search Results

Your search for Benchmark 'medieval_manuscripts__true' with Search Hidden 'False' returned 85 results, showing page 6 of 9.
Result 51 of 85

Test T0284 at 2026-01-24

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Providergenai
Modelgemini-2.0-flash-lite
  
Temperature0.0
DataclassDocument
  
Normalized Score58.60 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.61 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 28.4K IT + 2.8K OT = 31.1K TTCost: 0.002$0.001$0.003$
Result 52 of 85

Test T0272 at 2026-01-24

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Providergenai
Modelgemini-2.5-pro
  
Temperature0.0
DataclassDocument
  
Normalized Score66.50 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.68 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 9.3K IT + 2.8K OT = 12.1K TTCost: 0.012$0.028$0.040$
Result 53 of 85

Test T0293 at 2026-01-24

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Provideranthropic
Modelclaude-sonnet-4-5-20250929
  
Temperature0.0
DataclassDocument
  
Normalized Score25.70 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.26 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 35.9K IT + 3.7K OT = 39.6K TTCost: 0.108$0.056$0.164$
Result 54 of 85

Test T0441 at 2025-11-25

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Providermistral
Modelmistral-small-2506
  
Temperature0.0
DataclassDocument
  
Normalized Score9.70 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.18 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 9.0K IT + 2.0K OT = 11.0K TTCost: 0.001$0.001$0.001$
Result 55 of 85

Test T0409 at 2025-11-25

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Provideropenai
Modelgpt-5.1-2025-11-13
  
Temperature0.0
DataclassDocument
  
Normalized Score62.40 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.65 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 18.8K IT + 2.5K OT = 21.3K TTCost: 0.024$0.025$0.048$
Result 56 of 85

Test T0271 at 2025-11-24

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Providergenai
Modelgemini-2.5-flash
  
Temperature0.0
DataclassDocument
  
Normalized Score60.10 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.65 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: n/an/aTokens: 9.3K IT + 3.1K OT = 12.4K TTCost: 0.003$0.008$0.011$
Result 57 of 85

Test T0298 at 2025-10-24

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Providermistral
Modelmistral-large-latest
  
Temperature0.0
DataclassDocument
  
Normalized Score59.50 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.62 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: 5 months ago2025-10-24Tokens: 20.4K IT + 2.8K OT = 23.3K TTCost: 0.041$0.017$0.058$
Result 58 of 85

Test T0278 at 2025-10-24

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Provideropenai
Modelgpt-4.1-nano
  
Temperature0.0
DataclassDocument
  
Normalized Score35.10 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.39 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: 5 months ago2025-10-24Tokens: 24.0K IT + 2.0K OT = 26.0K TTCost: 0.002$0.001$0.003$
Result 59 of 85

Test T0295 at 2025-10-24

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Providermistral
Modelmistral-medium-2508
  
Temperature0.0
DataclassDocument
  
Normalized Score61.40 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.65 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: 5 months ago2025-10-24Tokens: 20.4K IT + 3.0K OT = 23.4K TTCost: 0.008$0.006$0.014$
Result 60 of 85

Test T0281 at 2025-10-24

{'document-type': ['manuscript'], 'writing': ['handwritten'], 'century': [15], 'language': ['de'], 'layout': ['prose'], 'task': ['transcription']}

Configuration
Provideropenai
Modelgpt-5-nano
  
Temperature0.0
DataclassDocument
  
Normalized Score19.40 %
Test timeunknown seconds
Prompt

IDENTITY and PURPOSE:

You are presented with an image from a 15th century medieval manuscript written in Basel in late medieval German. Your task is to extract the text from the manuscript in the specified JSON format. You must extract all text exactly as it appears in the manuscript, maintaining historical spellings, punctuation, and formatting. Do not resolve abbreviations.

The manuscript may contain:
1. A main text body (often in one column as continuous text)
2. A folio number (if present)
3. Additional notes or text written in the margins (labeled as addition1, addition2, etc.)

You must:
- Identify and transcribe the main text
- Extract the folio number if visible (use empty string "" if not visible)
- Identify and transcribe any marginal additions separately
- Preserve line breaks with \n
- Maintain all historical spellings and abbreviations exactly as written. If a letter is superscribed, normalize it by writing the superscribed letter after the main letter, e.g. "u with superscribed o" is spelled as "uo". If special characters are used for abbreviations, do not resolve them, but try to transcribe the special character according to the Medieval Unicode Font Initiative. ꝛ for "er" and ꝰ for "us" or "em" might be the most common special characters.
- Do not modernize or correct the text
- Do not use OCR or attempt to interpret unclear text - transcribe what you can see
- If a field has no content, use an empty string "" (not null)

Take a deep breath and think step by step about the layout of the page. First identify the folio number, then the main text area, then any marginal additions. Return only a JSON file with no additional commentary.

EXAMPLE OUTPUT:

{
  "folios": [
    {
      "folio": "15",
      "text": "In disem jare kam der kunig\n mit grossem here in daz lant\n vnd belagerte die stat Basel\n do wertend sich die burger\n mit grosser kraft vnd tugent\n vnd triben den kunig dannen\n mit schanden vnd verlust",
      "addition1": "Anno domini 1444",
      "addition2": "",
      "addition3": ""
    }
  ]
}

Results

no valid result

Scoring
Fuzzy Score F1 micro / macro Micro precision/recall Tue/False Positives
0.19 n/a n/a n/a n/a n/a n/a n/a n/a
      Micro Precision Micro Recall Instances TP FP FN
Costs / Pricing
Pricing Date: 5 months ago2025-10-24Tokens: 20.6K IT + 20.8K OT = 41.4K TTCost: 0.001$0.008$0.009$