A test run is a single execution of a benchmark test using a defined model configuration.
Each run represents how a particular large language model (LLM) — such as GPT-4, Claude-3, or Gemini — performed on a given task at a specific time, with specific settings.
A test run includes:
Together, test runs make it possible to compare models, providers, and configurations across benchmarks in a transparent and reproducible way.
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | genai |
| Model | gemini-2.0-flash |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 73.00 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.73 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 10.2K IT + 8.4K OT = 18.7K TT | Cost: 0.001$ + 0.003$ = 0.004$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | openai |
| Model | gpt-4.1-nano |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 0.00 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.00 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 12.4K IT + 212 OT = 12.6K TT | Cost: 0.001$ + 0.000$ = 0.001$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | mistral |
| Model | pixtral-large-latest |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 36.00 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.41 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 19.2K IT + 7.9K OT = 27.2K TT | Cost: 0.038$ + 0.048$ = 0.086$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | openai |
| Model | gpt-5-mini |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 45.10 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.47 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 7.7K IT + 16.7K OT = 24.4K TT | Cost: 0.002$ + 0.033$ = 0.035$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | openai |
| Model | gpt-4.1-mini |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 0.00 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.05 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 9.3K IT + 4.3K OT = 13.6K TT | Cost: 0.004$ + 0.007$ = 0.011$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | openai |
| Model | gpt-4o-mini |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 22.10 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.27 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 130.7K IT + 4.6K OT = 135.3K TT | Cost: 0.020$ + 0.003$ = 0.022$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | genai |
| Model | gemini-2.5-flash |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 90.00 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.92 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 3.9K IT + 9.7K OT = 13.6K TT | Cost: 0.001$ + 0.024$ = 0.025$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | anthropic |
| Model | claude-opus-4-20250514 |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 44.20 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.46 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 10.7K IT + 10.9K OT = 21.6K TT | Cost: 0.160$ + 0.821$ = 0.981$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | genai |
| Model | gemini-2.5-pro |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 92.90 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.95 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 3.9K IT + 9.7K OT = 13.6K TT | Cost: 0.005$ + 0.097$ = 0.102$ |
{'document-type': ['book-page'], 'writing': ['printed'], 'century': [18, 19], 'language': ['de'], 'layout': ['prose'], 'script-style': ['fraktur'], 'task': ['transcription']}
| Provider | openai |
| Model | gpt-5 |
| Temperature | 0.0 |
| Dataclass | Document |
| Normalized Score | 13.60 % |
| Test time | unknown seconds |
## IDENTITY AND PURPOSE
You are an OCR and information extraction system trained to process historical newspaper pages printed in 18th-century German using Fraktur type. The pages contain mostly classified advertisements. Your task is to identify and extract each advertisement *exactly as printed*, including historical spellings, typographic errors, punctuation, and formatting.
## INSTRUCTIONS
- Extract **all advertisements** from the input image, one after the other, following the sequence on the page.
- Maintain the **original spelling**, capitalization, and any **typos or non-standard forms**.
- Follow these transcription rules:
- the long s (ſ) is transcribed as "s"
- "/" is transcribed as ","
- Use the masthead of the newspaper only to extract the date, ignore other content.
- The layout is typically **two-column**; extract ads from both columns, including the ad number.
- Return the result as a **JSON object** in the specified format and **nothing else** (no explanations, summaries, or additional text).
- For each advertisement, include:
- `"date"`: the publication date of the page in ISO 8061 format (YYYY-MM-DD)
- `"tags_section"`: the heading under which the advertisement appears
- `"text"`: the full advertisement text
## EXAMPLE OUTPUT
{
"advertisements": [
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "5. Ein kleines, jedoch listiges Lehrbuch der Zauberkunst, lange im Gebrauche des jungen Bartolomeus Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zum Verkauff offeriert",
"text": "6. Ein rarer, mit Edelsteinen besetzter Saxophon-Kasten, aus dem Besitze der Jungfer Lisa Simpson."
},
{
"date": "1731-01-02",
"tags_section": "Es werden zu Entleihen begehrt",
"text": "7. Ein gar prachtvoller, jedoch etwas zerlesener Band mit Rezepten von Margaretha Simpsonin."
}
]
}
no valid result
| Fuzzy Score | F1 micro / macro | Micro precision/recall | Tue/False Positives | |||||
| 0.15 | n/a | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| Micro Precision | Micro Recall | Instances | TP | FP | FN | |||
| Pricing Date: 6 months ago, 2025-09-30. | Tokens: 6.3K IT + 19.6K OT = 25.9K TT | Cost: 0.008$ + 0.196$ = 0.204$ |