Question 1

What is LightOnOCR-2 1B best for?

Accepted Answer

It is best for end-to-end OCR from document images, including PDFs, scans, receipts, forms, and scientific articles, with strong handling of multi-column layouts, tables, and math notation.

Question 2

How does LightOnOCR-2 1B compare in speed and size to other OCR models?

Accepted Answer

It is approximately 9x smaller than prior best-performing models and 3.3x faster than Chandra OCR, 1.7x faster than OlmOCR, and 5x faster than dots.ocr, while achieving state-of-the-art accuracy on OlmOCR-Bench.

Question 3

What license is LightOnOCR-2 1B released under?

Accepted Answer

The model checkpoint is released under Apache License 2.0. The training dataset and evaluation benchmarks are released under their respective licenses.

Question 4

How do I call LightOnOCR-2 1B via the gigarouter API?

Accepted Answer

Use the gigarouter OpenAI-compatible endpoint with your API key. Send a chat completion request with an image URL or base64-encoded image as a user message, and receive the OCR text in the response.

Question 5

What input format does LightOnOCR-2 1B expect?

Accepted Answer

It accepts one image per request, provided as a URL or base64 data URI. Images should be rendered at 200 DPI with a longest dimension of approximately 1540px for best results.

Task	Optical Character Recognition (OCR)
Architecture	Vision-Language Transformer
Parameters	1 billion
License	Apache 2.0
Inference Speed	5.71 pages/s on H100

LightOnOCR-2 1B

specs

about this model

Key Strengths

Benchmark Results

Variants

Training Data

best for

FAQ

related vision-language models