Question 1

What is Surya OCR 2 best for?

Accepted Answer

Surya OCR 2 is best for document OCR with layout analysis, table recognition, and reading order extraction, especially for multilingual documents.

Question 2

How does Surya OCR 2 compare in accuracy to other OCR models?

Accepted Answer

Surya OCR 2 scores 83.3% on olmOCR-bench, ranking 4th overall and top under 3B parameters.

Question 3

What are the license terms for Surya OCR 2?

Accepted Answer

The code is Apache 2.0. The model weights use a modified AI Pubs Open Rail-M license, free for research, personal use, and startups under $5M funding/revenue. Broader commercial use requires a license from Datalab.

Question 4

How do I call Surya OCR 2 via the API on gigarouter?

Accepted Answer

Use the gigarouter OpenAI-compatible endpoint with an API key. Send an image or PDF as input and receive JSON output with text, layout, and table data.

Question 5

What input formats does Surya OCR 2 support?

Accepted Answer

It supports images (JPEG, PNG, etc.) and PDF files. Output is a structured JSON with per-block text, bounding boxes, layout labels, and confidence scores.

Task	OCR, Layout Analysis, Table Recognition
Architecture	VLM (Vision Language Model)
Parameters	650M
License	Code: Apache 2.0, Weights: Modified AI Pubs Open Rail-M (free for research, personal use, and startups under $5M)

Task	Example output
Detection
OCR
Layout
Table recognition

Surya OCR 2

specs

about this model

Key benchmarks and capabilities

Output formats

Visual examples

best for

FAQ

related vision-language models