skip to content
gigarouter gigarouter
models / vision-language · coming soon

Chandra

datalab-to/chandra

published Oct 2025 · updated Mar 2026

Chandra is a vlm model that converts images and PDFs into structured markdown, HTML, and JSON with preserved layout.

est. price
~$1.341
/ 1k images · estimated, set at launch
API providers
0
downloads / mo
138.4K
license
openrail

specs

TaskOCR (Document Intelligence)
ArchitectureVision Language Model (VLM)
LicenseOpenRAIL-M (model), Apache 2.0 (code)
Languages90+ languages (v2), 40+ languages (v1)

about this model

Chandra is a visual language model for OCR that converts images and PDFs into structured markdown, HTML, and JSON while preserving layout information.

Key strengths include high-accuracy text extraction with strong handwriting support, accurate form reconstruction (including checkboxes), and robust performance on tables, mathematical expressions, and complex multi-column layouts. The model also extracts images and diagrams with captions and structured data, and supports over 40 languages.

Benchmark Performance

Chandra was evaluated using the olmocr benchmark, which measures OCR accuracy across diverse document types:

Bar chart comparing olmocr scores across models
ModelArXivOld Scans MathTablesOld ScansHeaders and FootersMulti columnLong tiny textBaseOverallSource
Datalab Chandra v0.1.082.280.388.050.490.881.292.399.983.1 ± 0.9Own benchmarks
Datalab Marker v1.10.083.869.774.832.386.679.485.799.676.5 ± 1.0Own benchmarks
Mistral OCR API77.267.560.629.393.671.377.199.472.0 ± 1.1olmocr repo
Deepseek OCR75.272.379.733.396.166.780.199.775.4 ± 1.0Own benchmarks
GPT-4o (Anchored)53.574.570.040.793.869.360.696.869.9 ± 1.1olmocr repo
Gemini Flash 2 (Anchored)54.556.172.134.264.761.571.595.663.8 ± 1.2olmocr repo
Qwen 3 VL70.275.145.637.589.162.143.094.364.6 ± 1.1Own benchmarks
olmOCR v0.3.078.679.972.943.995.177.381.298.978.5 ± 1.1olmocr repo
dots.ocr82.164.288.340.994.182.481.299.579.1 ± 1.0dots.ocr repo

Chandra v0.1.0 achieves an overall score of 83.1 ± 0.9, leading on categories such as Old Scans Math (80.3), Tables (88.0), Old Scans (50.4), Long tiny text (92.3), and Base (99.9).

Example Outputs

Example conversion showing a complex document page with text, tables, and images faithfully rendered in markdown

Example conversions for tables, forms, handwriting, books, math, newspapers, and other document types are available in the model repository, demonstrating high-fidelity layout preservation and accurate content extraction.

best for

FAQ

What formats does Chandra output?

Chandra outputs markdown, HTML, or JSON with detailed layout information.

How do I call Chandra via the gigarouter API?

Use the OpenAI-compatible endpoint with your API key. Submit an image or PDF and specify the output format.

What is the license for Chandra?

The model uses OpenRAIL-M; the inference code is Apache 2.0. Commercial self-hosting requires a separate license.

How many languages does Chandra support?

Chandra 2 supports 90+ languages; the earlier version supports 40+.

Is Chandra available for on-premise deployment?

Yes, for on-prem licensing contact Datalab. A managed API is also available with SOC 2 Type 2 compliance.

not yet live

We're benchmarking and onboarding Chandra as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related vision-language models

compare all →