skip to content
gigarouter gigarouter
models / image-to-text · coming soon

trocr-small-handwritten

microsoft/trocr-small-handwritten

A popular open image-to-text model, with 448.6K downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

status
coming soon
API providers
0
downloads / mo
448.6K

about this model

Model Overview

TrOCR-small-handwritten is a transformer-based optical character recognition (OCR) model fine-tuned on the IAM handwriting dataset. It uses an encoder-decoder architecture: an image Transformer (initialized from DeiT) encodes input images, and a text Transformer (initialized from UniLM) autoregressively generates text. Images are processed as sequences of 16x16 fixed-size patches with absolute position embeddings.

Key Strengths

  • Designed specifically for handwritten text recognition from single text-line images.
  • Leverages pre-trained vision and language transformers for robust performance on handwriting.
  • Fine-tuned on the IAM dataset, a standard benchmark for handwriting recognition.

Best For

Optical character recognition (OCR) on handwritten text-line images, such as scanned documents, notes, or historical manuscripts.

Benchmark Results

The model card does not include specific benchmark numbers or evaluation metrics. The model was introduced in the paper TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models (Li et al., 2021).

Example Input

The model accepts single text-line images. A sample image from the IAM database is shown below:

Citation

Li et al., 2021: TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

not yet live

We're benchmarking and onboarding trocr-small-handwritten as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.