models / image-to-text · coming soon

PP-LCNet_x1_0_doc_ori

PaddlePaddle/PP-LCNet_x1_0_doc_ori

A popular open image-to-text model, with 445.3K downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

status

coming soon

API providers

downloads / mo

445.3K

license

apache-2.0

about this model

PP-LCNet_x1_0_doc_ori is a document image orientation classification model that identifies the orientation of document images across four categories: 0°, 90°, 180°, and 270°.

Key Strengths

This model is designed to pre-process document images—such as scanned documents or ID photos—that may have been captured at arbitrary rotations. By classifying orientation before OCR, it improves downstream text recognition accuracy. It achieves 99.06% recognition average accuracy with a storage size of only 7 MB.

Best For

Correcting rotated document images in scanning or photo-capture workflows
Preprocessing inputs for OCR pipelines to boost overall recognition accuracy
Digital document management and automated document image quality improvement

Performance

Model	Recognition Avg Accuracy (%)	Model Storage Size (M)	Categories
PP-LCNet_x1_0_doc_ori	99.06	7	0°, 90°, 180°, 270°

Example Output

For a sample input image, the model predicts a class ID of 2 (180° rotation) with a confidence score of 0.90971:

Visualized classification result showing the original image and its predicted orientation

Integration

Gigarouter hosts this model as a managed, OpenAI-compatible API. You can call it directly without any local installation or dependency management. The model can also be used as part of a document preprocessing pipeline that combines orientation classification with geometric distortion correction for more robust document image handling.

not yet live

We're benchmarking and onboarding PP-LCNet_x1_0_doc_ori as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.