PP-LCNet_x1_0_doc_ori
PaddlePaddle/PP-LCNet_x1_0_doc_ori
A popular open image-to-text model, with 445.3K downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.
about this model
Key Strengths
This model is designed to pre-process document images—such as scanned documents or ID photos—that may have been captured at arbitrary rotations. By classifying orientation before OCR, it improves downstream text recognition accuracy. It achieves 99.06% recognition average accuracy with a storage size of only 7 MB.
Best For
- Correcting rotated document images in scanning or photo-capture workflows
- Preprocessing inputs for OCR pipelines to boost overall recognition accuracy
- Digital document management and automated document image quality improvement
Performance
| Model | Recognition Avg Accuracy (%) | Model Storage Size (M) | Categories |
|---|---|---|---|
| PP-LCNet_x1_0_doc_ori | 99.06 | 7 | 0°, 90°, 180°, 270° |
Example Output
For a sample input image, the model predicts a class ID of 2 (180° rotation) with a confidence score of 0.90971:
Integration
Gigarouter hosts this model as a managed, OpenAI-compatible API. You can call it directly without any local installation or dependency management. The model can also be used as part of a document preprocessing pipeline that combines orientation classification with geometric distortion correction for more robust document image handling.
We're benchmarking and onboarding PP-LCNet_x1_0_doc_ori as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.