skip to content
gigarouter gigarouter
tasks / object detection

Hosted object detection models

36 models · 0 live as APIs · benchmarked & compared

Object detection models locate and classify objects within images or documents. They solve problems such as extracting tables from scanned PDFs (e.g., microsoft/table-transformer-structure-recognition, TahaDouaji/detr-doc-table-detection), detecting table regions in layouts (microsoft/table-transformer-detection, microsoft/table-transformer-structure-recognition-v1.1-all), and identifying small objects in general scenes (hustvl/yolos-small). Other models target document layout parsing (PaddlePaddle/PP-DocLayoutV3_safetensors) or high-accuracy detection across diverse categories (PekingU/rtdetr_v2_r50vd, PekingU/rtdetr_r50vd_coco_o365).

In production, these models are often deployed as part of a document processing pipeline, a real-time video analysis system, or a batch annotation service. They are typically called via an API that accepts an image or a document page and returns bounding boxes with class labels and confidence scores. Integration involves preprocessing inputs, handling model inference, and post-processing outputs for downstream tasks such as OCR, data extraction, or automation.

Choosing among object detection models involves a trade-off between model size, inference speed, and detection quality. Larger backbones (e.g., r50vd-based RT-DETR models) tend to achieve higher accuracy but require more compute and latency. Smaller models such as yolos-small trade some accuracy for faster inference and lower memory footprint. Domain-specific models (like the table transformers) are purpose-built for particular use cases and generally outperform general-purpose models on their target task. The right choice depends on the acceptable throughput, hardware budget, and precision requirements of your application.

For most call volumes, using a hosted API eliminates the overhead of provisioning GPUs, managing inference frameworks, and scaling for variable demand — making it a simpler and more cost-effective option than self-hosting.

compare

modelparamsdownloads/mopricestatus
microsoft/table-transformer-structure-recognition28.8M1.8M~$0.047 / 1k imagescoming soon
microsoft/table-transformer-detection28.8M1.5M~$0.047 / 1k imagescoming soon
hustvl/yolos-small30.7M713.6K~$0.047 / 1k imagescoming soon
PaddlePaddle/PP-DocLayoutV3_safetensors33.3M341.1K~$0.047 / 1k imagescoming soon
PekingU/rtdetr_v2_r50vd43M309.8K~$0.047 / 1k imagescoming soon
PekingU/rtdetr_r50vd_coco_o36543M254.5K~$0.047 / 1k imagescoming soon
microsoft/table-transformer-structure-recognition-v1.1-all28.8M239.5K~$0.047 / 1k imagescoming soon
TahaDouaji/detr-doc-table-detection41.6M208.1K~$0.047 / 1k imagescoming soon
keremberke/yolov8m-table-extraction-176.4Kat launchcoming soon
hustvl/yolos-tiny6.5M100.9K~$0.047 / 1k imagescoming soon
PekingU/rtdetr_r101vd_coco_o36576.8M99.4K~$0.047 / 1k imagescoming soon
PekingU/rtdetr_v2_r18vd20.2M97.1K~$0.047 / 1k imagescoming soon
Anzhc/Anzhcs_YOLOs-75.5Kat launchcoming soon
PekingU/rtdetr_r50vd43M63.7K~$0.047 / 1k imagescoming soon
foduucom/stockmarket-pattern-detection-yolov8-42.1Kat launchcoming soon
morsetechlab/yolov11-license-plate-detection-26.5Kat launchcoming soon
keremberke/yolov5m-license-plate-23.8Kat launchcoming soon
valentinafevu/yolos-fashionpedia-21.4Kat launchcoming soon
microsoft/conditional-detr-resnet-5043.5M18.5K~$0.047 / 1k imagescoming soon
PekingU/rtdetr_r18vd_coco_o36520.2M17.3K~$0.047 / 1k imagescoming soon
Ultralytics/YOLOv8-10.4Kat launchcoming soon
iitolstykh/YOLO-Face-Person-Detector-10.2Kat launchcoming soon
Ultralytics/YOLO11-9.8Kat launchcoming soon
PekingU/rtdetr_r18vd20.2M9K~$0.047 / 1k imagescoming soon
Ultralytics/YOLO26-8.5Kat launchcoming soon
SenseTime/deformable-detr40.2M8.1K~$0.047 / 1k imagescoming soon
Fuyucchi/yolov8_animeface-7.8Kat launchcoming soon
facebook/detr-resnet-101-dc560.7M7.1K~$0.047 / 1k imagescoming soon
PekingU/rtdetr_v2_r101vd76.8M6.9K~$0.047 / 1k imagescoming soon
Xenova/detr-resnet-50-6.5Kat launchcoming soon
tech4humans/conditional-detr-50-signature-detector43.5M6.2K~$0.047 / 1k imagescoming soon
mosesb/best-comic-panel-detection-4.6Kat launchcoming soon
mudler/locate-anything.cpp-gguf-4.6Kat launchcoming soon
ustc-community/dfine-small-coco10.4M4.5K~$0.047 / 1k imagescoming soon
jameslahm/yoloe-4.3Kat launchcoming soon
Armaggheddon/yolo11-document-layout-4.2Kat launchcoming soon