Hosted image segmentation models
9 models · 0 live as APIs · benchmarked & compared
Image segmentation models partition an image into distinct regions, enabling pixel-level classification. This capability solves problems such as isolating a product from its background for e-commerce catalogs (BriaAI/RMBG-1.4, ZhengPeng7/BiRefNet), extracting document layouts for automated data entry (PaddlePaddle/PP-DocLayoutV3), or performing salient object detection for photo editing (CIDAS/clipseg-rd64-refined, PramaLLC/BEN2). In production, these models are typically called via batch processing pipelines or real-time inference endpoints, where a raw image is passed to the model and the output mask is used for downstream cropping, compositing, or analysis.
Choosing between these nine models involves a trade-off between size, quality, and speed. Lightweight variants such as ZhengPeng7/BiRefNet_lite and Xenova/modnet offer faster inference at lower memory cost, while larger versions like ZhengPeng7/BiRefNet_HR and BEN2 deliver finer boundaries and higher accuracy on challenging scenes. For most production workloads, calling a hosted API eliminates the overhead of managing GPU infrastructure, scaling, and model updates, making it the simpler and more cost-effective option.
compare
| model | params | downloads/mo | price | status |
|---|---|---|---|---|
| CIDAS/clipseg-rd64-refined | 150.7M | 1M | ~$0.047 / 1k images | coming soon |
| ZhengPeng7/BiRefNet | 220.7M | 733.7K | ~$0.094 / 1k images | coming soon |
| briaai/RMBG-1.4 | 44.1M | 309.5K | ~$0.047 / 1k images | coming soon |
| Xenova/modnet | - | 54.3K | at launch | coming soon |
| PaddlePaddle/PP-DocLayoutV3 | - | 42.6K | at launch | coming soon |
| PramaLLC/BEN2 | 94.6M | 26.4K | ~$0.047 / 1k images | coming soon |
| ZhengPeng7/BiRefNet_lite | 44.4M | 24.8K | ~$0.047 / 1k images | coming soon |
| ZhengPeng7/BiRefNet_HR | 220.7M | 23.5K | ~$0.094 / 1k images | coming soon |
| ZhengPeng7/BiRefNet-portrait | 220.7M | 19.7K | ~$0.094 / 1k images | coming soon |