models / reranker · coming soon

Qwen3-VL-Reranker-2B

Qwen/Qwen3-VL-Reranker-2B

A popular open reranker model, with 300.3K downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price

~$0.008

/ 1k docs · estimated, set at launch

API providers

downloads / mo

300.3K

license

apache-2.0

about this model

The Qwen3-VL-Reranker-2B is a specialized multimodal reranking model built on the Qwen3-VL foundation. It accepts a (query, document) pair where each may contain text, images, screenshots, videos, or arbitrary combinations of these modalities, and outputs a precise relevance score. Designed to work alongside the Qwen3-VL-Embedding model in a two-stage retrieval pipeline, it refines initial recall results to significantly boost retrieval accuracy.

Key Strengths

Multimodal versatility: Handles text, images, screenshots, and videos within a unified framework, supporting image-text retrieval, video-text matching, visual question answering, and multimodal content clustering.
High-precision reranking: Delivers superior relevance scoring compared to baseline rerankers and base embedding models across diverse tasks.
Multilingual support: Covers over 30 languages, inheriting Qwen3-VL’s multilingual capabilities.
Instruction-aware: Customizable input instructions for different tasks; evaluations show 1–5% improvement when using task-specific prompts (English recommended).
Context length: 32K tokens.

Ideal Use Cases

Performing high-accuracy reranking in multimodal search pipelines, visual document retrieval, and cross-modal understanding applications where initial embeddings require refinement.

Benchmark Performance

Model	Size	MMEB-v2 Retrieval (Avg)	MMEB-v2 Image	MMEB-v2 Video	MMEB-v2 VisDoc	MMTEB Retrieval	JinaVDR	ViDoRe v3
Qwen3-VL-Embedding-2B	2B	73.4	74.8	53.6	79.2	68.1	71.0	52.9
jina-reranker-m0	2B	–	68.2	–	85.2	–	82.2	57.8
Qwen3-VL-Reranker-2B	2B	75.1	73.8	52.1	83.4	70.0	80.9	60.8
Qwen3-VL-Reranker-8B	8B	79.2	80.7	55.8	86.3	74.9	83.6	66.7

Qwen3-VL-Reranker-2B performance overview

For further details, including full benchmark evaluation, hardware requirements, and inference performance, refer to the technical report, blog, and GitHub repository.

not yet live

We're benchmarking and onboarding Qwen3-VL-Reranker-2B as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.