skip to content
gigarouter gigarouter
models / reranker · coming soon

Qwen3-Reranker-4B

Qwen/Qwen3-Reranker-4B

A popular open reranker model, with 1.8M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price
~$0.008
/ 1k docs · estimated, set at launch
API providers
0
downloads / mo
1.8M
license
apache-2.0

about this model

The Qwen3-Reranker-4B is a text reranking model from the Qwen3 Embedding series, designed for ranking tasks such as retrieval, code retrieval, and classification. It is built on the dense foundational model of Qwen3, inheriting multilingual capabilities (supporting over 100 languages), long-context understanding (up to 32k tokens), and reasoning skills. The model is instruction-aware, allowing custom prompts to improve performance for specific tasks, languages, or scenarios. Using instructions typically yields a 1% to 5% improvement in downstream tasks.

Key Strengths

  • Multilingual and cross-lingual support: Handles more than 100 languages, including programming languages, enabling robust multilingual and code retrieval.
  • Long context window: 32k token sequence length for processing extensive documents and queries.
  • Instruction-aware: Developers can provide custom instructions to tailor ranking behavior for specific use cases (e.g., classification, domain-specific retrieval).
  • Scalable sizes: Part of a series with 0.6B, 4B, and 8B variants, allowing selection based on efficiency and effectiveness.

Benchmark Performance

The Qwen3 Embedding series achieves state-of-the-art results on text retrieval benchmarks. The 8B embedding model holds the No.1 position on the MTEB multilingual leaderboard as of June 5, 2025 (score 70.58). The reranking models, including the 4B variant, excel in various text retrieval scenarios, though specific reranking benchmark scores are not provided in the model card.

Model Details

Property Value
Model TypeText Reranking
Parameters4B
Context Length32k
Supported Languages100+
Instruction AwareYes

Qwen3 Embedding series overview

For further details, refer to the official blog and GitHub repository.

not yet live

We're benchmarking and onboarding Qwen3-Reranker-4B as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.