skip to content
gigarouter gigarouter
models / reranker

Qwen3-Reranker-0.6B

Qwen/Qwen3-Reranker-0.6B

A hosted reranker model - call it over an OpenAI-compatible API, no GPU to run.

price
$0.008
/ 1k docs
throughput
660 docs/s

about this model

Qwen3-Reranker-0.6B is a text reranking model built on the Qwen3 dense foundation. It scores and ranks documents relative to a query, outputting relevance logits (or probabilities after sigmoid). With 0.6 billion parameters and support for over 100 languages, it is designed for efficient and accurate search result reordering.

Key Strengths

  • Instruction-aware: The model accepts custom instructions per task, enabling task- or domain-specific behavior. Using instructions typically yields a 1–5% improvement over default prompts across most downstream tasks.
  • Multilingual and cross-lingual: Inherits the multilingual capabilities of Qwen3, covering 100+ natural languages and multiple programming languages, making it suitable for global and code retrieval scenarios.
  • Long-context support: Handles input sequences up to 32,768 tokens, accommodating lengthy queries and documents.

What It Is Best For

Text retrieval, code retrieval, text classification, text clustering, and bitext mining. As a reranker, it is most effective when placed after an initial retrieval step to refine and order candidate documents by relevance.

Benchmark Context

The Qwen3 Embedding series achieved state-of-the-art results across a range of text embedding and ranking tasks. The 8B embedding model ranks No. 1 on the MTEB multilingual leaderboard (as of June 5, 2025, score 70.58). The reranker models — including this 0.6B variant — excel in various text retrieval scenarios, benefiting from the same foundational advances in multilingual capability, long-text understanding, and reasoning.

Model Specifications

PropertyValue
TypeText Reranking
Parameters0.6B
Layers28
Max Sequence Length32,768 tokens
Instruction AwareYes
Qwen3 Embedding series

This model is hosted by gigarouter as a managed, OpenAI‑compatible API. No installation or local loading is required — simply send queries and documents via API call to obtain relevance scores.

call it
# rerank documents by relevance; billed per document
curl https://gigarouter.ai/v1/rerank \
  -H "Authorization: Bearer $GR_KEY" \
  -d '{"model":"Qwen/Qwen3-Reranker-0.6B","query":"capital of France",
       "documents":["Paris is the capital of France.","Bananas are yellow."]}'