jina-reranker-v3
jinaai/jina-reranker-v3
A popular open reranker model, with 949.9K downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.
about this model
Model Overview
jina-reranker-v3 is a 0.6B-parameter multilingual document reranker designed for listwise reranking. It employs a novel "last but not late interaction" architecture: instead of separate multi-vector encoding, the model performs causal self-attention between the query and all documents within a single context window, extracting relevance scores from the last token embedding of each document. This approach enables simultaneous processing of up to 64 documents within a 131K token context window.
Benchmark Performance
The model achieves state-of-the-art results on BEIR (61.94 nDCG@10) while being 10x smaller than generative listwise rerankers. The following table compares jina-reranker-v3 against other rerankers on standard benchmarks (nDCG@10):
| Model | Size | BEIR | MIRACL | MKQA | CoIR |
|---|---|---|---|---|---|
| jina-reranker-v3 | 0.6B | 61.94 | 66.83 | 67.92 | 70.64 |
| jina-reranker-v2 | 0.3B | 57.06 | 63.65 | 67.90 | 56.14 |
| jina-reranker-m0 | 2.4B | 58.95 | 66.75 | 68.19 | 63.55 |
| bge-reranker-v2-m3 | 0.6B | 56.51 | 69.32 | 67.88 | 36.28 |
| mxbai-rerank-base-v2 | 0.5B | 58.40 | 55.32 | 64.24 | 65.71 |
| mxbai-rerank-large-v2 | 1.5B | 61.44 | 57.94 | 67.06 | 70.87 |
| Qwen3-Reranker-0.6B | 0.6B | 56.28 | 57.70 | 65.34 | 65.18 |
| Qwen3-Reranker-4B | 4.0B | 61.16 | 67.52 | 67.52 | 73.91 |
| jina-code-embeddings-0.5b | 0.5B | - | - | - | 73.94 |
We're benchmarking and onboarding jina-reranker-v3 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.