ms-marco-MiniLM-L2-v2
cross-encoder/ms-marco-MiniLM-L2-v2
A popular open reranker model, with 1.2M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.
about this model
The cross-encoder/ms-marco-MiniLM-L2-v2 model is a compact cross-encoder fine-tuned for passage reranking. Given a query and a set of candidate passages (for example, retrieved via a first-stage retrieval system like ElasticSearch), the model assigns a relevance score to each query–passage pair. Passages can then be sorted by score to produce a reranked result list. This model was trained on the MS Marco Passage Ranking task.
Key Strengths
- Efficient: With only 2 layers (MiniLM-L2 architecture), it offers a favorable speed–quality trade-off, processing approximately 4,100 query–passage pairs per second on a V100 GPU.
- Competitive performance: Achieves an NDCG@10 of 71.01 on the TREC Deep Learning 2019 benchmark and an MRR@10 of 34.85 on the MS Marco Dev set.
- Suitable for production reranking pipelines where low latency and throughput are important.
Performance Benchmark
The table below compares this model with other pre-trained cross-encoders on standard reranking metrics (NDCG@10 and MRR@10) and throughput (docs/second on a V100 GPU).
| Model-Name | NDCG@10 (TREC DL 19) | MRR@10 (MS Marco Dev) | Docs / Sec |
|---|---|---|---|
| cross-encoder/ms-marco-TinyBERT-L2-v2 | 69.84 | 32.56 | 9000 |
| cross-encoder/ms-marco-MiniLM-L2-v2 | 71.01 | 34.85 | 4100 |
| cross-encoder/ms-marco-MiniLM-L4-v2 | 73.04 | 37.70 | 2500 |
| cross-encoder/ms-marco-MiniLM-L6-v2 | 74.30 | 39.01 | 1800 |
| cross-encoder/ms-marco-MiniLM-L12-v2 | 74.31 | 39.02 | 960 |
| cross-encoder/ms-marco-TinyBERT-L2 | 67.43 | 30.15 | 9000 |
| cross-encoder/ms-marco-TinyBERT-L4 | 68.09 | 34.50 | 2900 |
| cross-encoder/ms-marco-TinyBERT-L6 | 69.57 | 36.13 | 680 |
| cross-encoder/ms-marco-electra-base | 71.99 | 36.41 | 340 |
| nboost/pt-tinybert-msmarco | 63.63 | 28.80 | 2900 |
| nboost/pt-bert-base-uncased-msmarco | 70.94 | 34.75 | 340 |
| nboost/pt-bert-large-msmarco | 73.36 | 36.48 | 100 |
| Capreolus/electra-base-msmarco | 71.23 | 36.89 | 340 |
We're benchmarking and onboarding ms-marco-MiniLM-L2-v2 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.