skip to content
gigarouter gigarouter
models / reranker · coming soon

ms-marco-MiniLM-L2-v2

cross-encoder/ms-marco-MiniLM-L2-v2

A popular open reranker model, with 1.2M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price
~$0.008
/ 1k docs · estimated, set at launch
API providers
0
downloads / mo
1.2M
license
apache-2.0

about this model

The cross-encoder/ms-marco-MiniLM-L2-v2 model is a compact cross-encoder fine-tuned for passage reranking. Given a query and a set of candidate passages (for example, retrieved via a first-stage retrieval system like ElasticSearch), the model assigns a relevance score to each query–passage pair. Passages can then be sorted by score to produce a reranked result list. This model was trained on the MS Marco Passage Ranking task.

Key Strengths

  • Efficient: With only 2 layers (MiniLM-L2 architecture), it offers a favorable speed–quality trade-off, processing approximately 4,100 query–passage pairs per second on a V100 GPU.
  • Competitive performance: Achieves an NDCG@10 of 71.01 on the TREC Deep Learning 2019 benchmark and an MRR@10 of 34.85 on the MS Marco Dev set.
  • Suitable for production reranking pipelines where low latency and throughput are important.

Performance Benchmark

The table below compares this model with other pre-trained cross-encoders on standard reranking metrics (NDCG@10 and MRR@10) and throughput (docs/second on a V100 GPU).

Model-Name NDCG@10 (TREC DL 19) MRR@10 (MS Marco Dev) Docs / Sec
cross-encoder/ms-marco-TinyBERT-L2-v2 69.84 32.56 9000
cross-encoder/ms-marco-MiniLM-L2-v2 71.01 34.85 4100
cross-encoder/ms-marco-MiniLM-L4-v2 73.04 37.70 2500
cross-encoder/ms-marco-MiniLM-L6-v2 74.30 39.01 1800
cross-encoder/ms-marco-MiniLM-L12-v2 74.31 39.02 960
cross-encoder/ms-marco-TinyBERT-L2 67.43 30.15 9000
cross-encoder/ms-marco-TinyBERT-L4 68.09 34.50 2900
cross-encoder/ms-marco-TinyBERT-L6 69.57 36.13 680
cross-encoder/ms-marco-electra-base 71.99 36.41 340
nboost/pt-tinybert-msmarco 63.63 28.80 2900
nboost/pt-bert-base-uncased-msmarco 70.94 34.75 340
nboost/pt-bert-large-msmarco 73.36 36.48 100
Capreolus/electra-base-msmarco 71.23 36.89 340
not yet live

We're benchmarking and onboarding ms-marco-MiniLM-L2-v2 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.