models / reranker · coming soon

mmarco-mMiniLMv2-L12-H384-v1

cross-encoder/mmarco-mMiniLMv2-L12-H384-v1

A popular open reranker model, with 1.6M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price

~$0.008

/ 1k docs · estimated, set at launch

API providers

downloads / mo

1.6M

license

apache-2.0

about this model

The cross-encoder/mmarco-mMiniLMv2-L12-H384-v1 is a multilingual cross-encoder model for document reranking, hosted as an OpenAI-compatible API on gigarouter. It was trained on the MMARCO dataset — a machine-translated version of MS MARCO covering 14 languages — and uses the multilingual MiniLMv2 architecture as its base.

Task

Given a query and a set of candidate passages (e.g., retrieved via ElasticSearch), the model assigns a relevance score to each (query, passage) pair. Passages can then be sorted by score to produce a reranked result list.

Key Strengths

Multilingual: trained on 14 languages (via MMARCO) and shows strong cross-lingual transfer to other languages beyond that set.
Compact and efficient: the MiniLMv2-L12-H384 architecture balances inference speed with ranking quality.

Best For

Information retrieval pipelines that require a reranking step after an initial retrieval stage. It is suitable for applications needing multilingual or cross-lingual search, question answering, and document retrieval.

References

For implementation details and usage patterns: SBERT.net Retrieve & Re-rank.
Training code: SBERT.net Training MS Marco.

not yet live

We're benchmarking and onboarding mmarco-mMiniLMv2-L12-H384-v1 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.