skip to content
gigarouter gigarouter
models / embeddings · coming soon

gte-large-en-v1.5

Alibaba-NLP/gte-large-en-v1.5

A popular open embeddings model, with 1.1M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price
~$0.008
/ 1M tokens · estimated, set at launch
API providers
0
downloads / mo
1.1M
license
apache-2.0

about this model

Alibaba-NLP/gte-large-en-v1.5 is an English text embedding model that generates dense vector representations for tasks such as retrieval, clustering, classification, and semantic similarity. It supports a maximum sequence length of 8192 tokens and produces 1024-dimensional embeddings.

Key strengths

  • State-of-the-art performance on the MTEB benchmark within the 400M parameter size category.
  • Competitive results on the LoCo long-context retrieval benchmark, demonstrating strong handling of extended documents.
  • Built on a transformer++ encoder backbone (BERT + RoPE + GLU) optimized for long contexts.

Benchmark results

MTEB (English, 56 tasks): average score 65.39. LoCo (long-context retrieval, 5 tasks): average score 86.71.

ModelParams (M)Dim.Max lengthMTEB avg.LoCo avg.
gte-large-en-v1.54341024819265.3986.71
mxbai-embed-large-v1335102451264.68
bge-large-en-v1.5335102451264.23

MTEB subtask breakdown (gte-large-en-v1.5): Classification 77.75, Clustering 47.95, Pair Classification 84.63, Reranking 58.50, Retrieval 57.91, STS 81.43, Summarization 30.91.

Best use cases

  • Long-document retrieval and search, where queries and documents exceed 512 tokens.
  • High‑accuracy semantic similarity, clustering, and classification tasks in English.
  • Applications requiring a lightweight (434M parameter) model with competitive MTEB scores.
not yet live

We're benchmarking and onboarding gte-large-en-v1.5 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.