skip to content
gigarouter gigarouter
models / embeddings · coming soon

nomic-embed-text-v1.5

nomic-ai/nomic-embed-text-v1.5

A popular open embeddings model, with 16.9M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price
~$0.008
/ 1M tokens · estimated, set at launch
API providers
0
downloads / mo
16.9M
license
apache-2.0

about this model

nomic-embed-text-v1.5 is a text embedding model that produces high-quality vectors for retrieval, clustering, classification, and semantic search. It supports Matryoshka Representation Learning, allowing developers to truncate embeddings to any dimension (64, 128, 256, 512, or 768) with a negligible performance trade-off. The model accepts sequences up to 8,192 tokens.

Task-specific prefixes

To achieve optimal results, prepend the appropriate instruction prefix to each input:

  • search_document – for documents in a RAG index
  • search_query – for queries to search against a document index
  • clustering – for grouping texts by topic
  • classification – for texts used as features in a classifier

Performance

The table below shows MTEB scores at various embedding dimensions. The full 768‑dimension version scores 62.28, while reducing to 512 dimensions retains 61.96.

ModelSequence LengthDimensionMTEB
nomic-embed-text-v1819276862.39
nomic-embed-text-v1.5819276862.28
nomic-embed-text-v1.5819251261.96
nomic-embed-text-v1.5819225661.04
nomic-embed-text-v1.5819212859.34
nomic-embed-text-v1.581926456.10
Performance vs dimension

Text embeddings from this model can also be used alongside the companion vision model nomic-embed-vision-v1.5, which is aligned to the same embedding space.

not yet live

We're benchmarking and onboarding nomic-embed-text-v1.5 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.