models / embeddings · coming soon

jina-embeddings-v2-small-en

jinaai/jina-embeddings-v2-small-en

A popular open embeddings model, with 1.3M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price

~$0.008

/ 1M tokens · estimated, set at launch

API providers

downloads / mo

1.3M

license

apache-2.0

about this model

jina-embeddings-v2-small-en is an English monolingual embedding model hosted on gigarouter. It is based on a BERT architecture (JinaBERT) with symmetric bidirectional ALiBi, enabling a maximum sequence length of 8192 tokens despite being trained on 512-length sequences. The model has 33 million parameters, supporting fast and memory-efficient inference.

Key Strengths

Supports up to 8192 token sequence length, suitable for long documents.
33M parameters for low-latency, memory-efficient deployment.
Pretrained on C4 and fine-tuned on over 400 million sentence pairs with hard negatives from diverse domains.

Best For

Long document retrieval
Semantic textual similarity
Text reranking and recommendation
RAG and LLM-based generative search

Benchmark Performance

According to LlamaIndex, to achieve peak performance in both hit rate and MRR, the combination of OpenAI or JinaAI-Base embeddings with CohereRerank or bge-reranker-large rerankers stands out.

Model Variants

Model	Parameters	Languages
jina-embeddings-v2-small-en	33M	English
jina-embeddings-v2-base-en	137M	English
jina-embeddings-v2-base-zh	161M	Chinese-English bilingual
jina-embeddings-v2-base-de	161M	German-English bilingual

Technical details are available in the Jina Embeddings V2 paper.

not yet live

We're benchmarking and onboarding jina-embeddings-v2-small-en as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.