models / embeddings · coming soon

Qwen3-Embedding-4B

Qwen/Qwen3-Embedding-4B

A popular open embeddings model, with 2.6M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price

~$0.008

/ 1M tokens · estimated, set at launch

API providers

downloads / mo

2.6M

license

apache-2.0

about this model

Qwen3-Embedding-4B is a 4-billion-parameter text embedding model from the Qwen3 Embedding series, optimized for multilingual text embedding and ranking tasks. It is designed for text retrieval, code retrieval, classification, clustering, and bitext mining across more than 100 languages.

Key Capabilities

Context length: 32K tokens.
Embedding dimension: Up to 2,560; supports Matryoshka Representation Learning (MRL) for user-defined dimensions from 32 to 2,560.
Instruction-aware: Both embedding and reranking models accept task-specific instructions; using instructions typically improves downstream task performance by 1%–5%.
Multilingual: Supports 100+ natural languages and multiple programming languages for code retrieval.

Performance

The 8B variant of the Qwen3 Embedding series achieved No. 1 rank on the MTEB multilingual leaderboard as of June 5, 2025, with a score of 70.58. The Qwen3-Embedding-4B benefits from the same model architecture and training methodology, providing strong multilingual and cross-lingual retrieval quality.

Model Series Overview

Model Type	Models	Size	Layers	Sequence Length	Embedding Dimension	MRL Support	Instruction Aware
Text Embedding	Qwen3-Embedding-0.6B	0.6B	28	32K	1024	Yes	Yes
Text Embedding	Qwen3-Embedding-4B	4B	36	32K	2560	Yes	Yes
Text Embedding	Qwen3-Embedding-8B	8B	36	32K	4096	Yes	Yes
Text Reranking	Qwen3-Reranker-0.6B	0.6B	28	32K	–	–	Yes
Text Reranking	Qwen3-Reranker-4B	4B	36	32K	–	–	Yes
Text Reranking	Qwen3-Reranker-8B	8B	36	32K	–	–	Yes

Qwen3 Embedding model architecture diagram

For further details, refer to the official blog post and GitHub repository.

not yet live

We're benchmarking and onboarding Qwen3-Embedding-4B as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.