models / vision-language · coming soon

Qwen3.6-35B-A3B-FP8

Qwen/Qwen3.6-35B-A3B-FP8

A popular open vision-language model, with 6.2M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price

~$1.341

/ 1k images · estimated, set at launch

API providers

downloads / mo

6.2M

license

apache-2.0

about this model

Qwen3.6-35B-A3B-FP8 is a causal language model with a vision encoder (VLM) hosted on gigarouter as an OpenAI-compatible API. It contains 35 billion total parameters with 3 billion activated through a mixture-of-experts architecture (256 experts, 8 routed plus 1 shared expert). Fine-grained FP8 quantization (block size 128) yields performance nearly identical to the original model. Native context length is 262,144 tokens, extensible to approximately 1,010,000 tokens.

Key Capabilities

Optimised for agentic coding, particularly frontend workflows and repository-level reasoning. A new thinking preservation feature retains reasoning context from historical messages, reducing overhead in iterative development. The model also handles general agent tasks.

Benchmark Results

Coding agent benchmarks (compared to Qwen3.5-27B, Gemma4-31B, Qwen3.5-35BA3B, and Gemma4-26BA4B):

	Qwen3.5-27B	Gemma4-31B	Qwen3.5-35BA3B	Gemma4-26BA4B	Qwen3.6-35BA3B
Coding Agent
SWE-bench Verified	75.0	52.0	70.0	17.4	73.4
SWE-bench Multilingual	69.3	51.7	60.3	17.3	67.2
SWE-bench Pro	51.2	35.7	44.6	13.8	49.5
Terminal-Bench 2.0	41.6	42.9	40.5	34.2	51.5
Claw-Eval (Avg)	64.3	48.5	65.4	58.8	68.7
Claw-Eval (Pass^3)	46.2	25.0	51.0	28.0	50.0
SkillsBench (Avg5)	27.2	23.6	4.4	12.3	28.7
QwenClawBench	52.2	41.7	47.7	38.7	52.6
NL2Repo	27.3	15.5	20.5	11.6	29.4
QwenWebBench	1068	1197	978	1178	1397
General Agent
TAU3-Bench	68.4	67.5	68.9	59.0	67.2

Benchmark Results

Model diagram

For further details, refer to the Qwen3.6-35B-A3B blog post.

not yet live

We're benchmarking and onboarding Qwen3.6-35B-A3B-FP8 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.