Qwen3.6-27B-FP8
Qwen/Qwen3.6-27B-FP8
A popular open vision-language model, with 4.9M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.
about this model
Overview
Qwen3.6-27B-FP8 is a 27-billion-parameter causal language model with a vision encoder, optimized for agentic coding and vision-language tasks. It is hosted on gigarouter as a managed, OpenAI-compatible API, eliminating the need for local infrastructure.
Key Strengths
- Agentic Coding: Enhanced fluency and precision in frontend workflows and repository-level reasoning.
- Thinking Preservation: Retains reasoning context from historical messages to streamline iterative development.
- Extended Context: Native context length of 262,144 tokens, extensible up to 1,010,000 tokens.
- FP8 Quantization: Fine-grained FP8 quantization (block size 128) delivers performance nearly identical to the original model.
Benchmark Results
Qwen3.6-27B achieves competitive scores on coding agent and knowledge benchmarks:
| Benchmark | Qwen3.6-27B |
|---|---|
| SWE-bench Verified | 77.2 |
| SWE-bench Pro | 53.5 |
| SWE-bench Multilingual | 71.3 |
| Terminal-Bench 2.0 | 59.3 |
| SkillsBench Avg5 | 48.2 |
| QwenWebBench | 1487 |
| NL2Repo | 36.2 |
| Claw-Eval Avg | 72.4 |
| Claw-Eval Pass^3 | 60.6 |
| QwenClawBench | 53.4 |

Model Architecture
- Parameters: 27B
- Hidden Dimension: 5120
- Layers: 64
- Hidden Layout: 16 × (3 × (Gated DeltaNet → FFN) → 1 × (Gated Attention → FFN))
- Context Length: 262,144 native, up to 1,010,000 tokens
- Training Stage: Pre-training & Post-training
Best For
Developers building agentic coding applications, particularly those requiring repository-level reasoning, frontend workflow automation, and long-context understanding. The model is also suited for vision-language tasks via its integrated vision encoder.
For further details, see the Qwen3.6-27B blog post.
We're benchmarking and onboarding Qwen3.6-27B-FP8 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.