skip to content
gigarouter gigarouter
models / vision-language · coming soon

Qwen3.6-27B-FP8

Qwen/Qwen3.6-27B-FP8

A popular open vision-language model, with 4.9M downloads a month. gigarouter benchmarks and hosts it as an OpenAI-compatible API.

est. price
~$1.341
/ 1k images · estimated, set at launch
API providers
0
downloads / mo
4.9M
license
apache-2.0

about this model

Overview

Qwen3.6-27B-FP8 is a 27-billion-parameter causal language model with a vision encoder, optimized for agentic coding and vision-language tasks. It is hosted on gigarouter as a managed, OpenAI-compatible API, eliminating the need for local infrastructure.

Key Strengths

  • Agentic Coding: Enhanced fluency and precision in frontend workflows and repository-level reasoning.
  • Thinking Preservation: Retains reasoning context from historical messages to streamline iterative development.
  • Extended Context: Native context length of 262,144 tokens, extensible up to 1,010,000 tokens.
  • FP8 Quantization: Fine-grained FP8 quantization (block size 128) delivers performance nearly identical to the original model.

Benchmark Results

Qwen3.6-27B achieves competitive scores on coding agent and knowledge benchmarks:

BenchmarkQwen3.6-27B
SWE-bench Verified77.2
SWE-bench Pro53.5
SWE-bench Multilingual71.3
Terminal-Bench 2.059.3
SkillsBench Avg548.2
QwenWebBench1487
NL2Repo36.2
Claw-Eval Avg72.4
Claw-Eval Pass^360.6
QwenClawBench53.4

Benchmark Results

Model Architecture

  • Parameters: 27B
  • Hidden Dimension: 5120
  • Layers: 64
  • Hidden Layout: 16 × (3 × (Gated DeltaNet → FFN) → 1 × (Gated Attention → FFN))
  • Context Length: 262,144 native, up to 1,010,000 tokens
  • Training Stage: Pre-training & Post-training

Best For

Developers building agentic coding applications, particularly those requiring repository-level reasoning, frontend workflow automation, and long-context understanding. The model is also suited for vision-language tasks via its integrated vision encoder.

For further details, see the Qwen3.6-27B blog post.

not yet live

We're benchmarking and onboarding Qwen3.6-27B-FP8 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.