Question 1

What is the aggressive variant and how does it differ from the balanced?

Accepted Answer

The aggressive variant applies stronger refusal removal, fully unlocking the model. It may still append short disclaimers baked into the base training but generates full content.

Question 2

Does this model support image and video inputs?

Accepted Answer

Yes, it is natively multimodal. You need the main GGUF file and the mmproj vision encoder to use image/video inputs with compatible runtimes.

Question 3

What are the recommended generation settings?

Accepted Answer

For thinking mode use temperature=0.6, top_p=0.95, top_k=20, min_p=0. For non-thinking mode use temperature=0.7, top_p=0.8, top_k=20, min_p=0. Maintain at least 128K context to preserve thinking capabilities.

Question 4

How can I call this model via gigarouter's API?

Accepted Answer

Use the OpenAI-compatible endpoint with your gigarouter API key. Refer to gigarouter documentation for the exact endpoint and model identifier.

Question 5

What is the maximum context length?

Accepted Answer

Native context is 262K tokens, extendable to 1M tokens using YaRN.

Task	other
Architecture	Hybrid: Gated DeltaNet linear attention + full softmax attention (3:1 ratio)
Parameters	9B
Context length	262K native (up to 1M with YaRN)

Qwen3.5 9B Uncensored Aggressive

specs

about this model

Key capabilities

Inference settings

best for

FAQ

related specialist model models