Gemma 4 12B V2 Coding Agentic
yuxinlu1/gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUF
published Jun 2026 · updated Jun 2026
Gemma 4 12B V2 Coding Agentic is a text-generation model that serves as a specialized local coding and tool-using agent for multi-step technical tasks.
specs
| Task | Text Generation |
| Architecture | Gemma 4 12B |
| Parameters | 12B |
| License | Apache 2.0 |
about this model
Gemma4-12B v2 (yuxinlu1/gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUF) is a text-generation model specialized for coding and agentic tool-use tasks, fine-tuned from Google's Gemma 4 12B instruction-tuned base model. It is designed to read, reason, use tools, and work through multi-step technical tasks before acting, making it suitable for local, private agentic workflows.
Key Strengths and Benchmarks
| tau2-bench telecom (20 tasks, Q8_0, local harness) | Score |
|---|---|
Official gemma-4-12B-it (base) | ~15% |
| Gemma4-12B v2 | ~55% |
The model achieves roughly 3.5× higher performance than the base model on the tau2-bench telecom benchmark, which mirrors real terminal/debugging work by requiring diagnose → fix → verify loops. In a separate fabrication probe, the model grounds actions before acting (0% fabrication, on par with the base model). The base model tends to abandon tasks (transfer to human), while v2 persists through the agentic loop.
Trade-offs and Specialization
This fine-tune is specialized for coding, terminal, and technical-agentic work. It trades a small amount of general-knowledge breadth (MMLU-Pro scores slightly below the base model) for its agentic capabilities. It is not designed for customer-service tasks (e.g., tau2-bench retail).
Training and Architecture
v2 builds on v1 with a significant agentic push: multi-step tool-use trajectories (read → reason → act → verify) in Gemma 4's native tool protocol, verified chain-of-thought for Python coding tasks, and curated reasoning/instruction data to maintain broad competence. All reasoning is distilled CoT from Opus 4.8. The model uses Gemma's native thought channel before answering, and is distributed under Apache 2.0.
best for
- ·Local coding and debugging with multi-step tool use
- ·Terminal and command-line agentic tasks
- ·Private, offline technical problem-solving
FAQ
It is specialized for coding, terminal, and technical-agentic tasks like multi-step debugging and tool use.
It is released under Apache 2.0, free to use, modify, and redistribute.
It scores roughly 3.5x higher on the tau2-bench telecom agentic benchmark but slightly lower on general knowledge benchmarks like MMLU-Pro.
Use the gigarouter OpenAI-compatible endpoint with your API key, passing the model name and your prompt.
Q4_K_M is the recommended sweet spot, requiring about 6.87 GB of storage.
We're benchmarking and onboarding Gemma 4 12B V2 Coding Agentic as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.