skip to content
gigarouter gigarouter
models / text generation · coming soon

Agents-A1

InternScience/Agents-A1

published Jun 2026 · updated Jul 2026

Agents-A1 is a text-generation model that uses a 35B Mixture-of-Experts architecture to perform long-horizon agentic tasks such as search, engineering, scientific research, instruction following, and tool-calling.

status
coming soon
API providers
0
downloads / mo
3.5K
license
apache-2.0

specs

TaskText Generation
ArchitectureMixture-of-Experts (MoE)
Parameters35B total (3B active)
LicenseApache-2.0

about this model

Agents-A1 is a 35B Mixture-of-Experts text-generation model from InternScience that achieves trillion-parameter-level performance by scaling the agent horizon rather than the number of parameters. It is designed to unify six heterogeneous agentic domains—long-horizon search, engineering tasks, scientific research, instruction following, general agentic tasks, and scientific agentic tasks—into a single deployable model.

Training and Architecture

The model is trained using a three-stage recipe: full-domain supervised fine-tuning for broad agentic alignment, domain-level teacher models for specialized expertise, and multi-teacher domain-routed on-policy distillation with salient vocabulary alignment. This process leverages agentic trajectories averaging 45K tokens in length, built from a long-horizon knowledge-action infrastructure that connects external knowledge, actions, observations, and verifier outcomes.

Key Strengths

  • Agentic reasoning: decomposes complex tasks into executable sub-steps and adapts strategies based on intermediate results.
  • Tool use: natively supports function calling and integration with APIs, code interpreters, search engines, and other external tools.
  • Scientific and professional reasoning: handles tool-integrated scientific reasoning and professional knowledge question answering.
  • Instruction following: precisely follows detailed, multi-constraint instructions across diverse domains.

Benchmark Performance

Despite its ~35B parameter class, Agents-A1 achieves overall state-of-the-art results on several challenging benchmarks and remains highly competitive against frontier-scale systems such as GPT-5.5, DeepSeek-V4-pro, and Kimi-K2.6.

Benchmark Agents-A1 Score Notable Comparison
Seal-056.4Overall SOTA
HiPhO46.4Overall SOTA
FrontierScience-Olympiad79.0Overall SOTA
FrontierScience-Research40.0Overall SOTA
IFBench80.6Overall SOTA
IFEval94.8Overall SOTA
BrowseComp75.5Best among comparable ~35B models
XBench-DS-251086.0Best among comparable ~35B models
GAIA96.0Best among comparable ~35B models
SciCode44.3Best among comparable ~35B models
HLE with tools47.6Best among comparable ~35B models
MolBench-bind56.8Best among comparable ~35B models

The model is released under the Apache-2.0 license. Quantized variants including FP8, Q4_K_M-GGUF, Q8_0-GGUF, and F16-GGUF are available in the Agents-A1 collection on Hugging Face.

best for

FAQ

What is the model size and architecture of Agents-A1?

Agents-A1 is a 35B total parameter Mixture-of-Experts model with approximately 3B active parameters per forward pass.

What license is Agents-A1 released under?

It is released under the Apache-2.0 license.

How does Agents-A1 compare to larger models like GPT-5.5 or DeepSeek-V4-pro?

Despite being only 35B parameters, Agents-A1 achieves leading or competitive results on benchmarks like Seal-0, IFBench, HiPhO, and FrontierScience-Olympiad, often matching or exceeding trillion-parameter models.

How can I call Agents-A1 via the API?

Use the gigarouter OpenAI-compatible endpoint with your API key to send text-generation requests to the model.

What domains does Agents-A1 specialize in?

It unifies six heterogeneous domains: long-horizon search, engineering tasks, scientific research, instruction following, general agentic tasks, and scientific agentic tasks.

not yet live

We're benchmarking and onboarding Agents-A1 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related text generation models

compare all →