Open-model inference

Private Inference for Coding Agents

Fast, EU-based endpoint for open-weight models. Zero data retention, zero training, and optimized for long-context workloads.

  • CodePrivate
  • ModelsOpen
  • SetupMinutes
Agent integrations
Zro
Claude Code
Codex
Cursor
Cline
Opencode
Hermes
  • API
  • AgentsCLI + IDE
  • InfraEU regions
Coding models
Available models
MiniMax M3MiniMaxGLM-5.2Z.ai
DeepSeek V4 ProComing soon
Kimi K2.6Coming soon

Open coding models. One endpoint.

Open-source models are becoming competitive with closed-source systems for coding tasksi. Zro starts with MiniMax M3 and GLM-5.2, with more open coding models coming soon.

Details

Built for private inference

Yes. Zro exposes OpenAI-compatible access for chat completions, so existing clients and agent tools can point at the Zro base URL.

Yes. Zro also supports Anthropic-compatible Messages requests at /v1/messages for tools that expect that API shape.

No. Prompt and completion bodies are not retained by default after inference is processed.

Yes. Zro is built for responsive, streaming inference, so developer tools, agents, and production apps do not have to trade speed for privacy.

MiniMax M3 and GLM-5.2 are available now, with more open-model options being added across regions.

No. Customer prompts and completions are never used for training, fine-tuning, evaluations, analytics, or dataset creation.

Zro runs on privacy-forward EU infrastructure. Current regions include Finland and France.

Pro includes monthly credits that do not roll over. Active Pro accounts can add one-time top-ups, and top-up credits expire after 180 days.