Sandbox

The Problem
The Solution
Lifecycle
Video
Resources

Klavis Sandbox supports a wide range of categories for training the best tool-use LLMs and AI Agents:

50+ MCP servers that need authentication - Klavis handles accessibility and reproducibility.
200+ stateless MCP servers via Klavis API, no authentication needed, for easy scaling.

These cover a wide range of categories.

The Problem

For LLM researchers, setting up LLM training or reinforcement learning environment for real-world tool use is complex and painful:

Managing different environment or test accounts
Implementing MCP Servers and handling various authentication issues
Initializing realistic data
Resetting states between multiple runs
Ensuring isolation across concurrent sessions

The Solution

Klavis MCP Sandbox as a Service solves these challenges. In addition to letting your model interact with our comprehensive MCP server ecosystem, you can use our sandbox infrastructure to easily dump and reset data on any concurrent run.

Our sandbox infrastructure is horizontally scalable, so it can handle any number of concurrent sessions as you need.

Lifecycle

Create

Request a sandbox based on the external services you need (Snowflake, Gmail, CRM, ERP, etc.) and get an MCP server URL for that isolated instance.

Initialize (seed)

Load a deterministic “world state” in JSON format. We handle everything—creating databases, setting up CRM data, ERP systems, and more.

Interact (MCP)

Let your LLM / AI agent use MCP tools against the sandbox as if it were the real app. You can use multiple MCP servers with many tools simultaneously.

Dump (verify)

Snapshot the full sandbox state to programmatically compare against your ground truth—whether your LLM completed the task correctly or not.

Reset / Delete

Wipe the sandbox back to a clean state and kick off the next run.

Video

Resources

Example Notebook

Create sandboxes, seed data, run an agent, then dump and clean up.

Sandbox API

Manage isolated sandbox environments for training/eval: pooling, init, export, teardown.

Fireworks + Klavis

Use Klavis MCP Sandbox with Eval Protocol for model training and RL at scale.

Strata API Key

⌘I

Get Started

Core Concepts

Auth with Klavis

Enterprise Security

AI Platform Integrations

MCP Integrations

Legacy

The Problem

The Solution

Lifecycle

Video

Resources

Example Notebook

Sandbox API

Fireworks + Klavis

Get Started

Core Concepts

Auth with Klavis

Enterprise Security

AI Platform Integrations

MCP Integrations

Legacy

​The Problem

​The Solution

​Lifecycle

​Video

​Resources

Example Notebook

Sandbox API

Fireworks + Klavis

The Problem

The Solution

Lifecycle

Video

Resources