IBM · florenzi002 · Mar 3, 2026 · Mar 3, 2026 · Mar 9, 2026 · Mar 9, 2026
diff --git a/INSTRUCTIONS.md b/INSTRUCTIONS.md
@@ -13,6 +13,7 @@ This directory contains the MCP servers and infrastructure for the AssetOpsBench
   - [fmsr](#fmsr)
   - [tsfm](#tsfm)
   - [wo](#wo)
+  - [sandbox](#sandbox)
 - [Plan-Execute Runner](#plan-execute-runner)
   - [How it works](#how-it-works)
   - [CLI](#cli)
@@ -84,6 +85,7 @@ To start a server manually for testing:
 
 ```bash
 uv run utilities-mcp-server
+uv run sandbox-mcp-server
 uv run iot-mcp-server
 uv run fmsr-mcp-server
 uv run tsfm-mcp-server
@@ -137,6 +139,20 @@ uv run wo-mcp-server
 | `current_date_time` | — | Return the current UTC date and time as JSON |
 | `current_time_english` | — | Return the current UTC time as a human-readable string |
 
+### Sandbox
+
+**Path:** `src/servers/sandbox/main.py`
+
+**Requires:** Docker or Podman (container runtime auto-detected)
+
+**Pre-installed Libraries:** matplotlib 3.9.3, numpy 2.2.1, pandas 2.2.3, pyarrow 18.1.0, pydantic 2.10.5, pympler 1.1, scikit-learn 1.6.1, seaborn 0.13.2
+
+| Tool | Arguments | Description |
+|---|---|---|
+| `execute_python_file` | `file_path`, `requirements?`, `input_files?`, `timeout?`, `output_files?` | Execute a Python file from workspace in an isolated container by copying it into the container |
+| `execute_python_code` | `code`, `requirements?`, `input_files?`, `input_file_paths?`, `timeout?`, `output_files?` | Execute arbitrary Python code in an isolated container with no network access, limited CPU/memory, and optional pip package installation. Supports both string content and workspace file paths as inputs |
+| `execute_python_script` | `script_content`, `input_data?`, `input_file_paths?`, `requirements?`, `timeout?` | Simplified interface for scripts that read from `data.json` and write to `output.json`. Supports workspace file paths as inputs |
+
 ### fmsr
 
 **Path:** `src/servers/fmsr/main.py`
@@ -348,6 +364,7 @@ runner = PlanExecuteRunner(
     server_paths={
         "iot":       "iot-mcp-server",
         "utilities": "utilities-mcp-server",
+        "sandbox": "sandbox-mcp-server",
         "fmsr":      "fmsr-mcp-server",
         "tsfm":      "tsfm-mcp-server",
     },
@@ -369,6 +386,10 @@ Add the following to your Claude Desktop `claude_desktop_config.json`:
       "command": "/path/to/uv",
       "args": ["run", "--project", "/path/to/AssetOpsBench", "utilities-mcp-server"]
     },
+    "sandbox": {
+      "command": "/path/to/uv",
+      "args": ["run", "--project", "/path/to/AssetOpsBench", "sandbox-mcp-server"]
+    },
     "iot": {
       "command": "/path/to/uv",
       "args": ["run", "--project", "/path/to/AssetOpsBench", "iot-mcp-server"]
@@ -400,6 +421,7 @@ uv run pytest src/ -v
 ```
 
 Integration tests are auto-skipped when the required service is not available:
+
 - IoT integration tests require `COUCHDB_URL` (set in `.env`)
 - Work order integration tests require `COUCHDB_URL` (set in `.env`)
 - FMSR integration tests require `WATSONX_APIKEY` (set in `.env`)
@@ -416,6 +438,7 @@ uv run pytest src/ -v -k "not integration"
 ```bash
 uv run pytest src/servers/iot/tests/test_tools.py -k "not integration"
 uv run pytest src/servers/utilities/tests/
+uv run pytest src/servers/sandbox/tests/
 uv run pytest src/servers/fmsr/tests/ -k "not integration"
 uv run pytest src/servers/tsfm/tests/ -k "not integration"
 uv run pytest src/servers/wo/tests/test_tools.py -k "not integration"
@@ -445,18 +468,18 @@ uv run pytest src/ -v
 │                     workflow/                        │
 │                                                      │
 │  PlanExecuteRunner.run(question)                     │
-│  ┌────────────┐   ┌────────────┐   ┌──────────────┐ │
-│  │  Planner   │ → │  Executor  │ → │  Summariser  │ │
-│  │            │   │            │   │              │ │
-│  │ LLM breaks │   │ Routes each│   │ LLM combines │ │
-│  │ question   │   │ step to the│   │ step results │ │
-│  │ into steps │   │ right MCP  │   │ into answer  │ │
-│  └────────────┘   │ server via │   └──────────────┘ │
+│  ┌────────────┐   ┌────────────┐   ┌──────────────┐  │
+│  │  Planner   │ → │  Executor  │ → │  Summariser  │  │
+│  │            │   │            │   │              │  │
+│  │ LLM breaks │   │ Routes each│   │ LLM combines │  │
+│  │ question   │   │ step to the│   │ step results │  │
+│  │ into steps │   │ right MCP  │   │ into answer  │  │
+│  └────────────┘   │ server via │   └──────────────┘  │
 │                   │ stdio      │                     │
 └───────────────────┼────────────┼─────────────────────┘
                     │ MCP protocol (stdio)
-         ┌──────────┼──────────┬──────────┬──────┐
-         ▼          ▼          ▼          ▼      ▼
-        iot     utilities    fmsr       tsfm    wo
-      (tools)    (tools)    (tools)   (tools) (tools)
+         ┌──────────┼──────────┬──────────┬──────┬─────────┐
+         ▼          ▼          ▼          ▼      ▼         ▼
+        iot     utilities    fmsr       tsfm    wo      sandbox
+      (tools)    (tools)    (tools)   (tools) (tools)   (tools)
 ```
diff --git a/pyproject.toml b/pyproject.toml
@@ -32,6 +32,7 @@ iot-mcp-server = "servers.iot.main:main"
 utilities-mcp-server = "servers.utilities.main:main"
 fmsr-mcp-server = "servers.fmsr.main:main"
 tsfm-mcp-server = "servers.tsfm.main:main"
+sandbox-mcp-server = "servers.sandbox.main:main"
 wo-mcp-server = "servers.wo.main:main"
 
 
@@ -51,4 +52,3 @@ norecursedirs = ["src/tmp"]
 filterwarnings = [
     "ignore:Core Pydantic V1 functionality:UserWarning",
 ]
-
diff --git a/src/servers/sandbox/.env.example b/src/servers/sandbox/.env.example
@@ -0,0 +1,13 @@
+# Python Sandbox MCP Server Configuration
+
+# Logging level (DEBUG, INFO, WARNING, ERROR, CRITICAL)
+LOG_LEVEL=WARNING
+
+# Container Runtime Configuration
+# Specify which container runtime to use: "docker" or "podman"
+# If not set, the system will auto-detect available runtime (Docker first, then Podman)
+# CONTAINER_RUNTIME=docker
+
+# Docker configuration (optional - uses defaults if not set)
+# Only relevant when using Docker as the container runtime
+# DOCKER_HOST=unix:///var/run/docker.sock
diff --git a/src/servers/sandbox/README.md b/src/servers/sandbox/README.md
@@ -0,0 +1,253 @@
+# Python Sandbox MCP Server
+
+A secure Model Context Protocol (MCP) server that provides isolated execution of arbitrary Python code using container technology (Docker or Podman).
+
+## Features
+
+- **Maximum Isolation**: Executes code in Docker containers with no network access
+- **Resource Limits**: CPU and memory constraints to prevent resource exhaustion
+- **Secure by Default**: Runs as non-root user with minimal privileges
+- **Flexible Input/Output**: Support for input files and output file retrieval
+- **Package Installation**: Install pip packages on-demand within the sandbox
+- **Timeout Protection**: Configurable execution timeouts
+
+## Installation
+
+### Prerequisites
+
+You need either **Docker** or **Podman** installed on your system:
+
+- **Docker**: [Installation Guide](https://docs.docker.com/get-docker/)
+- **Podman**: [Installation Guide](https://podman.io/getting-started/installation)
+
+The server will automatically detect which container runtime is available and use it.
+
+### Setup
+
+1. Install Python dependencies:
+
+```bash
+pip install -r requirements.txt
+```
+
+1. The container image will be built automatically on first use, or you can build it manually:
+
+**With Docker:**
+
+```bash
+docker build -t python-sandbox:latest .
+```
+
+**With Podman:**
+
+```bash
+podman build -t python-sandbox:latest .
+```
+
+## Usage
+
+### As an MCP Server
+
+Run the server:
+
+```bash
+python main.py
+```
+
+The server exposes three tools:
+
+#### 1. `execute_python_file`
+
+Execute a Python file from your workspace by copying it into the container.
+
+**Parameters:**
+
+- `file_path` (str, required): Path to Python file relative to workspace directory
+- `requirements` (list[str], optional): Pip packages to install
+- `input_files` (dict[str, str], optional): Input files as filename -> content mapping
+- `timeout` (int, optional): Execution timeout in seconds (default: 60)
+- `output_files` (list[str], optional): Output filenames to retrieve
+
+**Example:**
+
+```python
+{
+    "file_path": "scripts/analysis.py",
+    "requirements": ["pandas", "numpy"],
+    "input_files": {"data.csv": "col1,col2\n1,2\n3,4"},
+    "output_files": ["results.json"],
+    "timeout": 120
+}
+```
+
+#### 2. `execute_python_code`
+
+Execute arbitrary Python code with full control over the environment.
+
+**Pre-installed Libraries:**
+
+- matplotlib==3.9.3: Plotting and visualization
+- numpy==2.2.1: Numerical computing
+- pandas==2.2.3: Data manipulation and analysis
+- pyarrow==18.1.0: Columnar data format support
+- pydantic==2.10.5: Data validation
+- pympler==1.1: Memory profiling
+- scikit-learn==1.6.1: Machine learning
+- seaborn==0.13.2: Statistical data visualization
+
+**Parameters:**
+
+- `code` (str, required): Python code to execute
+- `requirements` (list[str], optional): Pip packages to install
+- `input_files` (dict[str, str], optional): Input files as filename -> content mapping
+- `input_file_paths` (dict[str, str], optional): Input files as destination -> source path mappings from workspace
+- `timeout` (int, optional): Execution timeout in seconds (default: 60)
+- `output_files` (list[str], optional): Output filenames to retrieve
+
+**Example:**
+
+```python
+{
+    "code": "import pandas as pd\ndf = pd.DataFrame({'a': [1,2,3]})\nprint(df)",
+    "requirements": ["pandas"],
+    "timeout": 30
+}
+```
+
+**Example with workspace files:**
+
+```python
+{
+    "code": "import pandas as pd\ndf = pd.read_csv('data.csv')\nprint(df.head())",
+    "input_file_paths": {"data.csv": "datasets/mydata.csv"}
+}
+```
+
+#### 3. `execute_python_script`
+
+Simplified interface for scripts that read from `data.json` and write to `output.json`.
+
+**Pre-installed Libraries:**
+
+- matplotlib==3.9.3: Plotting and visualization
+- numpy==2.2.1: Numerical computing
+- pandas==2.2.3: Data manipulation and analysis
+- pyarrow==18.1.0: Columnar data format support
+- pydantic==2.10.5: Data validation
+- pympler==1.1: Memory profiling
+- scikit-learn==1.6.1: Machine learning
+- seaborn==0.13.2: Statistical data visualization
+
+**Parameters:**
+
+- `script_content` (str, required): Python script code
+- `input_data` (str, optional): JSON string saved as data.json
+- `input_file_paths` (dict[str, str], optional): Input files as destination -> source path mappings from workspace
+- `requirements` (list[str], optional): Pip packages to install
+- `timeout` (int, optional): Execution timeout in seconds (default: 60)
+
+**Example:**
+
+```python
+{
+    "script_content": "import json\nwith open('data.json') as f:\n    data = json.load(f)\nresult = {'count': len(data)}\nwith open('output.json', 'w') as f:\n    json.dump(result, f)",
+    "input_data": "{\"items\": [1, 2, 3]}"
+}
+```
+
+**Example with workspace files:**
+
+```python
+{
+    "script_content": "import pandas as pd\ndf = pd.read_csv('data.csv')\nresult = df.describe().to_json()\nwith open('output.json', 'w') as f:\n    f.write(result)",
+    "input_file_paths": {"data.csv": "datasets/mydata.csv"}
+}
+```
+
+## Container Runtime
+
+The server supports both Docker and Podman as container runtimes:
+
+### Automatic Detection
+
+By default, the server automatically detects which container runtime is available:
+
+1. First checks for Docker
+2. Falls back to Podman if Docker is not available
+3. Raises an error if neither is found
+
+### Manual Selection
+
+You can explicitly specify which runtime to use via the `CONTAINER_RUNTIME` environment variable:
+
+```bash
+# Use Docker
+export CONTAINER_RUNTIME=docker
+
+# Use Podman
+export CONTAINER_RUNTIME=podman
+```
+
+Or in your `.env` file:
+
+```
+CONTAINER_RUNTIME=podman
+```
+
+### Runtime Differences
+
+Both runtimes provide equivalent security and isolation. Key differences:
+
+- **Docker**: Uses the Docker Python SDK for container management
+- **Podman**: Uses CLI commands (no daemon required, rootless by default)
+
+## Security Features
+
+1. **Network Isolation**: Containers run with no network access
+2. **Resource Limits**:
+   - Memory: 512MB
+   - CPU: 50% of one core
+3. **Non-root User**: Code runs as user `sandbox` (UID 1000)
+4. **Filesystem Isolation**: Only mounted workspace directory is accessible
+5. **Timeout Protection**: Execution is terminated after timeout expires
+
+## Configuration
+
+Environment variables (see `.env.example`):
+
+- `LOG_LEVEL`: Logging level (default: WARNING)
+- `CONTAINER_RUNTIME`: Force specific runtime - "docker" or "podman" (default: auto-detect)
+- `DOCKER_HOST`: Docker daemon socket (default: unix:///var/run/docker.sock)
+
+## Architecture
+
+The sandbox uses container technology (Docker or Podman) for maximum isolation:
+
+```
+┌─────────────────────────────────────┐
+│         MCP Server (Host)           │
+│  ┌───────────────────────────────┐  │
+│  │   FastMCP Server              │  │
+│  │   - execute_python_code       │  │
+│  │   - execute_python_script     │  │
+│  └───────────┬───────────────────┘  │
+│              │                       │
+│              ▼                       │
+│  ┌───────────────────────────────┐  │
+│  │   Container Runtime           │  │
+│  │   (Docker or Podman)          │  │
+│  └───────────┬───────────────────┘  │
+└──────────────┼───────────────────────┘
+               │
+               ▼
+┌──────────────────────────────────────┐
+│     Container (Isolated)             │
+│  ┌────────────────────────────────┐  │
+│  │  Python 3.11 Runtime           │  │
+│  │  - No network access           │  │
+│  │  - Limited CPU/Memory          │  │
+│  │  - Non-root user               │  │
+│  │  - Temporary filesystem        │  │
+│  └────────────────────────────────┘  │
+└──────────────────────────────────────┘
+```
diff --git a/src/servers/sandbox/__init__.py b/src/servers/sandbox/__init__.py
@@ -0,0 +1,4 @@
+"""Python Sandbox MCP Server.
+
+Provides secure execution of arbitrary Python code in an isolated Docker container.
+"""