3DGS Video Processor

Containerized Rust service for processing multi-view video captures into 3D Gaussian Splatting models.

Overview

3DGS Video Processor is a long-running service that:

Watches directories for new video uploads (multiple videos per scene)
Extracts frames from all videos concurrently using FFmpeg
Reconstructs camera poses using pluggable reconstruction backends (COLMAP, Nerfstudio, etc.)
Trains 3D Gaussian Splatting models using swappable backends
Exports outputs in standard formats (.ply and .splat)
Manages processed data with configurable retention policies

Perfect for automated 3DGS processing pipelines in local or cloud environments (Azure Blob Storage via Blobfuse2).

Quick Start

The fastest way to get the full pipeline running is with the numbered E2E scripts in scripts/e2e/. These scripts install dependencies, download real test data, build the project, and run an end-to-end test — no GPU or Docker required.

Prerequisites

Ubuntu 24.04 (or compatible Debian-based system) with sudo access
Rust toolchain installed (cargo, rustc)

Step-by-Step

# 1. Install system dependencies (ffmpeg, colmap, unzip)
./scripts/e2e/00-install-deps.sh

# 2. Download the COLMAP South Building dataset and create test videos
./scripts/e2e/01-download-testdata.sh

# 3. Build the Rust binary (release mode)
./scripts/e2e/02-build.sh

# 4. Clean output directories (ensures a fresh state)
./scripts/e2e/03-cleanup.sh

# 5. Run the full E2E pipeline test
./scripts/e2e/04-run-e2e.sh --mode file

Step 5 starts the processor in watch mode, copies the test videos into the input directory, waits for COLMAP reconstruction and mock training to complete, then verifies the output:

📊 Output verification:
  ✅ PLY file(s):      1 (44K)
  ✅ SPLAT file(s):    1 (32K)
  ✅ manifest.json:    present

🎉 ALL E2E TESTS PASSED!

Output files are written to ./output/data/output/ (my_scene.ply, my_scene.splat, manifest.json).

Note: The E2E test uses BACKEND=mock (no GPU) with real COLMAP reconstruction and real FFmpeg frame extraction. To produce real 3DGS models, set BACKEND=gsplat or BACKEND=gaussian-splatting with a CUDA GPU.

Alternative: Docker Mode

If you prefer running via Docker (e.g., for production or Azure Blob Storage integration):

Local Docker Mode

# Create directory structure
mkdir -p input output processed error

# Run container
docker run -d \
  --name 3dgs-processor \
  --gpus all \
  -v $(pwd)/input:/data/input \
  -v $(pwd)/output:/data/output \
  -v $(pwd)/processed:/data/processed \
  -v $(pwd)/error:/data/error \
  -e INPUT_PATH=/data/input \
  -e OUTPUT_PATH=/data/output \
  -e PROCESSED_PATH=/data/processed \
  -e ERROR_PATH=/data/error \
  -e BACKEND=gsplat \
  3dgs-processor:gpu

# Copy multi-view videos to input directory
mkdir -p input/scene_001
cp view1.mp4 view2.mp4 view3.mp4 input/scene_001/

# Wait for processing (monitor logs)
docker logs -f 3dgs-processor

# Check output
ls output/scene_001/
# -> manifest.json, model.ply, model.splat

Azure Blob Storage Mode

docker run -d --privileged \
  --name 3dgs-processor \
  -e AZURE_STORAGE_CONNECTION_STRING="DefaultEndpointsProtocol=..." \
  -e AZURE_CONTAINER_NAME=3dgs-input \
  -e BACKEND=gsplat \
  youracr.azurecr.io/3dgs-processor:gpu

Features

Multi-Video Processing: Handles multiple videos per scene for better reconstruction
Pluggable Reconstruction: Swap SfM backends (COLMAP, Nerfstudio, Precalibrated)
Pluggable 3DGS Backends: Swap training implementations (gaussian-splatting, gsplat, 3DGS.cpp)
Watch Mode: Automatically processes new uploads with stability detection
Progress Tracking: Real-time progress monitoring with checkpoint-based resumption
Error Recovery: Retry logic with exponential backoff, error folder isolation
Resource Management: Temporary file cleanup, configurable retention policies
Cloud Ready: Native Azure Blob Storage support via Blobfuse2
Multi-Architecture: Runs on x86_64 and ARM64 (Apple Silicon, AWS Graviton)

Hardware Requirements

Minimum Requirements (Testing/Development)

Component	Specification
CPU	4 cores, 2.0 GHz+
RAM	8 GB
Storage	50 GB free (for processing temporary files)
GPU	None (CPU-only mode using mock backend)

Suitable for: Pipeline testing, development, small test scenes

Recommended Requirements (Small-Medium Scenes)

Component	Specification
CPU	8 cores, 3.0 GHz+ (Intel i7/Ryzen 7 or better)
RAM	16 GB
Storage	100 GB free SSD
GPU	6-8 GB VRAM (NVIDIA RTX 3060)

Suitable for: 2-5 videos, 30-60 seconds each, 1080p resolution

Processing Time: 5-15 minutes per scene (30,000 training iterations)

Production Requirements (Large Scenes)

Component	Specification
CPU	16+ cores, 3.5 GHz+ (Intel Xeon, AMD EPYC, Ryzen 9)
RAM	32 GB+
Storage	500 GB+ NVMe SSD
GPU	12-24 GB VRAM (NVIDIA RTX 3090/4090, A100)

Suitable for: 5-10+ videos, 2-5 minutes each, 4K resolution

Processing Time: 10-30 minutes per scene (30,000+ iterations)

GPU Recommendations

NVIDIA CUDA (Best Performance):

✅ RTX 3060/3070/3080/3090 (8-24 GB VRAM)
✅ RTX 4060/4070/4080/4090 (8-24 GB VRAM)
✅ A100/A6000 (40-80 GB VRAM) for production
Backend: gsplat (optimized for CUDA)

CPU-Only (Testing):

⚠️ Mock backend only (no real training)
Use for: Pipeline testing, development, CI/CD

Storage Requirements Per Job

Estimate disk space needed based on input videos:

Input Size	Extracted Frames	COLMAP	Training	Output	Total
3×30s videos @ 1080p (500 MB)	2-4 GB	500 MB	1-2 GB	100-500 MB	~5-8 GB
5×60s videos @ 1080p (1.5 GB)	8-12 GB	2 GB	3-5 GB	200 MB-1 GB	~15-20 GB
10×120s videos @ 4K (10 GB)	40-60 GB	10 GB	10-15 GB	500 MB-2 GB	~60-90 GB

Recommendation: Allocate 15-20x input video size for temporary processing files.

Memory Requirements

RAM usage scales with:

Number of concurrent videos during frame extraction
COLMAP reconstruction complexity (number of images, features)
3DGS training batch size

Guidelines:

8 GB RAM: 2-3 videos, 1080p
16 GB RAM: 5-7 videos, 1080p or 2-3 videos @ 4K
32 GB RAM: 10+ videos, 4K or production workloads

Network Requirements (Azure Mode)

For Azure Blob Storage deployments:

Bandwidth: 100+ Mbps recommended (1 Gbps for production)
Latency: <50ms to Azure region (for blobfuse2 FUSE mounts)
Egress: Consider Azure egress costs for large output files

Platform Support

Platform	Architecture	Status	Notes
Linux	x86_64 (amd64)	✅ Full support	Recommended for production
Linux	ARM64 (aarch64)	✅ Full support	AWS Graviton, cloud ARM
macOS	Apple Silicon (ARM64)	⚠️ Development only	No blobfuse2, limited inotify
macOS	x86_64	⚠️ Development only	Intel Macs, same limitations
Windows	x86_64	🚧 Via WSL2	Docker Desktop + WSL2 required

Production Deployment: Use Linux (Ubuntu 22.04+, Debian 11+, RHEL 8+)

Installation

Using E2E Scripts (Recommended)

The scripts/e2e/ directory contains numbered scripts that handle everything:

git clone https://github.com/azure-samples/3DGS-accelerator.git
cd 3DGS-accelerator

# Install deps, download test data, build, and run
./scripts/e2e/00-install-deps.sh
./scripts/e2e/01-download-testdata.sh
./scripts/e2e/02-build.sh
./scripts/e2e/03-cleanup.sh
./scripts/e2e/04-run-e2e.sh --mode file

Build from Source (Manual)

Manual build without E2E scripts

# Clone repository
git clone https://github.com/azure-samples/3DGS-accelerator.git
cd 3DGS-accelerator

# Install system dependencies
sudo apt-get install -y ffmpeg colmap

# Build for current architecture
cargo build --release

# Or build multi-arch Docker image
./scripts/build-multiarch.sh --load

Pull from Registry

# CPU variant (mock backend, no GPU required) — published at cicorias/3dgs-processor
docker pull cicorias/3dgs-processor:cpu

# GPU variant — must be built locally or pushed to your own registry:
docker buildx build --target gpu -t youracr.azurecr.io/3dgs-processor:gpu .
docker push youracr.azurecr.io/3dgs-processor:gpu

Configuration

Configuration via environment variables (required) and YAML file (optional training parameters).

Required Environment Variables:

INPUT_PATH=/data/input                # Watch directory for new videos
OUTPUT_PATH=/data/output              # Output directory for .ply/.splat files
PROCESSED_PATH=/data/processed        # Archive for completed jobs
ERROR_PATH=/data/error                # Quarantine for failed jobs
BACKEND=gaussian-splatting            # 3DGS Backend: gaussian-splatting|gsplat|3dgs-cpp
RECONSTRUCTION_BACKEND=colmap         # Reconstruction: colmap|nerfstudio|precalibrated (default: colmap)

Reconstruction Backends:

The system supports pluggable reconstruction backends for camera pose estimation:

colmap (default) - Gold-standard Structure-from-Motion pipeline
nerfstudio - Load pre-computed camera poses from transforms.json
precalibrated - Use known camera poses from external calibration
Support for Polycam, Luma AI, and other mobile capture formats

See docs/CONFIGURATION.md for the complete configuration reference (all YAML settings and environment variables).

Documentation

Configuration Reference - Complete reference for all YAML and environment variable settings
User Guide - Complete configuration and usage documentation
Architecture - System design and component descriptions
Progress Tracking - Real-time monitoring and checkpointing
Deployment - Production deployment patterns and best practices
Troubleshooting - Common issues and solutions
PRD - Product requirements specification

Requirements

Runtime:

Docker 20.10+ (with buildx for multi-arch builds)
8GB+ RAM (16GB+ recommended for large scenes)
50GB+ disk space for processing

Bundled in Container:

FFmpeg 4.4+
COLMAP 3.8+
3DGS backends (pluggable)

Development

# Run tests (Note: Do NOT use --all-features due to plugin symbol conflicts)
cargo test

# Run integration tests
cargo test --test integration

# Run clippy
cargo clippy --all-targets -- -D warnings

# Format code
cargo fmt

# Generate test videos
./scripts/generate-test-videos.sh

Testing

E2E Pipeline Test (Recommended)

The fastest way to validate the full pipeline end-to-end:

# Run all E2E scripts in sequence (deps → data → build → clean → test)
./scripts/e2e/00-install-deps.sh
./scripts/e2e/01-download-testdata.sh
./scripts/e2e/02-build.sh
./scripts/e2e/03-cleanup.sh
./scripts/e2e/04-run-e2e.sh --mode file     # file/watch mode only
./scripts/e2e/04-run-e2e.sh --mode batch    # batch/Azurite mode only
./scripts/e2e/04-run-e2e.sh                  # both modes

This uses real COLMAP reconstruction, real FFmpeg frame extraction, and a mock training backend (no GPU required). Outputs are verified automatically.

Script	Purpose
`00-install-deps.sh`	Install ffmpeg, colmap, unzip via apt
`01-download-testdata.sh`	Download COLMAP South Building dataset, create 3 test videos
`02-build.sh`	`cargo build --release`
`03-cleanup.sh`	Wipe output directories for a clean run
`04-run-e2e.sh`	Start processor, trigger job, verify PLY/SPLAT/manifest output

Unit and Integration Tests

Synthetic test data (fast, no external downloads)

# Generate minimal COLMAP test data
python3 scripts/create_test_colmap_data.py

# Run integration tests
cargo test --test integration

# Test with mock backend (no GPU required)
BACKEND=mock cargo test

Docker Container E2E Tests

The tests/e2e/ test suite requires the app running inside a Docker container with the built image (3dgs-processor:test). These tests are skipped by default during cargo test to avoid hangs when Docker or the container image is unavailable.

# Run only the container-based E2E tests (requires Docker + built image)
cargo test --test e2e -- --ignored --nocapture

# Run ALL tests including container E2E tests
cargo test -- --include-ignored

Note: These tests orchestrate Docker containers from the host — they do not need to run inside a container. They do require Docker to be installed, running, and the 3dgs-processor:cpu image to be built (docker buildx build --target cpu -t 3dgs-processor:cpu .).

Real Dataset Testing (Quality Validation)

Tanks and Temples benchmark

# Download a scene (~2-20GB)
./scripts/download-tanks-and-temples.sh barn

# Run benchmark with automatic metrics collection
./scripts/benchmark-tanks-and-temples.sh barn

# Or test directly with Python training script
python3 scripts/gsplat_train.py \
  --data testdata/tanks-and-temples/barn/images \
  --colmap-dir testdata/tanks-and-temples/barn/colmap/sparse/0 \
  --model-dir outputs/barn \
  --iterations 7000 \
  --save-ply --save-splat

Note: Tanks and Temples provides pre-extracted images and COLMAP reconstructions, so we test the training backend directly (the Python script that the Rust processor calls internally). For testing the complete pipeline with videos, use the synthetic test videos or your own footage:

# Generate test videos
./scripts/generate-test-videos.sh

# Run full processor (watch → extract → COLMAP → train)
export INPUT_PATH=$PWD/testdata/sample_scene
export OUTPUT_PATH=$PWD/outputs  
export BACKEND=auto
cargo run --release

# Trigger processing
mkdir -p testdata/sample_scene/my_scene
cp testdata/sample_scene/*.mp4 testdata/sample_scene/my_scene/

Available scenes: barn, truck, church, caterpillar, courthouse, ignatius, meetingroom

See docs/TANKS_AND_TEMPLES_TESTING.md for complete testing guide.

License

MIT License - See LICENSE for details.

Support

Issues: GitHub Issues
Discussions: GitHub Discussions

Acknowledgments

Built with:

COLMAP - Structure-from-Motion
3D Gaussian Splatting - Original implementation
gsplat - Optimized 3DGS library

Trademarks

Trademarks This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft’s Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party’s policies.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.cargo		.cargo
.github		.github
.vscode		.vscode
container-test		container-test
docker		docker
docs		docs
examples		examples
plugins		plugins
scripts		scripts
src		src
testdata		testdata
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
.markdownlintignore		.markdownlintignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTORS.md		CONTRIBUTORS.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
Dockerfile.preflight		Dockerfile.preflight
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
compose.yaml		compose.yaml
config.example.yaml		config.example.yaml
docker-build-podman.sh		docker-build-podman.sh
docker-build.sh		docker-build.sh
mise.toml		mise.toml

Folders and files

Latest commit

History

Repository files navigation

3DGS Video Processor

Overview

Quick Start

Prerequisites

Step-by-Step

Alternative: Docker Mode

Features

Hardware Requirements

Minimum Requirements (Testing/Development)

Recommended Requirements (Small-Medium Scenes)

Production Requirements (Large Scenes)

GPU Recommendations

Storage Requirements Per Job

Memory Requirements

Network Requirements (Azure Mode)

Platform Support

Installation

Using E2E Scripts (Recommended)

Build from Source (Manual)

Pull from Registry

Configuration

Documentation

Requirements

Development

Testing

E2E Pipeline Test (Recommended)

Unit and Integration Tests

Docker Container E2E Tests

Real Dataset Testing (Quality Validation)

License

Support

Acknowledgments

Trademarks

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages