AetherShell Terminal User Interface (TUI)

Overview

The AetherShell TUI provides a rich terminal-based interface for interacting with multi-modal Large Language Models (LLMs) and managing AI agent swarms. It offers an intuitive visual environment for complex AI workflows while maintaining the power and flexibility of the command line.

Terminal Requirements

⚠️ Important: The TUI requires a terminal with full ANSI/VT100 support and raw mode capabilities.

Recommended Terminals

✅ Works Best:

Windows Terminal (recommended for Windows)
Native PowerShell console (not VS Code integrated terminal)
Command Prompt with modern console host
iTerm2 (macOS)
GNOME Terminal, Konsole, Alacritty (Linux)

❌ Not Recommended:

VS Code integrated terminal (limited raw mode support)
Basic terminal emulators without alternate screen buffer
SSH sessions without proper terminal forwarding

Testing Your Terminal

# Quick test - if this works, TUI should work:
ae tui

# If you see rendering issues, try:
# 1. Use Windows Terminal or native PowerShell
# 2. Ensure your terminal supports 256 colors
# 3. Check terminal size is at least 80x24

Features

🎯 Core Capabilities

Multi-modal AI Chat: Support for text, images, audio, and video inputs
Agent Swarm Management: Create, monitor, and coordinate multiple AI agents
Media Processing: Terminal-friendly display and processing of media files
Real-time Monitoring: Live status updates for agents and conversations
Interactive Configuration: Settings management through the UI

📺 Interface Modes

Chat Mode - Multi-modal conversation interface
Agent Swarm Mode - Agent management and monitoring
Media Browser Mode - File management and attachment system
Settings Mode - Configuration and preferences

Quick Start

# Launch the TUI
ae tui

# Navigate between modes
Tab / Shift+Tab    # Cycle through modes
1, 2, 3, 4         # Jump directly to Chat, Agent, Media, Settings

# Basic navigation
↑↓ or j/k          # Navigate lists
Enter or i         # Start editing
Esc                # Cancel/return to navigation
Ctrl+C or q        # Quit application

Usage Guide

Chat Mode

The chat interface supports multi-modal conversations with AI models:

Key Bindings:
- i/Enter: Start typing a message
- m: Switch to media browser to attach files
- a: Switch to agent management
- c: Clear chat history

Multi-modal Workflow:

Switch to Media mode (press 3 or m)
Select media files (press Space to toggle selection)
Return to Chat mode (press 1 or b)
Type your message and press Enter
The AI will analyze both text and attached media

Agent Swarm Mode

Manage and monitor AI agents working on complex tasks:

Key Bindings:
- n: Create new agent
- d/Delete: Remove selected agent
- Enter/s: Assign task to agent
- c: Switch to chat mode

Agent Workflow:

Create agents with specialized capabilities
Assign tasks and monitor progress
Coordinate between agents through shared context
View real-time status updates and results

Media Browser Mode

Handle various media types in the terminal:

Key Bindings:
- Space/Enter: Toggle file selection
- o: Add file to library (placeholder in demo)
- c: Clear all selections
- d/Delete: Remove from library
- b: Return to chat with selected files

Supported Formats:

Images: JPG, PNG, GIF, WebP, BMP, TIFF, SVG
Videos: MP4, AVI, MOV, MKV, WebM, FLV
Audio: MP3, WAV, FLAC, AAC, OGG, M4A

Multi-modal AI Integration

Supported Models

The TUI integrates with various AI backends:

OpenAI: GPT-4V for vision, GPT-4 for text
Ollama: LLaVA for vision, Llama3 for text
OpenAI-compatible APIs: Custom endpoints
Text Generation Inference (TGI): High-performance serving

Configuration

Set environment variables to configure AI backends:

# OpenAI (with vision support)
export OPENAI_API_KEY="your-key"
export OPENAI_MODEL="gpt-4o"

# Ollama (with vision model)
export OLLAMA_URL="http://localhost:11434"
export OLLAMA_MODEL="llava"

# Custom endpoints
export AETHER_MODEL_URI="openai:gpt-4o"

Agent Swarm Architecture

Agent Types

Single Agents: Individual AI workers with specific tools
Specialized Agents: Domain-specific capabilities (file analysis, code generation, etc.)
Coordinator Agents: Manage task distribution and orchestration

Coordination Strategies

Round Robin: Equal task distribution
Load Balanced: Assignment based on agent capacity
Specialized: Task routing based on agent capabilities

Communication

Blackboard Pattern: Shared memory for inter-agent communication
Task Queue: Centralized task management
Real-time Updates: Live status monitoring in the TUI

Examples

Basic Multi-modal Chat

# Launch TUI and try:
# 1. Go to Media tab
# 2. Select an image
# 3. Go to Chat tab
# 4. Type: "What do you see in this image?"
# 5. Press Enter

Agent Swarm Coordination

# Create a swarm for code analysis:
# 1. Go to Agent tab
# 2. Create agent "FileAnalyzer" 
# 3. Create agent "DocumentGenerator"
# 4. Assign coordinated tasks
# 5. Monitor progress in real-time

Advanced Features

Session Management

Auto-save: Conversations and agent states persist
Export: Markdown export for chat sessions
Context Window: Intelligent message summarization

Performance

Streaming: Real-time response display
Caching: Media thumbnail generation
Async Processing: Non-blocking agent operations

Extensibility

Plugin Architecture: Custom tool integration
Model Registry: Easy backend switching
Custom Agents: Specialized agent implementations

Tips and Best Practices

Media Optimization: Use compressed formats for faster processing
Agent Coordination: Design complementary agent capabilities
Context Management: Use clear, descriptive prompts
Resource Monitoring: Watch agent status for bottlenecks
Session Organization: Regular exports for important conversations

Troubleshooting

Common Issues

No Response: Check AI backend configuration and API keys
Media Not Loading: Verify file format support and permissions
Agent Errors: Review tool allowlists and environment setup
Performance Issues: Monitor system resources and agent count

Debug Mode

# Enable debug logging
export RUST_LOG=debug
ae tui

Future Enhancements

Voice input/output support
Video frame analysis
Real-time collaboration
Custom UI themes
Performance profiling
Cloud backend integration

For more examples and advanced usage, see the examples/ directory:

09_tui_multimodal.ae - Multi-modal chat examples
10_tui_agent_swarm.ae - Agent coordination patterns
11_tui_showcase.ae - Complete feature demonstration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AetherShell Terminal User Interface (TUI)

Overview

Terminal Requirements

Recommended Terminals

Testing Your Terminal

Features

🎯 Core Capabilities

📺 Interface Modes

Quick Start

Usage Guide

Chat Mode

Agent Swarm Mode

Media Browser Mode

Multi-modal AI Integration

Supported Models

Configuration

Agent Swarm Architecture

Agent Types

Coordination Strategies

Communication

Examples

Basic Multi-modal Chat

Agent Swarm Coordination

Advanced Features

Session Management

Performance

Extensibility

Tips and Best Practices

Troubleshooting

Common Issues

Debug Mode

Future Enhancements

FilesExpand file tree

TUI_GUIDE.md

Latest commit

History

TUI_GUIDE.md

File metadata and controls

AetherShell Terminal User Interface (TUI)

Overview

Terminal Requirements

Recommended Terminals

Testing Your Terminal

Features

🎯 Core Capabilities

📺 Interface Modes

Quick Start

Usage Guide

Chat Mode

Agent Swarm Mode

Media Browser Mode

Multi-modal AI Integration

Supported Models

Configuration

Agent Swarm Architecture

Agent Types

Coordination Strategies

Communication

Examples

Basic Multi-modal Chat

Agent Swarm Coordination

Advanced Features

Session Management

Performance

Extensibility

Tips and Best Practices

Troubleshooting

Common Issues

Debug Mode

Future Enhancements