The AetherShell TUI provides a rich terminal-based interface for interacting with multi-modal Large Language Models (LLMs) and managing AI agent swarms. It offers an intuitive visual environment for complex AI workflows while maintaining the power and flexibility of the command line.
✅ Works Best:
- Windows Terminal (recommended for Windows)
- Native PowerShell console (not VS Code integrated terminal)
- Command Prompt with modern console host
- iTerm2 (macOS)
- GNOME Terminal, Konsole, Alacritty (Linux)
❌ Not Recommended:
- VS Code integrated terminal (limited raw mode support)
- Basic terminal emulators without alternate screen buffer
- SSH sessions without proper terminal forwarding
# Quick test - if this works, TUI should work:
ae tui
# If you see rendering issues, try:
# 1. Use Windows Terminal or native PowerShell
# 2. Ensure your terminal supports 256 colors
# 3. Check terminal size is at least 80x24- Multi-modal AI Chat: Support for text, images, audio, and video inputs
- Agent Swarm Management: Create, monitor, and coordinate multiple AI agents
- Media Processing: Terminal-friendly display and processing of media files
- Real-time Monitoring: Live status updates for agents and conversations
- Interactive Configuration: Settings management through the UI
- Chat Mode - Multi-modal conversation interface
- Agent Swarm Mode - Agent management and monitoring
- Media Browser Mode - File management and attachment system
- Settings Mode - Configuration and preferences
# Launch the TUI
ae tui
# Navigate between modes
Tab / Shift+Tab # Cycle through modes
1, 2, 3, 4 # Jump directly to Chat, Agent, Media, Settings
# Basic navigation
↑↓ or j/k # Navigate lists
Enter or i # Start editing
Esc # Cancel/return to navigation
Ctrl+C or q # Quit applicationThe chat interface supports multi-modal conversations with AI models:
Key Bindings:
- i/Enter: Start typing a message
- m: Switch to media browser to attach files
- a: Switch to agent management
- c: Clear chat history
Multi-modal Workflow:
- Switch to Media mode (press
3orm) - Select media files (press
Spaceto toggle selection) - Return to Chat mode (press
1orb) - Type your message and press Enter
- The AI will analyze both text and attached media
Manage and monitor AI agents working on complex tasks:
Key Bindings:
- n: Create new agent
- d/Delete: Remove selected agent
- Enter/s: Assign task to agent
- c: Switch to chat mode
Agent Workflow:
- Create agents with specialized capabilities
- Assign tasks and monitor progress
- Coordinate between agents through shared context
- View real-time status updates and results
Handle various media types in the terminal:
Key Bindings:
- Space/Enter: Toggle file selection
- o: Add file to library (placeholder in demo)
- c: Clear all selections
- d/Delete: Remove from library
- b: Return to chat with selected files
Supported Formats:
- Images: JPG, PNG, GIF, WebP, BMP, TIFF, SVG
- Videos: MP4, AVI, MOV, MKV, WebM, FLV
- Audio: MP3, WAV, FLAC, AAC, OGG, M4A
The TUI integrates with various AI backends:
- OpenAI: GPT-4V for vision, GPT-4 for text
- Ollama: LLaVA for vision, Llama3 for text
- OpenAI-compatible APIs: Custom endpoints
- Text Generation Inference (TGI): High-performance serving
Set environment variables to configure AI backends:
# OpenAI (with vision support)
export OPENAI_API_KEY="your-key"
export OPENAI_MODEL="gpt-4o"
# Ollama (with vision model)
export OLLAMA_URL="http://localhost:11434"
export OLLAMA_MODEL="llava"
# Custom endpoints
export AETHER_MODEL_URI="openai:gpt-4o"- Single Agents: Individual AI workers with specific tools
- Specialized Agents: Domain-specific capabilities (file analysis, code generation, etc.)
- Coordinator Agents: Manage task distribution and orchestration
- Round Robin: Equal task distribution
- Load Balanced: Assignment based on agent capacity
- Specialized: Task routing based on agent capabilities
- Blackboard Pattern: Shared memory for inter-agent communication
- Task Queue: Centralized task management
- Real-time Updates: Live status monitoring in the TUI
# Launch TUI and try:
# 1. Go to Media tab
# 2. Select an image
# 3. Go to Chat tab
# 4. Type: "What do you see in this image?"
# 5. Press Enter# Create a swarm for code analysis:
# 1. Go to Agent tab
# 2. Create agent "FileAnalyzer"
# 3. Create agent "DocumentGenerator"
# 4. Assign coordinated tasks
# 5. Monitor progress in real-time- Auto-save: Conversations and agent states persist
- Export: Markdown export for chat sessions
- Context Window: Intelligent message summarization
- Streaming: Real-time response display
- Caching: Media thumbnail generation
- Async Processing: Non-blocking agent operations
- Plugin Architecture: Custom tool integration
- Model Registry: Easy backend switching
- Custom Agents: Specialized agent implementations
- Media Optimization: Use compressed formats for faster processing
- Agent Coordination: Design complementary agent capabilities
- Context Management: Use clear, descriptive prompts
- Resource Monitoring: Watch agent status for bottlenecks
- Session Organization: Regular exports for important conversations
- No Response: Check AI backend configuration and API keys
- Media Not Loading: Verify file format support and permissions
- Agent Errors: Review tool allowlists and environment setup
- Performance Issues: Monitor system resources and agent count
# Enable debug logging
export RUST_LOG=debug
ae tui- Voice input/output support
- Video frame analysis
- Real-time collaboration
- Custom UI themes
- Performance profiling
- Cloud backend integration
For more examples and advanced usage, see the examples/ directory:
09_tui_multimodal.ae- Multi-modal chat examples10_tui_agent_swarm.ae- Agent coordination patterns11_tui_showcase.ae- Complete feature demonstration