AI Coding Tools Orchestrator | Multi-Agent Collaboration Platform

Overview

🤝

Multi-Agent Collaboration

Coordinate Claude, Codex, Gemini, Copilot, and local model backends with intelligent workflows.

💻

Interactive CLI & Web UI

Choose between a powerful command-line interface or modern web UI with real-time updates.

⚙️

Configurable Workflows

Define custom collaboration patterns or use built-in workflows for different scenarios, including offline and hybrid execution.

🛡️

Production Ready

Security, monitoring, rate limiting, retry logic, and comprehensive test coverage built-in.

Latest implementation supports type-based adapter resolution (dynamic agent names), local/offline backends (Ollama + OpenAI-compatible), cloud-to-local fallback on recoverable failures, and live local model status probing in the Web UI.

How It Works

graph LR
    A[User Request] --> B[AI Orchestrator]
    B --> C{Offline Mode?}
    C -->|Yes| D[Route to Local Agent by type]
    C -->|No| E[Route to Cloud or Local Agent]
    D --> F[Execute Workflow Step]
    E --> F
    F --> G{Step Success?}
    G -->|Yes| H[Next Step]
    G -->|Recoverable failure| I[Fallback Agent]
    I --> H
    H --> J[Final Output + Files]

    style A fill:#667eea,stroke:#667eea,color:#fff
    style B fill:#764ba2,stroke:#764ba2,color:#fff
    style D fill:#43e97b,stroke:#43e97b,color:#fff
    style E fill:#4facfe,stroke:#4facfe,color:#fff
    style I fill:#f093fb,stroke:#f093fb,color:#fff
    style J fill:#00c853,stroke:#00c853,color:#fff

6+

AI Agents

14+

Python Files

7+

Workflows

80%+

Test Coverage

Core Features

🤖

Multi-Agent Collaboration

Coordinate multiple AI assistants with specialized roles:

Codex: Initial implementation
Gemini: Code review & analysis
Claude: Refinement & documentation
Copilot: Alternative suggestions
Ollama/llama.cpp: Local offline execution

📴

Offline + Local LLM Support

Run local-only workflows with --offline
Dynamic agent keys resolved by type
Supported local types: ollama, llamacpp, localai, text-generation-webui
Built-in cloud-to-local fallback routing
CLI local model lifecycle: status/list/pull/remove

💬

Interactive Shell

REPL-style conversation interface
Smart follow-up detection
Full readline support & history
Session save/restore
Colored output with Rich

🌐

Modern Web UI

Vue 3 with Composition API
Real-time updates via Socket.IO
Monaco code editor (VS Code)
Pinia state management
File management & downloads
Conversation mode toggle

📊

Monitoring & Metrics

Prometheus metrics integration
Structured logging (structlog)
Health & readiness checks
Performance tracking
Error rate monitoring

🛡️

Security Features

Input validation & sanitization
Rate limiting (token bucket)
Audit logging
Secret management
Security scanning (Bandit)

⚡

Performance

Async execution support
Multi-layer caching
Connection pooling
Retry logic with backoff
Circuit breaker pattern

🚢

Deployment Options

Docker & Docker Compose
Kubernetes manifests
Systemd service files
CI/CD with GitHub Actions
Multi-environment configs

🔧

Code Quality

Type hints with Pydantic
80%+ test coverage (pytest)
Black code formatting
Flake8 linting
MyPy type checking

System Architecture

The AI Orchestrator follows a modular, layered architecture with clear separation of concerns. It's designed for extensibility, reliability, and production-grade performance.

Runtime controls now include offline detection, fallback management, and local model endpoint probing. This allows hybrid and offline execution without changing core orchestration flow.

flowchart TB
    subgraph "User
Interfaces"
        CLI[CLI Shell
Click + Rich]
        WebUI[Web UI
Vue 3 + Socket.IO]
    end

    subgraph "Core
Orchestrator"
        Engine[Orchestration
Engine]
        Workflow[Workflow
Manager]
        Config[Config
Manager]
        Session[Session
Manager]
        Router[Type-based
Adapter Resolver]
    end

    subgraph "Cross-Cutting
Concerns"
        Metrics[Prometheus
Metrics]
        Cache[Response
Cache]
        Retry[Retry
Logic]
        Security[Security
Layer]
    end

    subgraph "AI
Adapters"
        Claude[Claude
Adapter]
        Codex[Codex
Adapter]
        Gemini[Gemini
Adapter]
        Copilot[Copilot
Adapter]
        Ollama[Ollama
Adapter]
        LlamaCpp[LlamaCpp
Adapter]
    end

    subgraph "Runtime
Controls"
        Offline[Offline
Detector]
        Fallback[Fallback
Manager]
        ModelStatus[Local Model
Status Probe]
    end

    subgraph "External AI
Tools"
        ClaudeCLI[Claude Code
CLI]
        CodexCLI[OpenAI Codex
CLI]
        GeminiCLI[Google Gemini
CLI]
        CopilotCLI[GitHub Copilot
CLI]
        OllamaAPI[Ollama API
/api/generate]
        OpenAICompat[Local OpenAI-Compatible API
/v1/completions]
    end

    CLI --> Engine
    WebUI --> Engine
    Engine --> Workflow
    Engine --> Config
    Engine --> Session
    Engine --> Router
    Engine --> Offline
    Engine --> Fallback
    WebUI --> ModelStatus
    ModelStatus --> OllamaAPI
    ModelStatus --> OpenAICompat
    Workflow --> Metrics
    Workflow --> Cache
    Workflow --> Retry
    Workflow --> Security
    Workflow --> Claude
    Workflow --> Codex
    Workflow --> Gemini
    Workflow --> Copilot
    Workflow --> Ollama
    Workflow --> LlamaCpp
    Claude --> ClaudeCLI
    Codex --> CodexCLI
    Gemini --> GeminiCLI
    Copilot --> CopilotCLI
    Ollama --> OllamaAPI
    LlamaCpp --> OpenAICompat

    style CLI fill:#667eea,stroke:#667eea,color:#fff
    style WebUI fill:#667eea,stroke:#667eea,color:#fff
    style Engine fill:#4facfe,stroke:#4facfe,color:#fff
    style Workflow fill:#43e97b,stroke:#43e97b,color:#fff
    style Offline fill:#ffe082,stroke:#ffca28,color:#000
    style Fallback fill:#f8bbd0,stroke:#ec407a,color:#000

1. Interface Layer

User-facing interfaces: CLI and Web UI

2. Orchestration Layer

Core business logic and workflow management

3. Cross-Cutting Layer

Security, caching, metrics, and logging

4. Adapter Layer

AI agent integrations with uniform interface

5. External Services

Third-party AI CLI tools

Design Patterns

Adapter Pattern

Uniform interface to different AI CLIs

Strategy Pattern

Configurable workflow strategies

Observer Pattern

Real-time UI updates via Socket.IO

Factory Pattern

Agent and workflow creation

Singleton Pattern

Config and metrics managers

Decorator Pattern

Retry, cache, and logging decorators

Standalone Agentic Team Runtime

Agentic Team is a separate runtime path from orchestrator workflows. It models a true software team where roles route work to each other at runtime, and only the lead role can finalize the user-facing response.

This path has its own backend/UI (ui/agentic_app.py), its own CLI REPL (./ai-orchestrator agentic-shell), dedicated validation, and live communication graph/timeline streaming.

Runtime Separation and Role Routing

flowchart LR
    subgraph Orchestrator Path
      OCLI[ai-orchestrator run/shell] --> OCORE[orchestrator.core]
      OCORE --> OWF[predefined workflow steps]
    end

    subgraph Agentic Team Path
      AUI[ui/agentic_app.py]
      ASHELL[ai-orchestrator agentic-shell]
      AUI --> AENGINE[agentic_team.engine]
      ASHELL --> AENGINE
      AENGINE --> PM[project_manager]
      PM --> SA[software_architect]
      PM --> SD[software_developer]
      PM --> QA[qa_engineer]
      PM --> DO[devops_engineer]
      SA --> SD
      SD --> QA
      SD --> DO
      QA --> PM
      DO --> PM
      PM --> USER[final response to user]
    end

    style OCLI fill:#667eea,stroke:#667eea,color:#fff
    style OCORE fill:#764ba2,stroke:#764ba2,color:#fff
    style AENGINE fill:#43e97b,stroke:#43e97b,color:#000
    style PM fill:#ffe082,stroke:#ffca28,color:#000
    style USER fill:#4facfe,stroke:#4facfe,color:#fff

🧠

True Runtime Routing

Any role can route to any team role at runtime
Decision output supports message and finalize
Non-lead finalize attempts are normalized to lead handoff
Invalid destinations are rerouted to lead safely

📡

Live Communications

Socket events: team_turn, team_communication, progress_log
Directed live graph with edge counts and latest path highlight
Readable communication stream panel in dedicated UI section
Session snapshots preserve turn and communication history

✅

Config + Validation Gate

Guided config editor for agentic_team.roles
Role-to-agent mappings validated before every run
Missing/unavailable mappings block execution early
Defaults are merged when team config is incomplete

🛠️

Production Controls

Execution IDs, timestamps, duration, and turn stats
Fallback manager integration for recoverable failures
Repeat-route escalation protection to lead role
Runtime settings for message and routing safety limits

Role	Purpose	Typical Outgoing Handoffs
project_manager	Lead, planning, gating final response	architect, developer, QA, DevOps, or finalize to user
software_architect	Architecture and constraints	developer or PM
software_developer	Implementation	QA, DevOps, or PM
qa_engineer	Validation and regressions	developer or PM
devops_engineer	Runtime and deployability	developer or PM

Example Communication Sequence

sequenceDiagram
    participant PM as project_manager (claude)
    participant DEV as software_developer (codex)
    participant QA as qa_engineer (gemini)
    participant USER as user

    PM->>DEV: action=message
"Implement endpoint + tests"
    DEV->>QA: action=message
"Implementation complete, validate"
    QA->>PM: action=message
"Validation passed"
    PM->>USER: action=finalize
"Ready to ship"

Agentic Team Run Commands

# Start standalone UI backend
./start-agentic-ui.sh

# Start standalone REPL
./ai-orchestrator agentic-shell

# Inspect team mappings in REPL
/team
/validate

# Full docs (protocol, examples, failure handling)
open AGENTIC_TEAM.md

Quick Start

1

Clone Repository

git clone <repository-url>
cd AI-Coding-Tools-Collaborative

2

Install Dependencies

pip install -r requirements.txt
chmod +x ai-orchestrator

3

Verify Installation

./ai-orchestrator --help
./ai-orchestrator agents

4

Start Interactive Shell

./ai-orchestrator shell

Local/Offline Quick Start

# Start local backend (example: Ollama)
ollama serve
ollama pull codellama:13b

# Check local backend and model status
./ai-orchestrator models status

# Run local-only workflow
./ai-orchestrator run "Build a Python CLI todo app" --workflow offline-default --offline

Prerequisites

Python 3.8+ - Core runtime
Node.js 20+ - For Web UI (optional)
At least one AI CLI - Claude, Codex, Gemini, or Copilot
Optional local backend - Ollama or any OpenAI-compatible local server for offline/hybrid runs
Docker - For containerized deployment (optional)

Example Usage

# Start interactive shell
./ai-orchestrator shell

orchestrator (default) > create a REST API for user management
✓ Task completed successfully!
📁 Generated Files:
  📄 api/routes.py
  📄 api/models.py

orchestrator (default) > add JWT authentication
💡 Detected as follow-up to previous task
✓ Authentication added!

orchestrator (default) > /save user-api-project
✓ Session saved!

Available Workflows

Workflow	Agents	Iterations	Use Case
default	Codex → Gemini → Claude	3	Production-quality code with review
quick	Codex only	1	Fast prototyping and iteration
thorough	Codex → Copilot → Gemini → Claude → Gemini	5	Mission-critical or security-sensitive
review-only	Gemini → Claude	2	Analyzing existing code
document	Claude → Gemini	2	Generating documentation
offline-default	local-code → local-instruct	2	Local-only execution in offline/air-gapped setups
hybrid	local-code → claude (fallback local-instruct)	2	Local draft with cloud review and local failover

Workflow Execution Flow

graph TD
    START([Start]) --> LOAD[Load Workflow Config]
    LOAD --> NORM[Normalize legacy and steps formats]
    NORM --> VALIDATE[Validate workflow and agent availability]
    VALIDATE --> INIT[Initialize adapters by type]
    INIT --> ITER{Iteration < Max?}

    ITER -->|Yes| STEP[Execute step with primary agent]
    STEP --> OK{Success?}
    OK -->|Yes| CTX[Update context]
    OK -->|No recoverable| FB[Run configured fallback]
    FB --> CTX
    OK -->|No non-recoverable| FAIL[Record step failure]
    FAIL --> CTX
    CTX --> CHECK{Stop criteria met?}

    CHECK -->|No| ITER
    CHECK -->|Yes| AGG[Aggregate iteration outputs]
    ITER -->|No| AGG

    AGG --> REPORT[Generate final result]
    REPORT --> END([End])

    style START fill:#667eea,stroke:#667eea,color:#fff
    style END fill:#43e97b,stroke:#43e97b,color:#fff
    style FB fill:#f8bbd0,stroke:#ec407a,color:#000

Custom Workflows

Define your own workflows in config/agents.yaml:

agents:
  my-custom-llama:
    type: llamacpp
    endpoint: http://localhost:9000
    offline: true
    enabled: true

workflows:
  custom:
    steps:
      - agent: "my-custom-llama"
        role: "implementer"
      - agent: "gemini"
        role: "reviewer"
        fallback: "my-custom-llama"

Documentation

📖

Technology Stack

Core

Python 3.8+
Click (CLI)
Rich (Terminal)
Pydantic (Validation)

Web UI

Vue 3 & Pinia
Vite
TailwindCSS
Monaco Editor

Backend

Flask
Flask-SocketIO
Socket.IO
Gunicorn

Monitoring

Prometheus
Structlog
Grafana
Health Checks

Testing

Pytest
Coverage
MyPy
Black

Deployment

Docker
Kubernetes
GitHub Actions
Systemd

Community & Support

💬

GitHub Discussions

Ask questions, share ideas, and connect with other users

Join Discussions →

🐛

Issue Tracker

Report bugs, request features, and track development

View Issues →

🤝

Contributing

Help improve the project with code, docs, or ideas

Contribution Guide →

📧

Security

Report security vulnerabilities responsibly

Security Policy →

Quick Contribution Guide

Fork the repository
Create a feature branch: git checkout -b feature/your-feature
Make your changes with tests
Run checks: make all
Commit: git commit -m "feat: add amazing feature"
Push and create a Pull Request

Built With

Click Rich Pydantic Vue 3 Flask Monaco Editor Pinia Prometheus Docker Kubernetes

See It In Action

Interactive CLI Shell

$ ./ai-orchestrator shell

Welcome to AI Orchestrator v1.0.0
Type /help for available commands

orchestrator (default) > create a Python REST API with FastAPI

🤖 Executing workflow: default
📊 Step 1/3: Codex (Implementation)
⏳ Processing...

✓ Implementation complete!

📊 Step 2/3: Gemini (Review)
⏳ Analyzing code...

✓ Review complete! Found 3 suggestions:
  • Add input validation
  • Include error handling
  • Add API documentation

📊 Step 3/3: Claude (Refinement)
⏳ Implementing improvements...

✓ Task completed successfully!

📁 Generated Files:
  📄 app/main.py (FastAPI app)
  📄 app/models.py (Pydantic models)
  📄 app/routes.py (API routes)
  📄 app/schemas.py (Request/response schemas)
  📄 tests/test_api.py (Unit tests)
  📄 requirements.txt (Dependencies)

Workspace: ./workspace/session-abc123

orchestrator (default) > add authentication
💡 Detected as follow-up to previous task

Modern Web Interface

The Web UI provides a visual interface with:

Real-time progress tracking with live updates
Monaco code editor (same as VS Code)
Pinia state management
File browser and management
Conversation mode for iterative development
Workflow and iteration visualization

Task Input

Multi-line textarea with syntax highlighting

Live Updates

Socket.IO for real-time progress

Code Editor

Full Monaco editor with IntelliSense

File Management

View, download, and manage generated files

Workflow Execution

# Run with specific workflow
./ai-orchestrator run "Build authentication system" --workflow thorough

# Custom iterations
./ai-orchestrator run "Optimize database queries" --max-iterations 5

# With verbose output
./ai-orchestrator run "Add caching layer" --workflow default --verbose

# Dry run to preview
./ai-orchestrator run "Refactor code" --dry-run

# Load previous session
./ai-orchestrator shell --load my-project

Overview

Multi-Agent Collaboration

Interactive CLI & Web UI

Configurable Workflows

Production Ready

How It Works

Core Features

Multi-Agent Collaboration

Offline + Local LLM Support

Interactive Shell

Modern Web UI

Monitoring & Metrics

Security Features

Performance

Deployment Options

Code Quality

System Architecture

1. Interface Layer

2. Orchestration Layer

3. Cross-Cutting Layer

4. Adapter Layer

5. External Services

Design Patterns

Standalone Agentic Team Runtime

Runtime Separation and Role Routing

True Runtime Routing

Live Communications

Config + Validation Gate

Production Controls

Example Communication Sequence

Agentic Team Run Commands

Quick Start

Clone Repository

Install Dependencies

Verify Installation

Start Interactive Shell

Local/Offline Quick Start

Prerequisites

Example Usage

Available Workflows

Workflow Execution Flow

Custom Workflows

Documentation

README

Architecture

Features

Agentic Team

Setup Guide

Add Agents

Deployment

Technology Stack

Core

Web UI

Backend

Monitoring

Testing

Deployment

Community & Support

GitHub Discussions

Issue Tracker

Contributing

Security

Quick Contribution Guide

Built With

See It In Action

Interactive CLI Shell

Modern Web Interface

Workflow Execution

Ready to Get Started?