Real-time · Local-first · Zero-config

Claude Code Agent Monitor

A professional monitoring platform for Claude Code agent activity. Captures sessions, agents, and tool events via native hooks, persists them in SQLite, and streams updates to a React UI over WebSocket — with no external services required.

Node.js ≥ 20 Express 4.21 React 18.3 TypeScript 5.7 JavaScript ES6 Vite 6 Tailwind CSS 3.4 SQLite 3 WebSocket MCP 1.0 OpenAPI 3.0 Swagger 3.0 i18next 22.4 Mermaid 10.2 better-sqlite3 12 React Router 6.28 Lucide Icons D3.js 7 PostCSS 8.5 Autoprefixer 10.4 ESLint 8.44 Python ≥ 3.6 Docker 20.10 Podman 4.0 Vitest 1.0 Testing Library 13 SSE Terraform ≥ 1.5 Kubernetes ≥ 1.24 Helm 3 Kustomize 5.0 Prometheus 2.x Grafana 10.x Nginx Ingress Coralogix OpenTelemetry AWS ECS | RDS Google Cloud Azure AKS Oracle Cloud GitLab CI Make 4.3 GitHub Actions VS Code Extension Claude Code Claude Plugins Claude CLI Electron 35 electron-builder web-push (VAPID) multer swagger-ui-express adm-zip tar cors ws uuid concurrently SMAppService macOS native APIs Windows native APIs NSIS installer Chromium (Electron) GET /api/metrics 4 Grafana dashboards Remote data sources SSH tunnel federation ccam CLI Data scope (local/all/selected) Import history File-header PR CI docker-compose monitoring npm run metrics:verify MIT License

◈ Introduction

System Overview

Claude Code Agent Monitor dashboard showing live agent cards, stats, and recent activity feed

📡 Live dashboard — real-time agent cards, stats, and activity feed

Claude Code Agent Monitor integrates with Claude Code through its native hook system. When Claude Code performs any action — tool use, session start, subagent orchestration, session end — it fires a hook that calls a small Node.js script bundled with this project. That script forwards the event over HTTP to the dashboard server, which stores it in SQLite and broadcasts it to the browser over WebSocket.

graph LR CC["Claude Code\nSession"]:::cc -->|"hooks fire on\ntool use / stop"| HH["hook-handler.js\n(stdin → HTTP)"]:::mid HH -->|"HTTP POST\n/api/hooks/event"| SRV["Express Server\n+ SQLite"]:::mid SRV -->|"WebSocket\nbroadcast"| UI["React Dashboard\n(browser)"]:::ui classDef cc fill:#6366f1,stroke:#818cf8,color:#fff classDef mid fill:#1a1a2b,stroke:#2e2e48,color:#e2e2f0 classDef ui fill:#10b981,stroke:#34d399,color:#fff

End-to-end data pipeline from Claude Code to the browser

< 50ms

Hook latency

Hook types captured

~30 MB

Server memory

1 M+

Events before slowdown

ℹ

Local-first by design

The server binds 127.0.0.1 (loopback) by default, so it is not network-reachable and everything runs on your machine. No data leaves your system. No API keys. No external services. Exposing it more widely is opt-in via DASHBOARD_HOST and should be paired with DASHBOARD_TOKEN.

◈ Features

What's Included

Every feature is driven by real hook events — nothing is hardcoded or simulated in production mode.

📡

Live Dashboard

Two tabs: Monitor shows overview stats, active agent cards with collapsible subagent hierarchy, and a recent activity feed whose item count fills available viewport height. Health renders a composite system health score ring, storage engine donut chart, cache/error/success gauges, tool invocation bars, subagent effectiveness ratios, model token distribution, and compaction stats. Both tabs auto-refresh every 5 seconds via WebSocket push so the view is always current without manual reload.

📋

Kanban Board

Toggle between Agents (Working / Waiting / Completed / Error) and Sessions (Active / Waiting / Completed / Error / Abandoned) swim lanes. A yellow Waiting column flags items sitting on the user — fresh prompt, between turns, or permission gate. Hover any column header for lifecycle tooltips explaining each state transition. Cards surface model name, cumulative cost, and the current tool being called. Counts update in real time via WebSocket so the board is always in sync with the live event store. Hover a Waiting badge to see why — needs input, turn done, at prompt, or interrupted.

📂

Sessions Table

Server-paginated table of every recorded session — each page fetches only its slice so cost computation stays bounded no matter how many sessions exist. Case-insensitive search across id, name, and cwd runs server-side with a 300 ms debounce; the status filter composes with search for precise narrowing. Each row shows the session's real name (synced live from the transcript — a /rename or claude -n title, else the auto title, else the first user prompt, with a short-ID fallback), status badge, agent count, duration, model, and estimated cost. Click any row to drill into the full session detail view with conversation transcript and agent hierarchy.

🔬

Session Detail

Per-session deep dive with a collapsible agent hierarchy tree and a full chronological event timeline showing every tool call name and summary. An overview panel at the top surfaces tile counters for events, tool calls, subagents, compactions, errors, and duration. Top-tool usage bars and a subagent type breakdown give quick distribution reads. The conversation viewer renders markdown with syntax highlighting, per-tool styled blocks, slash-command pills with their captured TUI output, and inline session-rename markers. Messages typed mid-turn (queued while Claude was still working) appear where Claude actually received them, and harness notifications are attributed to System. Export the entire session as JSON or share the permalink for async review. When the session is waiting on you, a yellow banner under the header names the reason and how long it has been waiting.

🔔

Alerts & Webhooks

A rules-based alerting engine evaluates the live event stream server-side: event pattern (match event type / tool / summary text, optionally N matches within a time window), inactivity, stuck agent, and token threshold — each with per-(rule, session, agent) cooldown dedup. Fired alerts surface in a live feed and fan out to 14 first-class webhook providers — Slack, Discord, Teams, Google Chat, Mattermost, Rocket.Chat, Telegram, PagerDuty, Opsgenie, Splunk On-Call, Zapier, Make, n8n, Pipedream — plus any generic JSON endpoint (with optional HMAC-SHA256 signing and custom headers). Delivery is detached and fail-safe with a request timeout, bounded retry/backoff, secret redaction, a one-click test probe, and a per-target delivery log. Rules and channels are managed together in Settings → Alerts.

🍎🪟

Desktop App (macOS & Windows)

A native desktop app — a macOS .app (shipped as a .dmg) and a Windows .exe (NSIS installer plus a no-install portable build) — built with Electron 35. It embeds the Express server in-process — require()-ing server/index.js directly, with no child process and no IPC — and renders the built React client in a BrowserWindow. Adds a menu-bar / notification-area (tray) icon, a native application menu, auto-start at login (macOS Login Items via SMAppService; Windows per-user HKCU\…\Run), and a single-instance lock. Closing the window hides it while the server keeps running, and the app auto-installs Claude Code hooks on first boot so an install-only user gets events flowing without a checkout.

📰

Activity Feed

Real-time streaming event log showing tool calls, agent state changes, errors, and compaction events as they arrive. Pause/resume with automatic buffering, paginated history for scrollback, and auto-scrolling to the latest entry. Click any row to expand its full hook payload inline. A dedicated Session → button navigates directly to session detail without collapsing the expanded state. Every entry is color-coded by event type and grouped by session for quick scanning of concurrent work.

📊

Analytics

Token usage breakdown by model with stacked bar charts, tool frequency rankings, agent type distribution donuts, and session outcome pie charts. A 52-week activity heatmap aligned by day-of-week shows density with hover tooltips. 30-day sparkline trends track cost and session volume at a glance. The cost summary panel totals input, output, and cache spend across all models. A live/offline indicator and auto-refresh via WebSocket keep everything current. All charts are responsive and adapt to mobile viewports.

⚡

WebSocket Push

Every UI update is pushed over a persistent WebSocket with sub-5 ms dispatch latency — zero polling anywhere. If the connection drops, automatic 2-second reconnect kicks in while a ping/pong heartbeat detects stale connections early. A sidebar indicator turns green/red so you always know whether you're live. The WebSocket carries typed JSON envelopes for new events, session updates, agent transitions, compaction results, and import progress — all parsed into the same eventBus the REST layer uses.

🖥️

Statusline

Standalone CLI statusline for Claude Code that prints model name, user, working directory, git branch, and a color-coded context-window bar (green → yellow → red). Token counts show input (green ↑), output (cyan ↓), and cache (dim) separately. Session cost in USD shifts color by configurable thresholds. ANSI-colored output updates on every turn. Python-based with a thin shell wrapper — drop it into your prompt or tmux status line. Works with any terminal emulator that supports 256-color ANSI.

🔄

History Import

Import existing Claude Code sessions from three sources — rescan the default ~/.claude/projects folder, scan any absolute path on disk, or drag-drop .jsonl, .zip, .tar.gz, and .gz archives through Settings → Import History. All paths funnel into the same ingestion pipeline the server uses for live hooks, so imported tokens and per-model cost match real-time capture exactly. Re-imports are idempotent via session-ID dedup, and archive extraction is guarded against path traversal and zip-bomb expansion.

🛰️

Continuous Project Sync

The startup auto-import of ~/.claude/projects is one-time and marker-gated, so a project folder created after first launch — whose sessions never flow through hooks (for example with host-only hooks disabled) — would stay invisible until a manual rescan. A background sync closes that gap with three triggers sharing one mtime cache and a single coalesced sweep: an immediate sweep at startup, a debounced fs.watch (recursive on macOS and Windows; root plus immediate child folders on Linux to avoid the userland recursive-watcher hazard) that fires the moment a new session file or project folder appears, and a periodic safety-net poll tunable via DASHBOARD_SESSION_SYNC_MS (default 30000 ms; 0 disables the poll while leaving the watcher running). Each sweep re-parses only files whose mtime advanced and broadcasts session_created / session_updated (plus the main agent) so the UI refreshes live, while an already-imported unchanged session is skipped without re-parsing.

🛜

Remote Data Sources

Pull Claude Code history from other machines over SSH. Each remote's ~/.claude/projects tree is mirrored via scp into a sandboxed staging directory, fed through the same importer as local history, and every session is tagged with its source. A background poller keeps remote data near-real-time, a global data-scope selector (Local only / All / a specific source) narrows the whole app, and sessions from a remote show a source badge. Auth defers entirely to the host's SSH stack — no passwords or secrets are stored. Managed in Settings → Remote Data Sources.

💾

Transcript Cache

Incremental JSONL reader shared across the hook handler, compaction scanner, conversation viewer, and import pipeline. Byte-offset tracking skips already-parsed content; cache hits short-circuit disk I/O so even sessions with tens of thousands of turns stay fast. It also extracts the live session title (custom-title / ai-title) so renames surface in real time, plus the first user prompt as a fallback descriptor for placeholder-named sessions and agents.

📉

Bounded Cache Memory

LRU eviction of cold session buffers plus a tail-cap on per-entry growable arrays (turn durations, API errors, compaction entries). A session that runs for days cannot grow a single cache entry without bound, and each entry stores its parsed result only once — no shadow copy.

🧭

Constant-Time Sweep

The periodic compaction sweep reads each active session's transcript path directly from sessions.transcript_path (a partial index covers exactly those rows), so the work is O(active sessions) instead of a json_extract scan over the whole events table.

🌳

Subagent Hierarchy

Collapsible parent–child agent tree rendered on both Dashboard and Session Detail. Agents with subagents display expand/collapse chevrons; leaf agents show a dot indicator. The tree auto-expands when any child transitions to active and correctly tracks backgrounded subagents without premature completion. Depth is unlimited — deeply nested chains render as indented rows with connecting lines. Each node shows model, current tool, status badge, and cumulative token cost for tracing spend down the spawn chain.

💰

Cost Tracking

Per-model cost estimation with configurable pricing rules — set input, output, and cache-read rates per model variant through the Settings UI. View total and per-session breakdowns on Sessions, Session Detail, and Analytics. Compaction- aware token accounting preserves baselines across context compressions so no usage is silently dropped. Cost chips appear on Kanban cards, session rows, and the sidebar summary. Subagent cards show each subagent's own cost, computed from that subagent's transcript token usage priced at current rates, so a subagent card no longer misleadingly reads as if it cost the whole session; main-agent cards still show the session total. Pricing changes retroactively recalculate all stored sessions, and imports apply the same rate table.

⚙️

Settings & Management

Model pricing editor with per-token rate configuration for every Claude variant. Hook installation status with one-click reinstall and per-hook health checks. Full JSON data export covering sessions, agents, events, tokens, and pricing rules. Session cleanup controls to abandon stale sessions or purge old data by age. Browser notification preferences with per-event toggles. A system information panel shows database row counts, file sizes, server uptime, and WebSocket connection status at a glance.

🔌

Local MCP Server

Local MCP sidecar with three transport modes — stdio for Claude Code native integration, HTTP+SSE for remote clients, and an interactive REPL for ad-hoc terminal queries. Exposes 25 typed tools across 6 domains: sessions, agents, events, analytics, settings, and system health. Every mutation is gated behind a tiered policy so nothing dangerous fires without opt-in. Retry-aware API access handles transient failures. Runs as a standalone Node process with no Docker or cloud dependency.

🧩

Claude + Codex Extensions

Instruction, skills, rules, and custom-agent layers for both Claude Code and Codex. Path-scoped rules target backend, frontend, MCP, and docs directories with context-appropriate guidelines. Reusable skills cover onboarding, feature shipping, live-issue debugging, release-readiness, and MCP operations. Specialized subagents for backend, frontend, and MCP code review run in parallel with focused tooling. Everything lives in .claude/ and is version-controlled alongside the codebase.

🔀

Workflow Graphs

D3.js-powered visualizations: an agent orchestration DAG showing spawn patterns across sessions, a tool-execution Sankey diagram mapping tool-to- tool transitions, and a directed pipeline graph with frequency labels. Every chart title carries an info icon that opens a popover explaining what it shows and how to read it. Hovering nodes, edges, and bars surfaces tooltips with share-of-source percentages, success-rate buckets, and timing patterns. All labels are translated to English, Vietnamese, and Chinese.

📊

Workflow Analytics

Subagent effectiveness scorecards with success-rate rings and day-of-week sparklines. Auto-detected workflow patterns expand on click into a detail panel with the full step chain, stats grid, and a narrative with loop detection. Model delegation flow, error propagation bars with API error cards, concurrency swim-lanes, and complexity bubble charts round out the view. Six headline stat cards each include an info popover explaining the metric and its current value. Status filter applies globally.

🧬

Workflow Runs

Surfaces "dynamic workflows" — the fleets of sub-agents spawned by the Workflow tool and self-paced /loop runs. These emit no hooks, so they are reconstructed from the on-disk run journal written when a workflow finishes (workflows/wf_<runId>.json) plus the inner subagents/agent-*.jsonl transcripts. Each run shows its phases and a per-agent token / tool-call / duration breakdown; a running workflow is detected from its launch script before the journal exists. Runs appear in a panel on the Workflows page and as a linked subsection on each session.

🔍

Session Drill-In

Searchable session selector with pagination to explore any session's agent tree, tool-call timeline, and event sequence. The detail page opens with a live-updating overview — tile counters for events, tool calls, subagents, compactions, errors, and duration. Top-tool usage bars and subagent breakdown give quick reads. The conversation viewer renders markdown with syntax highlighting. Cross-filter from DAG nodes, run compaction analysis, or export as JSON — all with real-time WebSocket auto-refresh.

🔔

Browser Notifications

Persistent browser notifications via Web Push (VAPID) for real-time alerts even when the tab is not focused or the browser is backgrounded. Includes macOS audio support so notifications are audible alongside system sounds. Per-event toggles let you choose which events fire — session starts, completions, errors, compactions, or agent spawns. Server-side subscription management ensures one push per event per browser. Works on Chrome, Edge, Firefox, and Safari 17+ with graceful degradation elsewhere.

🐳

Docker Deployment

Ready-to-use Dockerfile and docker-compose.yml for one-command deployment. Supports both Docker and Podman with persistent volume mounts for the SQLite database and hook data. Configurable port mapping via environment variables and a health-check endpoint the container runtime can poll. Multi-stage build keeps the image lean — only production deps and the compiled bundle ship. Run docker compose up -d --build and the dashboard is live with zero additional setup or configuration required.

🏪

Plugin Marketplace

Official Claude Code plugin marketplace shipping 10 plugins with 53 skills, 14 agents, 30 slash commands, 3 CLI tools, 3 hook configs, and 1 MCP server. Deep analytics with compaction-aware baselines, productivity automation, developer diagnostics, AI-powered workflow intelligence, and dashboard MCP integration. Five newer plugins go further: ccam-cost-guard (budget guardrails, spend forecasts, and cost-threshold alerts), ccam-sessions (session forensics — search, timeline, and transcript replay), ccam-workflows (multi-agent orchestration and fleet intelligence), ccam-quality (reliability and SLO checks), and ccam-config (Claude Code config and memory governance). Install with claude plugin install — no restart needed. Each listing shows author, license, homepage, and per-skill contribution breakdown. The Config Explorer's Plugins tab surfaces installed plugins with live status.

▶️

Run Claude

Spawn claude subprocesses straight from the dashboard with a chat-style streaming UI — multi-turn Conversation or single-shot Headless mode. One-click Resume on any past conversation spawns claude --resume seeded with the prior transcript. Re- attach reconciles in-memory logs with the on-disk JSONL so navigating away never loses history. Slash-command autocomplete, file references, live token/context-window meter, and a thinking-effort dial bring TUI parity to the browser. Same-origin guard blocks drive-by spawns.

⌨️

ccam CLI

The whole dashboard as terminal commands — ccam stats, kanban, tail, session <id>, analytics, alerts, pricing, import, doctor, export and more. Zero dependencies, linked by npm run setup, auto-discovers the running server, renders a full terminal UI (box-drawn tables, status icons, inline bar charts, agent trees), and pipes cleanly (colors off when not a TTY). The one destructive command is gated behind --yes.

🧰

Claude Config Explorer

A 12-tab inspector at /cc-config for everything Claude Code knows about: skills, subagents, slash commands, output styles, plugins, marketplaces, MCP servers, hooks, settings, memory, keybindings, and statusline scripts. The Settings tab leads with a Current configuration summary of the options /config controls — model, verbose, theme, output style, auto-compact, notifications — resolved across scopes. The Memory tab surfaces both the user and project CLAUDE.md files and the per-project file-based memory store — every auto-memory *.md under ~/.claude/projects/<slug>/memory/ (a MEMORY.md index plus one file per remembered fact, often 100+), grouped by project, searchable, and editable — the MEMORY.md index links are clickable and jump to (scroll to + highlight) the matching fact file. Create, edit, and delete the low-risk surfaces — text-file artifacts, per-project auto-memory files, and keybindings.json (structured inline editor) — with mandatory timestamped backups before every write. Plugins, MCP, hooks, and live settings stay read-only with explainer banners and copy-able CLI commands. See the dedicated Config Explorer section for the full tour. Per-plugin contribution breakdowns show author and license.

📱

Responsive Design

Mobile-first layouts with stacking grids, horizontally scrollable tables, and a collapsible sidebar that auto-hides below 1400 px. All pages adapt from phone to ultrawide with consistent navigation and readable typography. Kanban columns stack vertically on narrow screens, analytics charts reflow to single-column, and the activity feed stays fully swipeable. Touch targets meet 44 px minimum. Dark theme renders consistently across iOS Safari, Chrome, and Firefox with no flash of unstyled content.

🌊

Concurrency Timeline

Visualize parallel agent execution with a Gantt-style timeline showing overlapping subagent lifetimes, tool-call concurrency windows, and wait gaps. Color-coded bars distinguish working, waiting, and errored states so bottlenecks are immediately visible. Hover any bar for exact timestamps and duration. Zoom and pan across long-running sessions with hundreds of agents. The timeline shares the Workflows status filter so you can isolate active, completed, or errored sessions without leaving the view.

🔌

VS Code Extension

Professional VS Code extension with a real-time Activity Bar sidebar showing active sessions, agent counts, and recent events without leaving your editor. A status bar pulse monitor surfaces connection health and the latest event type at a glance. Deep navigation links open any session or analytics view directly in your browser. An embedded webview renders the full dashboard inside a VS Code tab with WebSocket push, theme sync, and responsive layout. Install from the marketplace or build from source.

🛡️

Error Propagation Map

Trace how errors cascade across agents and tool calls with a directed graph showing failure origins, retry paths, and recovery points. Each node displays the agent or tool that errored, the error message, and whether a retry succeeded or propagated upstream. Pinpoint root causes in deeply nested subagent chains. Horizontal bar charts rank the most error-prone tools and models. API error cards group failures by HTTP status and endpoint. Filter by session, time range, or error severity to narrow the view.

📲

Progressive Web App

Three independent PWAs — dashboard, landing page, and wiki — each with its own Web App Manifest and Service Worker. Install to your home screen or dock for a standalone, chrome-less experience. SVG icons with sizes="any" and iOS standalone meta tags included.

🔄

Fresh-by-Default Caching

The dashboard SW serves Vite's hashed /assets/* bundles cache-first (URLs are immutable per build) and treats everything else as network-first with cache fallback. Explicit Cache-Control headers on the production Express static middleware reinforce the policy, so a rebuild replaces the in-browser code without a hard refresh.

♻️

Auto-Reload on Update

A controllerchange listener in client/src/main.tsx reloads the page exactly once when a new SW takes over an already-controlled page. First installs do not reload, so the very first visit is never interrupted.

Screenshots

📡 Dashboard · Monitor — live overview of active sessions and agents. Stats tiles, collapsible subagent hierarchy cards, and a recent activity feed. Auto-refreshes every 5 s via WebSocket

🩺 Dashboard · Health — composite health score ring, storage engine donut, cache/error/success gauges, tool invocation bars, subagent effectiveness, and model token distribution

📋 Kanban Board (agents) — agents swim-laned by status: Working, Waiting, Completed, Error. Cards show model, cost, and current tool call. Yellow column flags agents waiting on user input

🗂️ Kanban Board (sessions) — sessions swim-laned across 5 columns: Active, Waiting, Completed, Error, Abandoned. Each card shows agent count, duration, model, and cumulative cost

📂 Sessions — searchable, filterable, server-paginated table. Each row shows status badge, agent count, duration, model, and cost. Click any row to drill into session detail

🤖 Session Detail · Agents — overview tiles (events, tool calls, subagents, errors, duration), top-tool usage bars, subagent type breakdown, and a collapsible parent–child agent hierarchy tree

💬 Session Detail · Conversation — full transcript viewer with markdown rendering, syntax-highlighted code blocks, per-tool sections, and collapsible thinking blocks

🔬 Session Detail · Timeline — chronological event timeline with multi-dimension filters, color-coded entries by type, expandable hook payloads, and direct links to the owning session and agent

📰 Activity Feed — real-time streaming event log with pause/resume buffering, multi-dimension filters, expandable hook payloads, color-coded entries, and per-row session navigation buttons

📊 Analytics — token usage by model, tool frequency bars, 52-week activity heatmap, 30-day sparkline trends, session outcome donuts, and cost summary with WebSocket auto-refresh

🔀 Workflows — D3.js agent orchestration DAG, tool-execution Sankey diagram, directed pipeline graph, effectiveness scorecards, concurrency swim-lanes, and complexity bubble charts

Dynamic Workflow Runs on the Workflows page

🧬 Workflow Runs — "dynamic workflows" spawned by the Workflow tool, reconstructed from on-disk run journals: status, agent count, tokens, and tool calls, expandable into a per-agent breakdown (phase, state, tokens, tools, duration) with humanized result previews

Dynamic Workflow Run expanded with phase filters and per-agent results

🧬 Workflow Runs · expanded — a run opened up: clickable color-coded phase filters, the per-agent metrics table, and a full list of clickable result items that expand to each agent's complete prompt and result

Dynamic Workflow Runs on the session detail page

🧬 Workflow Runs · in a session — the same fleets linked to their launching session, so a session's dynamic-workflow sub-agents and their folded-in token cost are visible inline

🧰 Claude Config Explorer — 12-tab inspector for skills, subagents, slash commands, plugins, MCP servers, hooks, settings, memory, keybindings, and statusline. Safe edits with backups

🧩 Claude Config Explorer · Skills — the Skills tab lists every discovered skill (user, project, and plugin) with its description and source, is searchable across the whole set, and opens any skill file for a safe, timestamp-backed edit

▶️ Run Claude — spawn or resume Claude subprocesses from the browser. Pick Conversation or Headless mode, set cwd, model, permission level, and thinking effort. Same-origin guard included

💬 Run Claude · live stream — character-by-character streaming output. Tool uses, tool results, and thinking blocks are collapsible. Active runs switcher juggles multiple sessions

⚙️ Settings — model pricing editor with per-token rates, hook installation status, JSON data export, session cleanup controls, browser notification toggles, and system info panel with DB stats

🔔 Settings · Alerts — rules-based alerting engine and outbound webhooks in one place: alert rules (event pattern / inactivity / stuck agent / token threshold) with per-rule cooldown, a live fired-alert feed, and 14 first-class webhook providers plus a generic JSON endpoint with optional HMAC signing

🛰️ Settings · Remote Data Sources — pull Claude Code activity from other machines over SSH: add sources by destination, test connectivity, sync manually or on a background poller, and switch the global data scope between local, all sources, or a specific machine with per-session source badges

📘 API Docs · Swagger UI — interactive OpenAPI 3.0 playground at /api/docs: collapsible endpoint groups, request/response schemas, auth headers, and try-it-out request execution against the live local server

📗 API Docs · ReDoc — a self-hosted, read-optimized rendering of the full OpenAPI 3.0 spec at /api/redoc, served entirely offline with no CDN. Complements the interactive Swagger UI at /api/docs; every backend route is documented with parameters, schemas, and examples

📊 Observability · Grafana — default home dashboard CCAM — Overview (four boards auto-provisioned): live fleet snapshot, database totals, breakdown charts, and rates from /api/metrics scrapes. Login admin / admin on port 3000

🔥 Observability · Prometheus console — pre-built landing page at /consoles/index.html with live metric cards, session/token tables, and drill-down links into the Graph UI (Prometheus 3.x compatible — queries the HTTP API directly)

📈 Observability · Prometheus Graph — run PromQL against scraped CCAM series (e.g. sum(ccam_sessions), ccam_events_total, rate(ccam_tokens_total[5m])) with starter expressions from the CCAM console and recording rules in monitoring/prometheus/ccam-rules.yml

Hook Events Captured

Hook Type	Trigger	Dashboard Action
`SessionStart`	Claude Code session begins	Creates session and main agent. Stamps `awaiting_input_since` (with `awaiting_reason` = `session_start`) so the row lands in Waiting from the start (the CLI is at a prompt) — except a `compact`-source SessionStart (mid-turn auto-compaction), which leaves the flag untouched so a working session stays Active. Reactivates resumed sessions. Abandons orphaned sessions with no activity for `DASHBOARD_STALE_MINUTES` (default 180).
`UserPromptSubmit`	User hits enter on a prompt	Clears the waiting flag and promotes the main agent to Working. The only reliable signal that text-only assistant turns have started — they emit no `PreToolUse` before `Stop`.
`PreToolUse`	Agent begins using a tool	Clears the waiting flag, sets agent → Working, `current_tool` set. If tool is `Agent`, subagent record created.
`PostToolUse`	Tool execution completes	Clears the waiting flag (covers permission-prompt approvals mid-tool). `current_tool` cleared. Agent stays Working.
`Stop`	Claude finishes a turn	Non-error: main agent → `waiting` — UI shows Waiting until the next user input. `stop_reason=error`: marks the agent and session Error. Background subagents keep running.
`SubagentStop`	Background agent finished	Matched subagent → Completed. Deliberately does not clear the waiting flag — a backgrounded subagent finishing tells us nothing about the human. Also kicks off a fire-and-forget JSONL scan (`scanAndImportSubagents`) that walks the session's `subagents/agent-*.jsonl` files, pairs `tool_use` ↔ `tool_result` blocks by `tool_use_id`, and emits per-tool `PreToolUse` + `PostToolUse` events under each subagent's own `agent_id` — surfaces tool calls that subagents make internally and which never fire any hooks. The same scan also rebuilds the nested-subagent hierarchy — it repoints each subagent's `parent_agent_id` to the true spawner recovered from the transcript's Task tool result (`toolUseResult.agentId`), so subagents that spawn their own subagents nest correctly instead of flattening under main.
`Notification`	Agent sends notification	Event logged to activity feed. Permission/input-prompt patterns (e.g. "needs your permission", "waiting for your input") set the agent to `waiting` and stamp `awaiting_input_since` (with `awaiting_reason` = `notification`). Compaction-related notifications tagged as `Compaction` events. Triggers a browser notification if enabled.
`Compaction`	`/compact` detected in JSONL	Creates a compaction subagent → Completed. Detected via `isCompactSummary` entries in the transcript. Token baselines preserve pre-compaction totals. Periodic scanner (cadence ~¼ of `DASHBOARD_STALE_MINUTES`) catches compactions when no hooks fire.
`APIError`	API error detected in transcript	Extracted from JSONL during history import, real-time transcript scanning, or the error detection watchdog. Captures quota limits, rate limits, auth failures, and other API errors. Immediately marks sessions and agents as error — previously recorded as events without changing status.
`Interrupted`	Turn cancelled by the user (`Esc`)	Synthesized by the watchdog because pressing `Esc` fires no hook. Recovered either from the transcript `[Request interrupted by user]` marker (flagged as `pendingInterrupt`) or, when `Esc` preceded any output and left no marker, from the idle-working timeout (`DASHBOARD_WORKING_IDLE_SECONDS`, default 120). Moves the session to Waiting — the same state a normal `Stop` produces.
`TurnDuration`	Per-turn timing recorded	Extracted from JSONL turn boundaries. Records the duration of each assistant turn for latency analysis.
`SessionEnd`	Claude Code CLI process exits	Drops the waiting flag. If the session is already in Error, the error state is preserved; otherwise marks all agents and the session as Completed. Evicts the session's transcript from the shared cache.

◈ Getting Started

Quick Start

Clone

Clone the repository to your machine

Install

Run npm run setup to install all dependencies

Start

Run npm run dev — server + client launch automatically

Use Claude

Start a new Claude Code session — events appear in real-time

bash

# 1. Clone
git clone https://github.com/hoangsonww/Claude-Code-Agent-Monitor.git
cd Claude-Code-Agent-Monitor

# 2. Install all dependencies (server + client)
npm run setup

# Optional: install hooks on the HOST (e.g. for container deployments)
# or if auto-install on server startup fails. Run this on the host, never
# inside a container — the installer refuses there (issue #193).
npm run install-hooks

# Optional: develop in a preconfigured VS Code Dev Container / Codespace
# (Node 22 + native build tools + Python, ports 4820/5173). Opt-in only —
# host-based dev is unaffected. See .devcontainer/.

# 3. Start in development mode
npm run dev
# → Express server on http://localhost:4820
# → Vite dev server on http://localhost:5173

# 4. (Optional) Build and run the local MCP server
npm run mcp:install
npm run mcp:build
npm run mcp:start              # stdio (for MCP host integration)
npm run mcp:start:http         # HTTP + SSE server on port 8819
npm run mcp:start:repl         # interactive CLI with tab completion

# 5. Open the dashboard
# http://localhost:5173  (dev)
# http://localhost:4820  (prod after npm run build && npm start)

Alternative: Docker / Podman

A multi-stage Dockerfile and docker-compose.yml are included. You can run the monitor with either Docker or Podman and keep the SQLite database in a named volume.

bash

# Docker Compose
docker compose up -d --build

# Podman Compose
CLAUDE_HOME="$HOME/.claude" podman compose up -d --build

# Plain Docker / Podman
docker build -t agent-monitor .
docker run -d --name agent-monitor \
  -p 127.0.0.1:4820:4820 \
  -v "$HOME/.claude:/root/.claude:ro" \
  -v agent-monitor-data:/app/data \
  agent-monitor

✓

Hooks auto-install in local mode

When you run the server directly on the host with npm run dev or npm start, it automatically writes Claude Code hook entries to ~/.claude/settings.json. If you run the dashboard in Docker or Podman, install hooks from the host with npm run install-hooks after the container is up, then restart Claude Code. The installer refuses to run inside a container (issue #193) so it never writes a container-internal handler path into a bind-mounted host ~/.claude; override with CCAM_ALLOW_CONTAINER_HOOKS=1 only if Claude Code itself runs in the container.

Optional: Enable MCP and Agent Extensions

This repository also ships a local MCP server under mcp/ and extension scaffolding for both Claude Code and Codex. These are optional for the dashboard UI, but recommended for complete local-agent workflows. The MCP server supports stdio (for host integration), HTTP+SSE (for remote clients), and an interactive REPL (for operator debugging).

bash

# MCP lifecycle
npm run mcp:install
npm run mcp:build
npm run mcp:start              # stdio (default — for MCP hosts)
npm run mcp:start:http         # HTTP + SSE server on port 8819
npm run mcp:start:repl         # interactive CLI with tab completion

Verification

After starting a Claude Code session, you should see:

Page	Expected
Sessions	Your session listed with status Waiting (a fresh CLI is sitting at the prompt) — flips to Active the moment Claude starts a turn
Kanban Board	A Main Agent card in the Waiting column until you type your first message; flips to Working on `UserPromptSubmit` / `PreToolUse` and back to Waiting after each `Stop`
Activity Feed	Events streaming in; click any row to expand payload, use "Session →" to drill into session details
Dashboard	Stats updating in real-time

⚠

Start server before Claude Code

Hooks only fire to a running server. If Claude Code was already running when you started the dashboard, restart the Claude Code session.

◈ Configuration

Configuration

Environment Variables

Variable	Default	Description
`DASHBOARD_PORT`	`4820`	Port the Express server listens on
`CLAUDE_DASHBOARD_PORT`	`4820`	Port used by the hook handler to reach the server (for custom port setups)
`MCP_DASHBOARD_BASE_URL`	`http://127.0.0.1:4820`	Base URL used by the local MCP server when calling dashboard APIs
`MCP_TRANSPORT`	`stdio`	MCP transport mode: `stdio`, `http`, `repl`
`MCP_HTTP_PORT`	`8819`	Port for the MCP HTTP+SSE server (only when `MCP_TRANSPORT=http`)
`MCP_HTTP_HOST`	`127.0.0.1`	Bind address for the MCP HTTP server
`DASHBOARD_DB_PATH`	`data/dashboard.db`	Path to the SQLite database file
`DASHBOARD_ALLOWED_HOSTS`	`(empty)`	Comma-separated extra `Host`-header names allowed past the DNS-rebinding guard — needed for a non-loopback client or scraper, such as Prometheus in Docker reaching `/api/metrics` via `host.docker.internal`; an unlisted host is rejected with `403 EBADHOST`
`DASHBOARD_WORKING_IDLE_SECONDS`	`120`	Idle-working timeout the watchdog uses to recover an `Esc` cancel that left no transcript marker
`DASHBOARD_LIVENESS_PROBE`	`1`	Set to `0` to disable the dead-session liveness reap — the watchdog completes active sessions with no running `claude` process; auto-disabled on Windows and in containers. Sessions forwarded from another machine (household hooks) report a non-POSIX `cwd` and are auto-skipped, so a mixed local + forwarded deployment no longer needs this off.
`DASHBOARD_LIVENESS_IDLE_SECONDS`	`60`	Idle gate for watchdog-tick liveness reaps — the transcript must not have been written for at least this long (last hook write is the fallback clock); startup passes skip the gate
`DASHBOARD_SESSION_SYNC_MS`	`30000`	Poll interval for the background sync of `~/.claude/projects`; `0` disables the poll but keeps the filesystem watcher
`NODE_ENV`	`development`	Set to `production` to serve built client from `client/dist/`

Hook Configuration

The server writes the following to ~/.claude/settings.json on every startup:

json

{
  "hooks": {
    "SessionStart": [
      { "hooks": [{ "type": "command", "command": "node \"/path/to/scripts/hook-handler.js\" SessionStart" }] }
    ],
    "PreToolUse": [
      { "matcher": "*", "hooks": [{ "type": "command", "command": "node \"/path/to/scripts/hook-handler.js\" PreToolUse" }] }
    ],
    "PostToolUse": [
      { "matcher": "*", "hooks": [{ "type": "command", "command": "node \"/path/to/scripts/hook-handler.js\" PostToolUse" }] }
    ],
    "Stop":         [{ "matcher": "*", "hooks": [{ "type": "command", "command": "node \"/path/to/scripts/hook-handler.js\" Stop" }] }],
    "SubagentStop": [{ "matcher": "*", "hooks": [{ "type": "command", "command": "node \"/path/to/scripts/hook-handler.js\" SubagentStop" }] }],
    "Notification": [{ "matcher": "*", "hooks": [{ "type": "command", "command": "node \"/path/to/scripts/hook-handler.js\" Notification" }] }],
    "SessionEnd": [
      { "hooks": [{ "type": "command", "command": "node \"/path/to/scripts/hook-handler.js\" SessionEnd" }] }
    ]
  }
}

Existing hooks are preserved. The installer only adds or updates entries containing hook-handler.js.

◈ Reference

Scripts Reference

Script	Command	Description
`setup`	`npm run setup`	Install all dependencies (server + client)
`dev`	`npm run dev`	Start server + client in development mode with hot reload
`dev:server`	`npm run dev:server`	Start only the Express server with `--watch`
`dev:client`	`npm run dev:client`	Start only the Vite dev server
`build`	`npm run build`	TypeScript check + Vite production build to `client/dist/`
`start`	`npm start`	Start Express in production mode serving built client
`install-hooks`	`npm run install-hooks`	Manually write Claude Code hooks to `~/.claude/settings.json`
`seed`	`npm run seed`	Insert demo sessions, agents, and events (8 sessions / 23 agents / 106 events)
`import-history`	`npm run import-history`	Import historical Claude Code sessions from `~/.claude` with deep JSONL extraction (API errors, turn durations, thinking blocks, subagent data)
`clear-data`	`npm run clear-data`	Delete all data from the database (keeps schema)
`test`	`npm test`	Run all server and client tests
`test:server`	`npm run test:server`	Server integration tests only (Node built-in test runner)
`test:client`	`npm run test:client`	Client unit tests only (Vitest + Testing Library)
`mcp:install`	`npm run mcp:install`	Install dependencies for the local MCP package under `mcp/`
`mcp:typecheck`	`npm run mcp:typecheck`	Type-check MCP source without emitting build output
`mcp:build`	`npm run mcp:build`	Compile MCP server into `mcp/build/`
`mcp:start`	`npm run mcp:start`	Start MCP server (stdio transport — for MCP hosts)
`mcp:start:http`	`npm run mcp:start:http`	Start MCP HTTP+SSE server on port 8819 (Streamable HTTP + legacy SSE)
`mcp:start:repl`	`npm run mcp:start:repl`	Start interactive MCP REPL with tab completion and colored output
`mcp:dev`	`npm run mcp:dev`	Run MCP server in dev mode with `tsx` (stdio)
`mcp:dev:http`	`npm run mcp:dev:http`	Run MCP HTTP server in dev mode with `tsx`
`mcp:dev:repl`	`npm run mcp:dev:repl`	Run MCP REPL in dev mode with `tsx`
`mcp:docker:build`	`npm run mcp:docker:build`	Build MCP container image with Docker (`agent-dashboard-mcp:local`)
`mcp:podman:build`	`npm run mcp:podman:build`	Build MCP container image with Podman (`localhost/agent-dashboard-mcp:local`)
`test:mcp`	`npm run test:mcp`	Run MCP server unit tests
`desktop:install`	`npm run desktop:install`	Install Electron + electron-builder under `desktop/`; rebuilds `better-sqlite3` for Electron's ABI. Preflights the native `better-sqlite3` build; prints actionable setup help (incl. a no-toolchain alternative) on failure
`desktop:build`	`npm run desktop:build`	Prebuild guard + `tsc` compile of the Electron main process into `desktop/out/`
`desktop:dev`	`npm run desktop:dev`	Build, then launch the desktop app against `desktop/out/main.js`
`desktop:test`	`npm run desktop:test`	Desktop smoke test — spawn Electron and probe `/api/health`
`desktop:dmg`	`npm run desktop:dmg`	Build both per-arch DMGs (arm64 + x64). Correct for release — slower (packages each arch)
`desktop:dmg:arm64`	`npm run desktop:dmg:arm64`	Build an Apple-Silicon-only DMG — fast (~1 min), recommended for a single machine
`desktop:dmg:x64`	`npm run desktop:dmg:x64`	Build an Intel-only DMG — fast (macOS host)
`desktop:dmg:universal`	`npm run desktop:dmg:universal`	Build one merged universal DMG (arm64 + x86_64 in a single file). Optional, slowest — not what the release ships
`desktop:win`	`npm run desktop:win`	Build the Windows NSIS installer `ClaudeCodeMonitor-Setup-<ver>-x64.exe` (Windows host)
`desktop:win:portable`	`npm run desktop:win:portable`	Build the no-install portable `ClaudeCodeMonitor-<ver>-x64-portable.exe` (Windows host)
`build:win-icon`	`npm run build:win-icon`	Regenerate `desktop/assets/icon.ico` from `icon.png` (PowerShell + .NET; Windows host)
`format`	`npm run format`	Format all files with Prettier
`format:check`	`npm run format:check`	Check formatting without writing

◈ Reference

ccam CLI

The full dashboard feature surface is also available from any terminal through the dependency-free ccam CLI (bin/ccam.js). It is linked automatically by npm run setup (via a fail-soft npm link), discovers the running server through the same ~/.claude/.agent-dashboard.json registry the hook handler uses (override with CLAUDE_DASHBOARD_PORT / DASHBOARD_PORT), and renders a full terminal UI — box-drawn tables, status icons, inline bar charts, and agent trees — that degrades to plain text when piped (--no-color / NO_COLOR / FORCE_COLOR respected), so it is equally at home in an interactive shell, grep pipelines, or CI.

For a live monitoring session, ccam repl (aliases shell / i) opens an interactive shell where you type commands without the ccam prefix — with a CCAM welcome banner, tab-completion, persisted arrow-key history, a live server-status prompt (● up / ○ offline), a grouped help / help <cmd> menu, and a watch [secs] <cmd> built-in that auto-refreshes any command. Each entered line runs as an isolated child process, so an offline refusal or a blocking tail never takes the shell down.

Group	Commands	What you get
Server	`status` · `start` · `repl`	Up/down indicator, a background server start that waits for health, and the interactive `ccam repl` shell — tab-completion, persisted history, a live server-status prompt, and a `watch` auto-refresh
Monitoring	`health` · `stats` · `kanban` · `tail`	Reachability, totals and status distributions, the Kanban board rendered as text columns, and a live event feed that polls for new rows every 2 seconds
Data	`sessions` · `session <id>` · `agents` · `events`	Filterable tables plus a per-session deep dive with an indented agent tree, per-session cost, and the most recent events
Insights	`analytics` · `workflows` · `runs` · `cost`	Token totals and top tools, workflow-intelligence stats with detected patterns, dynamic Workflow-tool runs, and the per-model cost breakdown, which can be scoped to a single session with `--session`, surfaces server-tool surcharges (web search / code execution), warns about models with usage but no matching pricing rule (priced at $0, excluded from the total), and points to `ccam pricing set`
Alerts & Webhooks	`alerts` · `alerts ack` · `rules` · `webhooks` · `webhooks test`	The fired-alert feed with acknowledge / acknowledge-all, the rule list, and synthetic test deliveries against any webhook target
Pricing	`pricing` · `pricing set / delete / reset`	Full pricing-rule CRUD with per-mtok rates — including 1-hour cache-write, fast-tier, and time-limited intro rates — straight from the terminal, with Fast In/Out and Intro In/Out columns in the list view
Import	`import rescan` · `import path <dir>`	The same idempotent ingestion pipeline the Settings page uses, reporting imported / backfilled / skipped / error counts
Remote Data Sources	`remote-sources list` · `remote-sources add` · `remote-sources test <id>` · `remote-sources sync [id]` · `remote-sources rm <id> [--purge]`	List, add, and test SSH data sources; sync one source (or all when no id is given); and remove a source, optionally purging the sessions it imported
Administration	`doctor` · `info` · `export` · `cleanup` · `reinstall-hooks` · `update-check` · `clear-data --yes` · `open`	Connectivity and hook diagnosis, raw system info, full JSON export, session cleanup, hook reinstallation, an update check that prints how far behind the canonical remote you are plus the copy-paste update command, and a guarded wipe that refuses to run without `--yes`

The CLI also manages the server itself: ccam status shows an up/down indicator and ccam start boots a detached production server. When the server is down, read-only commands fall back to reading data/dashboard.db directly under an explicit offline banner, while live or server-computed commands refuse with the specific reason.

ℹ

Safe by default

Read commands only issue GETs; mutating commands map one-to-one to explicit dashboard actions; and the single destructive command, clear-data, always refuses to run without an explicit --yes flag. Exit codes are script-friendly: 0 on success, 1 on any failure. See docs/CLI.md for the complete reference.

◈ Architecture

System Architecture

Core dashboard telemetry is composed of three processes (Claude hook source, dashboard server, browser UI). When the local MCP sidecar is enabled, it integrates with the same dashboard API via stdio, HTTP+SSE, or interactive REPL transport.

graph TB subgraph Claude["Claude Code Process"] CC["Claude Code CLI"] H0["SessionStart Hook"] H1["PreToolUse Hook"] H2["PostToolUse Hook"] H3["Stop Hook"] H4["SubagentStop Hook"] H5["Notification Hook"] H6["SessionEnd Hook"] CC --> H0 CC --> H1 CC --> H2 CC --> H3 CC --> H4 CC --> H5 CC --> H6 end subgraph Hook["Hook Layer"] HH["hook-handler.js"] end subgraph Server["Server Process — port 4820"] direction TB EX["Express App"] HR["/api/hooks"] SR["/api/sessions"] AR["/api/agents"] AN["/api/analytics"] DB[("SQLite WAL")] WSS["WebSocket Server"] EX --> HR EX --> SR EX --> AR EX --> AN HR -->|transaction| DB SR --> DB AR --> DB AN --> DB HR -->|broadcast| WSS end subgraph Client["Browser"] direction TB APP["React App"] WS_C["WS Client"] EB["Event Bus"] PAGES["Dashboard / Kanban /\nSessions / Analytics /\nWorkflows"] APP --> WS_C WS_C --> EB EB --> PAGES end H0 -->|stdin JSON| HH H1 -->|stdin JSON| HH H2 -->|stdin JSON| HH H3 -->|stdin JSON| HH H4 -->|stdin JSON| HH H5 -->|stdin JSON| HH H6 -->|stdin JSON| HH HH -->|"POST /api/hooks/event"| HR WSS -->|push| WS_C PAGES -->|fetch| EX

Full system architecture — Claude Code process → Hook Layer → Server → Browser

Agent State Machine

stateDiagram-v2 [*] --> waiting: ensureSession (first hook) waiting --> working: PreToolUse / UserPromptSubmit working --> working: PostToolUse (tool completed) working --> waiting: Stop (non-error) working --> waiting: Notification (input prompt) working --> waiting: Esc cancel (watchdog marker or idle timeout) waiting --> error: Stop with error working --> error: Stop with error waiting --> error: API error (watchdog) working --> error: API error (watchdog) error --> working: UserPromptSubmit / PreToolUse (recovery) working --> completed: SessionEnd waiting --> completed: SessionEnd note right of waiting Agent is between turns or awaiting user input end note

Agent status transitions driven by hook events. waiting is a real persisted status — agents start as waiting and return to it after each turn. Error recovery requires active user retry (UserPromptSubmit or PreToolUse). A background watchdog detects API errors in transcripts every 15 s. The same watchdog also recovers Esc-cancelled turns — via either the transcript [Request interrupted by user] marker or the idle-working timeout when Esc preceded any output — and moves the session to Waiting.

Session State Machine

stateDiagram-v2 [*] --> waiting: SessionStart startup/resume/clear (status=active + flag) active --> active: SessionStart compact (mid-turn — state preserved, no flag) waiting --> active: UserPromptSubmit / PreToolUse / PostToolUse active --> waiting: Stop (non-error, flag re-stamped) active --> waiting: Permission Notification (agent → waiting) active --> waiting: Esc cancel (watchdog marker or idle timeout) active --> error: Stop (stop_reason=error) active --> error: API error (watchdog) waiting --> error: API error (watchdog) error --> active: UserPromptSubmit / PreToolUse (recovery) waiting --> completed: SessionEnd active --> completed: SessionEnd error --> error: SessionEnd (preserves error) waiting --> abandoned: Stale > DASHBOARD_STALE_MINUTES active --> abandoned: Stale > DASHBOARD_STALE_MINUTES completed --> active: Session resumed error --> active: Session resumed abandoned --> active: Session resumed completed --> [*] error --> [*] abandoned --> [*]

Session status lifecycle. waiting is a UI overlay — persisted as active with awaiting_input_since set and awaiting_reason recording why (session_start, stop, notification, or interrupted). SessionEnd preserves error state. Error recovery requires UserPromptSubmit or PreToolUse. The watchdog also recovers Esc-cancelled turns (marker or idle-timeout path) and moves the session to Waiting. The dashboard surfaces the reason on every Waiting badge (hover tooltip, plus an inline chip on the Sessions table and session detail header) and in a waiting-for-input banner on the session detail page.

◈ Architecture

Data Flow

Event Ingestion Pipeline

flowchart TD CC["Claude Code"] -->|stdin JSON| HH["hook-handler.js
parse JSON,
add hook_type"] HH -->|POST| API["Express
/api/hooks/event"] API --> TX["SQLite txn:
ensureSession,
process by hook_type,
insertEvent, COMMIT"] TX --> RULES["per hook_type:
stamp / clear awaiting flag,
track current tool,
working / waiting / error,
complete subagents,
SessionEnd → all done"] RULES -->|broadcast| WS["WebSocket:
agent_updated,
new_event"] WS -->|publish| UI["React Client
re-renders"]

Complete event ingestion from hook fire to browser re-render

Client Data Loading Pattern

Initial load + WebSocket subscription lifecycle

◈ Architecture

Server Architecture

graph TD INDEX["server/index.js
Express app + HTTP server"] DB["server/db.js
SQLite + prepared statements
better-sqlite3 / node:sqlite fallback"] WS["server/websocket.js
WS server + heartbeat"] HOOKS["routes/hooks.js
Hook event processing"] SESSIONS["routes/sessions.js
Session CRUD"] AGENTS["routes/agents.js
Agent CRUD"] EVENTS["routes/events.js
Event listing"] STATS["routes/stats.js
Aggregate queries"] ANALYTICS["routes/analytics.js
Analytics metrics"] PRICING["routes/pricing.js
Pricing CRUD + cost calc"] SETTINGS_R["routes/settings.js
System info + data mgmt"] WORKFLOWS_R["routes/workflows.js
Workflow visualizations"] INDEX --> DB INDEX --> WS INDEX --> HOOKS INDEX --> SESSIONS INDEX --> AGENTS INDEX --> EVENTS INDEX --> STATS INDEX --> ANALYTICS INDEX --> PRICING INDEX --> SETTINGS_R INDEX --> WORKFLOWS_R HOOKS --> DB HOOKS --> WS SESSIONS --> DB SESSIONS --> WS AGENTS --> DB AGENTS --> WS EVENTS --> DB STATS --> DB ANALYTICS --> DB PRICING --> DB SETTINGS_R --> DB WORKFLOWS_R --> DB

Server module dependency graph

Server Modules

Module	Responsibility
`server/index.js`	Express app setup, middleware (CORS, JSON 1MB limit), route mounting, static serving in production, HTTP server, auto-hook installation on startup
`server/db.js`	SQLite connection, WAL/FK pragmas, schema migrations (`CREATE TABLE IF NOT EXISTS`), all prepared statements as a reusable `stmts` object. Tries `better-sqlite3` first, falls back to built-in `node:sqlite` via `compat-sqlite.js`
`server/compat-sqlite.js`	Compatibility wrapper giving Node.js built-in `node:sqlite` (`DatabaseSync`) the same API as `better-sqlite3` — pragma, transaction, prepare. Used as automatic fallback on Node 22+
`server/websocket.js`	WebSocket server on `/ws` path, 30s ping/pong heartbeat, typed `broadcast(type, data)` function
`routes/hooks.js`	Core event processing inside SQLite transactions. Auto-creates sessions/agents. Switch-case dispatch by hook type. Extracts token usage from Stop events.
`routes/sessions.js`	CRUD with pagination. GET includes agent count via LEFT JOIN. POST is idempotent on session ID.
`routes/agents.js`	CRUD with status/session_id filtering. PATCH broadcasts `agent_updated`.
`routes/events.js`	Read-only event listing with session_id filter and pagination.
`routes/stats.js`	Single aggregate query — total/active counts, status distributions, WS connection count.
`routes/analytics.js`	Extended analytics — token totals, tool usage counts, daily event/session trends, agent type distribution, event type breakdown, average events per session.
`routes/pricing.js`	Model pricing CRUD (list, upsert, delete). Per-session and global cost calculation with pattern-based model matching and specificity sorting.
`routes/settings.js`	System info (DB size, row counts, hook status, server uptime). Data export as one versioned JSON bundle and matching import/restore (`POST /api/settings/import` via `server/lib/data-transfer.js` — idempotent, session-atomic, non-destructive; consolidate machines). Session cleanup (abandon stale active sessions, purge old completed sessions). Clear all data. Reset pricing to defaults. Reinstall hooks.
`routes/workflows.js`	Aggregate workflow visualization data (agent orchestration, tool transitions, collaboration networks, workflow patterns, model delegation, error propagation, concurrency, session complexity, compaction impact). Accepts `?status=active\|completed` filter. Per-session drill-in with agent tree, tool timeline, and events.

◈ Architecture

Client Architecture

graph TD APP["App.tsx — Router + WebSocket"]:::root LAYOUT["Layout.tsx — Sidebar + Outlet"] SIDEBAR["Sidebar.tsx — Nav + Connection Status"] DASH["Dashboard.tsx"] KANBAN["KanbanBoard.tsx"] SESS["Sessions.tsx"] DETAIL["SessionDetail.tsx"] ACTIVITY["ActivityFeed.tsx"] ANALYTICS["Analytics.tsx"] WORKFLOWS["Workflows.tsx"] SETTINGS_P["Settings.tsx"] SC["StatCard x4"] AC["AgentCard list"] COL["Agents: 4 columns\n(working/waiting/completed/error)\nSessions: 5 columns\n(active/waiting/completed/error/abandoned)"] AC2["AgentCard list"] APP --> LAYOUT LAYOUT --> SIDEBAR LAYOUT --> DASH LAYOUT --> KANBAN LAYOUT --> SESS LAYOUT --> DETAIL LAYOUT --> ACTIVITY LAYOUT --> ANALYTICS LAYOUT --> WORKFLOWS LAYOUT --> SETTINGS_P DASH --> SC DASH --> AC KANBAN --> COL COL --> AC2 classDef root fill:#6366f1,stroke:#818cf8,color:#fff

React component tree

Client Routes

/ Dashboard — stats, active agents, recent events

/kanban KanbanBoard — agent status columns

/sessions Sessions — searchable, filterable table

/sessions/:id SessionDetail — agents + full event timeline

/activity ActivityFeed — real-time streaming event log

/analytics Analytics — token usage, heatmap (day-of-week aligned), tool charts, donut charts

/workflows Workflows — D3.js visualizations, cross-filtering, status filter, session drill-in

/settings Settings — model pricing, hook status, data export, session cleanup

Key Client Modules

Module	Purpose
`lib/api.ts`	Typed fetch wrapper — one method per REST endpoint. All return typed promises.
`lib/types.ts`	TypeScript interfaces: `Session`, `Agent`, `DashboardEvent`, `Stats`, `Analytics`, `WSMessage`, plus all workflow-related types (`WorkflowData`, `SessionDrillIn`, etc). Status config maps.
`lib/eventBus.ts`	Set-based pub/sub. `subscribe(fn)` returns an unsubscribe function for clean useEffect teardown.
`lib/format.ts`	Date/time formatting helpers — relative time, duration, ISO display.
`hooks/useWebSocket.ts`	Auto-reconnecting WebSocket React hook. 2-second reconnect interval. Publishes messages to eventBus.

PWA & Service Worker

The dashboard is a Progressive Web App with its own manifest.json and Service Worker (client/public/sw.js). The landing page and wiki are also independent PWAs with separate manifests and service workers.

Surface	Manifest	Service Worker	Strategy
Dashboard	`client/public/manifest.json`	`client/public/sw.js`	Precaches app shell. Cache-first for static assets (JS/CSS bundles). Network-first for navigation with offline fallback. Skips `/api/*`, `/ws`, and Vite HMR. Preserves push notification handlers.
Landing page	`manifest.json` (root)	`sw.js` (root)	Precaches HTML shell, favicon, OG image. Lazy-caches screenshot PNGs on first view. Network-first HTML, cache-first assets.
Wiki	`wiki/manifest.json`	`wiki/sw.js`	Precaches `index.html`, `style.css`, `script.js`. Fully offline after one visit.

All three SWs call skipWaiting() on install and delete stale caches on activate (keyed by version strings like dashboard-v1). Manifests use SVG icons (favicon.svg) with sizes="any". iOS standalone mode is enabled via apple-mobile-web-app-capable meta tags.

◈ Architecture

State Management

The client deliberately avoids Redux / Zustand / Context. Each page owns its data and lifecycle. WebSocket events trigger a reload or append — no complex state merging.

graph TD REST["REST API\n(initial load)"]:::source WSM["WebSocket Messages\n(real-time updates)"]:::source EB["eventBus\n(Set-based pub/sub)"]:::bus US1["useState\nDashboard"] US2["useState\nKanbanBoard"] US3["useState\nSessions"] US4["useState\nSessionDetail"] US5["useState\nActivityFeed"] US6["useState\nAnalytics"] US8["useState\nWorkflows"] US7["useState\nSettings"] REST --> US1 REST --> US2 REST --> US3 REST --> US4 REST --> US5 REST --> US6 REST --> US8 REST --> US7 WSM --> EB EB --> US1 EB --> US2 EB --> US3 EB --> US4 EB --> US5 EB --> US6 EB --> US8 EB --> US7 classDef source fill:#6366f1,stroke:#818cf8,color:#fff classDef bus fill:#f59e0b,stroke:#fbbf24,color:#000

Each page pulls initial data from REST then subscribes to eventBus for live updates

ℹ

No global store — by design

There is no cross-page shared state. Each page fetches and owns exactly the data it displays. This simplifies debugging and avoids stale-closure hazards that are common with global stores in long-running WebSocket apps.

◈ Data

Database Design

erDiagram sessions ||--o{ agents : has sessions ||--o{ events : has sessions ||--o{ token_usage : tracks agents ||--o{ events : generates agents ||--o{ agents : spawns_subagents sessions { TEXT id PK "UUID" TEXT name "Human-readable label" TEXT status "active|completed|error|abandoned" TEXT cwd "Working directory path" TEXT model "Claude model ID" TEXT started_at "ISO 8601" TEXT ended_at "ISO 8601 or NULL" TEXT metadata "JSON blob" TEXT awaiting_input_since "ISO 8601 or NULL — Waiting flag" TEXT awaiting_reason "notification|stop|session_start|interrupted or NULL" } agents { TEXT id PK "UUID or session_id-main" TEXT session_id FK TEXT name "Display name" TEXT type "main|subagent" TEXT subagent_type "Explore|general-purpose|etc" TEXT status "working|waiting|completed|error" TEXT task "Current task description" TEXT current_tool "Active tool or NULL" TEXT started_at "ISO 8601" TEXT ended_at "ISO 8601 or NULL" TEXT parent_agent_id FK "References agents.id" TEXT metadata "JSON blob" TEXT awaiting_input_since "ISO 8601 or NULL — main-agent Waiting flag" TEXT awaiting_reason "notification|stop|session_start|interrupted or NULL" } events { INTEGER id PK "Auto-increment" TEXT session_id FK TEXT agent_id FK TEXT event_type "PreToolUse|PostToolUse|Stop|etc" TEXT tool_name "Tool that fired the event" TEXT summary "Human-readable summary" TEXT data "Full event JSON" TEXT created_at "ISO 8601" } token_usage { TEXT session_id PK "Composite PK with model" TEXT model PK "Model identifier" INTEGER input_tokens INTEGER output_tokens INTEGER cache_read_tokens INTEGER cache_write_tokens } model_pricing { TEXT model_pattern PK "SQL LIKE pattern" TEXT display_name "Human-readable name" REAL input_per_mtok "USD per million input tokens" REAL output_per_mtok "USD per million output tokens" REAL cache_read_per_mtok "USD per million cache read tokens" REAL cache_write_per_mtok "USD per million cache write tokens" TEXT updated_at "ISO 8601" }

Entity Relationship Diagram — SQLite schema

Indexes

Index	Table	Column(s)	Purpose
`idx_agents_session`	agents	`session_id`	Fast agent lookup by session
`idx_agents_status`	agents	`status`	Kanban column queries
`idx_events_session`	events	`session_id`	Session detail event list
`idx_events_type`	events	`event_type`	Filter events by type
`idx_events_created`	events	`created_at DESC`	Activity feed ordering
`idx_sessions_status`	sessions	`status`	Status filter on sessions page
`idx_sessions_started`	sessions	`started_at DESC`	Default sort order

SQLite Configuration

Pragma	Value	Rationale
`journal_mode`	`WAL`	Concurrent reads during writes. Far better for read-heavy dashboards.
`foreign_keys`	`ON`	Referential integrity — prevents orphaned agents/events.
`busy_timeout`	`5000`	Wait up to 5s for write lock instead of failing immediately under load.

◈ Data

API Reference

All endpoints return JSON. Errors follow { "error": { "code", "message" } }. The OpenAPI 3.0 spec comprehensively documents every backend route - parameters, request/response schemas, field descriptions, and examples. It is served at /api/openapi.json (with a committed openapi.yaml mirror), rendered as interactive Swagger UI at /api/docs, and as a clean, read-optimized ReDoc reference at /api/redoc. ReDoc is self-hosted, so it works fully offline.

Interactive Swagger UI rendering the dashboard's OpenAPI 3.0 spec, with collapsible endpoint groups, request/response schemas, and try-it-out controls

📘 Swagger UI at /api/docs — auto-generated interactive playground for every REST endpoint. Try-it-out forms, request/response schema, auth headers, and curl snippets

ReDoc rendering the dashboard's OpenAPI 3.0 spec as a read-optimized three-panel reference, served self-hosted and fully offline

📗 ReDoc at /api/redoc — a self-hosted, read-optimized three-panel rendering of the same OpenAPI spec: deep-linkable sections, search, and full schema/example detail. Works entirely offline (no CDN)

Health

GET /api/health Returns { "status": "ok", "timestamp": "..." }

Sessions

GET /api/sessions List sessions with agent counts and per-session cost. Params: status, q (case-insensitive search across id/name/cwd), limit (default 50, max 10000), offset. Response includes total for paginators.

GET /api/sessions/:id Session detail with agents and events

POST /api/sessions Create session (idempotent on id)

PATCH /api/sessions/:id Update session status / metadata

Agents

GET /api/agents List agents — params: status, session_id, limit, offset

GET /api/agents/:id Single agent detail

POST /api/agents Create agent

PATCH /api/agents/:id Update agent status / task / current_tool

Events, Stats, Analytics

GET /api/events List events newest-first — params: session_id, limit, offset

GET /api/stats Aggregate counts + status distributions + WS connections

GET /api/analytics Token totals, tool usage, daily trends, agent types, event types, averages

Metrics

GET /api/metrics Live counters in Prometheus text-exposition format (v0.0.4): sessions and agents by status, total events, cumulative tokens by kind, connected WebSocket clients, configured remote sources, process uptime and RSS, and build version — so CCAM can be scraped into Prometheus / Grafana. Read-only, and — being under /api — behind the DNS-rebinding Host-header guard and the optional DASHBOARD_TOKEN: a non-loopback scraper (e.g. Prometheus in Docker via host.docker.internal) must be allowlisted with DASHBOARD_ALLOWED_HOSTS or the scrape returns 403 EBADHOST, and must send the token when one is set. A ready-to-run Prometheus + Grafana stack with a pre-provisioned "CCAM — Overview" dashboard lives in monitoring/ (cd monitoring && docker compose up -d; Grafana on port 3000).

Hooks Ingestion

POST /api/hooks/event Receive and process a Claude Code hook event (called by hook-handler.js)

Pricing

GET /api/pricing List all model pricing rules

PUT /api/pricing Create or update a pricing rule

DELETE /api/pricing/:pattern Delete a pricing rule

GET /api/pricing/cost Total cost across all sessions

GET /api/pricing/cost/:sessionId Cost breakdown for a specific session

Settings

GET /api/settings/info System info, DB stats, hook installation status

POST /api/settings/clear-data Delete all sessions, agents, events, token usage

POST /api/settings/reinstall-hooks Reinstall Claude Code hooks

POST /api/settings/reset-pricing Reset pricing rules to defaults

GET /api/settings/export Export all data (sessions, agents, events, tokens, workflows, runs, alert rules, pricing) as one versioned JSON download

POST /api/settings/import Restore an export bundle (multipart file or JSON { path }) — idempotent, non-destructive; consolidate machines

POST /api/settings/cleanup Abandon stale sessions (by hours), purge old data (by days)

Import History

GET /api/import/guide OS-aware paths, archive command, supported extensions, step-by-step instructions; includes live stats for the default ~/.claude/projects folder

POST /api/import/rescan Re-scan the default ~/.claude/projects directory; safe to re-run (idempotent via session-ID dedup)

POST /api/import/scan-path Scan any absolute directory (body { path }); tilde (~) is expanded; walks subdirectories recursively and imports every .jsonl found

POST /api/import/upload Multipart upload of .jsonl, .meta.json, .zip, .tar, .tar.gz, .tgz, .gz. Per-request staging dir, path-traversal and extraction-size guards. Returns 413 EXTRACTION_LIMIT_EXCEEDED on suspected bomb archives

Workflows

GET /api/workflows Aggregate workflow data — orchestration graphs, tool flows, effectiveness, patterns, model delegation, error propagation, concurrency, complexity, compaction impact. Accepts ?status=active|completed query param to filter by workflow status

GET /api/workflows/session/:id Per-session drill-in — agent tree, tool timeline, event details

Alerts

GET /api/alerts Fired-alert feed, newest first (?unacked=true, limit, offset; carries total and unacked counts)

POST /api/alerts/:id/ack Acknowledge one alert

POST /api/alerts/ack-all Acknowledge every unacked alert

GET /api/alerts/rules List alert rules

POST /api/alerts/rules Create a rule (event_pattern | inactivity | status_duration | token_threshold)

PATCH /api/alerts/rules/:id Update name / config / enabled / cooldown

DELETE /api/alerts/rules/:id Delete a rule and its fired-alert history

Webhooks

GET /api/webhooks/providers Supported providers + their config fields (drives the UI form)

GET /api/webhooks List webhook targets (URLs masked, secrets redacted)

POST /api/webhooks Create a target — 14 first-class providers (Slack, Discord, Teams, Google Chat, Mattermost, Rocket.Chat, Telegram, PagerDuty, Opsgenie, Splunk On-Call, Zapier, Make, n8n, Pipedream) + a generic JSON endpoint

PATCH /api/webhooks/:id Update name / url / enabled / secret / headers / config / rule scope (type is immutable)

DELETE /api/webhooks/:id Delete a target and its delivery log

POST /api/webhooks/:id/test Send a synthetic test alert and report the result

GET /api/webhooks/:id/deliveries Recent delivery log for a target

json — Hook event payload

{
  "hook_type": "PreToolUse",
  "data": {
    "session_id": "abc-123",
    "tool_name":  "Bash",
    "tool_input": { "command": "ls -la" }
  }
}

◈ Data

WebSocket Protocol

Property	Value
Path	`ws://localhost:4820/ws`
Protocol	Standard WebSocket (RFC 6455)
Heartbeat	Server pings every 30s — clients that don't pong are terminated
Reconnect	Client retries every 2 seconds on disconnect

Message Envelope

typescript

// All messages use this shape
{
  type:      "session_created" | "session_updated" |
             "agent_created"   | "agent_updated"   | "new_event";
  data:      Session | Agent | DashboardEvent;
  timestamp: string;  // ISO 8601
}

stateDiagram-v2 [*] --> Connecting: Component mounts Connecting --> Connected: onopen Connected --> Closed: onclose / onerror Closed --> Connecting: setTimeout 2000ms Connected --> [*]: Component unmounts Closed --> [*]: Component unmounts

Client WebSocket auto-reconnect state machine

◈ Integrations

Hook Integration

Hook Handler Design

scripts/hook-handler.js is a minimal, fail-safe forwarder. It always exits 0 so it can never block Claude Code regardless of server state.

flowchart TD START[Claude Code fires hook] --> STDIN[Read stdin to EOF] STDIN --> PARSE{Parse JSON?} PARSE -->|Success| POST["POST to 127.0.0.1:4820\n/api/hooks/event"] PARSE -->|Failure| WRAP["Wrap raw input payload"] WRAP --> POST POST --> RESP{Response?} RESP -->|200| EXIT0[exit 0] RESP -->|Error| EXIT0 RESP -->|Timeout 3s| DESTROY[Destroy request] DESTROY --> EXIT0 SAFETY["Safety net: setTimeout 5s"] --> EXIT0

hook-handler.js flow — always exits 0, never blocks Claude Code

Hook Installation Flow

flowchart TD START[Server startup or npm run install-hooks] --> READ{"~/.claude/settings.json\nexists?"} READ -->|Yes| PARSE[Parse JSON] READ -->|No| EMPTY[Start with empty object] PARSE --> CHECK[Ensure hooks section exists] EMPTY --> CHECK CHECK --> LOOP["For each hook type:\nSessionStart, PreToolUse, PostToolUse,\nStop, SubagentStop, Notification, SessionEnd"] LOOP --> EXISTS{"Our hook\nalready installed?"} EXISTS -->|Yes| UPDATE[Update command path] EXISTS -->|No| APPEND[Append to array] UPDATE --> NEXT{More hook types?} APPEND --> NEXT NEXT -->|Yes| LOOP NEXT -->|No| WRITE["Write settings.json\n(preserves all other hooks)"] WRITE --> DONE[Done]

Hook installation is idempotent — safe to run multiple times

◈ Integrations

Import Pipeline

Settings page Import History section showing rescan default folder, scan custom path, and upload controls with progress output

📥 Settings → Import History — rescan default paths, set a custom directory, or drag-and-drop .gz archives. Progress bar and result card show counts for every run

The dashboard ships with a first-class history importer that backfills sessions, agents, events, tokens, and costs from Claude Code JSONL transcripts. Live hook ingestion and manual import share the exact same parser — parseSessionFile + importSession in scripts/import-history.js — which is the architectural contract that guarantees imported token totals and cost values are identical to those captured in real time. Re-imports are idempotent: session IDs are the dedup key and compaction baseline_* columns preserve pre-compaction token totals.

Three Modes, One Pipeline

flowchart LR A1["Default folder\n~/.claude/projects"] -->|POST /api/import/rescan| R["server/routes/import.js"] A2["Custom folder\nany absolute path"] -->|POST /api/import/scan-path| R A3["Uploaded files\n.jsonl / .meta.json /\n.zip / .tar(.gz) / .gz"] -->|POST /api/import/upload\nmultipart| R R -->|archive extract\n+ path-traversal guard\n+ zip-bomb cap| X["server/lib/archive.js"] R -->|walks recursively| I["importFromDirectory\n(scripts/import-history.js)"] X --> I I -->|same pipeline\nas live hook ingestion| P["parseSessionFile +\nimportSession"] P -->|prepared statements,\none transaction| D[("SQLite\nsessions / agents / events /\ntoken_usage")] I -.->|import.progress\nthrottled ~150ms| W["WebSocket /ws"] W -.-> U["Settings → Import History\nprogress + result card"]

All three modes funnel into the same parser and DB transaction — imported numbers match live capture bit-for-bit

Upload Request Sequence

flowchart TD UI["Settings UI"] -->|upload| API["/api/import/upload"] API -->|middleware| M["multer (disk):
per-request tempDir,
fileFilter rejects
unsupported"] M --> EX["per file:
extractInto + safeJoin
(no absolute / ..),
enforce MAX_EXTRACT_BYTES"] EX --> DEC{"bomb /
oversized?"} DEC -->|yes| ERR["413
EXTRACTION_LIMIT_
EXCEEDED"] DEC -->|no| IMP["importFromDirectory:
collectJsonlFiles,
parseSessionFile,
importSession (1 txn)
→ SQLite"] IMP -->|progress| RESP["200: imported,
backfilled, skipped,
errors, rejected_files"] RESP --> CLEAN["rmTempDir (finally)"]

Upload path: multipart → safe extract → walk → parse → import — every temp dir reclaimed in finally

Idempotence & Cost Accuracy

flowchart LR A[Parse session JSONL] --> B{Session ID\nalready in DB?} B -->|no| C[Insert session,\nmain agent, events,\ntoken_usage] B -->|yes| D{Any new fields,\ntools, compactions,\nturn durations?} D -->|no| E[skipped = true] D -->|yes| F[Backfill: insert\nmissing events +\nenrich metadata] F --> G[backfilled = true] C --> H[replaceTokenUsage] F --> H H --> I{New input_tokens\n< existing?} I -->|yes\ncompaction occurred| J[Move existing into\nbaseline_* columns\nadd new on top] I -->|no| K[Overwrite with new totals]

The baseline_* columns make cost monotonic under re-imports — compacted sessions retain pre-compaction usage for billing

Supported Source Layouts

Layout	Example	Handling
Default Claude Code	`<proj>/<sid>.jsonl`	Session transcript — extracts tokens, compactions, tool uses, turn durations
Default subagent	`<proj>/<sid>/subagents/agent-*.jsonl`	Paired with parent on discovery via `findSessionSubagents`
Alternative subagent	`<proj>/subagents/<sid>/agent-*.jsonl`	Paired with parent on discovery (second layout probed automatically)
Orphan subagent	No parent JSONL in source, but `sid` exists in DB	`importFromDirectory` probes both layouts; attaches if the parent is found
Flat JSONL drop	`<root>/<sid>.jsonl`	Recognized as a loose session transcript
Archives	`.zip`, `.tar`, `.tar.gz`, `.tgz`	Extracted into a per-request temp dir, then walked by the same importer
Single-file gzip	`any.jsonl.gz`	Gunzipped in streaming mode with running byte-counter size cap

Safety Model

Threat	Mitigation
Path traversal via archive entries	`archive.safeJoin` resolves under the extraction root; any `..` or absolute path returns `null`
Zip / tar / gzip bombs	`MAX_EXTRACT_BYTES` (default 4 GB) enforced by running byte counter; aborts with `ExtractionLimitError` → HTTP 413
Per-file upload size abuse	multer `limits.fileSize = MAX_UPLOAD_BYTES` (default 1 GB)
Too many files per request	multer `limits.files = MAX_UPLOAD_FILES` (default 2000)
Unsupported file types	`fileFilter` drops them early and reports them in `rejected_files[]`
Concurrent upload temp-dir collisions	Per-request temp dir on `req._ccamUploadDir`; created in multer `destination`, reclaimed in `finally`
Arbitrary absolute path on `scan-path`	Validated: must be absolute (after `~` expansion), exist, and be a directory
Relative / traversal paths on `scan-path`	Rejected with `INVALID_INPUT`

Environment Variables

Variable	Default	Purpose
`CCAM_IMPORT_MAX_BYTES`	1 GB	Maximum size per uploaded file on `/api/import/upload`
`CCAM_IMPORT_MAX_FILES`	2000	Maximum files per upload request
`CCAM_IMPORT_MAX_EXTRACT_BYTES`	4 GB	Ceiling on total uncompressed bytes from any single archive (zip-bomb defense)

WebSocket Progress Events

Every import emits import.progress messages on /ws. Messages are throttled to at most one every ~150 ms to avoid flooding the channel on multi-thousand-session imports; the terminal complete and error frames are never throttled.

{
  "type": "import.progress",
  "timestamp": "2026-04-18T15:48:34.123Z",
  "data": {
    "importId": "upload-1729264114000",
    "phase": "parse",
    "source": "upload",
    "processed": 184,
    "total": 512,
    "current": "/tmp/ccam-import-work-xyz/project/<uuid>.jsonl",
    "counters": { "imported": 120, "backfilled": 40, "skipped": 20, "errors": 4 }
  }
}

Phases: start → scan → extract (upload only) → parse → complete, with error / extract_error replacing complete on failure.

◈ Integrations

Remote Data Sources

🛰️ Settings → Remote Data Sources — add machines by SSH destination, test connectivity, sync on demand or on a background schedule, and narrow the whole app with the global data-scope selector

Beyond local history, the dashboard can pull Claude Code activity from other machines over SSH and fold it into the same views as your local sessions. Each remote's ~/.claude/projects tree is mirrored via scp (or wsl.exe + tar when Claude runs in WSL on a Windows SSH host) into a sandboxed staging directory, fed through the exact same importer used for local history, and every imported session is tagged with the source it came from. A background poller (DASHBOARD_REMOTE_SYNC_MS, default 15000 ms) keeps remote data near-real-time — adding or re-enabling a source also triggers an immediate pull — and successful syncs broadcast remote_data.updated plus per-session session_created / session_updated frames. A global data scope selector lets you narrow the entire app to local sessions only, all sources, or a specific machine. Sources are managed in Settings → Remote Data Sources.

How It Works

Each remote is mirrored over SSH, imported through the local pipeline, and tagged by source — the poller (default 15 s) keeps it near-real-time

Setup

Add a source with an SSH destination — either a user@host pair or an alias defined in your ~/.ssh/config — plus an optional port, identity file, and remote home directory. Use Test connection to verify reachability before saving, then Sync now to pull immediately, or let the background poller do it on its own schedule. Removing a source can optionally purge the sessions it imported.

Data Scope

A global data-scope selector narrows the whole app: choose Local only, All sources, or one specific machine. The scope flows into every data view through a sources query parameter on the sessions, events, agents, stats, and analytics endpoints, and sessions imported from a remote carry a source badge so their origin is always visible.

Live Status

Because a remote session is pulled over SSH, it never emits live hooks to this dashboard, so remote sessions (source ≠ local) are excluded from every local liveness and stale sweep — the process liveness reap, the watchdog transcript scan, the startup cleanup, and the periodic abandon sweep all skip them. Instead, a remote session's live status is reconciled from the mirrored transcript on every sync: activity is judged from the newest event timestamp inside the JSONL (falling back to mirror mtime). If the last event is within DASHBOARD_REMOTE_ACTIVE_WINDOW_MS (default 10 minutes) the session stays active; once the mirror stops advancing for longer than that window, the session is reconciled to completed. This keeps a still-running remote session from being wrongly shown as Completed.

Security Model

Authentication defers entirely to the host's own SSH stack — ~/.ssh/config, ssh-agent, on-disk keys, and known_hosts. The dashboard stores no passwords or secrets; it only records the SSH destination and connection options you supply. StrictHostKeyChecking is left at its default, so the remote host must already be present in known_hosts — an unknown host fails the connection rather than being trusted blindly.

Environment Variables

Variable	Default	Purpose
`DASHBOARD_REMOTE_SYNC_MS`	`15000`	Interval for the background remote poller (add/re-enable also syncs immediately); `0` disables the poller while leaving manual sync available
`DASHBOARD_REMOTE_SYNC_TIMEOUT_MS`	`600000`	Ceiling on a single scp-and-import run before it is aborted
`DASHBOARD_REMOTE_TEST_TIMEOUT_MS`	`15000`	Timeout for the Test connection probe
`DASHBOARD_REMOTE_ACTIVE_WINDOW_MS`	`600000`	Freshness window (10 min) for a Remote Data Source session's live status — judged from the newest JSONL event timestamp in the mirror (mtime fallback); remote sessions receive no live hooks, so this replaces the local liveness and stale sweeps (which skip them)

API Endpoints

GET /api/remote-sources List configured sources with their status

POST /api/remote-sources Add a source — SSH destination plus optional port, identity file, and remote home

PATCH /api/remote-sources/:id Edit an existing source

DELETE /api/remote-sources/:id Remove a source; optionally purge the sessions it imported

POST /api/remote-sources/:id/test Test SSH reachability without importing anything

POST /api/remote-sources/:id/sync Trigger an immediate scp-and-import for that source

The sources query parameter is also accepted on /api/sessions, /api/events, /api/agents, /api/stats, and /api/analytics to scope any view to local, all, or a comma-separated list of source IDs.

WebSocket Events

Sync progress and connection state broadcast on /ws as remote_source.status messages, so the Settings panel and the source badges update live as a remote connects, syncs, or fails.

CLI

The same operations are available from the terminal via ccam remote-sources, with subcommands list, add, test <id>, sync [id] (all sources when no id is given), and rm <id> [--purge] to remove a source and optionally drop the sessions it imported.

◈ Integrations

MCP & Agent Extensions

In addition to dashboard telemetry, this project includes a production-grade local MCP server and complete extension scaffolding for both Claude Code and Codex. This gives agents a richer local tool surface while keeping all execution local-first. The MCP server supports three transport modes: stdio for host integration, HTTP+SSE for remote clients, and an interactive REPL for operator debugging.

MCP Server interactive REPL showing the MCP Tools banner, server info, and tool listing with colored output

🔧 MCP Server REPL — interactive tool invocation terminal with colored JSON output, argument prompts, error formatting, and session-aware context for rapid testing

graph LR subgraph Hosts["Agent Hosts"] CC["Claude Code"] CX["Codex"] RC["Remote Client"] OP["Operator"] end subgraph Layers["Instruction + Skill Layers"] CL["CLAUDE.md + .claude/rules + .claude/skills + .claude/agents"] AG["AGENTS.md + .codex/rules + .codex/skills + .codex/agents"] end subgraph Runtime["Local MCP Runtime"] MCP_STDIO["MCP Server (stdio)"] MCP_HTTP["MCP Server (HTTP+SSE :8819)"] MCP_REPL["MCP Server (REPL)"] API["Dashboard API\nhttp://127.0.0.1:4820"] end CC --> CL CX --> AG CC -->|"stdin/stdout"| MCP_STDIO RC -->|"POST /mcp · GET /sse"| MCP_HTTP OP -->|"interactive CLI"| MCP_REPL MCP_STDIO -->|"REST"| API MCP_HTTP -->|"REST"| API MCP_REPL -->|"REST"| API

Local extension architecture: host instructions + skills + multi-transport MCP sidecar

Local MCP Server Runtime

The mcp/ package exposes dashboard-oriented tools for AI agents across three transport modes. Mutation and destructive operations are policy-gated by environment variables and disabled by default. HTTP mode serves both Streamable HTTP (protocol 2025-11-25) and legacy SSE (protocol 2024-11-05). REPL mode provides tab-completed interactive tool invocation with colored output and JSON syntax highlighting.

Component	Location	Notes
MCP source	`mcp/src/`	TypeScript server, tools, policy guards, transport layer, CLI UI
MCP build output	`mcp/build/`	Compiled JavaScript runtime for all transport modes
MCP docs	`mcp/README.md`	Tool catalog, architecture diagrams, host integration examples, REPL guide
Transport layer	`mcp/src/transports/`	HTTP+SSE server, interactive REPL, tool handler collector
CLI UI	`mcp/src/ui/`	ANSI banner, colors, formatter with tables, boxes, JSON highlighting
Runtime commands	`npm run mcp:start\|start:http\|start:repl\|dev\|dev:http\|dev:repl`	Start MCP in stdio, HTTP+SSE, or REPL mode (production or dev)

Agent Extension Layout

Target	Files	Purpose
Claude Code	`CLAUDE.md`, `.claude/rules/`	Persistent project instructions + path-scoped coding rules
Claude Code Skills	`.claude/skills/`	Reusable workflows (onboarding, shipping, MCP ops, live debugging)
Claude Code Subagents	`.claude/agents/`	Specialized reviewers for backend, frontend, and MCP code paths
Codex Base Instructions	`AGENTS.md`, `.codex/rules/default.rules`	Project-wide guidance + execution policy defaults
Codex Skills	`.codex/skills/`	Task-specific skills aligned to this repository
Codex Agents	`.codex/agents/`	Reusable custom-agent templates for implementation and review

Root Helper Scripts

Script	Role
`scripts/hook-handler.js`	Receives Claude hook payloads over stdin and forwards them to dashboard API
`scripts/install-hooks.js`	Writes/updates hook registration in `~/.claude/settings.json`
`scripts/import-history.js`	Batch history importer used by server startup auto-import, the `/api/import/` routes, and the `import-history` CLI. Exposes `importAllSessions()` for the default projects dir and the generalized `importFromDirectory(dbModule, rootDir, {onProgress})` which walks any directory recursively, classifies session vs subagent JSONLs (probes both `<proj>/<sid>/subagents/` and `<proj>/subagents/<sid>/` layouts), and funnels everything through the shared `parseSessionFile` + `importSession` pipeline — identical to live ingest. Re-import is fully incremental: a per-event-type high-water mark (`MAX(created_at) GROUP BY event_type` for the session) drives `ts > cutoff[type]` dedup for Stop / PostToolUse / TurnDuration / ToolError, so long-running sessions whose transcripts grow across multiple days keep receiving new events on every re-run. `sessions.ended_at` is rolled forward when the JSONL has progressed past the stored value, and message-count metadata is refreshed on every pass. Session-ID dedup and `baseline_` preservation keep token totals stable. Extracts tokens, API errors, turn durations, thinking blocks, usage extras, and per-subagent breakdowns
`server/routes/import.js`	Express router for Import History. Four endpoints: `GET /api/import/guide` (OS-aware instructions + default-dir stats), `POST /api/import/rescan` (default `~/.claude/projects`), `POST /api/import/scan-path` (arbitrary absolute dir with `~` expansion), `POST /api/import/upload` (multer multipart). Each request uses a per-request temp dir reclaimed in `finally`. Progress broadcast as throttled `import.progress` WebSocket messages. Limits tunable via `CCAM_IMPORT_MAX_BYTES`, `CCAM_IMPORT_MAX_FILES`, `CCAM_IMPORT_MAX_EXTRACT_BYTES`
`server/lib/archive.js`	Safe archive extraction: `.zip` via `adm-zip`, `.tar`/`.tar.gz`/`.tgz` via `tar`, plain `.gz` streaming via `zlib`. Every entry validated through `safeJoin` which rejects absolute paths and `..` traversal before any bytes are written. Enforces a hard `MAX_EXTRACT_BYTES` cap (default 4 GB) with `ExtractionLimitError` surfaced as HTTP 413 — defense against zip/tar/gzip bombs
`scripts/seed.js`	Loads deterministic demo data for testing and demos
`scripts/clear-data.js`	Removes persisted rows while preserving schema

◈ Integrations

Plugin Marketplace

The Agent Monitor ships with an official Claude Code plugin marketplace containing ten production-ready plugins (53 skills, 14 agents, 30 slash commands, 3 CLI tools, 3 hook configs, and 1 MCP server). These extend Claude Code with skills, agents, hooks, CLI tools, and MCP integration — all grounded in the real data model (token tracking with compaction baselines, cost calculation via pattern-matched pricing rules, workflow intelligence with 11 datasets per session, and session metadata including thinking blocks, turn counts, and inference geography).

Installation

bash

# Add the marketplace
claude plugin marketplace add hoangsonww/Claude-Code-Agent-Monitor

# Install individual plugins
claude plugin install ccam-analytics@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-cost-guard@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-productivity@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-devtools@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-insights@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-sessions@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-workflows@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-quality@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-config@hoangsonww-claude-code-agent-monitor
claude plugin install ccam-dashboard@hoangsonww-claude-code-agent-monitor

Available Plugins

Plugin	Skills	Agent	CLI Tools	Focus
ccam-analytics	session-report, cost-breakdown, usage-trends, productivity-score	analytics-advisor	`ccam-stats`	Token usage (4 types + baselines), cost via pricing engine, daily trends, productivity scoring
ccam-cost-guard	budget-set, spend-forecast, cost-alert, model-savings, daily-budget-check	budget-sentinel	—	Budget guardrails: set budgets, forecast week/month-end spend, cost-threshold alerts, model-routing savings (fail-safe Stop hook)
ccam-productivity	daily-standup, weekly-report, sprint-summary, workflow-optimizer	productivity-coach	—	Standup reports, sprint tracking, workflow optimization via 11 workflow intelligence datasets
ccam-devtools	session-debug, hook-diagnostics, data-export, health-check	issue-triager	`ccam-doctor`, `ccam-export`	Session debugging, hook diagnostics, data export (JSON/CSV), system health
ccam-insights	pattern-detect, anomaly-alert, optimization-suggest, session-compare	insights-advisor	—	Pattern detection via tool flow transitions, anomaly alerting, optimization, session comparison
ccam-sessions	session-search, session-timeline, transcript-replay, cwd-rollup, session-cleanup	session-investigator	—	Session forensics: search, timeline, transcript replay, per-cwd rollup, cleanup
ccam-workflows	dag-map, delegation-audit, concurrency-report, error-propagation, fleet-runs	orchestration-analyst	—	Multi-agent orchestration & fleet intelligence: DAG map, delegation audit, concurrency, error propagation, fleet runs (11-dataset workflow intelligence API)
ccam-quality	error-scan, api-error-report, hook-failure-audit, slo-check, regression-alert	reliability-engineer	—	Reliability & SLOs: error scan, API-error report, hook-failure audit, SLO check, regression alert
ccam-config	config-audit, memory-review, skill-inventory, mcp-audit, hook-inventory	config-auditor	—	Claude Code config & memory governance: config audit, memory review, skill/MCP/hook inventory (via the Config Explorer API)
ccam-dashboard	dashboard-status, quick-stats	—	—	Dashboard connector with MCP integration and one-line metric summaries

Skill Usage Examples

claude code

# Analytics — session report with per-model token breakdown + cost
/ccam-analytics:session-report latest

# Analytics — cost breakdown with cache efficiency analysis
/ccam-analytics:cost-breakdown this week

# Productivity — daily standup grouped by project
/ccam-productivity:daily-standup today

# Productivity — workflow optimization using workflow intelligence API
/ccam-productivity:workflow-optimizer analyze

# DevTools — debug a session's full event chain
/ccam-devtools:session-debug errors

# Insights — detect tool flow patterns and anti-patterns
/ccam-insights:pattern-detect tools

# Dashboard — one-line metrics summary
/ccam-dashboard:quick-stats

CLI Tools

bash

# Quick terminal dashboard
ccam-stats                  # Sessions, costs (per-model), tokens (with baselines)
ccam-stats --cost           # Cost summary with matched pricing rules
ccam-stats --tokens         # Token usage including compaction baselines
ccam-stats --json           # Raw JSON output

# System diagnostics
ccam-doctor                 # Full diagnostic: API, endpoints, database, hooks, freshness
ccam-doctor --quick         # Basic connectivity check

# Data export
ccam-export sessions --format csv --limit 500
ccam-export all --output backup.json

Plugin Architecture

Each plugin follows the official Claude Code plugin specification. The marketplace manifest at .claude-plugin/marketplace.json catalogs all ten plugins. Each plugin directory contains:

text

plugins/ccam-{name}/
├── .claude-plugin/plugin.json    # Plugin manifest (name, version, description)
├── skills/{skill-name}/SKILL.md  # Skill definitions with $ARGUMENTS
├── commands/{command-name}.md    # Slash command definitions
├── agents/{agent-name}.md        # Agent definitions (model, tools, instructions)
├── hooks/hooks.json              # Event hooks (fail-safe, non-blocking)
├── bin/{cli-tool}                # CLI scripts (added to PATH)
├── .mcp.json                     # MCP server configuration (dashboard plugin)
└── settings.json                 # Plugin settings (dashboard plugin)

Data Model Reference

All plugins query the Agent Monitor API at http://localhost:4820. Key capabilities they leverage:

Capability	Details
Token tracking	4 types (input, output, cache_read, cache_write) + 4 compaction baselines per model per session
Cost calculation	`(tokens / 1M) × rate_per_mtok` for each type; longest pattern match wins
Session metadata	`thinking_blocks`, `turn_count`, `total_turn_duration_ms`, `usage_extras` (service_tier, speed, inference_geo)
Event types	PreToolUse, PostToolUse, Stop, SubagentStop, SessionStart, SessionEnd, Notification, Compaction, APIError, TurnDuration
Workflow intelligence	11 datasets: stats, orchestration (DAG), toolFlow, effectiveness, patterns, modelDelegation, errorPropagation, concurrency, complexity, compaction, cooccurrence
Agent hierarchy	Recursive parent/child tree with subagent_type, depth tracking via recursive CTE

📖 Full documentation: docs/plugins.md

◈ Integrations

Statusline Utility

CLI statusline showing model, user, git branch, context window bar, and token counts

🖥️ Statusline — always-visible bar showing context window usage, token counts, active model, git branch, and session ID. Configurable segments with theme support

The statusline/ directory contains a standalone CLI statusline for Claude Code — completely independent of the web dashboard. It renders a color-coded bar at the bottom of the Claude Code terminal showing context window usage, per-direction token counts, session cost in USD, and git branch.

terminal output

nguyens6@host ~/agent-dashboard/client | Sonnet 4.6 | main | ████████░░ 79% | 3↑ 2↓ 156586c | $0.4231

Segment	Source	Color Logic
Model	`data.model.display_name`	Always cyan
User	`$USERNAME` / `$USER`	Always green
Working Dir	`data.workspace.current_dir`	Always yellow, `~` prefix for home
Git Branch	`git symbolic-ref --short HEAD`	Always magenta, hidden outside git repos
Context Bar	`data.context_window.used_percentage`	Green < 50%, Yellow 50–79%, Red ≥ 80%
Token Counts	`data.context_window.current_usage`	Green `↑` input, cyan `↓` output, dim `c` cache reads
Session Cost	`data.cost.total_cost_usd`	Green < $5, Yellow $5–$20, Red ≥ $20 (shown on API and subscription plans)

Statusline rendering pipeline — invoked on each Claude Code update

Installation

Add this to ~/.claude/settings.json:

json

{
  "statusLine": {
    "type": "command",
    "command": "bash \"/absolute/path/to/statusline/statusline-command.sh\""
  }
}

ℹ

No dependencies required

The statusline uses only Python 3.6+ stdlib (sys, json, os, subprocess). It fails silently on empty input or JSON errors and never blocks Claude Code.

◈ Integrations

VS Code Extension

🔌 Sidebar — live health, analytics, and deep navigation links

The Claude Code Agent Monitor is a premium, high-fidelity extension designed to minimize context switching for AI engineers. It brings the full power of the dashboard directly into VS Code, allowing you to monitor complex subagent orchestration without ever leaving your active code file.

Detailed Components

Live Monitoring Sidebar

A dedicated Activity Bar view that performs background polling every 5 seconds. Includes a real-time Agent Health monitor tracking all 5 states (Working, Connected, Idle, Completed, Error) with native VS Code theme-aware icons and colors.

Interactive Analytics & Usage

Aggregates data from multiple API endpoints to display high-signal metrics directly in the sidebar:

Token Consumption: Scaled tracking from 1k to 1.0B+ tokens.
Live Cost Estimates: Automatic USD cost calculation based on model pricing rules.
Event Frequency: Total events, daily sessions, and subagent spawning rates.

Embedded Dashboard & Deep Navigation

Renders the full React application within a native webview tab. Supports Deep Linking: one-click jump from the sidebar directly to specific views like the Kanban Board, Analytics Hub, or your Last 10 Sessions.

Smart Auto-Detection & Connection

Seamlessly scans ports 5173 (Vite Dev) and 4820 (Production) on localhost. Automatically toggles between Online and Offline modes in the sidebar as you start or stop your local server.

✓

Zero-Config Setup

The extension is designed to be plug-and-play. Once your server is running, the extension automatically discovers the API and begins streaming telemetry — no manual URL configuration required.

📖 Full developer guide: vscode-extension/README.md

◈ Integrations

Desktop App (macOS & Windows)

The dashboard ships as an optional native desktop application — a desktop/ workspace that wraps the existing server and client into a macOS .app (distributed as a .dmg) and a Windows .exe (an NSIS installer plus a no-install portable build) you install once and forget. desktop/ is a sibling workspace to client/, server/, mcp/, and vscode-extension/, built with Electron 35. It embeds the Express server in-process — it require()s server/index.js directly in the same Node runtime as the Electron main process (no child process, no IPC) — and renders the already-built React client in a BrowserWindow. Everything you see in the browser at localhost:4820 lives inside this window, with native OS lifecycle on top.

Claude Code Monitor running as a native desktop app

🍎🪟 The full dashboard, natively on macOS and Windows — same React client, same Express server, real BrowserWindow. Menu-bar / notification-area (tray) icon included. Shipped as a macOS DMG and a Windows EXE (macOS shown) — see DESKTOP.md.

Claude Code Monitor running as a native desktop app on Windows, showing the Activity Feed, native Windows window menu, and Tabby companion

🪟 The same dashboard as a native Windows app — real BrowserWindow with the native Windows window menu, live Activity Feed, and the Tabby companion. A notification-area (system tray) icon sits beside the clock for quick access.

ℹ

One-line mental model

Electron is a window onto the same code. The desktop app does not reimplement the dashboard — it hosts the exact server and client the standalone deployment runs. The only change outside desktop/ is a behavior-preserving refactor of server/index.js: its post-listen bootstrap was extracted into an exported startBackgroundServices() so the embedded server runs exactly what node server/index.js runs.

In-Process Architecture

The Electron main process hosts the embedded server and manages the window, tray, and menus. The renderer is just Chromium loading http://127.0.0.1:<port> — the same origin a normal browser would use.

flowchart LR OS["macOS Login Items / Windows HKCU Run\nlaunch at login"]:::mid --> MAIN["Electron Main Process\n(Node 22 / Electron 35)"]:::accent MAIN --> HOST["server-host.ts\nport discovery + adopt"]:::mid HOST -->|"require() in-process"| SRV["server/index.js\nExpress + WS + SQLite"]:::mid MAIN --> TRAY["Menu-bar / notification-area tray\n+ native app menu"]:::mid MAIN --> WIN["BrowserWindow\nReact dashboard"]:::ui SRV -->|"http + ws on 127.0.0.1:port"| WIN HOOKS["Claude Code hooks"]:::mid -->|"POST /api/hooks/event"| SRV classDef mid fill:#1a1a2b,stroke:#2e2e48,color:#e2e2f0 classDef accent fill:#6366f1,stroke:#818cf8,color:#fff classDef ui fill:#10b981,stroke:#34d399,color:#fff

The desktop app embeds the Express server in-process — no child process, no IPC

What It Adds

📍

Menu-Bar / Notification-Area (Tray) Icon

An always-on tray icon — the macOS menu bar (a tinted template glyph) or the Windows notification area (the colored icon.ico). A single click (left or right) opens a dropdown with a live status snapshot queried straight from SQLite at click time — server port, active sessions, working agents, events today — followed by Open Dashboard, Open in Browser, Restart Server, Show Logs, Open at Login (toggle), and Quit. The snapshot rows are clickable — they open the dashboard. The menu is rebuilt on each open so every value is current.

🍎🪟

Native Application Menu

A standard native application menu — About, Open at Login, File, Edit, View, Window, Help — with ⌘R / Ctrl+R wired to View ▸ reload. External links open in the system browser, never inside Electron. The File ▸ Open Dashboard item (⌘1) is macOS-only; on Windows/Linux the window-attached menu can't reopen a hidden window, so reopen from the tray's Open Dashboard.

🔁

Auto-Start at Login

Flip Open at Login in the tray or app menu — both platforms go through Electron's first-party app.*LoginItemSettings API. On macOS it registers via the modern SMAppService API and appears under System Settings → General → Login Items; on Windows it writes a per-user HKCU\Software\Microsoft\Windows\CurrentVersion\Run entry, visible in Task Manager → Startup. When the app is launched at login it starts tray-only, with no window jumping into view (on Windows the login launch is detected via a --ccam-hidden argument).

🪟

Close Hides, Server Stays Up

Closing the window hides it — the embedded server keeps running, the tray icon stays, and the dock / taskbar icon stays too (a clickable "still alive" indicator). Quit (⌘Q / Ctrl+Q, app menu, or tray → Quit) pops a confirmation modal — press the Quit button or hit ⌘Q / Ctrl+Q a second time to skip the prompt — and only then does the embedded server shut down, closing SQLite cleanly with a WAL checkpoint and removing this PID's entry from the discovery file.

🔀

Runs Alongside the Web Dashboard

Launch the desktop app and npm run dev at the same time and both stay real-time. Each server appends its {port, pid, startedAt} entry to ~/.claude/.agent-dashboard.json on startup; the Claude Code hook handler reads that list and fan-outs every event to every live entry in parallel. Stale entries self-evict via a PID liveness check on read, so a crashed server can't misroute events to a dead port.

🔒

Single-Instance Lock

Double-launching the app just focuses the existing window — no second server, no port collision, on every platform. The lock is acquired via requestSingleInstanceLock() before any server boots.

🪝

First-Boot Bootstrap

On its first owned-server boot the app auto-installs the Claude Code hooks into ~/.claude/settings.json and starts the background services (update scheduler, config watcher, orphaned-run reconciliation) — so an install-only user (DMG or EXE) gets events flowing without ever running npm run install-hooks from a checkout.

Data Persistence & CLI Reliability

Two packaging realities — a read-only application bundle / install directory and (on macOS) the minimal PATH a Finder-launched app inherits — are handled automatically so installs survive updates and the Run Claude feature works out of the box on both macOS and Windows.

ℹ

Your data survives reinstalls and updates

The SQLite database and VAPID keys live in a per-user app-data directory outside the application bundle / install dir — ~/Library/Application Support/Claude Code Monitor/data/ on macOS, %APPDATA%\Claude Code Monitor\data\ on Windows. server-host.ts points DASHBOARD_DATA_DIR at that per-user directory on boot. Because a packaged, code-signed, or app-translocated bundle is read-only, older builds that stored the database inside the bundle broke History Import; with the data directory now in app-data, your imported history and events persist across app reinstalls and updates (the Windows NSIS uninstaller keeps this data by default). After upgrading from a pre-fix build, re-run Import History → Rescan once to bridge the one-time gap.

ℹ

The claude CLI is found automatically

A Finder- or Dock-launched macOS app inherits only launchd's minimal PATH, not your login shell's. At startup shell-path.ts recovers the user's login-shell PATH so the Run Claude feature can locate and spawn the claude CLI. (On Windows the process already inherits the user PATH, so no recovery step is needed.) If it still cannot be found, make sure claude is a real executable on your PATH — a shell alias or function cannot be spawned — and check the user PATH resolved line in the desktop log (~/Library/Logs/Claude Code Monitor/desktop.log on macOS, %APPDATA%\Claude Code Monitor\logs\desktop.log on Windows).

Port Discovery

On launch the Electron main process picks a free port. If a healthy dashboard server already answers /api/health on port 4820 (for example, you ran npm start in a terminal), the app adopts that server instead of starting a second one — no double-binding, no SQLite contention. An adopted server is not owned by the app, so quitting leaves it running.

Step	Port choice
Adopt	A healthy server already on `4820` is adopted as-is
Preferred	`4820` when free
Fallback	The first free port in `4821`–`4829`
Last resort	A random high port when all of the above are taken

How to Get It

Three ways to obtain the desktop app — the latest GitHub Release (best for most users), a per-commit CI artifact (fresher than the latest release), or a local build.

Option A — download the latest GitHub Release (recommended)

Open Releases → latest and download the asset for your platform. The macOS and Windows Desktop CI jobs auto-publish a new vX.Y.Z release every time the version in package.json is bumped on master, so this link always points at the current build. Releases are public — no GitHub sign-in required.

Platform	Asset	Notes
macOS (Apple Silicon)	`ClaudeCodeMonitor-<ver>-arm64.dmg`	Drag the `.app` into `/Applications`
macOS (Intel)	`ClaudeCodeMonitor-<ver>-x64.dmg`	Drag the `.app` into `/Applications`
Windows (installer)	`ClaudeCodeMonitor-Setup-<ver>-x64.exe`	NSIS installer — per-user, no admin elevation
Windows (portable)	`ClaudeCodeMonitor-<ver>-x64-portable.exe`	Run without installing

Option B — per-commit CI artifact

Want a build straight off the tip of master, ahead of the next tagged release? Every green run of the 🍎 macOS Desktop (DMG) job on macos-latest uploads both per-arch DMGs as the ClaudeCodeMonitor-dmg workflow artifact, and the 🪟 Windows Desktop (EXE) job on windows-latest uploads the installer + portable EXEs as the ClaudeCodeMonitor-win artifact. Open the latest passing run , scroll to its Artifacts section, and download ClaudeCodeMonitor-dmg or ClaudeCodeMonitor-win. (GitHub sign-in required; 14-day retention.)

Option C — build locally

From the project root, after git clone:

bash

# 1. Install root + client + vscode-extension deps
npm run setup

# 2. Build the React client (the SPA the Electron window loads)
npm run build

# 3. Install Electron + electron-builder under desktop/
#    (also fetches better-sqlite3 as a prebuilt Electron binary)
#    Preflights the native better-sqlite3 build; on failure it prints
#    copy-pasteable setup help (incl. a no-toolchain alternative) and exits non-zero
npm run desktop:install

# 4a. ON macOS — build a DMG, pick ONE:
npm run desktop:dmg:arm64    # Apple Silicon only — fast (~1 min), recommended
npm run desktop:dmg:x64      # Intel only — fast
npm run desktop:dmg          # both per-arch DMGs (arm64 + x64) — for release, SLOWER

# 4b. ON Windows — build an EXE, pick ONE:
npm run desktop:win          # NSIS installer → ClaudeCodeMonitor-Setup--x64.exe
npm run desktop:win:portable # no-install portable → ClaudeCodeMonitor--x64-portable.exe

# electron-builder packages for the HOST OS — build DMGs on a Mac, EXEs on Windows.
# desktop:dmg:arm64 / desktop:dmg:x64 each emit ONE arch-labelled DMG (the volume
# title states the arch, e.g. "Claude Code Monitor (Apple Silicon)"). desktop:dmg
# builds BOTH per-arch DMGs at once (arm64 + x64) — the release pair.
open desktop/release/ClaudeCodeMonitor-*-arm64.dmg   # match the arch you built

⚠

Use the arch-specific build on your own machine

desktop:dmg is slower because it builds the full app tree twice (once per architecture) and ad-hoc-signs each — emitting two separate per-arch DMGs (…-arm64.dmg + …-x64.dmg). There is no @electron/universal merge; the release ships the two per-arch DMGs. For running on a single Mac, use desktop:dmg:arm64 (Apple Silicon) or desktop:dmg:x64 (Intel) — one architecture, finishing in roughly a minute. CI runs desktop:dmg and uploads both DMGs as ClaudeCodeMonitor-dmg, so you rarely need to build them yourself.

Install

macOS

Open the DMG

Double-click the downloaded .dmg to mount it

Drag to Applications

Drag Claude Code Monitor.app into your Applications folder

Clear Quarantine

Run xattr -cr on the app to get past Gatekeeper (see below)

Launch

Open the app — the tray icon appears and the dashboard window loads

⚠

Gatekeeper warning on first launch

The DMG is ad-hoc signed by default — that is all the project can offer without a paid Apple Developer ID. macOS warns the first time you open it ("Apple could not verify…"). Strip the quarantine attribute to get past it:

bash

# After dragging the app into /Applications:
xattr -cr "/Applications/Claude Code Monitor.app"

Alternatively, open System Settings → Privacy & Security, find the blocked app, and click Open Anyway. Code signing and Apple notarization are opt-in for the maintainer — when configured, this warning goes away for everyone.

Windows

Run the Installer

Run ClaudeCodeMonitor-Setup-<ver>-x64.exe — a per-user NSIS install (no admin), or run the *-portable.exe to skip installing

Clear SmartScreen

The EXE is unsigned by default, so SmartScreen may warn — click More info → Run anyway

Launch

Open from the Start menu / desktop shortcut — the notification-area (tray) icon appears and the dashboard window loads

Windows NSIS installer step 1 — Choose Installation Options

1️⃣ NSIS installer, step 1 — Choose Installation Options: pick per-user setup and optional shortcuts.

Windows NSIS installer step 2 — Choose Install Location

2️⃣ NSIS installer, step 2 — Choose Install Location: defaults to %LOCALAPPDATA%\Programs\Claude Code Monitor, or point it anywhere.

Windows NSIS installer step 3 — Completing Setup

3️⃣ NSIS installer, step 3 — Completing Setup: click Finish to launch the app and drop the tray icon in the notification area.

⚠

SmartScreen warning on first launch

The installer and portable EXE are unsigned by default — that is all the project can offer without a paid code-signing certificate. Windows SmartScreen may show "Windows protected your PC" the first time you run it; click More info → Run anyway. The installer lays the app down per-user under %LOCALAPPDATA%\Programs\Claude Code Monitor (and lets you choose the install directory) and sets an AppUserModelId (com.hoangsonww.ccam.desktop) so native toast notifications are attributed correctly and the window groups under one taskbar entry.

ℹ

Bundle size

The DMG is roughly 80 MB, about 250 MB installed on disk — the standard Electron tax; the Windows installer is comparable. The app runs natively on macOS and Windows; Linux is tracked as a follow-up. Logs live at ~/Library/Logs/Claude Code Monitor/desktop.log on macOS or %APPDATA%\Claude Code Monitor\logs\desktop.log on Windows (reach them from the tray menu → Show Logs).

📖 User-facing guide: DESKTOP.md · architecture & contributor reference: desktop/README.md

◈ Integrations

Settings Page

⚙️ Settings — model pricing editor, hook installation toggle, JSON data export, session cleanup, browser notification preferences, and system info panel with DB stats

The /settings route provides a comprehensive management interface with six sections:

💲

Model Pricing

Editable table of per-model pricing rules. Each Claude model variant has its own explicit pattern (e.g., claude-opus-4-6%). Rates cover input, output, cache read, and cache write tokens. Each rule's editor also has a collapsible Introductory rates block — a YYYY-MM-DD promo cutoff plus per-category intro prices (input / output / cache-read / cache-write 5m & 1h); an empty date means no promo, and any future model-launch promo needs no code change. Reset to defaults or add custom models. The section header carries an info popover (the i icon) that explains how rule lookup works (first matching pattern wins), the SQL-style % wildcard syntax with concrete examples (claude-opus-4-7%, claude-%-haiku, exact ids), and reminds the user that prices must be updated manually when Anthropic publishes new rates — already-stored sessions keep the price applied at ingest time. The CLAUDE_HOME panel and Import History flow are fully i18n-driven across en/vi/zh.

🪝

Hook Configuration

Shows per-hook installation status (SessionStart, PreToolUse, PostToolUse, Stop, SubagentStop, Notification, SessionEnd). One-click reinstall if hooks are missing or outdated. Validates paths and permissions automatically.

🗄️

Data Management

View database row counts and size. Session cleanup: abandon stale active sessions after N hours, purge old completed sessions after N days. Danger zone: clear all data with confirmation dialog to prevent accidental loss.

📤

Data Export

Download all sessions, agents, events, token usage, and pricing rules as a single JSON file for backup or analysis. Includes full event history, model metadata, and cost breakdowns in one portable archive.

🖧

System Health

Dedicated Health tab on the Dashboard with a composite health score (weighted from success rate, cache hit rate, error rate, and heap usage), storage engine donut chart, tool invocation frequency bars, subagent effectiveness, model token distribution, and compaction impact — all with cursor-following tooltips and 5-second auto-refresh.

🔔

Notification Preferences

Configure native browser notifications with per-event toggles for session starts, completions, errors, and subagent spawns. Automatic permission management with test-send button and graceful fallback when denied.

ℹ

Per-model pricing — no catch-all grouping

Each Claude model variant (e.g., Opus 4.6 vs Opus 4.1) has its own explicit pricing pattern because different model versions have different rates. The cost engine uses specificity sorting — longer patterns match before shorter ones.

◈ Integrations

Claude Config Explorer

🧰 Claude Config Explorer — the /cc-config page: 12 tabs (skills, subagents, slash commands, output styles, plugins, marketplaces, MCP servers, hooks, keybindings, settings, memory), a scope filter (user / project / all), search, and safe edit affordances on the low-risk surfaces

Claude Config Explorer — Skills tab with searchable skill list and edit actions

🧩 Claude Config Explorer · Skills — the Skills tab lists every discovered skill (user, project, and plugin) with description and source, is searchable, and opens any SKILL.md for a timestamp-backed edit via the shared editor modal

A first-class configuration inspector at /cc-config that reads every surface Claude Code knows about — skills, subagents, slash commands, output styles, plugins, marketplaces, MCP servers, hooks, settings, memory, keybindings, and statusline scripts — from the same on-disk layout the CLI uses. The Overview tab summarizes counts and filesystem roots; every other tab drills into one surface with search, scope filtering (user / project / all), and — where safe — create / edit / delete with mandatory timestamped backups before every mutation. A cc-watcher on CLAUDE_HOME plus cc_config_changed WebSocket messages keep the page live when files change outside the dashboard.

🗂️

12 discovery tabs

Skills, Subagents, Slash commands, Output styles, Plugins (with per-plugin contributes counts and plugin.json metadata), Marketplaces, MCP servers, Hooks (plus ~/.claude/hooks/ script listing), Keybindings, Settings (Current configuration summary + per-file JSON), and Memory (user/project CLAUDE.md + per-project auto-memory). Statusline config and script previews live under Settings.

✏️

Safe mutations

PUT / DELETE /api/cc-config/file handles skills, agents, commands, output-styles, CLAUDE.md, and per-project auto-memory (scope: "auto-memory"). Writes are atomic (temp file + rename), names are validated against a strict regex, resolved paths are containment-checked, and content is capped at 256 KiB. Every overwrite or delete creates a backup under cc-config-backups/ — outside the directories Claude Code scans — so deleted artifacts cannot reappear as ghost entries.

⌨️

Keybindings editor

The Keybindings tab includes a structured inline editor for ~/.claude/keybindings.json. Add or remove contexts and bindings, edit keys and actions in place, and save via PUT /api/cc-config/keybindings. The server read-modify-writes the file: top-level metadata ($schema, $docs) is preserved, duplicate contexts/keys are rejected, and the file is backed up first. Unlike settings.json, the live CLI does not rewrite keybindings mid-session, so dashboard edits are safe.

🧠

Memory store

The Memory tab surfaces user + project CLAUDE.md and the per-project file-based memory store — every *.md under ~/.claude/projects/<slug>/memory/ (a MEMORY.md index plus one file per remembered fact, often 100+). Files are grouped by project in collapsible sections, searchable, and editable. Clickable index links in MEMORY.md jump to (scroll + highlight) the matching fact file.

🔒

Read-only by design

Plugins, MCP servers, hooks-in-settings, and live settings.json files are intentionally not writable from the dashboard — they race with the running claude CLI. Those tabs show explainer banners with copy-able claude plugin …, claude mcp …, and claude config … commands instead of edit buttons. Secret-like settings keys are redacted server-side.

🔄

Live refresh

Successful mutations broadcast cc_config_changed over the WebSocket so every open /cc-config tab refetches without polling. A filesystem watcher on CLAUDE_HOME (cc-watcher.js) detects external edits (e.g. you changed a skill in your editor) and triggers the same refresh path. The Backups modal lists every timestamped copy and builds mv restore commands.

Editable vs read-only

Surface	Dashboard	How to change (if read-only)
Skills · Subagents · Commands · Output styles	Create / edit / delete (`PUT`/`DELETE /file`)	—
`CLAUDE.md` · auto-memory `*.md`	Create / edit / delete (`PUT`/`DELETE /file`)	—
`keybindings.json`	Structured inline editor (`PUT /keybindings`)	—
Plugins · marketplaces	Read-only inventory	`claude plugin install\|uninstall …`
MCP servers	Read-only inventory	`claude mcp add\|remove …`
Hooks · settings.json	Read-only (redacted secrets)	`claude config` · `/config` · `$EDITOR`

API endpoints

GET /api/cc-config/overview Filesystem roots + aggregate counts across every surface

GET /api/cc-config/{skills,agents,commands,…} Per-tab discovery lists; most accept ?scope=user|project|all

GET /api/cc-config/file?path=… Single-file viewer (path-contained to allowed roots)

PUT /api/cc-config/file Create/overwrite a text artifact — backs up first, atomic write

DELETE /api/cc-config/file Backup-then-delete (skill dirs backed up whole)

PUT /api/cc-config/keybindings Structured overwrite of keybindings.json — preserves metadata, backs up first

GET /api/cc-config/backups Timestamped backup listing for the recovery modal

json — PUT /api/cc-config/keybindings body

{
  "groups": [
    {
      "context": "Global",
      "bindings": [
        { "key": "ctrl+t", "action": "toggleTodos" },
        { "key": "ctrl+n", "action": "newThing" }
      ]
    },
    {
      "context": "Chat",
      "bindings": [{ "key": "escape", "action": "cancel" }]
    }
  ]
}

ℹ

Backups live outside Claude Code's scan paths

Every mutation writes a timestamped copy under <root>/cc-config-backups/<type>/ (or <memory-dir>/.cc-config-backups/auto-memory/ for per-project memory files) before touching the original. Skill folders are copied whole so bundled assets survive in the backup. The UI's Backups modal shows every copy and prints a one-line mv restore command.

⚠

Do not edit settings.json from here

The live claude process rewrites settings.json and ~/.claude.json constantly (model changes, plugin toggles, MCP registration). A dashboard write would race and lose state — or worse, break hook ingestion. Use claude config set …, /config, or hand-edit only when no claude process is running.

◈ Integrations

Alerts & Webhooks

🔔 Settings · Alerts — the rules-based alerting engine, a live fired-alert feed, and outbound webhook channels (14 first-class providers + a generic JSON endpoint) managed together in one place

Turns the dashboard from passive viewing into active monitoring. A rules-based alerting engine evaluates the live event stream server-side, and fired alerts fan out to outbound webhook channels. Everything lives in one place — Settings → Alerts — behind a segmented control with three tabs: Rules (what triggers an alert), Channels (where alerts are delivered), and Activity (the live fired-alert feed with acknowledge / acknowledge-all).

📐

Rule types

Four condition types: event pattern (match event_type / tool_name / a summary substring, optionally requiring ≥ N matches within a rolling window — e.g. "5 errors in 2 minutes"), inactivity (an active session goes quiet for N minutes), status duration (an agent is stuck in working / waiting for N minutes), and token threshold (a session's cumulative tokens cross a limit). Each rule has a configurable cooldown that dedups repeat alerts per (rule, session, agent).

⚙️

Evaluation engine

Event-driven rules (event_pattern, token_threshold) run on every hook ingest — after the transaction commits and the response is sent, fully try/catch-guarded, so alerting can never slow or fail hook delivery. Time-based rules (inactivity, status_duration) run on an unref'd 60-second sweep. Enabled rules are cached in memory and invalidated on every edit. Fired alerts persist to alert_events and broadcast an alert_triggered WebSocket message.

🔗

14 first-class providers

Slack, Discord, Microsoft Teams, Google Chat, Mattermost, Rocket.Chat, Telegram, PagerDuty, Opsgenie, Splunk On-Call, Zapier, Make, n8n, and Pipedream — plus a generic JSON endpoint. A declarative provider registry describes each one's payload formatter, URL resolution, auth headers, and credential fields, so adding a provider is a single server-side entry that surfaces in the UI with no front-end change.

📡

Delivery engine

Each delivery POSTs with an AbortController timeout and bounded retry/backoff (retries transport errors, 429, and 5xx — never other 4xx), then records the attempt-chain in webhook_deliveries. A provider can also veto a 2xx whose body signals failure (Splunk On-Call returns 200 with result:"failure"). Delivery is detached and fail-safe — it never throws into, slows, or blocks the alert path.

🔒

Security

Target URLs are masked (host + last 4 chars), and secrets / credential fields (routing keys, API keys, bot tokens) plus custom-header values are redacted in every API response — the full URL and secrets are stored server-side and never leave it. Generic endpoints support optional HMAC-SHA256 body signing (X-Webhook-Signature + X-Webhook-Timestamp) so receivers can verify authenticity.

🧭

Guided setup

Every alert-rule field has a help tooltip — the event-type, tool-name, and summary-contains fields include example chips of real hook events and built-in tool names. Each webhook provider ships a collapsible step-by-step setup guide linking to the official docs. A one-click "Send test" probe fires a synthetic alert and reports the delivery result inline, and targets can be scoped to specific rules. Fully localized (en / zh / vi / ko).

Provider payloads

Provider(s)	Payload format	URL / credentials
Slack	Block Kit (header + section + context)	Incoming Webhook URL
Discord	Rich embed	Webhook URL
Microsoft Teams	Adaptive Card in a Workflows `message` envelope	Power Automate Workflows URL
Google Chat	Text message (basic markdown)	Space webhook URL
Mattermost · Rocket.Chat	Slack-style legacy attachments	Incoming Webhook URL
Telegram	Bot API `sendMessage` (HTML)	Bot token + chat ID (URL derived)
PagerDuty	Events API v2 trigger (with `dedup_key`)	Routing key (URL prefilled)
Opsgenie	Alert API	API key (GenieKey header) + region
Splunk On-Call	VictorOps REST	REST endpoint URL (key embedded)
Zapier · Make · n8n · Pipedream · generic	Stable `{ event, alert }` JSON envelope	Endpoint URL (+ optional HMAC & headers)

json — generic webhook payload

{
  "event": "alert.triggered",
  "source": "claude-code-agent-monitor",
  "sent_at": "2026-06-13T17:00:00.000Z",
  "alert": {
    "id": 42,
    "rule_name": "Too many errors",
    "rule_type": "event_pattern",
    "session_id": "sess-1",
    "agent_id": "agent-1",
    "message": "5 matching events in 2 min (threshold 5)",
    "details": { "observed_count": 5 },
    "triggered_at": "2026-06-13T17:00:00.000Z"
  }
}

ℹ

Additive & non-blocking by design

Two new tables — webhook_targets (config; survives Clear Data like alert rules) and webhook_deliveries (audit log) — with no changes to existing tables, response shapes, or WebSocket message types. Webhook dispatch is fire-and-forget off the alert path, so a slow or failing endpoint can never slow or break alert firing or hook ingestion.

⚠

Provider setup steps can drift

Microsoft retired classic Office 365 connectors in 2025, so Teams uses an Adaptive Card delivered via Power Automate Workflows. More broadly, provider setup UIs change often — the in-app guides say so and link to each provider's official docs. Always confirm against the source.

◈ Integrations

Update Notifier

Update modal showing commits-behind count and copy-to-clipboard command

⬆ Update Notifier — version comparison modal with one-click copy of the update command. No automatic self-restart; you stay in control of when upgrades happen

A detection-only subsystem that tells the user when the dashboard's git checkout is behind the canonical default branch. Branch- and fork-aware: if an upstream remote is configured (the standard convention for forks), it takes priority over origin; the chosen remote's master / main / HEAD is the comparison ref. The printed command adapts to the user's situation — git pull --ff-only only when their branch actually tracks the canonical ref, otherwise git fetch (with a fast-forward merge in the fork case). The server never pulls or restarts itself — the user runs the command in a terminal — so the mechanism cannot break dev sessions, pm2/systemd/launchd/Docker supervision, or leave orphaned processes.

The same status is available from any terminal: ccam update-check asks the server for the branch- and fork-aware comparison, prints the behind-by count and the copy-paste update command, and triggers the update_status WebSocket broadcast so any open dashboard tab refreshes at the same time.

flowchart LR S["Server start"]:::mid --> SCHED["Scheduler\nevery 5 min"]:::mid SCHED --> FETCH["git fetch\n120s timeout"]:::mid FETCH --> CMP["Compare HEAD\nvs upstream"]:::mid CMP -->|"behind"| CHANGED["Fingerprint\ncompared"]:::mid CMP -->|"current"| IDLE["Idle"]:::mid CHANGED -->|"changed"| WS["Broadcast\nupdate_status"]:::accent CHANGED -->|"same"| IDLE WS --> UI["Modal and\nSidebar badge"]:::ui CHECK["POST check"]:::mid --> FETCH STATUS["GET status"]:::mid --> CMP classDef mid fill:#1a1a2b,stroke:#2e2e48,color:#e2e2f0 classDef accent fill:#6366f1,stroke:#818cf8,color:#fff classDef ui fill:#10b981,stroke:#34d399,color:#fff

Detection pipeline from scheduler to UI

🛰

Non-Blocking Detection

A shell-less git fetch with a 120-second timeout, followed by a rev-list against the tracked upstream. Each call runs from server/lib/update-check.js and returns a structured payload — never throws — so a flaky remote can't stall the dashboard.

⏱

5-min Scheduler

update-scheduler.js polls every five minutes with .unref() timers so it never blocks shutdown, de-duplicates with a fingerprint over the status payload, and announces up-to-date → behind transitions in a framed stdout block. Disable entirely with DASHBOARD_UPDATE_CHECK=0.

📋

Situation-Aware Command

Each status payload carries a manual_command shaped for the user's actual situation: git pull --ff-only on a tracked canonical branch, git fetch && git merge --ff-only for forks where local tracks the wrong remote, and a plain git fetch on a feature branch where pulling would update the wrong branch. Install / build steps are appended only when the working tree is actually being rewritten.

🎯

Two UI Surfaces

A modal opens automatically when upstream is ahead; ESC or a backdrop click dismisses it. A persistent sidebar button stays in the footer — emerald when behind, amber when the last check errored — so users can always trigger a fresh check on demand.

🛡

Soft Failure Semantics

Non-git installs, no remotes configured, offline fetches, and unresolvable upstream refs all return tagged payloads instead of throwing. The sidebar badge turns amber on fetch errors and the modal stays suppressed until a successful check arrives — no spinners, no stuck state.

🧠

Dismissal Memory

Dismissal is keyed by the upstream SHA in localStorage, so closing the modal silences it only for that commit — a newer upstream commit re-opens it automatically. Clicking the sidebar button is an explicit intent signal and clears the stored dismissal before firing a fresh check.

API Surface

Endpoint	Purpose
`GET /api/updates/status`	Read-only check — runs `git fetch`, compares, returns the payload.
`POST /api/updates/check`	Same check, and broadcasts `update_status` over WebSocket so every connected client re-syncs at once.

ℹ

Detection-only by design

There is no POST /api/updates/apply and no in-process restart helper. A process cannot reliably replace itself without an external supervisor, and npm run dev, npm start, pm2, systemd, launchd, and Docker each need different restart logic. Detection-only keeps the mechanism portable across every supervisor and OS, and leaves the dashboard's lifecycle owned by whatever started it. The user runs the printed command in their own shell.

◈ Integrations

Connection Status

Connection details modal with throughput sparkline, top event types, and recent activity

◈ Connection Status — sidebar-launched details modal with WebSocket endpoint, connection uptime, 60-second throughput sparkline, top event-type breakdown, and recent activity list

The Live / Disconnected pill in the sidebar footer opens a small details panel about the dashboard's WebSocket transport. It surfaces the active ws:// endpoint, how long the current socket has been up, total events received, the top event types as a horizontal bar chart, a 60-second throughput sparkline, and the most recent 8 events as an activity list. Cumulative stats (totals, type breakdown, recent list) persist across reloads via localStorage under sidebar-connection-stats; the rolling sparkline and "connected since" timer are intentionally ephemeral since they only make sense relative to "now". A Reset button clears everything on demand.

Implementation note: per-event state lives in useRef buffers on the sidebar so the WS firehose never re-renders the navigation tree — the modal does its own one-second tick to sample the refs while open. Writes are throttled (single-flight timer, 2 s window) and flushed on pagehide / visibilitychange so the latest events aren't lost to the throttle window. The modal itself is portalled to document.body so the sidebar's stacking context can't trap it.

◈ Integrations

Prometheus & Grafana

Grafana CCAM — Overview dashboard with live fleet metrics

📊 Grafana · CCAM — Overview — default home dashboard after login (admin / admin on port 3000): fleet snapshot, database totals, breakdown charts, and rates from live /api/metrics scrapes — no sample or synthetic data

🔥 Prometheus · CCAM console — static HTML at /consoles/index.html (Prometheus 3.x compatible): live metric cards, session/token tables, and one-click Graph drill-down links that query the Prometheus HTTP API directly

📈 Prometheus · Graph — ad-hoc PromQL against scraped CCAM series; starter expressions ship in the CCAM console and monitoring/README.md, with derived rates in monitoring/prometheus/ccam-rules.yml

CCAM exposes a read-only Prometheus text-exposition endpoint at GET /api/metrics — sessions and agents by status, cumulative events and tokens, connected WebSocket clients, configured remote sources, process uptime/RSS, and build version — so the same dashboard you use for Claude Code monitoring can be scraped into your own observability stack. A turnkey stack lives in monitoring/: npm-managed binaries (no Homebrew/apt) or optional Docker Compose, with Grafana datasource and four dashboards auto-provisioned on first start.

Quick start (npm): run the dashboard on loopback, then npm run monitoring:install once and npm run monitoring:up. Open Grafana at http://localhost:3000 and Prometheus at http://localhost:9090. The pre-built CCAM console is at http://localhost:9090/consoles/index.html (also reachable from the Prometheus UI under Consoles). For an all-Docker path use npm run monitoring:docker:up or npm run docker:full:up for dashboard + Prometheus + Grafana together. Verify with npm run monitoring:verify — the ccam scrape target should read UP.

📊

Four Grafana boards

CCAM — Overview (default home), CCAM — Sessions & Agents, CCAM — Tokens & Events, and CCAM — Platform — all provisioned from monitoring/grafana/dashboards/ with PromQL against your live scrape. Credentials default to admin / admin via monitoring/grafana.defaults.env.

🔥

Prometheus console

Prometheus 3.x dropped the legacy console template libraries, so CCAM ships a static HTML console at monitoring/prometheus/consoles/index.html that fetches /api/v1/query directly — live cards, session tables, and Graph links without Go-template errors on /consoles/index.html.

📈

Recording rules

monitoring/prometheus/ccam-rules.yml derives convenient rates and aggregates from raw ccam_* series (e.g. rate(ccam_tokens_total[5m])) so Grafana panels and ad-hoc Graph queries stay fast without hand-rolling every expression.

🛡

Scrape & auth guard

Loopback scrapes (127.0.0.1:4820) work out of the box. When Prometheus runs in Docker and reaches the host via host.docker.internal, add that host to DASHBOARD_ALLOWED_HOSTS or the scrape returns 403 EBADHOST. If DASHBOARD_TOKEN is set, scrapers must send the same bearer token.

ℹ

Full reference

Commands, Docker vs npm paths, dashboard UIDs, starter PromQL, and troubleshooting (Grafana login reset, target DOWN) are documented in monitoring/README.md and the API reference entry for GET /api/metrics.

◈ Integrations

Internationalization (i18n)

The entire UI ships in four languages — English, 简体中文, Tiếng Việt, and 한국어 — built on i18next + react-i18next with i18next-browser-languagedetector. Coverage is end-to-end: every page, chart tooltip, Settings flow, Workflow narrative, Config Explorer tab, Run page, and the Alerts rule-help tooltips + webhook setup guides are translated. Switch languages from the sidebar (EN / 中文 / VI / 한국어) — the choice persists in localStorage.

🗂️

Namespaced resources

Translations are split into per-area JSON namespaces (common, nav, dashboard, sessions, analytics, workflows, settings, kanban, run, ccConfig, alerts, errors, updates) under client/src/i18n/locales/<lng>/. Components load only the namespaces they need via useTranslation("…").

🔎

Detection & fallback

Language is detected from localStorage (i18nextLng) then the browser's navigator setting, and the choice is cached back to localStorage. fallbackLng is English and nonExplicitSupportedLngs resolves regional tags (e.g. vi-VN → vi), so any unmapped key falls back gracefully rather than rendering a raw key.

🔢

Locale-aware formatting

Numbers, costs, dates, and relative times format against the active locale via a shared getCurrentLocale() helper, and plurals use i18next's _one / _other suffixes. Interpolated values ({{count}}, {{provider}}, …) keep sentences natural across languages.

🔤

Technical terms preserved

Domain terms that are proper nouns or code stay untranslated in every locale — Agent, Subagent, hook event names (PostToolUse), tool names (Bash), and webhook provider names (Slack, PagerDuty). Only the surrounding prose is localized, so instructions stay accurate.

ℹ

Adding a language

Copy client/src/i18n/locales/en/ to a new locale folder, translate the JSON values (leaving keys and technical terms intact), then register the bundle and add the tag to supportedLngs in client/src/i18n/index.ts. Missing keys fall back to English automatically, so even a partial translation ships cleanly.

◈ Components & UI

🐾 Tabby — Reactive Cat Companion

Tabby is a cute SVG cat companion pinned to the edges of every page of the dashboard. It is always present and turns the live session stream into glanceable, ambient feedback — calm when idle, alert when something needs attention, and celebratory when a run finishes. Tabby is built entirely on the existing eventBus WebSocket stream: no new backend, no API key, and no new dependencies. The component lives in client/src/components/Tabby/ and can be toggled on or off in Settings page.

📥 Tabby Companion — a cute SVG cat in the edges of every page, reacting in real time to the live session stream with eight distinct moods and animations, auto-surfacing speech bubbles for notable events, and serving as the gateway to a status panel and Ask box

Reactive Mascot — Eight Moods

Tabby derives one of eight moods from the live session WebSocket stream, each with its own animation. The eyes track your cursor, and the active mood drives a distinct motion cue.

Mood	When it appears	Animation
Idle	Nothing notable happening	Gentle tail flick
Watching	Sessions active, observing the stream	Ear perk, cursor-tracking eyes
Happy	A run completed successfully	Sparkle
Worried	Something looks off	Head bob
Stuck	A session appears blocked	Shake + alert `!`
Thinking	Work in progress	Head bob
Sleeping	Quiet for a while	Zzz
Disconnected	WebSocket offline	Calm, dimmed state

Auto-Surface Speech Bubbles

Notable events — session started or finished, errors, and run completed — automatically surface a speech bubble. Bubbles are throttled and coalesced so bursts of events never spam you, and they can be muted on demand. Everything reflects in real time over the existing eventBus WebSocket channel, with no polling and no extra services.

The ⌘B Panel

Click the cat — or press ⌘B / Ctrl+B — to open Tabby's panel (Esc closes it). The panel groups a live status line, quick actions, and an Ask box.

Live status line: N live · M errored · connection state, updated from cached data.
Quick actions: jump to Run Claude, Activity, Sessions, or errored sessions; mute bubbles; clear alerts.
Ask box: answers simple status questions locally from cached data (“what's running”, “any errors”, “status”).

Ask → Run Claude Handoff

The Ask box answers status questions instantly and offline from cached data. For anything beyond a simple status question, Tabby hands off to the existing Run Claude page (/run?prompt=...) to spawn a real Claude Code session — so there is never a separate model call, key, or service to manage.

Accessibility & Resilience

Fully keyboard operable: ⌘B / Ctrl+B to open, Esc to close.
Status and bubbles announce via aria-live for screen readers.
Respects prefers-reduced-motion to calm animations.
Degrades gracefully to a calm, dimmed disconnected state when offline.

◈ Operations

Deployment Modes

graph LR subgraph dev["Development — 2 processes"] D_CMD["npm run dev\n(concurrently)"] D_SRV["node --watch server/index.js\nport 4820 — auto-restart"] D_VITE["vite dev server\nport 5173 — HMR"] D_BROWSER["Browser"] D_CMD --> D_SRV D_CMD --> D_VITE D_BROWSER --> D_VITE D_VITE -->|"proxy /api + /ws"| D_SRV end subgraph prod["Production — 1 process"] P_BUILD["npm run build\n(tsc + vite build)"] P_DIST["client/dist/\nstatic files"] P_START["npm start"] P_SRV["node server/index.js\nport 4820 — serves dist"] P_BROWSER["Browser"] P_BUILD --> P_DIST P_START --> P_SRV P_BROWSER --> P_SRV end

Development vs production deployment topology

Aspect	Development	Production
Processes	2 (Express + Vite)	1 (Express only)
Client URL	`http://localhost:5173`	`http://localhost:4820`
API proxy	Vite proxies `/api` + `/ws` to :4820	Same origin, no proxy
File watching	`node --watch` + Vite HMR	None
Source maps	Inline	External files

ℹ

A third way to run: the Desktop App (macOS & Windows)

Beyond development and standalone production, the dashboard also ships as a native desktop app — a macOS .app and a Windows .exe — that embeds the same production server in-process, no terminal required. See the Desktop App (macOS & Windows) section for download, build, and install instructions.

Container Runtime (Docker / Podman)

The production image is OCI-compatible and works with both Docker and Podman. The server listens on 4820, reads legacy Claude history from a read-only mount, and persists SQLite data under /app/data.

graph LR subgraph build["Multi-stage image build"] C1["server-deps\nnode:22-alpine\nnpm ci --omit=dev"] C2["client-build\nvite build"] C3["runtime image\nnode server/index.js"] C1 --> C3 C2 --> C3 end subgraph runtime["Container runtime"] V1["~/.claude\nmounted read-only"] V2["agent-monitor-data\npersistent volume"] PORT["localhost:4820"] C3 --> PORT V1 --> C3 V2 --> C3 end

Container image build and runtime mounts

bash

# Docker Compose
docker compose up -d --build

# Podman Compose
CLAUDE_HOME="$HOME/.claude" podman compose up -d --build

# Stop the stack
docker compose down
# or
podman compose down

Mount	Purpose
`~/.claude:/root/.claude:ro`	Read historical Claude session files for import without modifying them
`agent-monitor-data:/app/data`	Persist the SQLite database across rebuilds and container restarts

⚠

Hooks still run on the host

Claude Code fires hooks from the host machine, not from inside the container. After the container is healthy on http://localhost:4820, run npm run install-hooks on the host so hook events post back to the containerized server.

◈ Operations

Docker / Podman

A multi-stage Dockerfile and docker-compose.yml are included. Both Docker and Podman are fully supported — the image is OCI-compliant.

Quick Start

# Docker Compose
docker compose up -d --build

# Podman Compose
CLAUDE_HOME="$HOME/.claude" podman compose up -d --build

Plain Docker / Podman (no Compose)

# Build the image
docker build -t agent-monitor .
# — or —
podman build -t agent-monitor .

# Run the container
docker run -d --name agent-monitor \
  -p 127.0.0.1:4820:4820 \
  -v "$HOME/.claude:/root/.claude:ro" \
  -v agent-monitor-data:/app/data \
  agent-monitor

Volume Mounts

Mount	Purpose
`~/.claude:/root/.claude:ro`	Read-only access to legacy session history for automatic import on startup
`agent-monitor-data:/app/data`	Persists the SQLite database across container restarts

ℹ

Local-only by default

The image bakes DASHBOARD_HOST=0.0.0.0 (the server binds loopback by default, but a container's loopback is a separate namespace the published port cannot reach) and DASHBOARD_DATA_DIR=/app/data (the read-only ~/.claude mount cannot hold the SQLite data dir). The trust boundary is the host port publish: the examples use -p 127.0.0.1:4820:4820, so the dashboard is local-only. To expose it on a LAN, publish on 0.0.0.0 (-p 4820:4820) and set DASHBOARD_TOKEN.

Multi-Stage Build

The Dockerfile uses three stages to minimize the final image size:

Stage	Purpose
server-deps	Installs production `node_modules` on `node:22-alpine`. `better-sqlite3` is optional — if prebuilds are unavailable, the server falls back to built-in `node:sqlite`
client-build	Runs `npm ci` + `vite build` to produce optimized static assets
runtime	Clean `node:22-alpine` with only `node_modules`, server code, and `client/dist`

⚠

Hook note

Claude Code hooks run on the host, not inside the container. The containerized server receives hook events via HTTP on localhost:4820. Run npm run install-hooks on the host after starting the container.

◈ Operations

Performance Characteristics

<200ms

Server startup

<50ms

Hook-to-broadcast latency

200 KB

JS bundle (63 KB gzipped)

50k/s

SQLite inserts/sec (WAL)

Metric	Value	Notes
Server startup	< 200ms	SQLite opens instantly; schema migration is idempotent
Hook latency	< 50ms	Transaction + broadcast, no async I/O beyond SQLite
Client JS bundle	200 KB / 63 KB gzip	CSS: 17 KB / 4 KB gzip
WebSocket latency	< 5ms	Local loopback, JSON serialization only
SQLite write throughput	~50,000 inserts/sec	WAL mode on SSD; far exceeds any hook event rate
Max events before slowdown	~1M rows	Pagination prevents full-table scans
Server memory	~30 MB	SQLite in-process, no ORM overhead
Client memory	~15 MB	React + Tailwind, minimal runtime deps

◈ Operations

Security Considerations

Area	Approach
SQL injection	All queries use prepared statements with parameterized values — no string interpolation
Request size	Express JSON body parser limited to 1MB
Input validation	Required fields checked before DB operations; CHECK constraints on status enums
Hook safety	Hook handler always exits 0; 5s max lifetime; uses `127.0.0.1` not external hosts
CORS	Restricted to loopback origins, so cross-origin pages can't read responses; no-Origin clients like curl still work
Authentication	Off by default since the loopback bind is the trust boundary; set `DASHBOARD_TOKEN` to require a bearer token on every `/api/*` request and the WebSocket when exposing on a LAN.
Secrets	No API keys, tokens, or credentials stored or transmitted anywhere
Dependency surface	5 runtime server deps, 6 runtime client deps (includes D3.js for Workflows) — minimal attack surface

◈ Operations

Troubleshooting

No sessions appearing after starting Claude Code

Check 1 — Is the server running?

bash

curl http://localhost:4820/api/health
# Expected: {"status":"ok","timestamp":"..."}

Check 2 — Are hooks installed?

bash

# Open ~/.claude/settings.json and confirm it contains "hook-handler.js"
# If not, re-run:
npm run install-hooks

Check 3 — Start a new Claude Code session

Hooks only apply to sessions started after installation. Restart Claude Code after starting the dashboard.

Check 4 — Is Node.js in PATH?

On some systems the shell environment when Claude Code fires hooks may not include the full PATH. Test with node --version. If not found, use the absolute path to node in the hook command.

Common Issues

Problem	Solution
`better-sqlite3` errors during install	This is non-fatal — `better-sqlite3` is an optional dependency. On Node 22+ the server automatically falls back to built-in `node:sqlite`. On older Node versions, install Python 3 + C++ build tools, then run `npm rebuild better-sqlite3`. For the desktop app, the `desktop:install` preflight prints copy-pasteable per-OS setup guidance (incl. a no-toolchain alternative) when the native build fails.
Dashboard shows "Disconnected"	Server is not running. Start it with `npm run dev`. Client auto-reconnects every 2s.
Events Today shows 0	Ensure you are on the latest version (timezone bug was fixed). Restart the server.
Port 4820 already in use	Run `DASHBOARD_PORT=4821 npm run dev`, update Vite proxy in `client/vite.config.ts`, and re-run `npm run install-hooks`.
Stale seed data shown	Run `npm run clear-data` to wipe all rows, then restart.
Hooks show validation error about matcher	Ensure you're on the latest version — the hook format was updated to use `matcher: "*"` string (not object).
"SQLite backend not available" on startup	Neither `better-sqlite3` nor `node:sqlite` could load. Upgrade to Node.js 22+ (recommended), or install Python 3 + C++ build tools and run `npm rebuild better-sqlite3`.
Docker container runs but no sessions appear	Hooks run on the host, not inside the container. Run `npm run install-hooks` on the host after the container starts. Verify hooks in `~/.claude/settings.json` point to `localhost:4820`.

◈ Reference

Technology Choices

Technology	Why This Over Alternatives
SQLite (better-sqlite3 / node:sqlite)	Zero-config, embedded, no server process. WAL mode gives concurrent reads. Synchronous API is simpler than async for this use case. `better-sqlite3` is preferred when prebuilds are available; falls back to Node.js built-in `node:sqlite` on Node 22+ when the native module cannot be compiled.
Express	Battle-tested, minimal, well-understood. Fastify would be overkill; raw `http` module would require too much boilerplate for routing.
ws	Fastest, most lightweight WebSocket library for Node. No Socket.IO overhead needed — we only push typed JSON messages one-way.
React 18	Stable, widely known, strong TypeScript support. No Server Components or RSC needed for a client-rendered local SPA.
Vite 6	Fast builds, native ESM, excellent dev experience. Proxy config handles the dev server split cleanly with no ejection.
Tailwind CSS	Utility-first approach keeps styles colocated with markup. No CSS module boilerplate. Custom dark theme config for the dark UI.
React Router 6	Standard routing for React SPAs. Layout routes with `<Outlet>` give clean shell composition without prop drilling.
Lucide React	Tree-shakeable icon library — only imports what's used (~20 icons). No heavy icon font.
TypeScript Strict	Catches null/undefined bugs at compile time. `noUncheckedIndexedAccess` prevents array bounds issues in analytics aggregations.
D3.js + d3-sankey	Industry-standard data visualization library. Powers the Workflows page's 11 interactive sections — DAG layouts, Sankey diagrams, force-directed graphs, bubble charts, and swim-lane timelines. No wrapper libraries needed; direct SVG rendering keeps bundle impact minimal.
Python (statusline)	Available on virtually all systems. Handles ANSI and JSON natively with stdlib only. No install step required.