24 KiB
Hermes Web UI: Full Parity Roadmap
Goal: Full 1:1 parity with the Hermes CLI experience via a clean dark web UI. Everything you can do from the CLI terminal, you can do from this UI.
Last updated: v0.50.44 (April 14, 2026) — 1195 tests, 1195 passing Local delta: enabling password from Settings keeps the current browser signed in; the former Assistant Reply Language enhancement has been removed. Tests: 1059 total (1059 passing, 0 failures) Source: /
Sprint History (Completed)
Current Architecture Status
| Layer | Location | Status |
|---|---|---|
| Python server | /server.py (~165 lines) + api/ modules (~5000 lines) | Thin shell + QuietHTTPServer + auth middleware + business logic in api/ |
| HTML template | /static/index.html (~600 lines) | Served from disk |
| CSS | /static/style.css (~1050 lines) | Served from disk, incl. mobile responsive, KaTeX, table styles |
| JavaScript | /static/{ui,workspace,sessions,messages,panels,boot,commands,icons,i18n,login}.js | 10 modules, ~7100 lines total |
| Docker | Dockerfile, docker-compose.yml, .dockerignore | python:3.12-slim, multi-arch (amd64+arm64) |
| CI/CD | .github/workflows/release.yml | Auto-release + GHCR publish on tag push |
| Runtime state | ~/.hermes/webui-mvp/sessions/ | Session JSON files |
| Test server | Port 8788 (conftest.py), port 8789 (browser sanity) | Isolated, wiped per run |
| Production server | Port 8787 | SSH tunnel from Mac |
Feature Parity Checklist
Chat and Agent
- Send messages, get SSE-streaming responses
- Switch models per session (10 models, grouped by provider)
- Composer-scoped model picker in footer (moved from sidebar to align with per-conversation model selection)
- Multi-provider API support: use any Hermes agent API provider (OpenAI, Anthropic, Google, etc.) directly, not just OpenRouter (Sprint 11)
- Custom endpoint model discovery: auto-detect models from Ollama, LM Studio, and other local LLM servers via base_url (PR #18)
- Upload files to workspace (drag-drop, click, clipboard paste)
- File tray with remove button
- Tool progress shown inline in the conversation via live tool cards
- Approval card for dangerous commands (Allow once/session/always, Deny)
- Approval polling + SSE-pushed approval events
- INFLIGHT guard: switch sessions mid-request without losing response
- Session restores from localStorage on page load
- Reconnect banner if page reloaded mid-stream
- Copy message to clipboard (hover icon on each bubble)
- Edit last user message and regenerate
- Branch/fork conversation (Wave 3)
- Token/cost estimate per message (Sprint 23)
Tool Visibility
- Tool progress in live tool cards (kept out of the composer/footer chrome)
- Approval card with all 4 choices
- Tool call cards inline (collapsed, show name/args/result)
Workspace / Files
- Workspace panel defaults closed and opens only for active browsing or preview
- Browse workspace directory tree with type icons
- Preview text/code files (read-only)
- Preview markdown files (rendered, tables supported)
- Preview image files (PNG, JPG, GIF, SVG, WEBP inline)
- Edit files inline (Edit button, Enter to save, Escape to cancel)
- Create new file (+ button in panel header)
- Delete file (hover trash, confirmation modal)
- File name truncation with tooltip for long names
- Right panel resizable (drag inner edge)
- Syntax highlighted code preview (Prism.js)
- Rename file (Sprint 14)
- Create folder (Sprint 14)
- Shared app modal for confirm/input flows (Sprint 33)
Sessions
- Create session (+ button or Cmd/Ctrl+K)
- Load session (click in sidebar)
- Delete session (hover trash, toast, correct fallback)
- Auto-title from first user message
- Rename session title (double-click in sidebar, Enter saves, Escape cancels)
- Filter/search sessions by title (live filter box)
- Date group headers (Today / Yesterday / Earlier)
- Download session as Markdown transcript
- Export session as JSON (full messages + metadata)
- Session inherits last-used workspace on creation
- Session content search (search message text across sessions)
- Session tags / labels (Sprint 14)
- Archive sessions (Sprint 14)
- Clear conversation (wipe messages, keep session) (Wave 3)
- Import session from JSON (Sprint 12)
- Pin/star sessions to top of list (Sprint 12)
- Duplicate session (Sprint 13)
- Session projects / folders (Sprint 15)
Workspace Management
- Add workspace with path validation (must be existing directory)
- Remove workspace
- Rename workspace display name
- Quick-switch workspace from topbar dropdown
- Sidebar live workspace display (name + path, updates in real time)
- New sessions inherit last used workspace
- Workspace list persists to workspaces.json
- Workspace reorder (drag) (Wave 2)
Scheduled Tasks (Cron)
- View all cron jobs (Tasks sidebar tab)
- View last run output per job (auto-loaded on expand)
- Expand job to see prompt, schedule, last output
- Run job manually (Run now button)
- Pause / Resume job
- Create cron job from UI (+ New job form with name, schedule, prompt, delivery)
- Edit existing cron job
- Delete cron job
- View full cron run history (expandable per job)
- Skill picker in cron create form (Sprint 23)
Skills
- List all skills grouped by category (Skills sidebar tab)
- Search/filter skills by name, description, category
- View full SKILL.md content in right preview panel
- Create skill
- Edit skill
- Delete skill
- View skill linked files (Sprint 23)
Memory
- View personal notes (MEMORY.md) rendered as markdown (Memory tab)
- View user profile (USER.md) rendered as markdown (Memory tab)
- Last-modified timestamp on each section
- Add/edit memory entry inline
Configuration
- Settings panel (default model, default workspace) (Sprint 12)
- Send key preference (Enter or Ctrl+Enter) (Sprint 17)
- Password authentication (Sprint 19)
- Enable/disable toolsets per session (deferred)
Notifications
- Cron job completion alerts (Sprint 13)
- Background agent error alerts (Sprint 13)
Workspace
- Breadcrumb navigation in subdirectories (Sprint 17)
- Workspace tree view with expand/collapse (Sprint 18, Issue #22)
- File preview auto-close on directory navigation (Sprint 18)
Slash Commands
- Command registry + autocomplete dropdown (Sprint 17)
- Built-in: /help, /clear, /model, /workspace, /new (Sprint 17)
Security
- Password auth with signed cookies (Sprint 19, Issue #23)
- Security headers (X-Content-Type-Options, X-Frame-Options) (Sprint 19)
- POST body size limit (20MB) (Sprint 19)
Thinking / Reasoning
- Collapsible thinking cards for extended-thinking models (Sprint 18)
Voice
- Voice input via Web Speech API (Sprint 20)
Mobile
- Mobile responsive layout — hamburger sidebar, sidebar tabs on phones, files slide-over (Sprint 21 + later mobile nav simplification)
Profiles
- Multi-profile support — create, switch, delete profiles (Sprint 22, Issue #28)
Advanced / Future
- Subagent session tree -- show subagent hierarchy in sidebar with expand/collapse (PR #75)
- Specialized tool card renderers -- diff viewer, terminal output, todo checklist views (PR #75)
- Streaming performance -- rAF-throttled token rendering (Sprint 24, PR #81)
- Workspace git detection -- branch name and dirty status badge (Sprint 24, PR #82)
- Collapsible date groups -- click group headers to collapse (Sprint 24, PR #80)
- Context usage indicator -- compact circular badge in composer footer (Sprint 24, PR #83; refreshed April 10, 2026)
- LLM-generated session titles -- auto-title via small model instead of first-message substring (PR #75)
- Workspace git detection -- show branch name, dirty status in workspace header (PR #75)
- Clarify dialog -- agent can ask clarifying questions that block until user responds (PR #75)
- Gateway approval polling -- support blocking approvals from messaging gateway (PR #75)
- Unified session storage -- SessionDB shared between webui and CLI (PR #75)
- TTS playback of responses (deferred)
- Background task cancel (composer footer stop button)
- Code execution cell (deferred)
- Desktop application (Sprint 25, PLANNED)
- Pluggable UI themes -- Dark, Light, Slate, Solarized, Monokai, Nord (Sprint 26, v0.34)
- Extended slash command / skill integration (deferred)
- Virtual scroll for large lists (deferred)
Sprint 7: Wave 2 Core -- Cron/Skill/Memory CRUD + Session Content Search (COMPLETED)
Theme: "Wave 2 Core -- Cron/Skill/Memory CRUD + Session Content Search"
Track A: Bug Fixes
| Item | Description |
|---|---|
| Activity bar sizing | Activity bar sometimes overlaps first message on short viewports |
| Model dropdown sync | Model chip in topbar sometimes shows stale model after session switch |
| Cron output truncation | Long cron output in the tasks panel overflows its container |
Track B: Features
| Feature | What | Value |
|---|---|---|
| Session content search | Search message text across all sessions, not just titles. GET /api/sessions/search already does title search; extend to message content with a configurable depth limit | High: the single most-requested nav feature after rename |
| Cron edit + delete | Edit an existing cron job (name, schedule, prompt, delivery) inline in the tasks panel. Delete with confirm. POST /api/crons/update and /api/crons/delete | High: closes the cron CRUD gap (create was Sprint 6) |
| Skill create + edit | A "New skill" form in the Skills panel. Name, category, SKILL.md content in a textarea editor. Save calls POST /api/skills/save (writes to ~/.hermes/skills/). Edit opens existing skill in the same editor | High: biggest remaining CLI gap after cron |
Track C: Architecture
| Item | What |
|---|---|
| Phase E: app.js module split (start) | Split app.js (1332 lines) into logical modules: sessions.js, chat.js, workspace.js, panels.js, ui.js. Serve via ES module imports in index.html. This is Phase E completion. |
| Health endpoint improvement | Add active_streams, uptime_seconds to /health response (Phase G) |
| Git init | git init , first commit, push to private GitHub repo |
Tests
- ~20 new pytest tests (cron update/delete, skill save, session content search)
- TESTING.md: Sections 29-31 (cron edit, skill edit, session search)
- Estimated total after Sprint 7: ~126
Wave 2: Full CRUD and Interaction Parity
Status: In progress. Sprint 6 completed cron create and workspace management. Remaining Wave 2 items targeted for Sprints 7-8.
Sprint 2.0: Workspace Management (COMPLETE Sprint 5+6)
All workspace features delivered: add/validate/remove/rename workspaces, topbar quick-switch, sidebar live display, new sessions inherit last workspace. See Sprint 5 completed section.
Sprint 2.1: Cron Job Management (Partial -- Sprint 7 for remaining)
- View all jobs (Sprint 3)
- Run / pause / resume (Sprint 3)
- Create job from UI (Sprint 6)
- Edit job
- Delete job
- Full cron run history
Sprint 2.2: Skill Management (Partial -- Sprint 7 for remaining)
- List all skills with categories (Sprint 3)
- View SKILL.md content (Sprint 3)
- Create skill
- Edit skill
- Delete skill
Sprint 2.3: Memory Write (Sprint 7)
- View notes + profile (Sprint 3)
- Edit notes inline
Sprint 2.4: Todo Management (Wave 2)
- View current todo list (sidebar Todo panel, parsed from session history)
Sprint 2.5: Session Content Search (Sprint 7)
- Session title search (Sprint 4)
- Message content search across sessions
Sprint 2.6: Session Rename (COMPLETE Sprint 4)
Double-click any session title in the left sidebar to edit inline. Enter saves, Escape cancels. Topbar updates immediately.
Completed Waves (Summary)
| Wave | Theme | Key Deliverables |
|---|---|---|
| Wave 2 | Full CRUD + Interaction | Cron/skill/memory CRUD, session search, workspace management, session rename |
| Wave 3 | Power Features | Tool call cards, multi-model dropdown, resizable panels, file actions, conversation controls |
| Wave 4 | Settings + Notifications | Settings panel, cron alerts, background error banner |
| Wave 5 | Session Continuity | Session tags, archive, projects/folders |
| Wave 6 | Agentic Features | Background task cancel, voice input (Web Speech API) |
| Wave 7 | Production Hardening | Password auth, security headers, mobile responsive, Docker + GHCR CI |
User Requested Features
Community-requested enhancements tracked from GitHub issues. All shipped.
| Feature | Issue | Shipped | Sprint |
|---|---|---|---|
| Workspace tree view | #22 | Done | Sprint 18 |
| Docker container + GHCR images | #7 | Done | Sprint 21 + v0.28.1 CI |
| Authentication | #23 | Done | Sprint 19 |
| Send key / personalization | #26 | Done | Sprint 17 |
| Multi-profile support | #28 | Done | Sprint 22 |
| Mobile responsive UI | #21 | Done | Sprint 21 |
| Profile creation in Docker | #44 | Done | v0.27 |