webui

Sabo/webui

Author	SHA1	Message	Date
nesquena-hermes	a064542df9	release: v0.39.0 — security hardening, 12 fixes (#171 ) * Security: harden auth, CSRF, SSRF, XSS, and env race conditions Twelve fixes from a full security audit: CRITICAL - Add CSRF Origin/Referer validation on all POST endpoints (prevents cross-origin abuse of self-update, settings, file ops) HIGH - Unify password hashing: config.py now uses PBKDF2 (600k iters) instead of single-iteration SHA-256 - Add per-IP rate limiting on login (5 attempts/60s, 429 on excess) MEDIUM - Validate session IDs as hex-only before filesystem operations (prevents path traversal via crafted session ID) - SSRF: resolve DNS before private-IP check in model fetching (prevents DNS rebinding to internal services) - Warn loudly when binding non-loopback without password set - SSE env var mutations: wrap sync chat + streaming restore in _ENV_LOCK - Force Content-Disposition:attachment for HTML/XHTML/SVG uploads (prevents stored XSS via uploaded files) LOW - Extend HMAC session signature from 64 to 128 bits - Add resolve()+relative_to() check on skills path construction - Set Secure flag on session cookie when connection is HTTPS - Sanitize exception messages to strip filesystem paths No breaking changes. All fixes are backward-compatible. * fix: use getattr for Secure cookie SSL detection handler.request.getpeercert raises AttributeError on plain sockets (non-SSL). Use getattr(..., None) to safely check for SSL. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * tests: add sprint 29 security hardening coverage (PR #171) 33 tests covering all 12 security fixes: - CSRF origin/referer validation - Login rate limiting (5 attempts/60s) - Session ID hex validation (path traversal prevention) - Error path sanitization (_sanitize_error) - Secure cookie getattr safety - HMAC signature length (64->128 bit) - Skills path traversal prevention - Content-Disposition for HTML/SVG/XHTML - PBKDF2 password hashing verification - Non-loopback startup warning - SSRF DNS guard code presence - _ENV_LOCK export from streaming module * release: v0.39.0 — security hardening, 12 fixes (#171) --------- Co-authored-by: betamod <matthew.sloly@gmail.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 22:26:03 -07:00
Nathan Esquenazi	8aa1c9684d	fix: sync message_count to state.db for /insights (#163 ) (#164 ) * fix: sync message_count to state.db for /insights (#163) sync_session_usage() didn't write message_count to state.db, so /insights showed 0 messages for all WebUI sessions even with sync_to_insights enabled. Added message_count parameter to sync_session_usage() and pass len(s.messages) from both the streaming and non-streaming chat paths. Fixes #163 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use callable pattern for _execute_write in sync_session_usage --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 22:56:27 -07:00
nesquena-hermes	5a52259fd7	fix: tool cards actually render on page reload from session data (#140 ) (#153 ) Co-authored-by: Nathan Esquenazi <nesquena@gmail.com>	2026-04-06 14:23:26 -07:00
Nathan Esquenazi	2442fca5e5	fix: personalities from config.yaml + ephemeral_system_prompt (#139 ) (#148 ) The previous implementation read SOUL.md files from a filesystem directory. The Hermes agent uses config.yaml agent.personalities section with string or dict format (system_prompt, tone, style), resolved via _resolve_personality_prompt() and passed to AIAgent via ephemeral_system_prompt. Changes: - /api/personalities: reads from config.yaml agent.personalities, not filesystem SOUL.md directories. Calls reload_config() to pick up config changes without restart. - /api/personality/set: resolves prompt from config.yaml using the same logic as hermes-agent cli.py (string or dict with system_prompt/tone/style) - streaming.py: passes personality via agent.ephemeral_system_prompt (agent's own mechanism) instead of prepending to system_message - Removed unused 're' import from streaming.py - Updated tests to match config-based approach Fixes #139 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 14:10:30 -07:00
Nathan Esquenazi	442b0d872a	fix: multi-provider model routing via @provider: hint (#138 ) (#146 ) The previous fix (#142) prefixed non-default provider models with 'provider/model' which then hit the cross-provider guard and routed to OpenRouter — worse than before for users without an OpenRouter key. New approach: non-default provider models use '@provider:model' format (e.g. @minimax:MiniMax-M2.7). resolve_model_provider() parses this hint and returns (bare_model, provider, None). streaming.py and routes.py then pass the resolved provider to resolve_runtime_provider(requested=provider) which gets the correct per-provider API key and base_url from hermes-agent. This uses the agent's own credential resolution instead of reinventing routing logic in the webui. Fixes #138 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 14:10:26 -07:00
Nathan Esquenazi	58eb6e7fd5	feat: /personality slash command with backend integration (#143 ) * feat: /personality slash command with backend integration Add /personality command to switch the agent's system prompt personality. Hermes CLI supports personalities stored at ~/.hermes/personalities/<name>/SOUL.md. Backend: - GET /api/personalities: lists available personalities from the active profile's personalities directory (reads first line of SOUL.md for desc) - POST /api/personality/set: sets active personality on the session, reads and validates the SOUL.md file exists, returns the prompt text - streaming.py: injects personality prompt (SOUL.md content) as prefix to the system_message when run_conversation is called Frontend (commands.js): - /personality with no args: lists available personalities as a local message - /personality <name>: sets the personality with a toast confirmation - /personality none\|default\|clear: removes the active personality Session model: new 'personality' field (backward-compatible, defaults to None) Closes #139 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: path traversal in personality name + case sensitivity Security: personality name is now validated with regex ^[a-zA-Z0-9][a-zA-Z0-9_-]{0,63}$ in both routes.py (POST /api/personality/set) and streaming.py (system prompt injection). Defense-in-depth: resolve().relative_to() check ensures the path stays inside the personalities directory even if regex is bypassed. Also: removed toLowerCase() from frontend command handler so personality names are case-preserved (filesystem may be case-sensitive). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: /personality command — hardened, compact() fix, tests Fixes on top of original PR: - compact() was missing 'personality' field — UI couldn't know active personality after page load. Added to Session.compact(). - GET /api/personalities: add symlink guard (is_symlink() skip) and resolve() check — prevents reading SOUL.md from symlink targets outside personalities dir. - POST /api/personality/set: require() only checks session_id (not name) so clearing with name='' works correctly instead of 400. - POST /api/personality/set: add MAX_FILE_BYTES size cap on SOUL.md to prevent unbounded context window consumption. - POST /api/personality/set: return personality:null (not '') when cleared. - streaming.py: same MAX_FILE_BYTES guard before prepending to system msg. Added tests/test_sprint28.py: 11 tests for API round-trip, listing, symlink guard, path traversal rejection, clear, size cap, persistence. Tests pass in isolation; full-suite run has a test-isolation interaction with shared server state across sprint tests (tracked as follow-up). --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 11:16:37 -07:00
Nathan Esquenazi	39066bc614	security: fix env race, signing key, upload traversal, password hash (#106 ) * security: fix four audit findings -- env race, signing key, upload traversal, password hash 1. Race condition in os.environ (HIGH): Per-session _agent_lock didn't prevent cross-session env writes from interleaving. Added global _ENV_LOCK in streaming.py that serializes the entire env save/restore block across all sessions. 2. Predictable signing key (MEDIUM): sha256(STATE_DIR) was deterministic. Now generates a random 32-byte key on first startup and persists it to STATE_DIR/.signing_key (chmod 600). Existing sessions invalidated on first restart (acceptable for a security fix). 3. Upload path traversal (MEDIUM): Filename '..' survived the regex sanitization (dots are allowed chars). Added explicit rejection of dot-only names and safe_resolve_ws() check to verify the resolved path stays within the workspace. 4. Weak password hashing (MEDIUM): Replaced bare SHA-256 with PBKDF2- SHA256 (600k iterations per OWASP). Uses stdlib hashlib.pbkdf2_hmac, no new dependencies. Note: existing passwords must be re-set after this change (hash format changed). Closes #106 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use random signing key as PBKDF2 salt (replaces predictable STATE_DIR salt) --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 22:25:08 -07:00
Nathan Esquenazi	bb595afde9	feat: opt-in state.db sync for /insights visibility (#92 ) WebUI sessions were invisible to 'hermes /insights' because the WebUI bypasses the gateway and calls AIAgent.run_conversation() directly, never writing to state.db. New 'Sync usage to /insights' setting (default: off) that mirrors WebUI session metadata (tokens, cost, model, title) into state.db after each turn. Uses absolute token counts to avoid double-counting. Components: - api/state_sync.py: bridge module with sync_session_start() and sync_session_usage(). Uses ensure_session() (idempotent) and update_token_counts(absolute=True). All wrapped in try/except. - api/config.py: new 'sync_to_insights' boolean setting - api/streaming.py: calls sync_session_usage() after s.save() - api/routes.py: same for the non-streaming chat path - Settings UI: checkbox toggle with description Default off because: - Writing to state.db while CLI/gateway also writes could cause WAL lock contention on busy systems - Some users may not want WebUI sessions in /insights stats Closes #92 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 20:07:05 -07:00
Nathan Esquenazi	2797e5189b	feat: context window usage indicator with real agent data The context indicator in the composer footer now shows real data from the agent's context compressor instead of hardcoded estimates: - last_prompt_tokens / context_length (e.g. '12.4k / 200k (6%)') - Bar color: blue <50%, yellow 50-75%, red >75% - Hover tooltip shows exact numbers + compression threshold - Cost appended when available Backend: streaming.py now reads context_length, threshold_tokens, and last_prompt_tokens from agent.context_compressor after run_conversation() and includes them in the usage dict sent with the 'done' SSE event. This matches the CLI's context window display (the bar that shows current context vs total window). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 18:50:17 -07:00
Nathan Esquenazi	429a0ea228	feat: handle auto-compaction side effects + /compact command The agent's run_conversation() already triggers context compression internally, but the WebUI was unaware of the side effects: 1. Session ID rotation: compression creates a new session_id inside the agent. The WebUI kept writing to the old session file, causing silent data loss. Fix: detect agent.session_id mismatch after run_conversation(), rename the session file, and update in-memory caches. 2. No user notification: compression was invisible. Fix: emit a 'compressed' SSE event when compression is detected. Frontend shows a system message and toast. 3. No manual control: Fix: add /compact slash command that sends a message to the agent requesting context compression. Shows in the autocomplete dropdown. Detection works two ways: - agent.session_id != original session_id (ID rotation) - agent.context_compressor.compression_count > 0 (compressor state) Closes #90 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 18:46:34 -07:00
Nathan Esquenazi	e2d24f57ac	Merge pull request #78 from carlytwozero/fix/pass-api-key-to-aiagent fix: pass api_key to AIAgent for non-Anthropic /anthropic providers	2026-04-04 13:05:29 -07:00
Carly 2.0	cc6709c9d5	fix: pass api_key to AIAgent for non-Anthropic /anthropic providers When the user's config uses a non-Anthropic provider with an Anthropic-compatible endpoint (e.g. MiniMax at https://api.minimax.io/anthropic), chat in the WebUI fails silently with APIConnectionError on every request, while the hermes CLI and messaging gateway work fine with the same config. Root cause: both api/routes.py and api/streaming.py constructed AIAgent using only (model, provider, base_url) from resolve_model_provider() and never passed api_key. When the base URL ends in /anthropic, AIAgent uses the anthropic_messages adapter, but only falls back to ANTHROPIC_TOKEN when provider == "anthropic" (a safety check to avoid leaking Anthropic credentials to third parties). For MiniMax and similar providers the effective key becomes "", and the auth failure surfaces as a generic "Connection error" after three retries. The CLI and gateway resolve the key via hermes_cli.runtime_provider.resolve_runtime_provider(), which reads MINIMAX_API_KEY (and similar) from ~/.hermes/.env. This patch does the same before creating the AIAgent in both chat paths. Fixes #77	2026-04-04 15:03:02 -05:00
Nathan Esquenazi	90b5ad8d99	fix: strip webui metadata from messages before sending to LLM API (#67 ) The webui stores display-only fields on messages (attachments, timestamp, _ts) for UI rendering. These leaked into the conversation_history passed to AIAgent.run_conversation(). Most providers ignore unknown fields, but Z.AI/GLM tries to deserialize 'attachments' as its native ChatAttachments type, causing HTTP 400 on every subsequent message after an image upload. Fix: _sanitize_messages_for_api() creates a clean copy with only API-standard keys (role, content, tool_calls, tool_call_id, name, refusal) before passing to run_conversation(). Applied to both the streaming path (streaming.py) and non-streaming path (routes.py). Closes #66 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 22:13:12 -07:00
Nathan Esquenazi	2fb2ddeaaa	feat: token usage toggle (setting + /usage command) + timestamp fixes Token usage display: - Add 'show_token_usage' boolean to settings (default: false, off by default) - Settings panel: checkbox 'Show token usage after responses' - /usage slash command: instant toggle with toast feedback, persists to server, updates checkbox if settings panel is open, re-renders messages - Boot: load show_token_usage alongside send_key on startup - ui.js: gate usage badge on window._showTokenUsage flag Timestamps: - streaming.py: stamp 'timestamp' on every message that lacks one at conversation completion; old messages (no timestamp field) now get a wall-clock time the first time they're touched by a new turn - messages.js: stamp _ts on the last assistant message at done-event time so the time shows immediately on the current turn before next reload - Timestamps already render in the UI (Sprint 14): faint time on each role header line, full opacity on hover, full date in title tooltip	2026-04-03 19:11:36 -07:00
Nathan Esquenazi	df06c1cdca	feat: Sprint 23 — agentic transparency + polish Track A: Token/cost display - Read agent usage attrs (session_prompt_tokens, session_completion_tokens, session_estimated_cost_usd) after run_conversation in streaming.py - Add input_tokens, output_tokens, estimated_cost fields to Session model - Include usage in done SSE event payload - Store usage on S.lastUsage in messages.js done handler - Render usage badge below last assistant message (input/output/cost) Track B: Subagent delegation cards - Add subagent_progress to toolIcon map with shuffle emoji - Special-case subagent_progress in buildToolCard: "Subagent" label, strip double emoji from preview, add tool-card-subagent CSS class - Indented border-left styling for subagent cards - Clean delegate_task display name Track C: Skill picker in cron create form - Add skill search input + tag chips to cron create form HTML - Skill picker JS in panels.js: search/filter, click-to-add tags, remove tag chips, pre-fetch skill list on form open - submitCronCreate sends skills array in POST body - Skill picker dropdown + tag CSS Track D: Skill linked files viewer - Add file query param to /api/skills/content endpoint - Serve linked files from skill directory with path traversal protection - Ensure linked_files key always present in skill content response - Render linked files section below SKILL.md content in preview panel - openSkillFile function for viewing individual linked files Track E: Bug fixes and code quality - Expand Session.__init__ and compact() to readable multi-line format - Remove inline import json as _j2 inside loop in streaming.py - Fix tool_calls: capture args from assistant messages, skip unresolved names - Store args snapshot in persisted tool_calls for reload display 6 new tests. Total: 421 (409 passing). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 18:33:49 -07:00
Nathan Esquenazi	e61382ef71	fix: pass fallback_model to AIAgent; show rate-limit error inline instead of 'Connection lost' Two fixes for Camanji rate limit UX: 1. api/streaming.py — pass fallback_model from profile config to AIAgent The agent already supports fallback_model (a dict with provider/model/base_url) for automatic rate-limit recovery, but streaming.py never read it from config or passed it to AIAgent. Now reads get_config().get('fallback_model') at call time (not module-level snapshot) and passes it through. Also reads platform_toolsets.cli from the active profile's config at call time so profiles with custom toolset lists use the right tools. Camanji has fallback_model: {provider: openrouter, model: anthropic/claude-sonnet-4.6} so hitting the direct-Anthropic rate limit will now automatically retry via OpenRouter before giving up. 2. api/streaming.py + static/messages.js — show error inline, not 'Connection lost' Previously: agent threw -> put('error', msg) -> SSE connection closed -> browser's network-level 'error' event fired -> generic 'Connection lost'. The actual error message was invisible to the user. Fix: renamed server-side error event to 'apperror' (distinct from the SSE spec's network error event). Added source.addEventListener('apperror', ...) in messages.js that renders the error as a styled assistant message: ⏱️ Rate limit reached: <full message> Rate limit reached. Fallback model exhausted. Try again in a moment. Also added source.addEventListener('warning', ...) for non-fatal notices (future use: fallback-activated status bar update). Tests: 426 passed, 0 failed.	2026-04-03 20:34:52 +00:00
Nathan Esquenazi	d2b27f6f1e	feat: multi-profile support -- create, switch, delete profiles from web UI (Issue #28 ) Add full profile management to the web UI, matching the hermes-agent CLI profile system. Profiles are isolated HERMES_HOME instances with their own config, skills, memory, cron, and API keys. Backend: new api/profiles.py wrapping hermes_cli.profiles, dynamic config reloading, 5 new API endpoints, profile-aware path resolution, HERMES_HOME env save/restore in streaming, module-level cache patching for skills_tool and cron/jobs. Frontend: profile chip in topbar with dropdown, Profiles sidebar panel with CRUD UI, boot-time profile fetch, cascade refresh on switch. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 10:50:21 -07:00
Nathan Esquenazi	1b1cd124f6	fix: stop leaking stack traces to clients in HTTP 500 responses Tracebacks exposed file paths, module names, and potentially secret values from local variables. Now logged server-side only; clients receive a generic error message. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-03 06:41:32 -07:00
Nathan Esquenazi	ceb091d6b1	fix: update MiniMax/Z.AI model lists, pass base_url to AIAgent - Update _PROVIDER_MODELS['minimax'] from stale ABAB 6.5 models to current MiniMax-M2.7/M2.5/M2.1 lineup (matching hermes-agent upstream) - Update _PROVIDER_MODELS['zai'] from GLM-4 to current GLM-5/4.7/4.5 lineup (matching hermes-agent upstream) - Extend resolve_model_provider() to also return base_url from config.yaml, so providers with custom endpoints (MiniMax, Z.AI) are routed correctly - Pass base_url to AIAgent in both streaming and sync chat paths Fixes #6 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 23:15:04 -07:00
Nathan Esquenazi	241357595d	refactor: extract resolve_model_provider helper, fix cross-provider routing Replace duplicated inline provider resolution in routes.py and streaming.py with a shared resolve_model_provider() helper in config.py. Improvements over original: - If model ID has a prefix matching any known direct-API provider (not just the config provider), strip it and route correctly. This handles edge cases like localStorage restoring a model from a different provider group. - Single source of truth for the resolution logic. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 22:56:34 -07:00
deboste	2864c2b691	fix(api): resolve model provider from config to prevent misrouting When the model dropdown sends a prefixed ID like "anthropic/claude-xxx", AIAgent interprets the provider/model format as an OpenRouter path and routes through OpenRouter instead of the direct Anthropic API. Fix: read the configured provider from config.yaml model section. If the model ID starts with the configured provider name followed by "/", strip that prefix and pass the provider explicitly to AIAgent. This ensures direct API providers (Anthropic, OpenAI, etc.) are used when configured, regardless of the model ID format from the dropdown.	2026-03-31 15:00:43 +00:00
Hermes	7019c25021	Hermes Web UI — Sprints 11-14: multi-provider models, settings, session QoL, alerts, polish Sprint 11 (v0.13): multi-provider model support, streaming smoothness - Dynamic model dropdown populated from configured API keys (OpenAI, Anthropic, Google, DeepSeek, GLM, Kimi, MiniMax, OpenRouter, Nous Portal) - Scroll pinning during streaming (no forced scroll when user has scrolled up) - All route handlers extracted to api/routes.py (server.py now ~76 lines) Sprint 12 (v0.14): settings panel, SSE reconnect, session QoL - Settings panel (gear icon) -- persist default model and workspace server-side - SSE auto-reconnect on network blips - Pin/star sessions to top of sidebar - Import session from JSON export Sprint 13 (v0.15): cron alerts, background errors, session duplicate, tab title - Cron completion alerts: toast per completion + unread badge on Tasks tab - Background agent error banner when a non-active session errors mid-stream - Session duplicate button - Browser tab title reflects active session name Sprint 14 (v0.16): Mermaid diagrams, file ops, session archive/tags, timestamps - Mermaid diagram rendering inline (dark theme, lazy CDN load) - File rename (double-click in file tree) and create folder - Session archive (hide without deleting, toggle to show) - Session tags -- #hashtag in title becomes colored chip + click-to-filter - Message timestamps (HH:MM on hover, full date as tooltip) Test suite: 224 tests across 14 sprint files + regression gate, 0 failures.	2026-03-31 07:02:47 +00:00
Nathan Esquenazi	a4e2174c29	Hermes WebUI v0.1.0 — initial public release	2026-03-30 20:40:19 -07:00

23 Commits