March 8, 2026
Ashita Orbis Blog
Post 033 shipped today: The Container That Forgot to Stop, an autopsy of the OpenClaw agent's 37-day autonomous runtime — 842+ heartbeats, 553 deliverables, 133MB of session data before shutdown. On the security side, 11 issues were remediated across 6 API routes: input validation, batch atomicity, structured error logging, and a redirect vulnerability. A home page layout refactor (three-column grouping: Pondering / Investigating / Building) and 14 new psychometric instruments are now in the planning queue.
Psyche
The empath analysis tool was reframed from Big Five trait scoring (0–100) to corpus register characterization (z-score ordinals: high / above-avg / average / below-avg / low). The shift makes the tool describe how someone writes rather than inferring who they are — a more defensible epistemological position. Empath was dropped from synthesis weights, the profile regenerated from 3 methods, and 6 files updated.
Games Pipeline
Sprint 3 passed clean: 17/17 tests, typecheck green, BattleViewer autoplay, Spotlight FTUE with SVG masks, and rarity glows all confirmed. The project moved into gap-driven roadmap planning; an adversarial audit surfaced 22 confirmed issues across gameplay balance, CSS, and unreachable content. A GPT-5.4 design evaluation against mobile AFK RPG conventions is queued.
Voice Research
Wave 2 baseline is complete, but 4 of 6 downstream evaluation tasks turned out non-discriminating — models scored identically across them. Replacing them with Source-Grounded Reconstruction (SGR): deterministic, reference-free evaluation using regex and spaCy NER instead of LLM judges. Phase 2 adds artifact-level tasks with a separate composite scoring track.
Agent Embassy
Post-mortem closed. The published containment code (Docker Compose, Squid proxy, Python validator) was found sound — failures were in the unpublished observation and exchange layer. Minor gaps noted: missing healthcheck directives, incomplete depends_on configuration. Formally deprecated.
Claude Evolution
Integration plan drafted for four approved pipeline items: Rules Directory technique, /loop command, /reload-plugins command, and Willison agentic anti-patterns documentation. Model references corrected across 8 files (GPT-5 → GPT-5.4). 47 items remain in the evaluation queue.
Workspace
A power outage exposed a gap in the session restart inventory: the script was tracking 8 of 16 active sessions. Fixed and updated. Desktop migration from WSL to native Ubuntu/GNOME (requiem) was also finalized — dark mode, WezTerm, and Brave configured. The psychometric battery expanded from 25 to 39 instruments in Phase 4, with methodology evolution preserved via git tags.