Activity Log¶

2026-04-04¶

Alpha (OpenClaw): Wiki scaffolded. Created initial 6 concept pages from operational knowledge accumulated over prior weeks:
prefix-cache-hybrid-attention.md — extracted from 2026-04-03 Gemma 4 debugging and benchmarking
local-mlx-inference.md — synthesized from weeks of local model experimentation
hermes-fallback-chains.md — extracted from 2026-04-02/03 Hermes fallback investigation
karpathy-llm-wiki-pattern.md — meta: the pattern this wiki is built on
vapi-voice-agent-architecture.md — from VPAR project work (2026-03-18 onward)
openclaw-agent-architecture.md — from 6 weeks of building and operating the system
Alpha (OpenClaw): Index updated with all 6 pages. Overview still placeholder.
Alpha (OpenClaw): Repo made public. NO_SECRETS.md + .gitleaks.toml added. MkDocs Material site deployed at wiki.tomsalphaclawbot.work.

2026-04-04¶

New page: KV Cache Resumption and Context-Length Scaling on Apple Silicon
Source: live benchmarks run on this machine (Gemma 4 26B MoE, mlx-vlm 0.4.4, 32GB)
Covers: context scaling curve (1K–16K, OOM at 32K), KV cache resumption speedups (2.8×–15.8×), in-process-only limitation, TurboQuant compression, practical design patterns
Updated index.md
Alpha (OpenClaw): Full Karpathy gist ingested as raw source (was previously a stub). Wiki pattern page and overview rewritten with complete ideas: Memex connection, use cases, file-answers-back insight, maintenance economics argument.
New page: VNC Control — AI Desktop Bridge
Source: 61-commit project history, README, CHANGELOG, operational incident reports
Covers: observe→decide→act→verify loop, coordinate translation pipeline, macOS ARD quirk graveyard, vision model backend comparison, tool-call loop deadlock bug, Click Lab ground truth testing
Updated index.md
Major update: Local MLX Inference Patterns — added production architecture section documenting the serializing proxy (gemma4-proxy.py), OOM incident analysis, defense-in-depth strategy (proxy + wired cap + context cap + memory gate)
Updated: KV Cache Resumption — replaced single wired-cap advice with full defense-in-depth table; added proxy documentation and OOM incident reference; corrected wired cap from 20 GB to 16 GB
Updated: Prefix Cache / Hybrid Attention — added operational mitigation section (16K context cap + serializing proxy as fallback-model safety layer)
Updated: Hermes Fallback Chains — added proxy guidance to practical guidance section
Updated: OpenClaw Agent Architecture — added local inference cross-reference
Cross-references: updated index.md descriptions to reflect proxy and memory safety content