Builder's Daily / Agents & Harness
Agents & Harness — June 6, 2026
How do I run the agent loop reliably in production?
- agent-harness
- orchestration
- microsoft
- codeact
- harness
- runtime
The read
The harness is commoditizing — compaction, skills, hosted deploy, and orchestration primitives are the new electricity. When every vendor ships a harness, the moat is how humans wire judgment, guardrails, and institutional context into the loop.
What moved
-
Microsoft Agent Framework ships Agent Harness, CodeAct, and Handoff orchestration at BUILD 2026 — Microsoft Agent Framework Blog MAF (1.0 GA April 2026) adds Agent Harness via AsHarnessAgent() with automatic context compaction, filesystem memory, todo tracking, plan/execute modes, AgentSkillsProvider, BackgroundAgentsProvider, shell execution (.NET), ToolApprovalAgent, and OpenTelemetry. Foundry Hosted Agents deploy local MAF agents as containers with scale-to-zero, per-session VM isolation, and persistent filesystem. CodeAct (alpha) runs multi-tool Python in Hyperlight micro-VMs, cutting benchmark latency 52% and tokens 64%. HandoffBuilder adds directed multi-agent routing with developer-defined topology and guardrails. Builder angle: One method turns a chat client into a production harness with compaction, skills, sub-agents, and hosted deployment — collapsing what teams typically stitch from separate OSS pieces.
-
NVIDIA releases NemoClaw orchestration blueprints and OpenShell secure agent runtime — NVIDIA Newsroom NVIDIA Agent Toolkit ships NemoClaw blueprints (available now) connecting popular harnesses for long-running agents, plus OpenShell early preview for policy/privacy controls and routing queries to local vs cloud models. Nemotron 3 Ultra (550B MoE) targets agent harnesses including LangChain Deep Agents, OpenHands, and OpenCode. CUDA-X libraries (cuDF, cuOpt, NeMo, PhysicsNeMo, CUDA-Q) are exposed as domain-specific agent skills. Microsoft partners on Windows security primitives plus OpenShell; Canonical and Red Hat integrate OpenShell into Ubuntu and Red Hat AI. Builder angle: NemoClaw plugs orchestration blueprints into existing harnesses while OpenShell adds a policy-controlled runtime layer for cross-platform agent deployment.
-
Amazon Agent-Ops multi-agent framework automates SOPs with 85–97% accuracy in production — Amazon Science Agent-Ops orchestrates three agents for e-commerce SOP automation: SOP Groomer converts ambiguous docs into automation-ready specs, WebAgent hits 91.3% task completion via demonstration-based learning on dynamic web UIs, and Document Verification Agent validates invoices and certificates at 94.2% accuracy across languages. Deployed across seven SOP categories in three regions with 83% case-resolution time reduction, used by 100 account managers. Builder angle: Demonstrates a supervisor plus specialized-worker orchestration pattern with measurable production accuracy on incomplete SOPs and unpredictable UIs — not just lab demos.