Skip to content
Go back

Builder's Daily / Agents & Harness

Agents & Harness — June 6, 2026

How do I run the agent loop reliably in production?

The read

The harness is commoditizing — compaction, skills, hosted deploy, and orchestration primitives are the new electricity. When every vendor ships a harness, the moat is how humans wire judgment, guardrails, and institutional context into the loop.

What moved

  • Microsoft Agent Framework ships Agent Harness, CodeAct, and Handoff orchestration at BUILD 2026Microsoft Agent Framework Blog MAF (1.0 GA April 2026) adds Agent Harness via AsHarnessAgent() with automatic context compaction, filesystem memory, todo tracking, plan/execute modes, AgentSkillsProvider, BackgroundAgentsProvider, shell execution (.NET), ToolApprovalAgent, and OpenTelemetry. Foundry Hosted Agents deploy local MAF agents as containers with scale-to-zero, per-session VM isolation, and persistent filesystem. CodeAct (alpha) runs multi-tool Python in Hyperlight micro-VMs, cutting benchmark latency 52% and tokens 64%. HandoffBuilder adds directed multi-agent routing with developer-defined topology and guardrails. Builder angle: One method turns a chat client into a production harness with compaction, skills, sub-agents, and hosted deployment — collapsing what teams typically stitch from separate OSS pieces.

  • NVIDIA releases NemoClaw orchestration blueprints and OpenShell secure agent runtimeNVIDIA Newsroom NVIDIA Agent Toolkit ships NemoClaw blueprints (available now) connecting popular harnesses for long-running agents, plus OpenShell early preview for policy/privacy controls and routing queries to local vs cloud models. Nemotron 3 Ultra (550B MoE) targets agent harnesses including LangChain Deep Agents, OpenHands, and OpenCode. CUDA-X libraries (cuDF, cuOpt, NeMo, PhysicsNeMo, CUDA-Q) are exposed as domain-specific agent skills. Microsoft partners on Windows security primitives plus OpenShell; Canonical and Red Hat integrate OpenShell into Ubuntu and Red Hat AI. Builder angle: NemoClaw plugs orchestration blueprints into existing harnesses while OpenShell adds a policy-controlled runtime layer for cross-platform agent deployment.

  • Amazon Agent-Ops multi-agent framework automates SOPs with 85–97% accuracy in productionAmazon Science Agent-Ops orchestrates three agents for e-commerce SOP automation: SOP Groomer converts ambiguous docs into automation-ready specs, WebAgent hits 91.3% task completion via demonstration-based learning on dynamic web UIs, and Document Verification Agent validates invoices and certificates at 94.2% accuracy across languages. Deployed across seven SOP categories in three regions with 83% case-resolution time reduction, used by 100 account managers. Builder angle: Demonstrates a supervisor plus specialized-worker orchestration pattern with measurable production accuracy on incomplete SOPs and unpredictable UIs — not just lab demos.

Share this post on: