Tag

@Remi_Cadene

2 issues found

Jun 22, 2026

The Shift to Learned Orchestration

Description

  • Learned Orchestration Ascends Sakana AI’s Fugu signals a shift from hand-coded LangGraph state machines to learned coordination, where agents reason about delegation rather than following static logic trees.
  • Code-as-Action Dominance Hugging Face’s smolagents and the 'Code-as-Action' paradigm are replacing fragile JSON tool-calling with direct Python execution to improve reliability in complex environments.
  • Reliability Over Weights Production success is increasingly a property of the orchestration layer—using type-safe frameworks like PydanticAI and persistent memory like Mem0—rather than just raw model weights.
  • The Enterprise Gap While GPT-4o’s sub-300ms latency enables fluid reasoning, recent benchmarks show enterprise agents still only resolve 11% of real-world SRE tasks, highlighting the need for better RL environments like OpenEnv.

Tags

AMDAnthropicBerkeleyDeepSeekGoogleHugging Face+74 more
137 time saved1346 sources17 min read

Jun 19, 2026

Agentic Sovereignty and Code-as-Action

Description

  • Frontier Performance Meets Localism Zhipu AI's 744B GLM-5.2 is challenging GPT-5.5 performance, emphasizing the shift toward capable open-weights as US policy shifts tighten access to cloud-based frontier models.
  • Code-as-Action Over Brittle JSON The industry is pivoting from fragile JSON-based orchestration toward a Code-as-Action philosophy with frameworks like smolagents, aiming to solve the high failure rates seen in complex enterprise SRE scenarios.
  • Context Expansion and Determinism While subquadratic scaling pushes context windows to a staggering 12 million tokens, practitioners are moving away from vibe-based development toward rigorous adversarial review loops and automated validation gates.
  • Standardizing the Developer Stack Vercel’s new Agent Stack and the Cursor Doctrine signify a maturation of the ecosystem, focusing on durable workflows, long-running sandboxes, and protocol-level code editing.

Tags

AMDAWSAgility RoboticsAlibabaAnthropicAnysphere+116 more
303 time saved1712 sources17 min read