Foundational Papers

Key papers that shaped the field of Harness Engineering and AI agent runtimes.

Agent Architectures

  • ReAct: Synergizing Reasoning and Acting in Language Models (2022) Yao et al. | Paper The reasoning + acting loop that underlies most modern agent harnesses.

  • Toolformer: Language Models Can Teach Themselves to Use Tools (2023) Schick et al. | Paper Foundation for tool-use in agent harnesses.

  • Generative Agents: Interactive Simulacra of Human Behavior (2023) Park et al. | Paper Memory architecture for persistent agent identities.

  • The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling (2024) Masterman et al. | Paper Comprehensive survey of agent architecture patterns.

Memory & Context

  • MemGPT: Towards LLMs as Operating Systems (2023) Packer et al. | Paper Virtual context management inspired by OS memory hierarchy.

  • Reflexion: Language Agents with Verbal Reinforcement Learning (2023) Shinn et al. | Paper Self-reflection as a memory and learning mechanism.

Multi-Agent Systems

  • AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation (2023) Wu et al. | Paper Multi-agent conversation framework design patterns.

  • AgentVerse: Facilitating Multi-Agent Collaboration (2023) Chen et al. | Paper Coordination patterns for multi-agent systems.

Harness Engineering (Emerging)

  • Harrison Chase: "Your Harness, Your Memory" (2026) Blog Post | Thread The essay that put "Harness Engineering" into mainstream discourse.

  • steipete: OpenClaw Strict Mode & Harness Pluggability (2026) Thread Runtime-level constraints vs prompt-level constraints.


This list is actively maintained. Submit papers via PR or open an issue with the paper label.