Loop Engineering

The engineering discipline of systems that self-improve through feedback.

Closed feedback loops — observe, act, evaluate, update, repeat — made structured, mathematical, benchmarkable, and fully engineerable.

Read the Manifesto · Explore Patterns · Run the Stack · Onboarding Paths

🚀 The Paradigm Shift

Era	Focus	Optimized Unit	Cognitive Ceiling
2020–2023	Prompt Engineering	Single turn, in-context cues	No closure, state loss
2023–2024	Context Engineering	Static retrieval-augmented memory	Unchanged parameters, no iteration
2024–2025	Agent Engineering	Autonomous delegation & tools	No systemic evaluation, feedback-blind
2025+	Loop Engineering	Closed dynamical feedback loops	Unbounded, self-directed systems

The Hierarchy of Optimization:

Prompt engineering optimizes a single interaction.

Agent engineering optimizes an autonomous actor.

Loop engineering optimizes the entire closed system to get better over time through feedback.

The Loop Engineering dividend

Prompt engineering optimizes a turn. Agent engineering optimizes an actor. Loop engineering optimizes the whole closed system — cheaper to run, faster to ship, impossible to hand-wave, and built to get better every iteration.

Loop Engineering benefits across tokens, speed, CI cost, diagnosability, comparability, traces, and schema drift

Benefit	What you get	Why teams care
Lean context	Combine + minify + budget specs down to 34% of raw YAML	More room for actual work in the window — not boilerplate
Minutes, not weeks	Golden path → valid LSS → scored loop in ~15 minutes	Stop reinventing loop config every sprint
Zero-dollar CI	SimEnv + ReplayEnv run 545 LoopNet trajectories with $0 API spend	Catch regressions before they hit prod invoices
Shared failure language	`fail.*` taxonomy across data, runtime, and bench	Post-mortems that actually transfer between teams
Public receipts	LoopBench 19 tasks · 4 suites · LES-ranked leaderboard	"It worked in the demo" is no longer a career strategy
Production visibility	LTF traces ~70% leaner than raw chat dumps	SREs see iteration quality, not megabytes of prompts
One spec layer	Pin `lss@1.1.0` once — LoopGym, LoopBench, LoopNet agree	Zero schema drift across five repos
Harness freedom	Claude Code, Cursor, LangGraph, CrewAI, Codex, Aider…	Keep your agent stack — add closure on top

The pitch in one line: ship loops that cost less per turn, score on a leaderboard, replay for free, fail with names, and compound improvement — not another prompt doc lost in Notion.

Leaner specs (measured)

Ship one flat spec instead of dragging multiple YAML files into context. Measured with le-loopforge 0.5.0 on the research → code → debug library trio.

_{Baseline = 3 separate library specs (3,255 est. tokens). Lower is leaner.}

Path	Command	Tokens	vs baseline
Separate library YAMLs	load 3 files into context	3,255	100%
Flat combine	`loop combine --library research-agent,coding-agent,autonomous-debugger`	2,750	84%
LSS-min JSON	`loopctl spec minify combined.yaml`	1,414	43%
Budgeted combine	`loop quick --max-tokens 1200 --library …`	1,101	34%

Same LSS structure. Same evaluators. Same termination contracts. Just less noise between your agent and the job.

⚡ Quick Start: 30-Second Setup

Get the entire Loop Engineering toolchain installed instantly.

pip install "le-loop-stack>=0.4.0"

Run your first scored, compressed loop:

# Scaffold a loop spec from an English intent
loopforge intent "Create a code-repair loop with a test-runner evaluator" -o mapped.yaml --suggest-level

# Minify it into LSS-min JSON (saves 40–60% of prompt context space)
loopctl spec minify mapped.yaml --json

# Estimate tokens & score its structural LES
loopctl score --spec mapped.yaml --json

🧩 Core Ecosystem Pillars

Pillar	Focus Area	Key Artifacts
Theory	Foundational conceptual rigor	13 Fundamentals · 6-Level Taxonomy · 14 Design Patterns
Method	Closed-loop lifecycle governance	D-D-M-I-S Framework (Design, Diagnose, Measure, Improve, Scale)
Standards	Interoperable specification models	LSS 1.1 (Composition blocks) · LES 1.0 (Loop Effectiveness Score)
Evidence	Real-world validation & history	Case Studies (AlphaGo, Toyota TPS, PR pipelines, coding agents)
Runtime	Execution, scoring, and benchmarks	Dataset registries, replay sandboxes, and the public scorecard

This repository serves as the narrative and theoretical home for the loop engineering movement. Machine-readable specifications and governance rules live in the canonical Loop Core Engineering repository.

📦 The Published Stack

Everything below is live, synchronized, and published across GitHub and PyPI. Version registry: ECOSYSTEM_VERSIONS.md.

flowchart TD
  classDef primary fill:#18181b,stroke:#27272a,stroke-width:2px,color:#ffffff;
  classDef highlight fill:#f4f4f5,stroke:#18181b,stroke-width:2px,color:#18181b;
  classDef standard fill:#ffffff,stroke:#e4e4e7,stroke-width:1.5px,color:#18181b;

  DOCS[["◆ Loop Engineering <br/>(You are here)<br/>Manifesto · Patterns · Case Studies"]]:::primary
  FORGE["⚙ LoopForge<br/>pip install le-loopforge"]:::standard
  CTL["loopctl CLI<br/>pip install le-loopctl"]:::standard
  CORE[["◆ Loop Core Engineering<br/>LSS Spec · LES Spec · Validators"]]:::highlight
  NET[("■ LoopNet v0.2<br/>545 trajectories")]:::standard
  GYM["◆ LoopGym<br/>pip install loopgym"]:::standard
  BENCH["▲ LoopBench<br/>pip install loopbench"]:::standard

  DOCS --> FORGE
  FORGE --> CTL
  FORGE --> CORE
  CORE --> NET
  CORE --> GYM
  NET --> GYM
  GYM --> BENCH
  CORE --> BENCH
  FORGE --> GYM

Repository	Focus	Purpose & Links
LoopForge	Creation	Scaffold valid LSS specs from patterns · loopforge/ · `pip install le-loopforge` · loopctl · Golden Path
Loop Core Engineering	Specs & Governance	The constitutional foundation, schemas, and validators · GitHub →
LoopNet	Dataset	Ground truth loop executions and trajectories · GitHub → · Hugging Face →
LoopGym	Runtime	Sandboxed simulation environment to run and replay loops · GitHub → · `pip install loopgym`
LoopBench	Benchmarks	Continuous, public community scoreboard · GitHub → · `pip install loopbench`

◆ Complete Install Map: ECOSYSTEM.md
◆ Ecosystem Governance: CANONICAL-SOURCE.md
◆ PyPI Registry Naming Rules: PYPI_NAMING.md

📐 The Loop, Formally

Every loop is structured as a closed dynamical system:

       Observe
          │
          ▼
        Decide
          │
          ▼
         Act
          │
          ▼
       Evaluate
          │
          ▼
     Update State
          │
          └───────────(repeat)───────────► [Observe]

Mathematically formalized as: $$\mathcal{L} = (S, A, O, T, E, M, \tau)$$

Where:

$\mathbf{S}$ : State space of the system
$\mathbf{A}$ : Action space of the loop workers
$\mathbf{O}$ : Observation space (feedback signals)
$\mathbf{T}$ : Transition functions ($S \times A \to S$)
$\mathbf{E}$ : Evaluator models (generates scores & rewards)
$\mathbf{M}$ : Memory representation (episodic & parameter state)
$\mathbf{\tau}$ : Termination conditions & criteria

→ Detailed breakdown: What is a loop?

Declaring Loops in LSS (Loop Specification Schema)

LSS provides a declarative, machine-readable format to define the architecture, inputs, and constraints of any loop.

loop_name: code-repair-loop
version: "1.1"
objective: "Fix failing tests with minimal diff"
workers:
  - role: implementer
evaluators:
  - type: test_suite
termination_conditions:
  - type: all_tests_pass
  - type: max_iterations
    value: 10

→ LSS 1.1 Specification

🔌 Building with Agent Harnesses

You do not need to replace your existing agent stack. Map your existing agent loop, monitor its trajectories, and benchmark its performance in minutes.

Harness / Platform	Integration Guide	Target Framework
Claude Code	integrate/CLAUDE_CODE.md	Anthropic CLI agent
OpenAI Codex	integrate/CODEX.md	Codex code models
LangGraph	examples/integrate-langgraph/	LangChain Graphs
CrewAI	examples/integrate-crewai/	Role-playing Multi-agent swarms
Cursor	integrate/CURSOR.md	Cursor IDE Composer & Agent
OpenAI Agents SDK	integrate/OPENAI_AGENTS.md	OpenAI Swarm/Agents
Aider	integrate/AIDER.md	CLI git-integrated coding agent
Gemini CLI	integrate/GEMINI_CLI.md	Google Generative AI

→ View Full Integration Hub

🧭 Onboarding Paths

Profile	Recommended Onboarding Path	Expected Time
The Theorist	Manifesto → Fundamentals	~2 hours
The Builder	Golden Path v6 → `pip install le-loop-stack` → Integration Hub	~15 min
The Practitioner	Loop Playground → Live Leaderboard	~30 min
The Researcher	Paper Series → LoopNet v0.2 → Case Studies	~1 day
The Architect	D-D-M-I-S Framework → LES scoring	~2 hours

🏢 Repository Architecture

Path	Purpose	Key Artifacts
`manifesto/`	Founding Principles	The philosophy and paradigm of loop engineering
`fundamentals/`	Core Theory	13-topic detailed theoretical foundation of self-improving systems
`taxonomy/`	Classification	Six-level loop classification taxonomy
`patterns/`	Design Patterns	14 engineering patterns described as reusable LSS specs
`framework/`	Methodology	D-D-M-I-S procedural guide for building and deploying loops
`case-studies/`	Historical Evidence	Analyses of AlphaGo, Toyota TPS, GitHub PR engines, and coding loops
`loop-library/`	Spec Library	Production-grade reference loop YAML files
`loopforge/`	Creation Tools	Interactive scaffolding tools to map intents to LSS specs
`implementations/`	Code Examples	Minimal reference implementations in Python, LangGraph, and CrewAI
`research/`	Research Frontier	Active open problems, roadmaps, and paper series

📚 Reference Loop Library

A preview of pre-declared loops available in loop-library/:

Reference Spec	Level	Intent / Target Use Case
Research Agent	Level 2	Literature review & multi-source synthesis
Coding Agent	Level 3	Autonomous software feature implementation
Autonomous Debugger	Level 3	Test-driven localized software repair
Code → Debug (nested)	Level 4	Coding loop with nested recursive debugging
Scenario Swarm (parallel)	Level 4	SWARM decision rehearsal: 3 parallel perspectives with a unified merged forecast
Startup Validator	Level 2	PMF hypothesis verification and fast lean iterations

→ Browse the Full Spec Library · Master Checklist · Next Steps

🛠️ Ecosystem Toolchain

Unified tools to speed up loop design, execution, validation, and benchmarking.

Tool	Purpose	Source / Usage
`loopctl`	Unified CLI tool	`tools/loopctl.py` · Validate, score, level, and diagram LSS specs
`loopforge`	Spec generator	`loopforge/` · Scaffold complete LSS YAML files from text-based intents
`loop_validator`	Schema validator	`tools/loop_validator.py` · Local LSS schema verification
`daily_checkin`	Automated reporter	`scripts/daily_checkin.py` · Continuous deployment checks
`loop_diagram_generator`	Visualizer	`tools/loop_diagram_generator.py` · Auto-generate clean Mermaid diagrams from LSS YAML

🤝 Join the Community

We welcome contributions to LSS specs, new agent harnesses, case studies, benchmarks, and core tooling.

◆ Loop Playground — Create and test your first loop in the sandbox.
◆ Community Spotlight — Highlighted community loops and implementations.
◆ Reproduction Challenge — Replicate verified benchmark scores.
◆ Contributor Guidelines · Governance Model · Reproduction Manual

📝 Citation

@misc{loop-engineering-2026,
  title={Loop Engineering: The Discipline of Self-Improving Systems},
  author={Loop Engineering Community},
  year={2026},
  url={https://github.com/KanakMalpani/Loop-Engineering}
}

Feedback is the fundamental unit of intelligence.
Loop Engineering makes it engineerable.

_{MIT License}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Loop Engineering

The engineering discipline of systems that self-improve through feedback.

🚀 The Paradigm Shift

The Loop Engineering dividend

Leaner specs (measured)

⚡ Quick Start: 30-Second Setup

Run your first scored, compressed loop:

🧩 Core Ecosystem Pillars

📦 The Published Stack

📐 The Loop, Formally

Declaring Loops in LSS (Loop Specification Schema)

🔌 Building with Agent Harnesses

🧭 Onboarding Paths

🏢 Repository Architecture

📚 Reference Loop Library

🛠️ Ecosystem Toolchain

🤝 Join the Community

📝 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
.cursor/rules		.cursor/rules
.github		.github
DIAGRAMS		DIAGRAMS
assets		assets
benchmarks		benchmarks
case-studies		case-studies
contributions		contributions
docs		docs
education/practitioner		education/practitioner
examples		examples
framework		framework
fundamentals		fundamentals
implementations		implementations
loop-library		loop-library
loopctl		loopctl
loopforge		loopforge
manifesto		manifesto
mathematics		mathematics
patterns		patterns
research		research
scoring		scoring
scripts		scripts
stack		stack
standards		standards
taxonomy		taxonomy
tools		tools
.gitignore		.gitignore
ECOSYSTEM_VERSIONS.md		ECOSYSTEM_VERSIONS.md
LICENSE		LICENSE
README.md		README.md
STATUS.md		STATUS.md

Folders and files

Latest commit

History

Repository files navigation

Loop Engineering

The engineering discipline of systems that self-improve through feedback.

🚀 The Paradigm Shift

The Loop Engineering dividend

Leaner specs (measured)

⚡ Quick Start: 30-Second Setup

Run your first scored, compressed loop:

🧩 Core Ecosystem Pillars

📦 The Published Stack

📐 The Loop, Formally

Declaring Loops in LSS (Loop Specification Schema)

🔌 Building with Agent Harnesses

🧭 Onboarding Paths

🏢 Repository Architecture

📚 Reference Loop Library

🛠️ Ecosystem Toolchain

🤝 Join the Community

📝 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages