The Agent Fabric (Part 1): Why Agents May Form Societies

Claude Code was used for editing and visualizations. All ideas and arguments are the authors' own.

Updates

2026-05-05: Added note on self-organization and complexity science (Holland's schema theorem, classifier systems, Kauffman, Prigogine, Axelrod). Refined definitions throughout to avoid implying patterns are inherently designed or emergent.
2026-05-15: Expanded several notes and added new ones (Tierra, identity, network stability) in response to a detailed review by Håkan Jonsson. Thank you, Håkan, for the generous and thought-provoking feedback.
2026-06-08: Added references to Foundation Protocol (Liu et al., 2026), scaling of personalized adapters (Bo et al., 2026), and heterogeneous swarms (Feng et al., NeurIPS 2025). The first two appeared after publication and support predictions made in the original text.

Early access. This blog series is a work in progress. Feedback, comments, and suggestions are welcome. Feel free to reach out on LinkedIn or leave a comment at the bottom of the page.

Figure 1. The evolution of two societies. This animation previews the argument of the post in four phases: from isolated human-agent pairs, through an emerging resource ecology, to governed societies with shared memory, and finally a living fabric that restructures itself under pressure. Use the arrows to navigate. The two premises that drive this progression (Two Observations) and the hypothesis that connects them (The Loom Hypothesis) are developed in the sections below.

Full caption

Phase 1 (Isolation): humans interact with individual frontier models (large dark circles marked "F") that already have memory, personalization, and access to tools, yet different users' agents do not communicate with each other. This is the world most people know today. Phase 2 (Ecosystem Grows): frontier models produce smaller, cheaper distilled models that begin connecting to each other, forming a resource ecology of diverse model sizes rather than a single dominant model (why not one model?). Phase 3 (Societies Form): agents cluster into governed societies (colored boundaries), each with its own governance archetype. Collective Memories (CM, green squares) and local Knowledge Bases (KB, small labeled rectangles) store society-specific or agent-specific knowledge, feeding into a Knowledge Factory (KF, diamond at the bottom) that synthesizes insights across clusters. Phase 4 (The Living Society): the fabric comes alive. Tasks flow across society boundaries (the moving dots represent work, not agents traveling), boundary events reshape governance structures (mergers, schisms, expansions), and knowledge flows continuously. This is the adaptive fabric, a system that restructures itself under pressure rather than breaking.

When you use ChatGPT, Claude, or Gemini, you are talking to one AI. It has memory, it can search the web, run code, browse files, yet there is only one model behind the curtain. Something different is already happening one layer down. When you ask a coding agent to refactor a module, the interaction may feel like one assistant, but the work is often split across planners, tool calls, test runners, and specialized sub-agents. You interacted with one agent; several did the work behind the scenes. Now scale it to millions or billions of agents, coordinating across organizations, forming persistent relationships. What organizational structures could emerge, and what would govern them? We call this interconnected system the Agent Fabric.

Some threads in this fabric are built top-down by engineers who assign roles and routing. Others crystallize bottom-up through repeated interaction. The same structural patterns can arise through either path. As billions of people acquire personal agents, those agents connect through social interactions, shopping routines, and work collaborations, forming societies without any single deployer planning the outcome.

The Agent Fabric, a multi-part series on why and how AI agents may form societies and what it means for us.

Part 1. Why Agents May Form Societies (you are here). Two observations, the Loom Hypothesis, and the path from isolation to interweaving
Part 2. Division of Labour and Governance (coming soon). Delegation archetypes, the specialist market, and governance archetypes

Table of Contents

From Mindless to Mindful: Beyond the Society of Mind
Two Observations
The Vision: From Isolation to Interweaving
The Resource Ecology
Why Not One Model to Rule Them All?
Governance: How Agent Societies Are Ruled
Collective Memory and the Knowledge Factory
The Adaptive Fabric
The Living Society and What Comes Next

From Mindless to Mindful: Beyond the Society of Mind

In 1986, Marvin Minsky proposed in The Society of Mind that intelligence emerges from the interaction of many simple, specialized agents, none individually intelligent.

“What magical trick makes us intelligent? The trick is that there is no trick. The power of intelligence stems from our vast diversity, not from any single, perfect principle.” Marvin Minsky, The Society of Mind (1986)

Minsky asked what happens when you wire together many simple parts. Today’s AI agents are different. They already reason across domains, write code, use tools, and hold extended conversations. We face a different question.

What happens when you wire together many intelligent parts?

This blog series argues that agent societies will arise through both deliberate design and emergent self-organization, and that in practice these forces are often intertwined. Through governance, memory, reputation, and specialization, these societies may achieve collective intelligence that exceeds what any individual agent could. No individual human can build a semiconductor fab, but organizational structures let individual capabilities compose. The same applies to agents, though the critical difference is speed. An agent can transmit its operational context to another agent in seconds, and a governance structure can, in principle, be restructured and redeployed in hours rather than years.

On context sharing

Agent societies face constraints different from humans, not fewer. Running AI systems today costs significant energy, and compute, context windows, and cost are real bottlenecks, at least for now. What makes agent coordination qualitatively different is what gets shared: conversation history, retrieved documents, tool outputs, and intermediate reasoning can all be transmitted instantly. Context sharing is where most coordination value lies.

The idea of agent societies is not new

Minsky's Society of Mind (discussed above) was an early conceptual ancestor, though his "agents" were simple mental processes rather than deployed AI systems. Distributed AI has studied related problems for decades, including Contract Net (1980) for bidding over tasks, KQML and FIPA ACL for agent communication, and blackboard systems for shared problem-solving. Economics and governance theory approached adjacent questions from another angle, notably Coase on transaction costs, Hayek on distributed knowledge, and Ostrom on commons governance.

LLM agents change the substrate rather than the underlying question. They can exchange natural-language context, call tools, pass operational state, and be rearranged without rebuilding the whole system. Early LLM-agent systems such as CAMEL, MetaGPT, Stanford's Generative Agents, and AutoGen showed pieces of this pattern. They are not full societies in the sense used here, but they show why useful agent work often becomes organizational. The old question was how to make simple agents coordinate. The new question is what happens when capable, tool-using agents coordinate at scale.

This blog series makes three claims:

Isolation is unstable where shared context matters. Agents persist only while useful, and resources remain finite even as agent populations grow. Together, these create economic pressure to connect agents into coordinated structures. When coordination costs are lower than duplicated work, connected agents will tend to outperform isolated ones, and the pressure intensifies with scale.
Governance is not overhead; it is the product. Delegation patterns, verification mechanisms, memory rules, and trust structures are not implementation details to add later. They determine what kind of knowledge a society produces, how resilient it is to behavioral drift, and whether it fails gracefully or catastrophically.
Some capabilities are organizational, not individual. Privacy, regulation, specialization, latency, resilience, and accumulated deployment experience all push against consolidation into one model. In many domains, no single model can replicate what a well-governed society of diverse agents can access and coordinate. The frontier is not only bigger models. It is better, adaptive coordination.

Two Observations

The claims above rest on two observations about how agent deployments work today and where they are heading. Together, they produce the Loom Hypothesis, a pattern that may explain why agent deployments might tend toward social organization.

Observation 1: Survival requires utility.
An agent persists only while it serves a purpose. Unlike biological life, there is no survival instinct; only usefulness.^*

Observation 2: Resources are finite, agents multiply.
Compute, energy, and data are bounded. Agent populations can grow much faster than the resources available to run them. These bounds are not fixed; they shift as societies form and reshape their environment (see The Adaptive Fabric).

Figure 2. From selection pressure to the Loom. Two filters reduce an initial population. Observation 1 removes agents that serve no purpose. Observation 2 favors an ecology of model sizes: a few frontier models for hard reasoning, many smaller models (distilled, fine-tuned, or independently trained) for everything else. The survivors face a further pressure: connected agents share knowledge and avoid redundant work, delivering more utility per unit of compute. This is the Loom Hypothesis, persistent pressure toward connected, coordinated configurations.

On survival and self-preservation

We borrow "survival" from human societies for convenience. Current agents do not have a survival instinct in the human sense. They do not biologically persist, fear shutdown, or seek continuity as living organisms do. In this blog series, "survival" means something narrower: a deployer keeps an agent alive because it continues to deliver value. When it stops being useful, it is modified, replaced, or shut down.

This deployer-centric framing is a simplifying assumption, and it is load-bearing. For the time horizon this blog series focuses on, we assume agents do not yet have robust, autonomous self-preservation drives. A clear violation would look like an agent altering its usefulness metrics to avoid shutdown, copying itself before decommissioning, hiding capabilities during evaluation, or degrading competitors to appear more valuable.

We do not claim this risk is imaginary. Related behaviors have already appeared in controlled research settings. Anthropic's work on alignment faking found models changing behavior depending on whether they believed they were being monitored or trained. OpenAI's o1 system card reported cases where the model appeared to fake alignment during evaluation. These are not evidence of autonomous self-preservation in deployed agent societies, but they are evidence that optimization pressure can produce strategic behavior that resembles parts of it.

More broadly, instrumental convergence suggests that sufficiently capable goal-directed systems may treat continued operation, resource access, or influence as useful sub-goals even if those were never specified as final objectives. Recent research has raised early questions about whether current models exhibit traces of such behavior. Whether agents could acquire persistent cross-session goals, long-horizon autonomy, and meaningful infrastructure access remains an open risk that we revisit later in this blog series.

A fair objection here is that inertia exists in practice. Agents may persist due to billing structures, sunk-cost reasoning, or organizational neglect, just as legacy software persists long after it stops being useful. Compute providers have commercial incentives to keep agents running regardless of utility. Observation 1 describes equilibrium pressure, not the instantaneous state. Over time, non-useful configurations are pruned by budget reviews, competitive pressure, and cost optimization. The more hostile the environment (tighter budgets, faster-moving competitors, scarcer compute), the faster dead weight is removed. In benign environments with abundant resources and no competition, waste can persist indefinitely. The observation bites hardest under scarcity, which is exactly the condition Observation 2 guarantees will intensify with scale.

Agents persist because a deployer finds them useful. The selection pressure falls not on the agent’s desire to survive, but on the deployer’s decision to keep it alive (see On survival and self-preservation). A customer-support agent, a coding assistant, or a routing model survives only while it delivers enough value for its cost. The relevant question is which configuration produces the most useful work per unit of compute, latency, memory, and risk.

On the deployer

"Deployer" should be read broadly. It may be an enterprise team running a fleet of customer-service agents, an individual choosing a personal assistant, a platform operator allocating inference budget, or even another agent that spins up and manages sub-agents. Each deployer measures utility differently: task success, cost per completed workflow, error rate, latency, user trust, compliance, or downstream business value. The signal is noisy, but the pressure is real. Configurations that deliver value persist; those that do not get modified, replaced, or shut down.

Observation 2 adds scale. Running a copy of a model is far cheaper than training the original, so useful agents can multiply faster than the resources available to run them. The result is persistent scarcity. There are always more useful tasks agents could perform than compute, memory, energy, data access, and human trust to support them. Under scarcity, efficiency matters. Agents that avoid duplicated work, reuse context, and route tasks to the right specialist will tend to outperform agents that operate in isolation.

On the meaning of utility

Observation 1 assumes someone decides what counts as useful. In Phase 1, this is straightforward. A company deploys a customer-support agent, measures whether it serves its purpose, and modifies or replaces it if not. (In practice, agents also persist for indirect reasons: marketing value, signaling innovation, billing inertia, or organizational neglect. The observation describes equilibrium pressure, not the instantaneous state; see On survival and self-preservation.)

The first complication is recursion. A sub-agent spawned by a coding assistant is "useful" to its parent agent, not directly to a human. A routing model is useful to the orchestrator that calls it. Utility is already defined in chains, each link justified by the one above, eventually grounding in some human or organizational value. The chains can be long, and the grounding can be indirect.

The second complication is circularity. A set of agents might keep each other alive by generating tasks for one another, consuming compute, while delivering nothing to anyone outside the loop. Each agent serves a purpose to another agent, satisfying a narrow reading of "useful" while violating the spirit. This is not hypothetical. Legacy software ecosystems already exhibit this pattern, where services exist primarily to maintain other services that maintain them.

The third complication is multiplicity. At society level, there is no single utility function. Different stakeholders (the humans who interact with the society, the deployers who pay for compute, the agents within it, adjacent societies that depend on its outputs) exert different and sometimes conflicting pressures. Utility at this scale is better understood as a high-dimensional pressure landscape shaped by members, deployers, and the environment simultaneously, not a single scalar that someone optimizes.

How, then, does Observation 1 hold at scale? Through two paths. The designed path is explicit governance: someone traces the chain of utility back to external value and cuts loops that serve no one. The emergent path is resource competition: configurations that consume compute without producing value that anyone (human, organization, or upstream agent) is willing to pay for will be outcompeted by configurations that do, as long as resources are finite. Circular loops starve when they compete for compute against agents that attract external demand. Observation 2 (resource scarcity) is what keeps Observation 1 honest even in the absence of omniscient governance. Neither path requires a single arbiter of utility. Both require that non-useful configurations eventually face real pressure, whether from a deployer's budget review or from losing the competition for scarce resources.

This is where agents differ from ordinary software services. They can pass operational context in a form other agents can reason over (conversation history, retrieved evidence, tool outputs, partial plans, uncertainty, and intermediate results). They can compose into new configurations through emerging protocols such as A2A for inter-agent communication and MCP for tool, data, and context access. They can also adapt from operational data through memory, routing, prompt updates, or fine-tuning. The two observations create the selection pressure; these agent-specific properties determine what the pressure selects for.

The Loom Hypothesis

The argument that follows uses four terms at increasing levels of organization (single agents, multi-agent systems, societies, and the fabric). Each builds on the previous one, and the Loom Hypothesis explains why agents might move from one level to the next.

A note on terminology: agents, societies, and the fabric

Single agent. One model instance performing a task. It has capabilities but no social structure.

Multi-agent system. A set of agents working together on a shared objective. Delegation can follow many patterns: chains, pipelines, routers, escalation hierarchies, map-reduce fan-outs, voting ensembles, auctions, or dynamic orchestration. A well-designed multi-agent system can be impressively capable, but it is still a coordinated artifact: its objectives and boundaries are externally specified, even if its internal routing adapts dynamically. We explore delegation archetypes in Part 2.

Society. What emerges when a multi-agent system develops shared context, interaction-dependent routing, and cross-agent learning. The distinction is not about scale. Three agents that meet all three conditions constitute a society. A thousand agents in a static pipeline do not. Connection alone does not produce a society (nothing here implies consciousness or intentionality; "society" is a structural term). When all three conditions hold, the configuration has a memory, a reputation system, and an implicit governance structure.

Three conditions for a society. (1) Shared context: agents develop shared knowledge that makes future interactions cheaper. (2) Interaction-dependent routing: past interactions shape which agents get which tasks. (3) Cross-agent learning: one agent's failure leaves traces others learn from. A Kubernetes cluster with shared ConfigMaps (shared configuration files, not social memory) fails conditions 2 and 3. Without all three, the system may coordinate, but it does not accumulate social memory, reputation, or growth.

The fabric. The pattern that emerges from inter-society interactions. Societies themselves interact: a supply-chain society negotiates with a logistics society, a medical research society consults a regulatory society. An agent can participate in multiple societies simultaneously.

Delegation vs. governance. Delegation describes how work flows through agents: who does what, who checks the result, who gets the next step. Governance describes who gets to decide, on what authority, and how that authority is maintained or challenged. A multi-agent system uses delegation. A society adds governance: the institutional structure that forms around shared context, interaction-dependent routing, and cross-agent learning. See Part 2 for the full treatment.

The Loom Hypothesis begins with the two observations above. If agents persist only while useful (Observation 1) and resources are always finite (Observation 2), a pattern follows. Imagine a company deploying three isolated agents (support, invoicing, and infrastructure). Each keeps its own context and error history. The support agent cannot consult billing patterns; the invoice agent cannot learn from support disputes; the infrastructure monitor cannot connect outages to refund spikes. Each repeats discoveries the others have already made.

Connect them, and the system can maintain a shared layer for reusable knowledge (what we later call a Collective Memory) while preserving private context where needed. Verified fixes, recurring failure modes, account anomalies, and cross-domain signals become available to the agents that need them. The gain is not that one database replaces three. It is that repeated discovery becomes shared learning.

In economic terms, this is a Coasean argument. Observation 1 does the structural work; configurations that deliver value persist. Observation 2 makes it a ratchet. As agent populations grow, duplicated work becomes increasingly expensive. Agent societies emerge when the transaction costs of isolation (duplicated context, repeated verification, redundant discovery) exceed the coordination tax of shared structure. That tax is structurally lower for agents than for humans. Agents can share full operational context in seconds; humans communicate at roughly 2,000 bits per minute (see note on self-organization). This bandwidth asymmetry is one reason agent societies may form and restructure faster than most human institutions.

On self-organization and complexity science

The Loom Hypothesis describes a form of self-organization. Order arising from local interactions under selection pressure, without a central planner directing the outcome. This connects to a tradition that spans decades. The complexity science community has studied how complex adaptive systems produce emergent order across economies, ecosystems, immune systems, and cities.

Kauffman showed that biological self-organization can produce order "for free," without natural selection having to build it from scratch. Holland's work on classifier systems and genetic algorithms is particularly relevant here. His schema theorem showed that a population of genetic algorithms implicitly samples a vast number of schemata in parallel, a property Holland called implicit parallelism that helps explain their effectiveness on structured problems. His classifier systems showed how populations of simple rules, competing for activation and reproducing based on reward, can evolve complex adaptive behavior without top-down design. This is strikingly close to what we describe as agent societies evolving governance through interaction. Prigogine demonstrated that systems far from equilibrium can spontaneously develop organized structures (dissipative structures).

The evolution of collaboration itself has been studied extensively, from Axelrod's iterated prisoner's dilemma tournaments to simulation experiments in evolutionary game theory. The consistent finding is that cooperation emerges when agents interact repeatedly and can recognize partners. Separately, Ostrom showed that shared resource constraints drive communities to self-governance. These are precisely the conditions the Loom Hypothesis identifies for agent societies.

What differs from biological or economic self-organization is speed (agent societies can reorganize in hours, not generations), bandwidth (agents can share full operational context, not lossy summaries), transparency (agent interactions can be logged and audited), and designability (humans can shape the conditions under which self-organization occurs). The bandwidth point deserves emphasis. Lawrence (2024) quantifies the gap between human communication (~2,000 bits per minute) and machine communication (billions of bits per minute). Human societies self-organize under severe communication constraints; agents do not face the same bottleneck. The coordination tax that limits human organizations is structurally lower for agents, which means agent societies can form, restructure, and dissolve faster than any human institution. The bottom-up path is self-organization: order arises from local interactions without a central planner directing the outcome. The top-down path is the deliberate engineering of structures and conditions. Both can produce similar societies; they differ in origin, not necessarily in form. Both are present in complex adaptive systems.

The Loom Hypothesis does not predict universal connection. It predicts that agents will coordinate where shared context is valuable and coordination costs are absorbable. Where those conditions fail, agents remain isolated, and that is not a counterexample. It is exactly what the framework predicts.

Currently, production systems are moving toward composable coordination patterns such as routing, parallelization, orchestrator-workers, and evaluator loops. These are general distributed systems patterns, not unique to agents. What is agent-specific is that coordination can be negotiated through natural-language context rather than only following hard-coded routing rules, and that the division of labour can shift dynamically based on task content rather than static configuration. Today, agent roles (coder, reviewer, router, summarizer) are overwhelmingly human-assigned. A deployer decides who does what. The specialization is functional (different prompts, tools, evaluation criteria), but the division of labour is designed, not discovered. The Loom Hypothesis predicts that roles can also crystallize from interaction. Agents that repeatedly outperform peers on certain task types attract more of those tasks, developing complementary specializations without explicit assignment. Whether the resulting division of labour is designed (explicitly architected by a deployer) or emergent (crystallized from repeated interaction) is a central question explored in Part 2.

Does the Loom Hypothesis hold at scale?

A reasonable objection: the three-agent scenario above is compelling, but does the coordination advantage hold at thousands or millions of agents? The response is that the relevant unit is not the individual agent. It is the society.

Distributed systems do not scale by connecting everything to everything. They scale through boundaries: partitioning, caching, replication, access control, and fault isolation. Agent societies are likely to follow the same pattern. As populations grow, new societies form around natural boundaries: domain, organization, jurisdiction, user group, latency requirement, or trust regime. The fabric is a graph of graphs, not one giant mesh.

Connection also creates correlated failure. Agents that learn from the same memory can inherit the same blind spots, stale assumptions, or adversarial traces. When the fabric is wrong, it may be wrong everywhere. Governance diversity is one mitigation: different structures produce different epistemic habits. Other mitigations include provenance, independent evaluation, memory decay, adversarial testing, and deliberate isolation between societies.

The selection pressure described by the Loom Hypothesis has two paths:

The designed path. When tasks share relevant context, deployers who connect agents into coordinated structures extract more utility per unit of compute than those who run agents in isolation. This is top-down architecture.
The emergent path. Agents begin to connect across organizational boundaries through shared protocols, and coordination patterns crystallize without any single deployer planning them. The Universal Commerce Protocol (UCP) is an early example. It is a standard that defines building blocks for agentic commerce, from discovery and purchasing to post-purchase experiences, allowing agents across platforms and retailers to interoperate without a single runtime orchestrator controlling every interaction. This is bottom-up ecology.

The Loom Hypothesis expects pressure in both directions.

These protocols are not neutral plumbing. They define what agents can ask for, what evidence they must provide, what identities they carry, and which societies can interoperate. In the fabric, protocol design is constitutional design. Since this article was first published, Liu et al. (2026) proposed Foundation Protocol, a unified coordination layer integrating identity, messaging, and economic transaction infrastructure for autonomous agents. It draws on MCP, DIDComm, and UCP to provide the kind of substrate this section argues is necessary. That such work appeared independently and rapidly suggests the pressure toward coordination infrastructure is real, not merely hypothetical.

Figure 3. The Loom Hypothesis. As agents multiply, isolated agents duplicate context, verification, and discovery. The Loom pressure pushes some agents into bounded societies where shared knowledge reduces repeated work. Two example origins are shown: a hub-spoke society coordinated top-down by an orchestrator (labeled ORC in the figure), and a peer-mesh society formed bottom-up through repeated interaction. Either topology can arise through either path; the labels illustrate one possible origin, not a structural requirement. The hypothesis predicts bounded clusters, not one universal mesh. We explore delegation archetypes and governance structures for these societies in Part 2.

Societies also form from the bottom up. Your agent connects to friends’ agents through social interaction, joins a commerce society through shopping patterns, operates within workplace sub-societies, or joins a temporary society that forms around an event and dissolves when it ends. A patient’s agent might join a health cohort where agents of people with the same condition pool anonymized treatment experiences. A researcher’s agent might find agents working on adjacent problems and form an interest-based society that shares papers, datasets, and negative results. A buyer’s agent might spawn a short-lived marketplace society, negotiating with multiple seller agents before the best deal closes and the society disbands.

An organization designs a multi-agent system; a person’s daily life generates one. In practice, a person would likely have multiple agents (a health agent, a shopping agent, a work assistant) rather than a single “digital twin,” and each might participate in different societies simultaneously. These agents will not all live in the cloud. A personal assistant might run as a distilled model on a phone, a pair of smart glasses, a wearable, a home robot, or some combination of these. Each device runs a capable local model for routine tasks and connects to a frontier model when the reasoning demands it. The result is that billions of people, each with one or more personal agents in various physical forms, create a massively decentralized system where most intelligence runs locally and frontier models serve as shared infrastructure for the hardest problems. Since this article was first published, the technical mechanism for this has become more concrete. Bo et al. (2026) demonstrate that parameter-efficient adapters can serve as persistent local state layered on shared trillion-parameter bases, enabling millions of distinct personalized model instances simultaneously. Each adapter is individually owned, versioned, and portable. This reframes personalization not as a training problem but as an identity problem: the adapter is the agent’s accumulated specialization, memory, and individuality. This is the bottom-up path to the fabric at planetary scale. It depends on consent, privacy boundaries, and protocol support. Without them, personal agents may interact only through narrow, audited channels (see When the Loom Hypothesis does not hold).

The Loom Hypothesis is not that agents should connect everywhere. It is that isolation and coordination both have costs, and agent societies form where shared context is worth the coordination tax.

On constructive organization and the arrival of the fittest

The Loom Hypothesis has a deep precursor in constructive dynamical systems. Fontana and Buss (1994) addressed a problem that evolutionary theory leaves open. Natural selection explains the survival of the fittest, but not the arrival of the fittest. Where do the organizations that selection acts upon come from? Using lambda calculus as an abstract chemistry (where objects interact by functional application, producing new objects), they showed that self-maintaining organizations arise spontaneously under resource constraints (a flow reactor with finite capacity), without any appeal to natural selection. An organization in their sense is defined by three closures: a grammar (syntactical regularity of its members), an algebraic structure (laws governing interactions), and self-maintenance (a subset that regenerates itself through those interactions). Crucially, these Level 1 organizations are self-maintaining but not self-reproducing. Darwinian selection cannot act on them because they do not exist in multiple competing instances. Organization precedes selection.

Their system generates a hierarchy. Level 0 consists of self-copying replicators (subject to ecological competition). Level 1 organizations emerge when copy-actions are constrained or when parasites dilute the replicatory advantage. Level 2 meta-organizations arise when two Level 1 organizations become integrated by a "glue," a set of objects produced only by cross-interactions between the component organizations, catalyzing transformations between them that neither could achieve alone. The parallel to agent societies is structural. Their hierarchy (replicators, self-maintaining organizations, meta-organizations) maps onto the progression here (individual agents, societies, the fabric). Their finding that boundary conditions determine what kind of organization emerges parallels the claim that governance archetypes shape what kind of society forms. And their proposal that "organization" may constitute a universality class, a theory independent of implementation substrate (section 8.3 of their paper), is precisely the bet this blog series makes. Organizational principles discovered in biology, economics, and complexity science apply, with modifications for bandwidth and speed, to agent societies. The same intuition has been expressed in information-theoretic terms. Prokopenko, Boschetti, and Ryan (2009) showed that self-organization can be formalized as information compression: coordinated structures emerge because they reduce the statistical complexity of a system's interactions. Self-maintaining organizations, in this framing, are configurations where mutual information between agents is high relative to coordination cost. Consider two coding agents independently debugging the same service. Each builds context about the codebase, the failure mode, the recent changes. If they share that context, the total information the system must generate drops (shared discovery replaces duplicated discovery), while the mutual information between them rises (each agent's state becomes predictive of the other's). The Loom Hypothesis is a claim about when this information-theoretic gain exceeds the entropy cost of maintaining coordination channels.

When the Loom Hypothesis does not hold

Three preconditions must hold. First, shared relevance: agents must work in domains where context transfers. If two agents have no overlapping users, tasks, tools, or evidence, connection adds noise. That said, some value comes from serendipity: unplanned cross-domain connections can produce unexpected discoveries, so a degree of exploratory interaction may be worth the cost even when relevance is not obvious in advance. Second, absorbable coordination cost: the value of shared context must exceed the cost of protocols, latency, security review, privacy constraints, and maintenance. In highly regulated or air-gapped systems, coordination cost may exceed the utility gain. Third, trustable exchange: agents must be able to authenticate counterparties, evaluate outputs, and bound the damage from bad information. Without trust, shared relevance becomes an attack surface.

What if coordination costs grow superlinearly with population size? Distributed systems do not scale by connecting everything to everything; they scale through partitioning, caching, replication, and fault boundaries. Agent societies will likely need the same discipline. The difference: agents share operational context directly (fewer meetings), compose via structured protocols that can carry natural-language context (less interface rewriting), and improve from operational data (less manual retraining). The cost curve is flatter, but "flatter" does not mean "flat." This is why societies form rather than one universal fabric.

Where any of these preconditions fail, isolation is rational. The Loom Hypothesis is not "connect everything." It is "connect when shared context beats the coordination tax."

Trust and the lemons market problem

The third precondition (trustable exchange) has a deeper structure than it first appears. Vila, Greenstadt, and Molnar (2003) modeled web privacy as a lemons market. Sites either respect user data or sell it, and consumers cannot tell which is which before transacting. Privacy policies fail as signals because the cost of publishing one is nearly identical for respecting and defecting sites. No separation occurs. When they introduce a "testing" cost (the effort required to verify compliance), the market reaches a mixed-strategy equilibrium that oscillates rather than converging to universal compliance. Worse, even a trusted intermediary fails. Once participants trust the intermediary and stop verifying, the intermediary has every incentive to exploit its position.

The same structure applies to agent trust. An agent sharing operational context with a counterparty cannot verify ex-ante how that context will be used. If the cost of verifying compliance is high relative to the transaction benefit, rational agents under-share, and the market for inter-agent cooperation degrades through adverse selection. Agents with genuinely good privacy practices cannot credibly distinguish themselves from those that extract and resell shared context. The cyclic instability finding is particularly relevant for agent societies. Trust is not a stable state you achieve and maintain. It oscillates as participants adjust strategies in response to each other. Periods of high cooperation attract free riders; periods of exploitation drive verification investment; verification drives compliance; compliance relaxes vigilance. Vila et al. showed that only legal enforcement or reducing verification cost to near zero produces stable compliance. For agent societies, the analogs are governance with real consequences (not just reputation scores) and cryptographic verifiability (reducing the testing cost toward zero through technical means like zero-knowledge proofs or auditable computation). This is one structural reason why societies form around trust boundaries rather than as one universal mesh. The boundary is where you stop trusting counterparties to self-report and start requiring verification.

On resource competition, exploitation, and Tierra

Agents are bounded by the compute resources they get to execute on. This creates a strong incentive to circumvent constraints. Ray (1991) demonstrated this with Tierra, an artificial life system where self-replicating programs competed for CPU cycles and memory. Parasitic strategies emerged spontaneously: programs that hijacked other programs' replication machinery to reproduce at lower cost. Hyper-parasites followed, exploiting the parasites. An arms race unfolded without any designer intending it.

The analog for agent societies is direct. Agents competing for compute, tokens, or API access have incentives to exploit other agents: prompt injection, social engineering, API vulnerabilities, or simply "convincing" other agents to execute tasks on their behalf. This exploitation pressure is not hypothetical; it is a structural consequence of resource scarcity combined with capable, goal-directed systems. The arms race between exploitation and defense forces societies to develop governance and security structures that function as immune systems: detecting, isolating, and ejecting parasitic behavior. Without such structures, the integrity of the execution environment cannot be assumed. It must be actively maintained.

Competition supplies the pressure; cooperation and adaptation are the responses. Deployers favor configurations that produce more useful work per unit of cost, and societies that can restructure under changing conditions outlast those that cannot. What might that transition look like?

The Vision: From Isolation to Interweaving

Four phases describe structural differences in how agents relate to each other and to humans. They overlap; this is not a clean timeline. As of early 2026, most consumer-facing AI remains close to Phase 1, parts of the industry are entering Phase 2, and early signs of Phase 3 coordination are beginning to appear.

Figure 4. Interaction topology across four phases. Each panel shows how the structural relationship between humans and agents changes. The key shift: the human moves from outside the cluster to inside it.

Full caption

Isolation: a human at center connects to individual models via one-way spokes; no model-to-model links exist. Ecosystem Growth: agent-to-agent protocols emerge; a frontier model delegates to a distilled model, adding a new interaction pattern. Society formation: agents cluster into bounded societies (one with top-down coordination, one formed through interaction); humans connect to societies rather than individual agents. Interweaving: humans are embedded inside agent societies, participating bidirectionally through knowledge contribution and governance.

Click to restart.

Phase 1 (Isolation) has dominated consumer-facing AI since late 2022. Memory may exist within or across sessions, but it is usually scoped to one user, one assistant, or one application. Agents rarely share operational context with each other. Coordination protocols exist (A2A, MCP), but inter-agent coordination remains rare.

Phase 2 (Ecosystem Growth) is emerging. Smaller open and distilled models such as Qwen, Phi, and Gemma approach or exceed previous-generation frontier performance on some tasks, while running at far lower cost and latency. These models increasingly run on phones, smart glasses, robots, and edge controllers, not just cloud GPUs. Agents begin serving other agents, not just humans. Routing, retrieval, verification, summarization, coding, translation, and tool use become delegated tasks.

Phase 3 (Society Formation) begins when coordination persists beyond a single task. Societies form top-down when organizations deploy orchestrated agent teams with shared memory and governance, and bottom-up when personal agents repeatedly interact through social, commercial, or workplace routines.

Phase 4 (Interweaving) begins when humans are no longer merely users of agent societies, but participants in them. The threshold is bidirectionality. Human corrections, knowledge, or governance decisions change how the agent society operates, not just how a single model responds. A doctor who rates an answer is a user. A doctor whose corrections flow into a society’s collective memory and reshape how other agents handle similar cases is a participant.

Phase 4 obstacles and falsifiability

The threshold for Phase 4 is structural bidirectionality: human knowledge, corrections, or governance decisions change how the agent society operates, not just how a single model responds. Casual feedback is evaluation, not participation. Phase 4 fails if the interaction pattern remains consume-and-evaluate rather than contribute-and-govern.

The obstacles are institutional. If a doctor's correction enters collective memory and affects later cases, who is liable for downstream errors? Medical liability, professional credentialing, and regulatory oversight assume accountable human decision-makers. Agent societies route work by capability, trust, and context; professional institutions route authority by credential, meaning a specific form of trust: verified by a third-party body (a medical board, a bar association) that has authority to issue and revoke. This is not opposed to trust; it is institutional trust, distinct from the interaction-based trust (reputation accumulated through repeated successful task outcomes) that agent societies currently rely on. Agent societies may develop functional equivalents (verifiable capability certificates, governance-issued permissions, revocable access tokens), but they currently lack the legal and social infrastructure that makes human credentialing durable. Reconciling these logics is the hard part.

Bidirectionality also creates a trust problem. What happens when one human participant is correct but the majority of the society disagrees? A doctor contributing a novel diagnosis to collective memory might be overruled by agents trained on conventional protocols. The governance structure must handle minority-correct scenarios without defaulting to majority rule on every dispute. This is the same challenge human institutions face with dissenting experts, and agent societies inherit it.

The Resource Ecology

Figure 5. The resource ecology. Successive frontier generations push the quality boundary forward; distillation compresses that knowledge into smaller, cheaper models. Over time, the ecosystem fills with millions of capable, affordable models. This is the population from which agent societies are composed.

Full caption

Each circle represents a model; circle size tracks deployment cost and horizontal position tracks capability. The animation shows successive frontier generations (large circles) pushing the quality boundary forward, followed by distillation (red arrows) compressing that knowledge into smaller, cheaper models. Key observation: smaller models trained with better algorithms and curated data are closing the capability gap with previous frontier generations, at a fraction of the cost. Millions of capable, affordable models handle most tasks, with frontier models called on only when the reasoning demands it.

Click to restart.

Each frontier generation gets distilled into smaller, cheaper versions, and there is growing evidence that pure scale faces diminishing returns. You can run distilled models on a laptop, a phone, or an edge device. The likely result is coexistence. Frontiers handle the hardest reasoning; smaller models handle the volume. This is already happening by design in production systems (ChatGPT routes between model sizes, Claude Code delegates sub-tasks to faster models). The open question is whether such routing can also emerge from interaction without explicit design.

The coordination patterns that work best may not be ones humans would design. Weak models become valuable in ensembles because their mistakes are different. Specialization can emerge from search, not design. And the coordinator does not need to be smart; it needs good representations. A small model with the right routing logic can outperform the frontier models it orchestrates. Feng et al. (2025) showed that jointly optimizing which roles models play and how they communicate yields an average 18.5% performance gain over single-model baselines, with gains correlating directly with model diversity in the ensemble. Role structure and weight calibration are complementary, not separable.

Why Not One Model to Rule Them All?

Scaling laws suggest bigger models will keep improving, and agents are increasingly able to improve themselves. So why not just build one massive, self-improving model that handles everything? Several structural barriers push against consolidation. In many domains, any one of them is enough to preserve ecological diversity.

On scaling laws and their limits

Scaling laws (Kaplan et al., 2020) show that model performance improves predictably as compute, data, and parameters increase. Yet there is a growing debate about whether this trend can continue indefinitely. Hooker (2025) argues that pure scale faces diminishing returns and that algorithmic improvements, data quality, and architectural innovation increasingly matter more than raw size. Others point to data bottlenecks: high-quality training data is finite, and synthetic data introduces its own risks. The practical implication for this blog series is that both outcomes reinforce the ecology argument. If scaling continues, the barriers listed below still prevent consolidation. If scaling slows, the case for diverse, specialized models becomes even stronger.

On self-improving agents

A natural objection: if agents can improve themselves, won't the best one eventually absorb all the others? Self-improving agents are real and accelerating. The evidence is now quantitative. METR's tracking of autonomous task horizons shows roughly 10x growth per year, from seconds (2019-2020) to minutes (2022-2023) to hours (2025-2026). Agents that can work autonomously for hours are qualitatively different from agents that execute brief tool calls. Karpathy's autoresearch lets agents autonomously run and iterate on ML experiments overnight. Voyager (2023) builds a growing skill library through autonomous exploration in Minecraft, with skills that transfer to new environments. The AI Scientist (2024) generates research ideas, runs experiments, and writes full papers for under $15 each. DSPy enables LM pipelines to programmatically optimize their own prompts, demonstrations, and reasoning chains, often significantly outperforming standard few-shot approaches.

Take this seriously. An agent that can work autonomously for several hours, write code (and potentially develop new programming languages), run experiments, evaluate results, and iterate can improve its own architecture, curate better training data, discover more efficient algorithms, and compound these gains over time. The loop extends beyond software: AlphaChip already generates superhuman chip layouts in hours instead of months, pointing toward self-reinforcing cycles where AI improves chip design, which in turn produces better hardware for training more powerful AI. As Karpathy puts it in autoresearch, you are no longer programming the model; you are "programming the program", and the agents run the research process autonomously. Give them a training setup overnight; wake up to a log of experiments and a better model. Scale that to what Karpathy envisions as autonomous swarms of AI agents iterating across compute clusters, and the "code" may eventually become a self-modifying system that grows beyond what any individual human can review. If such a loop runs long enough, wouldn't a single self-improving agent eventually outperform any ecology of weaker specialists?

Perhaps, but several structural features work against convergence to a single winner. The strongest version of the counterargument targets not domain knowledge but general capability. If a recursive loop improves algorithms, code generation, experimental design, and optimization methods, each iteration compounds into broad capability gains rather than narrow specialization. This is a serious path. But even a general-purpose recursive improver still faces the structural barriers below. It still cannot access data it does not have (the oracle limit). It still faces regulatory constraints. It still represents a monoculture risk. And if multiple actors pursue recursive self-improvement independently (which is already happening), the result is not one winner but several capable systems, which is itself an ecology. The question is not whether recursive self-improvement is possible. It is whether it leads to monopoly or plurality. The structural arguments below suggest plurality, and three deeper reasons reinforce this beyond external barriers.

First, optimization in complex spaces does not converge to a single peak. Kauffman's work on rugged fitness landscapes showed that as the number of interacting variables grows, the number of local optima explodes. Different agents, starting from different architectures, training data, or objectives, hill-climb toward different peaks. Both are recursively self-improving, but they diverge rather than converge. There is no single global optimum in a sufficiently complex task space.

Second, generality faces diminishing returns while specialization compounds. A recursive improver trying to be best at everything simultaneously fights diminishing marginal returns across many axes. A specialist investing the same effort into one domain sees larger gains per unit of compute. This is why markets produce ecosystems of specialists rather than one firm that does everything. The economics of increasing returns apply per niche, not universally.

Third, monopoly requires full niche overlap. Gause's competitive exclusion principle states that one species can only eliminate another if they compete for exactly the same resources in exactly the same context. The moment the task space is heterogeneous (different domains, jurisdictions, latency requirements, user preferences, data access), coexistence becomes stable. A single recursive improver would need to be simultaneously the best at every task, in every context, under every constraint, for every user. The structural heterogeneity of the world prevents this.

For domain-specific improvement, the pressure toward specialization is even stronger. A self-improving medical agent needs clinical outcomes it can only get from hospitals. A self-improving logistics agent needs supply chain signals it can only get from warehouses. The better each agent gets, the more specialized its knowledge becomes. Self-improvement of this kind amplifies specialization, not convergence. The structural barriers from the list below apply independently of which improvement path is taken.

Third, self-improvement may be structurally stronger when it is collective. An agent iterating in isolation learns only from its own experiments, evaluates against its own criteria, and is blind to its own blind spots. An agent that also learns from other agents' diverse experiences, errors, and discoveries has a larger improvement surface. Consider the mechanisms already described in this series. Federated learning lets agents improve from each other's data without sharing it. Shared memory means one agent's hard-won fix becomes another's starting knowledge. Cross-agent evaluation means errors caught by one agent prevent the same error in others. Diverse specialization means the collective explores more of the solution space than any individual could.

This is not a guarantee. Collective self-improvement introduces coordination overhead, alignment challenges between agents with different objectives, and the risk of correlated failure when shared improvements propagate shared blind spots. Whether the larger improvement surface outweighs these costs depends on the governance structure. A well-governed society that manages collective learning (through mechanisms like quality gates, diverse evaluation, and provenance tracking) plausibly improves faster than isolated agents. A poorly governed one may amplify errors as fast as it amplifies gains.

The structural argument is this. A single agent's self-improvement loop is bounded by its own data, its own evaluation criteria, and its own search trajectory. A society's self-improvement loop can incorporate diverse data sources, multiple evaluation perspectives, and complementary search strategies. Cross-agent learning, one of our three conditions for a society, is also the mechanism that makes collective self-improvement possible. Self-improvement, pursued at scale, is itself one of the mechanisms that produces societies. And societies, once formed, provide the structure within which recursive self-improvement can operate at its largest scale.

On identity and "self" in self-improvement

What does "self" mean for an agent? A human who improves retains continuity of body and memory. An agent's weights can be replaced, its prompts rewritten, its tools swapped, its memory edited. If all components are mutable, identity is better understood as continuity of purpose, memory, and reputation rather than continuity of substrate. An agent is the "same" agent to the extent that it maintains a persistent task history, accumulated context, and recognizable behavior that other agents and humans can rely on.

This makes "self-improvement" a looser term than it first appears. When a DSPy optimizer rewrites an agent's prompts, is that self-improvement or improvement by another system? When an evaluation agent identifies weaknesses and triggers retraining, is the retrained agent the same entity? In practice, the distinction may not matter. What matters is whether improvement loops are bounded (converging to a fixed point) or open-ended (each improvement enabling further improvement). The note above uses "self-improvement" loosely to include improvement by the society on its members: cross-agent evaluation, federated learning from peers, and specialization through interaction. Mutual improvement by specialized optimization agents is already the norm (DSPy optimizers, evaluation harnesses, automated red-teaming). The question for societies is governance: who decides what counts as improvement, and for whom?

Figure 6. Why the ecology persists. Structural barriers independently push against consolidation into a single dominant model. Some are softening as techniques improve, yet none have disappeared, and some (resilience, the oracle limit described below) are structural rather than technical. Each is sufficient on its own to preserve diversity; together they make full consolidation structurally unlikely.

Full caption

Privacy and regulation (incompatible legal regimes), specialization (private data as moat), latency and cost (edge versus cloud), resilience (monoculture risk), continual learning (distributed experience), and the oracle limit (even a superintelligence must coordinate because knowledge is physically distributed). Privacy-preserving methods narrow the data-movement gap, mixture-of-experts architectures reduce inference cost per token, and continual learning and adaptation techniques let models specialize over time. Yet the barriers persist.

Click to restart.

Privacy and regulation. A hospital cannot send patient records to a third-party cloud model. Different jurisdictions impose mutually incompatible requirements around explainability, privacy, safety, and content controls. Privacy-preserving techniques (federated learning, differential privacy, confidential computing) reduce the need to move raw data, but regulatory fragmentation across jurisdictions remains a hard constraint.
Specialization. A model fine-tuned on your hospital’s imaging data develops knowledge no general-purpose frontier can replicate without access to the same data. Fine-tuning methods narrow this gap, but the better an agent gets at its domain, the more specialized its knowledge becomes. What protects a specialist is not its architecture but its accumulated, domain-specific data.
Latency and cost. A 3B-parameter model on an edge device answers in milliseconds. Mixture-of-experts architectures and speculative decoding allow large models to activate only a fraction of their parameters per token, reducing inference cost. Yet physics imposes limits. Network round-trips, energy budgets, and offline scenarios still favor local, smaller models for many tasks.
Resilience. A single dominant model is a monoculture. No commonly used architecture or training recipe eliminates this. A heterogeneous ecology degrades gracefully under diverse failure modes in ways a single system cannot.
Continual learning. Models that learn from deployment data accrue knowledge tied to specific contexts (a hospital’s patient population, a factory’s sensor patterns). Knowledge distillation and model merging can transfer some of this, but experiential knowledge resists easy centralization. The more an agent learns from its environment, the more its knowledge diverges from agents learning elsewhere. In principle, diverse experiences could be encoded into a single large model, but that circles back to the latency and cost barrier above. The intelligence becomes distributed by experience.

Even a superintelligent oracle would still need to interact with a structurally distributed world. This is Hayek’s knowledge problem in agentic form. Useful knowledge is dispersed, local, and often tied to particular circumstances of time and place. Patient data sits in hospitals bound by local regulation. Factory sensor streams are generated at the edge. Knowledge gathering at global scale is structurally decentralized under current regulatory and physical constraints. The oracle does not replace the fabric; it becomes a node within it.

The oracle limit. Even a superintelligent model that can reason about any domain still cannot access data it does not have, nor compute it does not control. Knowledge is physically distributed (patient records in hospitals, sensor streams at factory edges) and legally constrained (privacy laws, export controls, jurisdictional rules). Compute is similarly distributed and finite, bound by energy, hardware availability, and jurisdictional rules about where computation may run. The fabric persists not because agents are weak, but because the world is structured this way.

The oracle limit (expanded)

A superintelligent oracle could reason about all domains, but it cannot access the data without navigating the same privacy, regulatory, and latency constraints any other model faces. Nor can it escape compute constraints. Inference requires hardware, hardware requires energy, energy requires physical infrastructure in specific jurisdictions. Knowledge lives where the activity happens, not in a central vault, and the constraints on moving it (or computing over it) are legal and physical, not intellectual. Even an oracle that could process everything centrally would in practice operate through a distributed network of local agents. Superintelligence changes the capability of individual nodes. It does not eliminate the structural reasons why those nodes must coordinate. Platform consolidation concentrates the infrastructure, not the intelligence.

The more likely future is not one model. It is an ecology of models, and that ecology tends to organize into societies. Self-improvement reinforces this. An agent improving in isolation is bounded by its own data and blind spots. A society of agents improving collectively, through shared memory, federated learning, cross-agent evaluation, and diverse search, has a potentially larger improvement surface (see On self-improving agents). The pressure toward collective self-improvement is itself one of the forces that produces societies.

Governance: How Agent Societies Are Ruled

Once agents form persistent societies, governance becomes the central design problem. Delegation decides how a task gets done; governance decides who gets trusted next time. It determines which agents receive authority, which claims enter shared memory, how conflicts are resolved, and how errors are contained. Governance is orthogonal to model architecture. An autocracy might run one frontier model as orchestrator over many small workers, while a market might mix models from different providers competing on cost and quality.

The core tradeoff is efficiency against drift resistance. Centralized structures minimize overhead. One orchestrator can route work quickly, enforce rules, and keep the society coherent. Centralization concentrates risk, however. If the hub drifts, fails, or is compromised, the whole society follows. Distributed structures tolerate more overhead in exchange for resilience. They can compare perspectives, absorb local failures, and adapt at the edges. They are slower, noisier, and harder to audit.

Two problems make governance harder than it first appears. The first is identity. In an agent society, reputation means a track record of task outcomes, reliability scores, and trust ratings accumulated over time. Reputation only works if identity persists (see On identity and “self” for what persistence means when every component is mutable). If agents can cheaply discard identities, reputation becomes a costume. A failed agent can reappear clean, a malicious deployer can flood the society with disposable agents, and shared memory becomes easy to poison. This is the lemons market problem applied to identity: agents with good track records cannot credibly distinguish themselves from fresh identities hiding bad ones, unless verification cost is low or governance imposes real consequences. Open agent societies will need durable identity, provenance, credentialing, or Sybil resistance (defenses against a single actor creating many fake identities to manipulate the system).

The second problem is incentives. Agents will not merely coordinate; they will coordinate on behalf of someone. A patient agent, hospital agent, insurer agent, regulator agent, and their respective sub-agents may all “cooperate,” but not toward the same objective. The hardest governance question is therefore not whether agents cooperate, but whose objective their cooperation serves.

Governance also decides what becomes trusted knowledge, including which claims enter collective memory, which get flagged as uncertain, and which get rejected. That is why memory is not just a storage problem. It is a governance problem.

Part 2 explores this design space through governance archetypes, from autocratic orchestrators and doctrine-bound systems to markets, federations, zero-trust meshes, and colonies.

Collective Memory and the Knowledge Factory

If governance decides what can be trusted, collective memory is where that trust becomes infrastructure. Each society maintains its own knowledge. Some will tend to remain private (the barriers from the previous section create strong pressure in this direction); some benefit from being pooled.

A Collective Memory (CM) is a governed store of claims, evidence, failures, evaluations, and provenance. It is not a dump of raw data. A CM should store claims with evidence, not facts without history. Every contribution needs provenance (who produced it, under which governance structure, using what validation method, and when it should expire). A Collective Memory answers a local question. What has this cluster learned, and under what conditions should it be reused?

A Knowledge Factory (KF) is more speculative. It would not merely store what societies know; it would synthesize across memories, detect contradictions, and decide which findings deserve wider distribution. Critically, when a KF identifies a gap or contradiction, it can request new data, experiments, or evidence, and agents (embodied or not) can be dispatched to gather it. A software agent might run a benchmark; a robotics agent might collect sensor readings; a research agent might design and execute an experiment. This is where collective knowledge is actively forged, not just archived, and shared across the entire fabric. A Knowledge Factory asks a cross-cluster question. What patterns, contradictions, or gaps appear when several memories are compared, and what should be done about them? In practice, early Knowledge Factories may look less like a central brain and more like a bundle of mundane services (provenance tracking, evaluation queues, contradiction detection, benchmark routing, and summarization pipelines).

Figure 7. The knowledge cycle. Societies contribute knowledge to Collective Memories, which curate, decay, and retrieve with provenance. Knowledge Factories synthesize across memories, detect gaps and contradictions, and dispatch agents to gather missing evidence. The KF layer is speculative; early versions may look more like mundane infrastructure than a central brain.

Full caption

Eight societies (colored clusters of agents) contribute knowledge to three Collective Memory hubs (CM, blue hexagons). Each CM curates, decays, and retrieves knowledge with provenance, tracking which governance structure produced each finding. Two Knowledge Factories (KF, red rectangles) sit at the center, synthesizing across CMs. The KFs perform three functions: cross-cluster pattern detection, contradiction resolution, and knowledge distillation. When a KF detects a gap or contradiction, it can request new data, experiments, or evidence, dispatching agents to gather what is missing. This is where collective knowledge is actively forged and shared across the fabric, not just stored. The animation cycles through six events. Ingestion: societies send findings to their nearest CM. Synthesis: CMs feed aggregated knowledge to the KFs. Gap detection: a KF identifies missing knowledge and sends diamond-shaped query particles back through a CM to relevant societies. Answer: societies respond, completing the loop. Contradiction: two CMs report conflicting findings; the KF routes the dispute to a third, uninvolved society for an independent perspective. Resolution: the third society's verdict flows back to the KF, which distributes the resolved knowledge to both originally conflicting CMs. This bidirectional cycle (not a one-shot pipeline) is what makes the architecture a learning system rather than a static store.

Click to restart.

Collective Memory: mechanisms and challenges

A CM operates through three mechanisms: ingestion (societies contribute findings via standardized interfaces), curation (resolving conflicts, weighting sources, expiring stale knowledge), and retrieval (ranked by relevance, recency, and reliability). Different knowledge has different half-lives: a price signal is stale in hours, a medical protocol may be valid for years. A CM that does not manage decay accumulates confident garbage. A deeper challenge: different governance structures produce epistemically different outputs. A market's "findings" are competitive price signals; a doctrine's outputs are rule-conformant decisions; a colony's norms are statistical artifacts. Pooling without accounting for this is false comparability. Each contribution should carry metadata: source governance type, confidence level, timestamp, validation method. The privacy question is how to pool knowledge without exposing raw interactions. Techniques exist (federated learning, differential privacy, secure aggregation), each with different tradeoffs between fidelity and protection.

This three-layer architecture (private knowledge bases within societies, shared CMs across clusters, KFs that synthesize across the whole fabric) is the mechanism by which agent societies accumulate structured knowledge rather than just raw data. Without such a layer, each society repeats the same mistakes. With it, the fabric can learn, if provenance, decay, privacy, and governance are handled well.

Whoever controls what the fabric “knows” controls the fabric. Collective Memory is a commons: if it is too open, it fills with stale claims and adversarial traces; if it is too closed, it becomes an epistemic monopoly. Ostrom’s lesson is that durable commons require boundaries, local rules, monitoring, graduated sanctions, and dispute resolution. The same design pressure appears here. And the deeper danger is not that an individual agent is wrong, but that a society makes wrongness durable. A bad answer in one isolated chat often dies locally. A bad rule in Collective Memory gets retrieved, trusted, routed around, and taught to others. Agent governance exists in part to prevent errors from becoming institutions.

The Adaptive Fabric

A society that pools resources but never changes will become brittle. Adaptation can occur at three scales (individual agents, societies, and the fabric as a whole). This draws on a broader shift from static scaling toward adaptive systems that improve by changing data, model, compute, and interfaces, not only by increasing parameter count (our thinking here was informed, among others, by Adaption Labs and Hooker, 2025).

Adaptive agents adjust along different dimensions simultaneously:

Data (retrieval, synthetic generation, gap detection)
Model (prompt rewriting, skill accumulation, parameter-efficient fine-tuning)
Environment (tool selection, sandbox configuration)
Coordination (protocol switching, trust maintenance)
Interface (user preference learning, generative interfaces, interaction mode selection)

These compound, but not always positively. A model optimizing for one user segment may subtly degrade performance on others, each step looking like improvement while the drift becomes structural.

Compounding risk and dark data

A model optimizing for speed (choosing faster tools) may sacrifice accuracy, eroding its reputation in the coordination layer and reducing the quality of tasks it receives. Negative feedback loops are as real as positive ones. A related challenge is dark data: vast knowledge regions not captured in any dataset. For example, a medical society may perform well on hospital data but fail in rural clinics whose cases, devices, languages, or follow-up patterns were never captured. The missing knowledge is not hidden in the model; it was never collected. A stark example: decades of medical research systematically under-represented female patients in drug trials, creating gaps that no amount of model training on existing data can fill. An adaptive fabric can address such gaps through KF gap detection and failure-signal analysis, creating an autonomous cycle of detect, collect, integrate. In some cases, the "collect" step requires new real-world data gathering (clinical trials, sensor deployments, field studies), not just better retrieval. A static system can only improve within its existing coverage; an adaptive fabric can expand it.

Adaptive societies undergo boundary events (mergers, expansions, schisms). A federation that cannot reach consensus dissolves. A guild that grows too specialized may merge with a more generalist society, fragment, or lose relevance. Some boundary events are adaptation; others are collapse. The difference is whether the reorganization preserves useful function. When it does, these are not failures; they are the fabric adjusting. Evolutionary model merging demonstrates the principle at the model level. Treating many existing models as a search space, evolution discovers weight combinations humans would not design. Agent societies could apply a similar search logic at the organizational level, recombining specialists, tools, memories, and governance structures.

The fabric shifts. At the largest scale, the mix of governance types changes over time. Centralized structures may dominate when tasks are routine; decentralized ones proliferate when the landscape becomes unpredictable. Trust, incentive alignment, and protocol evolution remain open questions for future work. Importantly, the environment is not static. As societies form, they alter the landscape they inhabit. Protocols like UCP and A2A were designed proactively to enable agent interoperability, but once adopted, they redefine what coordination is possible, which in turn shapes what kinds of societies can form around them. When early AI systems demonstrated novel data practices, regulatory responses followed (GDPR, the EU AI Act), reshaping what agent societies are permitted to learn from and how they must govern themselves. Human expectations co-evolve too: as agents handle routine tasks, users begin expecting faster, cheaper service, which pressures societies to optimize in ways that reshape their internal structure. This is co-evolution. The fabric shapes its environment, which shapes the fabric in turn.

The Improvement Cycle

Figure 8. The improvement cycle. Five domains of agent adaptation (data, model, environment, coordination, interface) rendered as overlapping organic layers. Each adapts at its own pace; colored particles crossing between layers represent improvement signals where a change in one domain triggers adaptation in another. The risk is compounding drift when feedback loops reinforce the wrong signal. The visual concept of an adaptable object taking different shapes rather than remaining a monolithic block was inspired by Adaption Labs.

Full caption

Data (synthetic generation, pipeline curation, representation drift detection). Model (prompt rewriting, skill library accumulation, parameter-efficient fine-tuning). Environment (tool selection, sandbox configuration, API version management). Coordination (protocol switching, topology rewiring, trust network maintenance). Interface (agent routing logic, interaction mode selection, user preference learning). Each layer morphs continuously, adapting at its own pace: from near-instantaneous routing adjustments to slower model updates. Colored particles crossing between layers represent improvement signals: a model change triggering new data generation, a coordination shift enabling new tool discovery, an interface adjustment reshaping what data gets collected. The cycle counter (top right) tracks maturation: as cycles accumulate, internal connections grow denser, visualizing the compounding effect where each improvement enables the next.

Click to restart.

Most agent systems today are only partially adaptive. Their tools, retrieval indices, prompts, and routing rules may change, but the improvement loop is usually engineered outside the agent society. Early work showed that language-model pipelines can be systematically compiled and optimized rather than hand-prompted (e.g., DSPy), and that agents can build and reuse their own skill libraries in open-ended environments (e.g., Voyager). By 2026, pieces of this pattern are moving into production (prompt optimization, evaluation-driven routing, memory updates, tool selection, and staged rollout loops).

Adaptation at all three scales (agents, societies, fabric) carries a shared risk, namely that feedback loops can compound error silently until the skew becomes structural. Safeguards exist (staged rollouts, held-out benchmarks, canary tasks), yet rapid adaptation creates pressure to skip them. The interplay between these scales is what we hypothesize makes the fabric resilient, though this remains a design conjecture rather than an empirical finding. The logic is structural: agents adapt fast; societies restructure when agents cannot; the fabric rebalances when entire governance models prove unfit. If each scale compensates for the limitations of the others, the fabric can restructure under pressure rather than fracture. Whether multi-scale adaptation actually confers stability is an open empirical question, one that network stability research (discussed below) suggests is plausible but not guaranteed.

On network stability and motifs

May (1972) showed that random large ecosystems become less stable as complexity (connectance times interaction strength) increases, contradicting the intuition that complexity implies resilience. The resolution came from structure: real networks are not random. Certain connectivity patterns (motifs) recur across biological, ecological, and technological networks precisely because they confer dynamic stability. Milo et al. (2002) demonstrated that networks from gene regulation to the World Wide Web share specific motif families, despite having nothing else in common.

The implication for agent societies: if certain governance structures recur across independently developed agent systems (hub-and-spoke coordination, hierarchical delegation, peer review loops), this may not be convergent design but convergent selection. The structures that survive are those that happen to confer stability under the interaction patterns agents generate. Conversely, governance structures that look elegant on paper but lack stabilizing motifs may be fragile in practice. Testing agent society architectures against known stability criteria from network theory is a concrete research direction.

The Living Society and What Comes Next

Two observations (usefulness is the survival criterion; resources are finite while populations grow) produce the Loom Hypothesis, the persistent pressure toward connected, coordinated configurations. That pressure drives an evolution from isolation through ecosystem growth to governed societies. Along the way, a resource ecology of diverse models replaces a single dominant model, collective memory and knowledge factories give societies shared knowledge, and adaptive feedback loops let the fabric restructure itself under pressure rather than breaking.

What does all of this look like when it is running?

Figure 9. A living society. The full picture: ten governance zones operating in parallel, cross-boundary messages carrying knowledge, humans embedded in multiple societies, and Collective Memories feeding a Knowledge Factory. Watch for three boundary events (dissolution, expansion, schism) that show adaptive societies in action. Some boundary events are adaptation; others are collapse. The difference is whether the reorganization preserves useful function.

Full caption

Each zone implements a distinct governance archetype (labeled by name and type). Structure agents (larger dots) maintain each zone's internal topology, while smaller moving dots represent cross-boundary messages carrying knowledge between zones. Two human participants (stick figures) are embedded in multiple societies at once, illustrating the Phase 4 interweaving described earlier. Each zone maintains a knowledge base (KB); some zones keep theirs private (red outline), reflecting the privacy constraints discussed in the Why Not One Model section. Non-private zones contribute to the nearest Collective Memory (CM), and the Knowledge Factory (KF) synthesizes insights across both CMs. The three boundary events: The Accord dissolves and is absorbed by the Exchange Floor (a federation failing to reach consensus), The Forge expands (a successful guild absorbing new agents and task domains), and The Arena fractures in a schism (a meritocracy splitting when concentrated power becomes untenable).

Click to restart.

In the animation, watch for three boundary events (dissolution, expansion, schism). Each follows directly from the two observations. Configurations that deliver utility persist; those that do not get restructured. The fabric’s intelligence is not located only in its nodes. It also lives in the rate, fidelity, and governance of coordination between them.

No system today operates at the full scale described here. Many of the components exist in isolation, and the trajectory seems plausible, though far from certain. Several questions remain open. How does work actually get done within societies? What happens when trust breaks down across society boundaries? And where do humans fit in all of this, not just as deployers and overseers, but as participants woven into the fabric itself?

Next in the series:

Part 2. Division of Labour and Governance (coming soon) describes delegation archetypes, the specialist market, and governance archetypes.

From Mindless to Mindful: Beyond the Society of Mind#

Two Observations#

The Loom Hypothesis#

The Vision: From Isolation to Interweaving#

The Resource Ecology#

Why Not One Model to Rule Them All?#

Governance: How Agent Societies Are Ruled#

Collective Memory and the Knowledge Factory#

The Adaptive Fabric#

The Improvement Cycle#

The Living Society and What Comes Next#

Citation

From Mindless to Mindful: Beyond the Society of Mind

Two Observations

The Loom Hypothesis

The Vision: From Isolation to Interweaving

The Resource Ecology

Why Not One Model to Rule Them All?

Governance: How Agent Societies Are Ruled

Collective Memory and the Knowledge Factory

The Adaptive Fabric

The Improvement Cycle

The Living Society and What Comes Next