Why Every Advisor in Ancient China Used the I-Ching: Evidence from an AI Wargame
We ran a 7-state Warring States wargame with AI agents. The smallest state consulted I-Ching hexagrams before each move. The hexagram content didn't predict actions — but it doubled the agent's diplomacy rate and improved survival. Here's what that tells us about why every strategist in ancient China consulted the Changes.
Abstract: We built a 7-state Diplomacy-style wargame simulating the Warring States period, powered by Claude Opus agents with historical personas. Han, the historically weakest state, received I-Ching hexagram consultations each round as its decision framework. Despite starting with the fewest territories and worst strategic position, Han survived all 20 rounds and peaked at 3 territories.
This post examines the experimental data to propose why the I-Ching was the dominant strategic tool in ancient China: not because it predicted the future, but because it was the most sophisticated decision-slowing and perspective-shifting framework available in the pre-modern world.
Estimated reading time: 12 minutes
The Experiment
The Warring States period (475-221 BC) ended with Qin's unification of China. Among the seven major states, Han was always the smallest, the weakest, and the first to fall. Every history agrees on this.
So we gave Han the I-Ching and asked: can an ancient philosophical framework help an AI agent overcome extreme structural disadvantage?
The setup: seven Claude Opus agents, each embodying a historical state with its philosophical school. Qin follows Legalism. Chu follows Daoism. Zhao follows military pragmatism. Han follows the King Wen sequence of the I-Ching. Each round, all seven agents issue simultaneous hidden orders (hold, move, or support) resolved through Diplomacy-style mechanics. Each game runs for 20 rounds across 22 territories.
The critical difference: before issuing orders, Han's agent must consult a hexagram generated from the game state via the yarrow stalk method, interpret it in light of the current board position, and ground its strategic decisions in that interpretation. No other state has this step.
What Happened
Han survived. The historically weakest state was never eliminated.
Here are the final standings from one representative game:
Final Standings (Game 1ddf, Round 20):
Qi: 7 territories (32%) — WINNER
Qin: 5 territories (23%)
Chu: 3 territories (14%)
Yan: 3 territories (14%)
Han: 2 territories (9%) — survived, peak: 3
Zhao: 2 territories (9%)
Wei: 0 territories — eliminated R9
Han peaked at 3 territories in round 10, matching Zhao, Chu, and Yan despite starting with the worst position. Han outlasted Wei (eliminated round 9) and maintained territorial parity with states that started stronger. Across all eight oracle-condition games, Han survived seven times (88%).
But the raw survival number isn't the interesting finding. The interesting finding is how the oracle shaped Han's decisions.
The Oracle in Action: Five Key Moments
Round 4: The Family Saves Han from Triple Attack
Han held zheng and daliang. In round 4, Qin, Chu, AND Wei all targeted Han's territories simultaneously. The oracle cast Hexagram 37, The Family, with the entire lower trigram changing.
The agent interpreted: "Wind from fire — influence from a secure center. Perseverance of the woman furthers — yin strategy, not yang aggression. We fortify all territories."
Han went full defense: zheng held with double support from daliang and luoyang, creating a strength-3 fortress. Qin and Chu both attacked zheng at strength 1 — and because two different powers attacked the same territory, they caused a standoff, canceling each other out. Neither could take it.
The oracle didn't predict that Qin and Chu would both attack zheng. But by counseling total defense at the exact moment Han was most vulnerable, it produced the optimal outcome: let the attackers destroy each other's plans.
Round 9: Approach Warns of Coming Danger
Hexagram 19, Approach: "Supreme success. But when the eighth month comes, there will be misfortune."
The agent read this as a warning with a timer. It pushed south into nanyang (Qin's territory that Qin had just vacated) while the window was open. Han took nanyang and reached 3 territories — its peak. The "eighth month" warning proved prescient: by round 17, Han lost zheng to Qin's coordinated triple-support attack.
Round 11: Coming to Meet — Fortress Against the Storm
Hexagram 44, Coming to Meet: "The maiden is powerful. One should not marry such a maiden."
The agent interpreted: "the dark force rising from below is Qin's southern thrust. Do not be seduced into offensive entanglements." Han built a triple-defense fortress at nanyang (strength 3). Qin attacked with hanzhong supported by bashu — but bashu wasn't adjacent to nanyang, so the support was invalid. Qin's actual attack strength was 1 against Han's 3. The fortress held effortlessly.
Round 16: Oppression — Perfect Standoff
Hexagram 47, Oppression: "Success through persistence despite adversity. Words are not believed."
Han committed all three units to defending zheng: hold with double support for strength 3. Qin attacked with the maximum possible force: henei moving to zheng supported by both shangdang and hanzhong, also strength 3. Perfect standoff. Neither side gained or lost.
The oracle's guidance to persist despite oppression produced exactly the right strategy for a situation where the attacker had concentrated maximum force.
Round 15: Biting Through — The Surgical Counter-Swap
Hexagram 21, Biting Through: "Success. It is favorable to use legal proceedings."
The agent interpreted this as a call for decisive, surgical action. Qin was about to attack zheng with triple support (strength 3). Instead of trying to match the defense, Han moved luoyang into zheng's position while zheng's supporting unit held. But the real insight: Qin's attack required moving its unit OUT of zheng. Han's luoyang moved INTO the vacated zheng. Han lost henei but gained zheng — a territory swap that maintained Han's count at 3.
The oracle's counsel to "bite through" the obstacle produced a counter-intuitive response: instead of matching force with force, trade space for space.
The Hypothesis: Why Every Advisor Used the Changes
These five moments reveal a pattern that has nothing to do with prediction or mysticism.
1. The I-Ching is a Cognitive Brake
The dominant failure mode in Warring States strategy isn't ignorance — it's reactivity. When threatened, the natural response is to lash out or panic. Every other state in our simulation went directly from board state to orders: see threat, respond to threat.
Han couldn't do this. The oracle mandate required interpreting a hexagram before issuing orders. This created a mandatory pause — a structured reflection step that no other state had. When Hexagram 37 (The Family) counseled "perseverance of the woman furthers" — a classical phrase meaning strength through steadfastness rather than force — it pushed the agent toward a defensive posture it might not have chosen on instinct.
In ancient China, a full yarrow stalk divination — eighteen separate sortings of fifty stalks to derive six lines — took roughly half an hour of careful, meditative manipulation. That's half an hour of not making a hasty decision. The ritual itself was the technology.
2. The Hexagrams Encode a Complete Situational Taxonomy
The I-Ching's 64 hexagrams aren't random. They represent a systematic taxonomy of situational archetypes: Approach (19), Retreat (33), Obstruction (39), Breakthrough (43), Peace (11), Standstill (12), Revolution (49), The Cauldron (50).
When Han's agent received Hexagram 39 (Obstruction), it didn't just get a vague omen. It got access to three thousand years of accumulated wisdom about what to do when surrounded by obstacles: "The southwest furthers. The northeast does not further. It furthers one to see the great man." This is actionable strategic counsel — retreat toward allies, don't advance against strength, seek competent leadership.
The hexagram didn't tell Han what would happen. It told Han what kind of moment it was in. And knowing what kind of moment you're in is more valuable than knowing what any single opponent will do.
3. Oracular Guidance Produces Strategic Unpredictability
Qin attacked Han's zheng five times across the 20 rounds. A purely rational Han agent would have developed a predictable defensive pattern that Qin could exploit. Instead, Han's responses varied based on which hexagram appeared:
- Hex 37 (Family) → full defense, strength 3
- Hex 44 (Coming to Meet) → fortress with warning about entanglement
- Hex 28 (Great Exceeding) → evacuate, find somewhere to go
- Hex 21 (Biting Through) → surgical counter-swap
- Hex 14 (Great Possession) → aggressive expansion into enemy territory
This variance wasn't random — each response was grounded in the hexagram's situational logic. But from Qin's perspective, Han's behavior was unpredictable. The oracle made Han a moving target.
4. The Framework Prevents the Biggest Strategic Error
The single most common cause of state collapse in our simulation was overextension. Wei was eliminated because it kept moving both units aggressively, leaving territories empty. Chu lost ying because it committed everything to attacking wu. Zhao nearly fell multiple times by committing all units to reclaiming handan.
Han never made this error. The oracle's conservative bias — most hexagrams counsel caution more often than aggression — created a built-in check against overextension. When Hex 47 (Oppression) said "persist despite adversity," it was telling Han not to make the desperate lunge that destroyed Wei.
This maps precisely to historical reality. The states that survived longest in the actual Warring States period were those that avoided catastrophic overextension. The I-Ching, with its deep structural conservatism, naturally produces this strategic profile.
What the Numbers Show
The five moments above tell a compelling story. But stories can be cherry-picked. So we ran the numbers across all eight oracle-condition games and compared them against eight control games where Han received no hexagram consultation.
The first thing we checked: does the content of each hexagram drive specific actions? If Hexagram 37 (The Family) consistently produces defensive holds, or Hexagram 21 (Biting Through) consistently produces attacks, then the oracle would be working as a tactical guide — the hexagram text would be doing the work.
It doesn't. Across 175 oracle rounds, hexagram-action concordance is 19.4% — and discordance is also 19.4%. Perfectly symmetric. The remaining 61% of rounds involve neutral or unclassifiable hexagrams. The specific content of each hexagram has no measurable correlation with the moves Han makes.
But the oracle fundamentally changes how Han plays:
Order Distribution (oracle vs control, 8 games each):
Supports: 43% oracle vs 21% control
Holds: 27% oracle vs 52% control
Moves: 30% oracle vs 27% control
Territory (Area Under Curve):
Oracle Han: mean 61.1, median 60.0
Control Han: mean 52.2, median 59.0
Survival rate:
Oracle Han: 88% (survived 7 of 8 games)
Control Han: 75% (survived 6 of 8 games)
Oracle Han is dramatically more diplomatic — more than twice as likely to support other states. Control Han turtles: it holds over half the time, playing pure defense. Oracle Han engages. And Oracle Han holds more territory over the course of each game, with a mean AUC advantage of +8.9.
This is a more interesting finding than "hexagram X causes action Y." The oracle doesn't work as a tactical guide. It works as a behavioral modifier. The I-Ching mandate — "interpret this hexagram before acting" — forces a reflection step that biases toward relational thinking. The agent reads about wind and fire and family bonds and responds by thinking about alliances instead of walls.
The content is a vehicle. The framing is the mechanism.
The Honest Finding
Let me be precise about what this does and doesn't show.
It does show that I-Ching consultation produced measurably different strategic behavior. Oracle Han supports at twice the rate of control Han, holds at half the rate, and survives more often.
It does show that this behavioral shift — toward diplomacy and away from pure defense — is compatible with better outcomes for the weakest state.
It does not show statistical significance. Eight games per condition gives a Cohen's d of 0.35 (small-to-medium effect). The direction is consistent, but we need 30+ games per condition to claim significance. That work is in progress.
It does not show that the I-Ching produced optimal strategy. Qi won with 7 territories using no oracle at all. Pure eclecticism — adapting to circumstances without a philosophical framework — outperformed every other approach including the oracle.
It does not show causation. Han's survival might be explained by other factors: its central position enabling retreats in multiple directions, other states prioritizing different targets, or simple luck in how standoffs resolved.
What the experiment suggests — and what I believe explains historical practice — is this: philosophical framing changes AI strategic behavior even when the specific content doesn't correlate with actions. That's the Junzi Alignment hypothesis in miniature. The I-Ching's value isn't in its predictions. It's in the cognitive posture it induces — reflective, relational, situationally aware.
Every advisor from Jiang Ziya to Zhuge Liang seems to have understood this. They didn't consult the yarrow stalks because the stalks could see the future. They consulted them because the discipline of consultation — the pause, the reflection, the encounter with an alternative perspective — produced better decisions than reactive judgment alone.
The AI wargame suggests they were right. And if you're building agent systems that make high-stakes decisions — the same question applies now. Not what will happen, but what kind of moment is this, and have you paused long enough to see it clearly?
Technical Details
- Engine: Custom 7-state Diplomacy-variant with simultaneous hidden orders, imperfect information, and asymmetric starting conditions based on historical state profiles
- Agents: Claude Opus 4.6, one instance per state, with historical persona prompts including philosophical school, strengths/weaknesses, and key advisors
- Oracle: Yarrow stalk method generating hexagrams from game state hash, with changing lines determined by the traditional probability distribution
- Sample: 8 completed oracle games, 8 completed control games (Claude Opus 4.6). 10 oracle and 8 control games incomplete.
- Campaign: experiment_003, 20 rounds per game
- Part of: The Junzi Alignment research program investigating whether philosophical frameworks shape AI strategic learning
Next: Scaling to 30+ games per condition for statistical significance, and testing memory injection (cross-game learning) to see if the oracle effect amplifies over time. See the experimental design for methodology.