Quotulatiousness

May 17, 2026

QotD: Battlefield morale and cohesion in movies/games versus real history

Filed under: Europe, Gaming, History, Media, Military, Quotations — Tags: , , , , , — Nicholas @ 01:00

I’ve focused on game morale systems here, but of course this blends over into film as well, where the “mooks” often charge the heroes seemingly utterly heedless of their losses – frequently despite the fact that the last identical group of mooks to do so just got taken apart before their very eyes. And invariably they do this until they are so beaten that they switch to the other binary state, simply running away.

Actual armies have far more than two states of morale and behaved in far more dynamic, unpredictable and interesting ways!

The first problem with this “binary model” of morale is that it assumes just a single factor (“leadership” or “morale”) but in practice we ought to be thinking about at least two different ingredients here: morale and cohesion.

Morale is the commitment the combatants have to their leadership and their cause. To simplify a bit, we might say that soldiers with good morale believe three things: that their cause is a worthy one, that they are on the road to success and that their leaders have a good (enough) plan to achieve final victory. Poor morale can result from a breakdown in any of those three elements: troops might for instance believe both in their goal and its eventual possibility but not in their leaders to produce it (this seems to have been the case, for instance, in the French Mutiny of 1917). On the other hand, regardless of the charisma of leaders, few people come to a war intending to die in it; if the cause appears impossible, morale will sink regardless. And armies that do not believe in the cause at all are extremely difficult to motivate by other means.

On the other hand cohesion is the force that holds a specific unit together through the power of the bonds holding the individual combatants to each other and/or to their (generally junior or non-commissioned) officers. There are a lot of ways to build that cohesion: people are generally unwilling to abandon neighbors, close friends and relatives, for one. They are also reluctant to expose themselves to shame at home for having done so; shame is one of the few things people fear as much, if not more than, death. For armies that can’t rely on that sort of organic cohesion, it can be built by reconstructing the soldier’s unit as his primary social group. Drill can do this: it creates an experience of shared suffering and achievement which bonds the soldiers together creating strong “artificial” cohesion.

These two ingredients have different roots, but they also function differently. The formulation that has always stuck with me is one from James McPherson’s For Cause and Comrades: Why Men Fought in the Civil War (1998): morale (McPherson discusses it under the heading of “the Cause”) will get men into uniform, it will sustain them on large marches and cold nights and it will get them to the battle, but it will not get them through the battle. Instead, cohesion (the “comrades” of the title) gets men through the terror of actual combat, when fear has driven “the cause” far from mind. But of course cohesion isn’t enough on its own either, since it provides no reason to advance or attack or really to do anything at all except stick together.

Adding further complication to this, morale and cohesion are not, as they often exist in games, inherent properties of a unit, but rather emergent properties of the interactions of a whole bunch of individuals. In a strategy game, units exist primarily as extension of the player’s will; in film units typically exist as extensions of their commander’s or the main character’s will (note how common it is that right as the hero begins winning his duel with the villain, so too his army begins winning the battle). But of course actual armies are composed of lots of humans, each with their own individual will and agency.

Those humans are continually making calculations about risks, goals and survival. It’s not hard here to see why, by the by, morale won’t carry troops through high risk conditions: if your only goal is to survive to experience the end-state of the war, then it is always in your interest to let someone else do the dying; it doesn’t serve your end to stay in a high risk position. By contrast, if you are held there by the fear of shame if your close comrades see you run, that still applies. Thus these calculations get progressively more “primal” as the sense of danger rises (fear makes a mess of those higher brain functions), but they do not stop.

Bret Devereaux, “Collections: Total Generalship: Commanding Pre-Modern Armies, Part IIIC: Morale and Cohesion”, A Collection of Unmitigated Pedantry, 2022-07-01.

May 11, 2026

QotD: The cultural importance of the church in early Medieval Europe

We should start by charting the broad outlines of the place of the medieval church in Western Europe. I should start off by noting that this is a huge topic – as will swiftly become clear, there was almost no part of society in which the Church did not play a significant role – and I will only be offering a broad-strokes overview here, sufficient to provide a basis of comparison for [Game of Thrones]. Most of this discussion will principally concern the Latin Church (what today is the Catholic Church) in the West. Since this discussion is – importantly! – about the state of affairs before the reformation, I will tend to refer to the Latin Church simply as “the Church” for brevity’s sake.

The very first thing to note is that the Church (in this case, both the Latin West and the Greek East) pre-dated the Middle Ages themselves. The Church arrived in the Middle Ages as relic of the Roman Imperial past. It inherited Roman Imperial organization – the diocese, for instance, derived from the boundaries of Roman super-provinces called dioceses (Greek: διοίκησις). Unlike the new medieval aristocracy, which tended to rule from fortified estates in the countryside, the Church remained centered in towns and cities, many of which had been major centers under the Romans. As the Roman provincial administration collapsed, it largely fell to the Church – as one of the few surviving literate institutions – to replace some of the core functions, like record keeping and the preservation of literature and learning. This was less true [in] Scandinavia and Eastern Europe, places where the Church was a relative late-comer, but for most of Western Europe, the Church was not some new institution grafted on to a pre-existing society (as it had been under the Romans), but rather part of the bedrock cultural foundation upon which that new society was constructed (fellow pedants! – please note carefully the phrase part of in the previous sentence; I am aware there were other things).

That said, the institutional power of the Church (and here we really do mean what would be the Roman Catholic Church) begins to change dramatically in the 11th century, right as we enter the High Middle Ages, and continues for the next several centuries (keeping in mind that Game of Thrones and A Song of Ice and Fire really evoke the High and Late Middle Ages, rather than the Early period). In short, the institutional heft of the Church grows dramatically. Quite a few things begin happening which are linked together: the Popes begin trying to wrest control over the Church’s hierarchy (specifically, the investiture of bishops) from secular rulers. Clerical celibacy was more stringently enforced. The Church intruded into warfare (as we’ve discussed with the Peace of God / Truce of God movements). It began to more directly attempt to regulate marriage, especially among the powerful (marriage was a made a sacrament in 1184). By the 1300s, this included keeping detailed records in many parts of France about births, deaths and marriages, in part to ensure no one married a close relative.

(And, of course, for those of you thinking, “wait, isn’t this also the period of the Crusades – military expeditions called by and at least nominally (but not in practice) under the auspices of the Pope?” Yes, it is, and that’s not an accident either).

In my experience teaching this, it is the next step that baffles my students the most. This vast increase in the institutional power of the Church was made possible, not by armies or shrewd real-politic (though both were involved), but by belief. The primary weapon wielded by Popes in this effort was the threat of excommunication, which (under Catholic doctrine) cut off the excommunicated individual or community from salvation, potentially damning them for all eternity. But of course that threat is only real if you believe the Pope has that power. And therein is the key point: most of Europe did believe. As I tell my students, it is safe to assume, as a general matter, that people in the past believed their own religion. Of course there are exceptions, but the general rule remains.

In the conflicts that arose – because, as you might imagine, secular rulers were unwilling to give up their prerogatives – it did not actually much matter if the king or emperor believed in the power of excommunication, because no one rules alone. Thus when the Holy Roman Emperor Henry IV was excommunicated by Pope Gregory VII in 1076, the audience for this act wasn’t Henry himself (who had already declared Gregory illegitimate anyway). It was directed at all of Henry’s vassals and supporters, releasing them from their oaths of allegiance and essentially saying, “stick with this guy, and he’ll take you to hell with him”. It worked, sparking a major rebellion and forcing Henry to humiliatingly apologize the following year.

(History note: this would be “round 1” in a multi-round fight that wasn’t settled until 1122 with the Concordat of Worms; in the end the Papacy mostly won, sharply limiting the Holy Roman Emperors’ power over their bishops).

Bret Devereaux, “New Acquisitions: How It Wasn’t: Game of Thrones and the Middle Ages, Part II”, A Collection of Unmitigated Pedantry, 2019-06-04.

May 5, 2026

QotD: Why China never adopted war elephants

Filed under: China, History, Military, Quotations — Tags: , , — Nicholas @ 01:00

If I have any readers familiar with the armies of China during the Warring States, Han Dynasty or Three Kingdoms Period, they may have already guessed my conclusion for China. China never flirted with the war elephant the way the great powers of the ancient Mediterranean did, although the Han in particular had far greater resources than any of these save imperial Rome and far easier access to elephants to boot. Chinese emperors received elephants and elephant handlers often enough as tribute or spoils from war. And yet, no war elephants. As Trautmann (2015) notes, “the absence of the war elephant in China is … the result of a deliberate choice”.

Trautmann (2015) finds the solution in land-use patterns: China had simply converted so much of its pasture and forest to crop-land, in a densely settled city-and-agriculture land-use pattern that incorporating large numbers of elephants was not just prohibitive, but also culturally foreign. And there’s something to this, though I don’t buy it completely. Absolutely, Chinese land-use patterns would make elephants a lot more expensive to maintain than in India or even Rome. Highly productive farmland would likely have to be turned over to elephant pasture. That said, Chinese rulers had embraced the chariot and cavalry, so such things could be done, if the military or political calculus made them worth doing. But they weren’t done.

Instead, I tend to think that the same basic calculus that applied for Rome applies neatly for China – elephants fare poorly in societies with access to large numbers of disciplined infantrymen who can be trained in anti-elephant tactics. And this was certainly true of China, which had disciplined infantry to spare. Also, Han armies seem to have relied on close integration of missile weapons and polearms, meaning that they had the same sort of integrated light infantry support that the legion of the Roman Republic did. Later Chinese armies, as Trautmann briefly notes, had no problem defeating elephants in battle.

As with Rome, in China, elephants seem to have been a military solution looking for a problem to solve – and never found it. For one Chinese dynasty after another, the major military threats were either peer competitors (during periods of political fragmentation) whose disciplined infantry armies were no more vulnerable to elephants than Rome’s, or else steppe nomads. Given the tremendous logistical difficulties of operating even small armies out on the open steppe, attempting to take war elephants there would have been the height of stupidity. Elephants weren’t going to stop the Mongols – to be fair, not much stopped the Mongols (we’ll get into India, Mughals and elephants next time).

Bret Devereaux, “Collections: War Elephants, Part II: Elephants against Wolves”, A Collection of Unmitigated Pedantry, 2019-08-02.

April 29, 2026

QotD: The battlefield role of the general in pre-modern battles

Filed under: History, Military, Quotations — Tags: , , , — Nicholas @ 01:00

We have to start not with tactics or the physics of shouting orders, but with cultural expectations. First, we need to establish some foundations here. First, in a pre-modern battle (arguably in any battle) morale is the most critical element of the battle; battles are not won by killing all of the enemies, but by making the enemies run away. They are thus won and lost in the minds of the soldiers (whose minds are, of course, heavily influenced by the likelihood that they will be killed or the battle lost, which is why all of the tactics still matter). Second, and we’ve actually discussed this before, it is important to remember that the average soldier in the army likely has no idea if the plan of battle is good or not or even if the battle is going well or not; he cannot see those things because his vision is likely blocked by all of his fellow soldiers all around him and because (as discussed last time) the battlefield is so large that even with unobstructed vision it would be hard to get a sense of it.

So instead of assessing a battle plan – which they cannot observe – soldiers tend to assess battle commanders. And they are going to assess commanders not against abstract first principles (nor can they just check their character sheet to see how many “stars” they have next to “command”), but against their idea of what a “good general” looks like. And that idea is – as we’re about to demonstrate – going to be pretty dependent on their culture because different cultures import very different assumptions about war. As I noted back in the Helm’s Deep series, “an American general who slaughtered a goat in front of his army before battle would not reassure his men; a Greek general who failed to do so might well panic them.” An extreme example to be sure, but not an absurd one. In essence then, a general who does the things his culture expects from him is effectively performing leadership as we’ve defined it above.

But the inverse of this expectation held by the soldiers is that generals are not generally free to command however they’d like, even if they wanted to (though of course most generals are going to have the same culturally embedded sense of what good generalship is as their soldiers). Precisely because a general knows his soldiers are watching him for signs that he is their idea of a “good general”, the general is under pressure to perform generalship, whatever that may look like in this cultural context. That is going to be particularly true because almost all of the common models of generalship demand that the general be conspicuous, be available to be seen and observed by his soldiers. As a result, cultural ideals are going to heavily constrain what the general can do on the battlefield, especially if they demand that the general engage personally in combat.

Different sorts of generals

We can actually get a sense of a good part of the range simply by detailing the different expectations for generalship in ancient Greek, Macedonian and Roman societies and how they evolved (which has the added benefit of sticking within my area of expertise!).

On one end, we have what we might call the “warrior-hero general”. This is, for instance, the style of leadership that shows up in Homer (particularly in the Iliad), but this model is common more broadly. For Homer, the leaders were among the promachoi – “fore-fighters”, who fought in the front ranks or even beyond them, skirmishing with the enemy in the space between their formations (which makes more sense, spatially, if you imagine Homeric armies mostly engaging in longer range missile exchanges in pitched battle like many “first system” armies).

The idea here is not (as with the heroes of Homer) that the warrior-hero general simply defeats the army on his own, but rather that he is motivating his soldiers by his own conspicuous bravery, “leading by example”. This kind of leadership, of course, isn’t limited to just Homer; you may recall Bertran de Born praising it as well:

    And I am as well pleased by a lord
    when he is first in the attack,
    armed, upon his horse, unafraid,
    so he makes his men take heart
    by his own brave lordliness.

On the opposite end of the spectrum, there is the pure “general as commander” ideal, where the commanding general (who may have subordinates, of course, who may even in later armies have “general” in the name of their rank) is expected to stay well clear of the actual fighting and instead be a coordinating figure. This style […] is fairly rare in the pre-gunpowder era, but becomes common afterwards. Because in this model the general’s role is seen primarily in terms of coordinating various independently maneuvering elements of an army; a general that is “stuck in” personally cannot do this effectively. And it may seem strange, but violating these norms with excessive bravery can provoke a negative response in the army; confederate general Robert E. Lee attempted to advance with an attack by the Texas Brigade at the Battle of the Wilderness (May 6, 1864) only to have his own soldiers refuse to advance until he retired to a more protected position. Of course this sort of pure coordination model is common in tactical video games which only infrequently put the player-as-general on the battlefield (or even if the “general” of the army is represented on the battlefield, the survival of that figure is in no way connected to the player’s ability to coordinate the army).

In practice, pre-modern (which is to say, pre-gunpowder) generals almost never adopt this pure coordination model of generalship. The issue here is that effective control of a gunpowder army both demands and allows for a lot more coordination. Because units are not in melee contact, engagements are less decisive (units advance, receive fire, break, fall back and then often reform to advance again; by contrast a formation defeated in a shock engagement tends not to reform because it is chased by the troops that defeated it), giving more space for units to maneuver in substantially longer battles. Moreover, units under fire can maneuver, whereas units in shock generally cannot, which is to say that a formation receiving musket or artillery fire can still be controlled and moved about the field, but a unit receiving sword strikes is largely beyond effective command except for “retreat!”

In between these two extremes sits variations on what Wheeler terms a “battle manager”, which is a bit more complex and we’ll return to it in a moment.

What I want to note here is that these expectations are going to impact where the general is on the battlefield and thus what he can do to exert command. A general in a culture which expects its leaders to be at the front leading the army has the advantage of being seen by at least some of his soldiers (indeed that is the point – they need to see him performing heroic leadership), but once engaged, he cannot go anywhere or command anyone. This is also true, by the by, in cultures where the general is expected to be on foot to show that they share in the difficulties and dangers of the infantry; this is fairly rare but for much of the Archaic and Classical periods, this was expected of Greek generals. Even if a general on foot isn’t in combat directly, their ability to see or move about the battlefield is going to be extremely limited.

On the flipside, a general who is following the “commander” ideal is likely to be in the rear, perhaps in an elevated position for observation. The obvious limitation here is that such a commander is going to struggle to display leadership because no one can see them (everyone is facing towards the enemy, after all). But that also impacts their ability to command – no one is looking at them so if they want to change their plans on the fly they need to send word somehow to subordinate officers who are with or in front of the battle line who can then use their visibility to communicate those orders to the troops.

Bret Devereaux, “Collections: Total Generalship: Commanding Pre-Modern Armies, Part II: Commands”, A Collection of Unmitigated Pedantry, 2022-06-03.

April 23, 2026

QotD: The problems of a “no first use” nuclear weapons policy

Now, you might ask at this point: why not defuse some of this tension with a “no first use” policy – openly declare that you won’t be the first to use nuclear weapons even in a non-nuclear conflict?

For the United States during the Cold War, the problem with declaring a “no first use” policy was the worry that it would essentially serve as a “green light” for conventional Soviet military action in Europe. Recall, after all, that the Soviet military was stronger in conventional forces in Europe during the Cold War and that episodes like the Berlin Blockade (and resultant Berlin Airlift) seemed to confirm Soviet interest in expanding their control over central Europe. At the same time, the Soviet use of military force to crush the Hungarian Revolution (1956) and the Prague Spring (1968) continued to reaffirm that the USSR had no intention of letting Central or Eastern Europe choose their own fates – this was an empire that ruled by domination and intended to expand if it could.

The solution to blocking that expansion was NATO, the North Atlantic Treaty Organization. Not because NATO collectively could defeat the USSR in a conventional war – the general assumption was that they probably couldn’t – but because NATO’s article 5 clause pledging mutual defense essentially meant that the nuclear powers of NATO (Britain, the United States, and France) pledged to defend the territory of all NATO members with nuclear weapons. But just like deterrence, mutual defense alliances are based on the perception that all members will defend each other. Declaring that the United States wouldn’t use nuclear weapons first would essentially be telling the Germans, “we’ll fight for you, but we won’t use our most powerful weapons for you” in the event of a conventional war; it would be creating a giant unacceptable asterisk next to that mutual defense clause.

So the United States had to be committed to at least the possibility that it would respond to a conventional military assault on West Germany with nuclear retaliation (often envisaged as a “tactical” use of nuclear weapons – that is, using smaller nuclear weapons against enemy military formations. That said, even in the 1950s, Bernard Brodie was already warning that restraining the escalation to general use of nuclear weapons once a tactical nuclear weapon was used would be practically impossible).

Bret Devereaux, “Collections: Nuclear Deterrence 101”, A Collection of Unmitigated Pedantry, 2022-03-11.

April 17, 2026

QotD: The decline of cities in the late western Roman Empire

The ancient Mediterranean was a world of cities and in the eastern Mediterranean at least, it had been long before the Roman period. By the beginning of the Roman Republic (509 BC), the pattern of organization was broadly similar in Italy, Sicily, coastal North Africa, Egypt, the Levant, Mesopotamia, Anatolia and Greece: agricultural land was broken up into the territory of cities (so that each city consisted of both its urban core but also its agricultural hinterland). Those cities might then either be independent, as with the poleis of Greece and the various communities of pre-Roman Italy, or be the basic administrative units of larger empires, as in the Persian Empire (or later Roman Italy). And so, while most people still lived in the countryside, most of that countryside was in turn attached to an urban center which was the center of political, economic, religious and cultural life.

This was the world the Romans knew and the world they were most comfortable governing. Consequently, while the Romans were utterly uninterested in “civilizing” anyone, when they conquered areas which weren’t urbanized, they tended to found cities or encourage local urbanization in order to create the administrative structures through which the Romans could extract revenue most efficiently.

As mentioned above, the Romans generally wanted these cities to be mostly self-governing. While at conquest, the Romans found themselves managing a bewildering array of different styles of local urban government, over time a mix of Roman administrative preference and cultural diffusion tended to produce a fairly similar set of civic institutions. City governments, which also administered their rural countryside, were run by a town council which consisted of the wealthiest notables of the town – the curiales – in much the same way that the Roman upper-class had dominated the running of the city during the Republic. Roman authority generally protected the curiales and their wealth from the sorts of popular uprisings that tempered many Greek oligarchies in the classical period and in return the curiales managed the population and the collection of taxes for the Romans.

The curiales both managed the town affairs and were also expected to use their own wealth to fund public activity and works: maintain temples and baths, fund religious rituals and festivals, and so on. Through the first and second century, that process was mostly responsible for providing the cities of the Roman Empire with the impressive collection of often still-visible public works they boasted: baths, theaters, amphitheaters, aqueducts, temples, courthouses, public spaces and so on. While some of these structures were little more than the public posturing of the elites, many of them were open to the general public and will have represented, in as much as anything before the industrial revolution could, meaningful improvements in the lives of regular people.

While most of the wealth of any of these cities was derived from the rents and taxes extracted from their agricultural hinterlands, these cities also substantially lived off of trade and markets. Because the local city typically housed the local market, they were the obvious point for local products to enter the stream of provincial-wide or empire-wide trade or for distant imports to reach their final customers. We’ll come back to this next time when we discuss trade and the economy, but for now I want to note that this trade provided a fair bit of the economic vitality of these cities but also that it did in fact reach down beyond mere luxury goods into the basic staples that even the relatively poor might buy.

The decline and fall of these Roman cities is most extensively described in J.H.W.G. Liebeschuetz’ aptly titled, Decline and Fall of the Roman City (2001). Given his title, as you might imagine, Liebeschuetz is in the “decline and fall” camp, arguing that the classical city which defined the Roman world largely did not survive it. Regional patterns differ, with Liebescheutz identifying three “patterns”: I) Western and Central Anatolia, II) Syria, Palestine and Arabia, III) the west, including North Africa).

We’ll deal with the situation in the east in just a moment, so let’s focus here on the cities of the west, which were at the start generally smaller, less wealthy and generally far younger than those of the east (with some exceptions in Italy). Decline sets in fastest and is most severe in Britain, with the final collapse of the cities coming as early as the 360s, whereas in North Africa, the classical city doesn’t seem to tip into decline until after 400.

While each individual region and indeed each city will have been subject to its own unique conditions, a few basic causes seem to have been active everywhere to some degree. First, the crisis of the third century seems to have fundamentally disrupted empire-wide Roman trade, which then stabilized at a lower level for the fourth century, before declining precipitously in the fifth. That first decline seems to have been somewhat offset by the increased demands of imperial administration and in particular the centralized taxation in-kind and movement of goods which had to move through cities. Peter Brown describes the late Roman state as, “the crude but vigorous pump which had ensured the circulation of goods in an otherwise primitive economy” (The Rise of Western Christendom, 2nd ed., 13). We’ll return to this when we discuss the shape of the economy next time, but for now it works as a crude, but vigorous description of that facet of the late Roman economy.

At the same time, as Liebescheutz describes, the role of the curiales steadily atrophies in the fourth century. On the one hand, much of the authority and power of being on the council was steadily eroded as those functions were pulled upwards into the imperial bureaucracy. At the same time, members of the curial class who sought imperial office could get immunities from the progressively more severe taxation which otherwise often fell on the curiales and so the imperial elite often crowded out the curiales when it came to wealth and prestige in the community. As they lost both control and responsibility for their cities, the curiales‘ investment in public works and monumental architecture also ceased (though local elites do invest in church-building and monastic foundations), leading to the decay of the physical urban centers.

Finally, the warfare of the fifth century had its impact, though as Liebescheutz notes, it cannot be presented as a sole cause simply because many urban areas were already clearly in decline when conflict hit. In the case of Britain, the cities were gone by 420, decades before the arrival of any invaders. Nevertheless, political instability and violence in the fifth century seems to have delivered death-blows to ailing communities, especially in the Balkans and along the Rhine.

The end result was that in the West, urbanism declined severely between the fourth and sixth centuries. Rome, once a city of a million people, collapsed down to a population of just 80,000. Arles, which had been a thriving Roman city with an amphitheater, an aqueduct, a chariot-racing track, a theater and full city walls shrunk so severely that the remains of the city moved inside its amphitheater, repurposing it as a new set of city walls, with the town square in the middle and houses built in the stands. While many towns survived in their new, shrunken and impoverished form, urbanism in Europe outside of the Eastern Roman Empire would largely have to be reinvented during the High Middle Ages, (though with some key institutional survivals from the Roman era and often rising out of the diminished remains of Roman cities). Instead, the society of the early Middle Ages was overwhelmingly rural in both population and focus. If on politics we have a bit of a mix between decline and continuity, when it comes to the cities that made up the old political system, the “decline and fall” knight strikes a clear blow: the system of social organization that characterized the ancient world practically vanished and would have to be redeveloped centuries later. The institutions that had maintained it (like the curiales) largely vanished, replaced in some cases by local “notables” and in other cases by ruralization.

Bret Devereaux, “Collections: Rome: Decline and Fall? Part II: Institutions”, A Collection of Unmitigated Pedantry, 2022-01-28.

April 11, 2026

QotD: Kingdoms were not “nations” in the Middle Ages

Filed under: Europe, History, Media, Quotations — Tags: , , , , — Nicholas @ 01:00

Medieval kingdoms and Early Modern states were both built around the personal holdings of individual rulers. For instance, to talk of “Austria” or “Burgundy” in the 1400s as states/countries/governments is to engage in a degree of anachronism. There was no Austrian state, merely the collection of lands either owned or controlled by whoever the reigning Habsburg was at the time. Likewise, Burgundy in, say, 1440 was not a coherent entity, it was simply the collection of lands that Philip the Good (Duke of Burgundy, but also Duke of Brabant, Limburg, Luxemburg and Lothier, Count of Artois, Flanders, Charolais, Haniaut, Holland and Zeeland, and the Margrave of Namur). The “kingdom” was thus not a permanent, durable entity so much as a collection of possessions the same way my personal “library” is not permanent building but just a term for “books I happen to own right now”.

It is thus a bit odd that the regions of Westeros are seen by its inhabitants as being clear and unchanging. For instance, the Reach has borders, those borders do not move and everyone in those borders is loyal to House Tyrell. This is not how medieval rule works. The borders of, say, France, shifted over time (some places we consider “obviously” part of France were added only quite late, like French Flanders or Provence) as the ability of the French king to control those regions changed. For long periods of the Middle Ages, large parts of France were effectively controlled by the Kings of England (because they were also Dukes of this or that French duchy).

The idea that France, or Germany or Italy was a distinct, permanent entity with its own existence apart from a given royal family – more than just a space on a map – which comprised a people, their language and the government of those people, this is a modern phenomenon. Indeed, one may argue, it – that is, the nation-state – is the modern phenomenon.

Bret Devereaux, “New Acquisitions: How It Wasn’t: Game of Thrones and the Middle Ages, Part III”, A Collection of Unmitigated Pedantry, 2019-06-12.

April 5, 2026

QotD: The structure of a typical polis government

Filed under: Europe, Government, Greece, History, Quotations — Tags: , , — Nicholas @ 01:00

The Greek term for the structure of a polis government was its politeia (πολιτεία), which would could mean the government (the way we would say “the state”) or the structure of that government (its “constitution”) or the rights and conditions of the citizenry (in the sense of “citizenship”); as with the many meanings of polis, the many meanings of politeia all shade into each other and are understood as blended.

Because this week we’re interested in the politeia of a polis, that’s going to mean we’re mostly focused on the politai, the citizens, who we discussed last time as one of the key building blocks of the polis. Now, as we noted last time, it’s important to keep in mind that the politai are not all of the people in the polis or indeed even very many of them: women, children, resident foreigners, native members of non-citizen free underclasses and slaves were all set outside the politai and often had no means of gaining entrance. We’re going to talk about all of those folks in more depth in the third part, where we’ll look at the status layer-cake of polis society. But for now I just want to note that all of those people are there, even if they won’t figure very prominently in this discussion of the structures of polis government.

Now we’ll explore this question of how a polis was governed: first laying out the standard elements of a polis constitution, which as we’ll discuss were surprisingly similar from one polis to the next. Then we’ll deal with variations in how those elements are structured, which the Greeks understood to define the differences in the three kinds of constitution that a polis might normally have: oligarchy, democracy and tyranny. Then […] we’ll look at what sort of magistrates a polis might have and what their jobs might be as well as the structure of the legal system a polis might have.

THis is going to mean that we’re discussing the “constitutions” of poleis, but I want to be really clear here at the start that these are almost never written constitutions. So when I say “constitution”, understand that we mean this in the broad sense of “the actual makeup of the state’s institutions” rather than in the narrow sense of “a formal set of instructions for the running of the state”. Some poleis did actually have the latter (the oldest we have that I know of is a constitution established by Ptolemy I Soter for Kyrene in 322; the fact that this is a constitution dictated by a king to a subordinated polis should signal how odd it is), but they seem to have been very rare.

Bret Devereaux, “Collections: How to Polis, 101, Part IIa: Politeia in the Polis”, A Collection of Unmitigated Pedantry, 2023-03-17.

March 30, 2026

QotD: The Revenge of the Archaeologists

Filed under: Europe, History, Quotations, Science — Tags: , , , , — Nicholas @ 01:00

Before we dive into the evidence, I want to speak briefly to the nature of the evidence for these topics. “Historian” is often an odd sort of field because while there is a core discipline and skill set that basically all historians are going to have (focused on reading texts critically and assessing arguments and evidence), beyond this almost all historians end up acquiring other skill sets, often from other fields, depending on what they are investigating. I, for instance, work on military history and so I need to have some mastery of military theory, whereas an intellectual historian might instead have some training in philosophy.

It is thus relevant that over the past half-century or so, it has so happened that effectively all ancient historians have had to develop a strong grasp of archaeological data; we don’t all necessarily learn to do the excavation work, of course (that’s what archaeologists do), but pretty much all ancient historians at this point are going to have to be able to read a site or artifact report as well as have a good theoretical grasp of what kinds of questions archaeology can be used to answer and how it can be used to answer those questions. This happened in ancient history in particular for two reasons: first, archaeology was a field effectively invented to better understand the classical past (which is now of course also used to understand the past in other periods and places) so it has been at work the longest there, but also because the sources for ancient history are so few. As I like to say, the problem for the modern historian is taking a sip of meaning from the fire-hose of evidence they have; but the challenge of an ancient historian is finding water in the desert. Archaeological data was a sudden, working well in that desert and much of the last two decades of ancient history has been built around it. Other fields of history are still processing their much larger quantity of texts; why dig so deep a well when you live next to a running river?

The result, in ancient history, has been what I tend to refer to as “the revenge of the archaeologists”. Not, mind you, revenge on medievalists, but in fact revenge on a very specific ancient historian and classicist, Moses Finley. Moses Finley was, from the 1950s to the 1980s, one of the most prominent classicists and his work touched on many fields, including the study of the ancient economy. Finley, writing in the 1960s was generally skeptical of the ability of archaeology to provide useful answers about the ancient economy (he preferred to understand the question by probing the mentalities of the Greek and Roman elite). Archaeology, Finley thought, was frequently over-interpreted and could never give a representative sample anyway; as he quipped in his 1965 article “Technical Innovation and Economic Progress in the Ancient World”, “we are too often victims of that great curse of archaeology, the indestructibility of pots”, a line for which, as far as I can tell, he is still quite unforgiven by some archaeologists.

As if in response, the archaeologists have spent the subsequent almost-six-decades proving again and again the tremendous value of their discipline by, among other things, utterly burying Finley’s The Ancient Economy (1973) under a mountain of archaeological data. It turns out the mentalités of aristocrats who largely hated merchants were not a good barometer of the activities of those merchants.

But you may now guess how this is going to play out in the discussion of Late Antiquity. The ancient historians come to the question ready to think in archaeological terms and ask what archaeological data can do to clear up these questions. Scholars of Late Antiquity trained as medievalists on the other hand, may or may not be well versed in archaeological methods or data (to be clear, some medievalists very much are versed, including prominent voices on the “change and continuity” side of this debate! But it is also very possible to be a “pure text” medievalist in a way that I don’t think I know a “pure text” ancient historian younger than sixty) because their field has not been forced, by dint of the paucity of sources, to revolve so heavily around archaeological data and because the archaeological data on the Middle Ages is not yet as voluminous as that on Classical Antiquity.

As I noted in the first post, beginning in the 1970s, what James O’Donnell calls the “reformation in Late Antique studies” launched a long overdue reassessment of Late Antiquity and the impact of the Fall of Rome – what we’ve called the “change and continuity” argument. I bring up all of this to note that the “counter-reformation” – what we’re calling the “decline and fall” argument – that really emerges beginning in the 90s is in many ways an extension of the “revenge of the archaeologists” in Classical studies (and especially the ancient economy) into the field of Late Antiquity. Indeed some of the scholars are the same (e.g. Willem Jongman) and many of them enter the debate on Late Antiquity as an extension of the debate about the Roman economy (in part demanding that “change and continuity” Late Antique scholars acknowledge things now generally considered “proved” by ancient historians about the earlier Roman economy).

In my own experience, particularly in more informal conversations, the methodological difference that interaction creates between ancient historians – for whom it has long been almost entirely settled that in a “fight” between archaeological evidence and effectively any other kind, the archaeological evidence “wins” – and medievalists for whom archaeology is a much less central part of their method (in part because their textual sources are more extensive) can lead to situations where the two sides of the debate talk past each other.

But when it comes to questions of demographics, economics and the conditions of life for the sort of people who rarely figure in our sources, archaeological evidence – although it is often incomplete and hard to interpret – offers the possibility of decisive answers to questions that otherwise would have to live entirely within the realm of speculation.

Bret Devereaux, “Collections: Rome: Decline and Fall? Part III: Things”, A Collection of Unmitigated Pedantry, 2022-02-11.

March 24, 2026

QotD: Citizens of a polis

Filed under: Europe, Greece, History, Quotations — Tags: , , , , — Nicholas @ 01:00

A polis is most importantly made up of the citizens, the politai (singular polites (πολίτης), plural politai (πολῖται)); indeed, Aristotle says this too in his Politics (Arist. Pol. 1274b): “for the state [polis] is an assembly of citizens [politai].” Now we are used to the idea that most people in a country are citizens of it, but the idea of the politai is much narrower. In its fundamental meaning a polites is a person engaged in the running of the polis; it is an idea defined by political participation. The politai were adult, citizen men; women, children, the enslaved and free non-citizens were all excluded from this group. A bit of demographic math might suggest that a modest polis with 2000 inhabitants might thus have just 300-400 politai.

Not everyone born in a polis was a member of the politai. Women could be of citizen status (and thus able to bear citizen children in poleis where that was required), but they could not be citizens at all. Being the male child of citizen parents was generally the core requirement of citizenship and in a democratic polis that was generally enough, but oligarchic poleis typically imposed wealth qualifications for political participation so not everyone born to citizens might themselves be a polites if they ended up too poor to meet the requirements. The terms astos and aste (ἀστός and ἀστή), “townsman” and “townswoman” respectively, might be used to make this distinction between the politai and people who were “merely” natives of the polis but barred for whatever reason from political participation. These distinctions become a lot more meaningful when you realize the point Aristotle is making defining the polis this way: if the polis is a community of politai then the residents of a polis (the physical space) who are not citizens are not members of the polis (not merely, we might imagine, non-participatory members).

Now the politai themselves also existed in subdivisions. We’ve mentioned division into demes or neighborhoods; while notionally geographic, demes could become hereditary (and indeed did become so in Athens). In Sparta and some poleis on Crete, citizens were divided into mess groups (syssitia or andreia). But by far the most common and important such division was into “tribes” or phylai (φυλαί, sing. φυλή), inherited kinship groups that often formed the largest subdivision of the politai of a polis, with even very small poleis having attested divisions into phylai in some cases (e.g. Delos as noted by M.H. Hansen in “Civic Subdivisions” in the Inventory). The politai might also be subdivided by other groupings like phratria (brotherhoods) and indeed a polis might have multiple such groupings, either neatly nested (as in Athens’ demes sorted into thirty trittyes sorted into ten phylai to make up the citizen body) or they might confusingly cross-cut each other.

There’s another key distinction between the politai – or at least men who might be politai – which isn’t a legal distinction but nevertheless matters for understanding how the Greeks imagined civic governance: the distinction between the few (hoi oligoi) and the many (hoi polloi). The few were the economic elite of the politai – the wealthy landowners – and the dominant group in oligarchies. A few terms might signify this group: “the few” (οἱ ὀλίγοι – hoi oligoi) or “the best” (οἱ ἄριστοι – hoi aristoi), or “the rich” (οἱ πλούσιοι – hoi plousioi) and can also be part of the meaning of the appellation “beautiful and good” (καλὸς κἀγαθός = καλὸς καὶ ἀγαθός – kalos kagathos) which translates more idiomatically to something like “gentleman” with an implication of both good conduct (especially in war) and high status. At its broadest reach, the few might consist of those politai with enough wealth to serve as hoplites, though it seems in most cases this group is understood much more narrowly and might be defined by heredity in addition to wealth in some cases.

In contrast to the few were, of course, the many. Once again a few terms might signify this group: “the many” (οἱ πολλοί – hoi polloi or οἱ πλῆθος – hoi plethos) or “the poor” (οἱ ἀποροῖ – hoi aporoi) or the people (δῆμος – demos), the last of which gives us the word democracy – rule by the demos. At its narrowest extent, these are all of the people too poor to serve as hoplites but who would otherwise be politai; in fact in a democracy they are politai, but in closed oligarchies they may not be. More broadly the concept of the demos can encompass all of the politai, both wealthy and poor, especially in a democratic context. Nevertheless the Greeks often understand these two groups as oppositional and non-overlapping: the politai composed of “the few”, with money and high status lineages and “the many”, without that, but with far greater raw numbers.

As we’ll see, it is that distinction – between “the few” and “the many” which the Greeks used to define the different forms of polis government, what they called a politeia (πολιτεία), which we might translate as “constitution” with the caveat that these are not written constitutions. And that’s where we’ll go next: now that we have our subdivisions, we’ll discuss next week the different ways they are organized and governed.

Bret Devereaux, “Collections: How to Polis, 101: Component Parts”, A Collection of Unmitigated Pedantry, 2023-03-10.

March 18, 2026

QotD: Feeding a Roman Consular army

Filed under: Europe, History, Military, Quotations — Tags: , , , — Nicholas @ 01:00

So now we have our entire “campaign community” of men, women and animals. And so it might be worth doing some quick calculations to get a sense now of exactly what a community of this size is going to require. For a general sense of scale, we’ll consider the demands of a standard Roman army of the Middle Republic: two legions plus matching allied detachments, totaling around 19,200 soldiers (16,800 infantry, 2,400 cavalry).

Let’s deal with animals next. Each contubernium (“tent group”) of six soldiers likely had its own mule, so that’s 3,200 mules for the army, plus some additional number for the siege train and any army supplies; perhaps around 5,000 total (see Roth, op. cit. on this). On top of this we have horses for the cavalry; this will be rather more than 2,400 since spare horses will have been a necessity on campaign. Judging by Roman barley rations for cavalrymen (presumably intended to feed the horse) it seems a good guess that each cavalryman had one spare; for later medieval armies the number of spares would be substantially higher (at least three per rider). But for our lean army of Romans, that’s just 4,800 horses. An early modern army might require quite a few less mules (replacing them with wagons), but at the same time it is also probably hauling both field artillery and siege guns which demand a tremendous number of draft animals (mostly horses). My sense is that in the end this tends to leave the early modern army needing more animals overall.

Next the non-combatants. The mules will need drivers and the cavalrymen likely also have grooms to handle their horses, which suggests something like 3,400 calones [slaves or servants] as an absolute minimum simply to handle the animals. Roth (op. cit., 114) figures one non-combatant per four combatants in a Roman army, while Erdkamp (op. cit. 42) figures 1:5. Those figures would include not merely enslaved calones but also sutlers, slave-dealers, and women in the “campaign community”. Taking the lower estimate we might then figure something like 4,000 non-combatants for a “lean” Roman army, with many armies being more loaded up on non-combatants than even this. And while estimating the number of non-combatants for Roman armies is tricky, we actually have some figures for pre-modern armies to give a reference. Parker (op. cit. 252) notes units of the Army of Flanders (between 1577 and 1620) as high as 53% non-combatants, including women in the campaign community; one Walloon tercio in 1629 was 28% camp women on the march. It is tempting to compare these but caution is necessary here – both Roth’s and Erdkamp’s estimates are heavily informed by more modern armies so the argument would be circular: the estimates for the Romans look like later armies because later armies were used to calibrate estimates for the Romans.

That gives us an army now of 19,200 soldiers, 4,000 non-combatants, 5,000 mules and 4,800 horses. Roman rations were pretty ample and it seems likely that many of the calones did not eat so well but the ranges are fairly narrow; we can work with an average 1.25kg daily ration per person normally, with the absolute minimum being the 0.83kg daily grain ration following Polybius (Plb. 6.39.12-14, on this note Erdkamp op. cit. 33-42) if the army was short on supplies or needed to move fast eating only those buccelatum [hardtack] biscuits. That’s a normal consumption of 29,000kg per day for the humans, with the minimum restricted diet of 19,256kg for short periods. Then we need about 2.25kg of feed for each mule and about 4.5kg of feed for each horse (we’re assuming grazing and water are easily available), which adds up to 11,250kg for the mules and 21,600kg for the horses.

And at last we now have the scale of our problem: our lean army of 19,200 fighting men consumes an astounding 61,850kg (68.18 US tons) of food daily. It also consumes staggering amounts of water and firewood. In order to move this army or sustain it in place it is thus necessary to ensure a massive and relatively continuous supply of food to the army. Failure to do that will result in the army falling apart long before it comes anywhere close to the enemy.

Bret Devereaux, “Collections: Logistics, How Did They Do It, Part I: The Problem”, A Collection of Unmitigated Pedantry, 2022-07-15.

March 12, 2026

QotD: Roman armies of the middle and late Republic

Filed under: Europe, History, Military, Quotations — Tags: , , , — Nicholas @ 01:00

Polybius remarks both on the superior flexibility of Roman soldiers (18.31.9-11) and the intensity and effectiveness of Roman rewards and punishments (6.35-38). Josephus, a Greek-speaking Jewish man from the province of Judaea who first rebelled against the Romans and then switched sides offers the most famous endorsement of Roman drills, “Nor would one be mistaken to say that their drills are bloodless battles, and their battles bloody drills” (BJ 3.5.1).

It is hard to tell if the Roman triple-line (triplex acies) fighting system created the demand for synchronized discipline or if the Romans, having already developed a tradition of drill and synchronized discipline, adopted a fighting style that leveraged that advantage. Probably a bit of both, but in any event our evidence for the Roman army before the very late third century is very poor. By the time we truly see the Roman army clearly (c. 225 BC) the system seems to already [have been] in place for some time.

A Roman consular army was a complex machine. It was composed of an infantry line of two legions (in the center) and two socii “wings” (alae) to each side, along with cavalry detachments covering the flanks. Each of those infantry blocks (two legions, two alae) in turn was broken down into thirty separate maneuvering units (called maniples, generally consisting of 120 men; half as many for the triarii), which were in turn subdivided into centuries, but centuries didn’t really maneuver independently. In front of this was a light infantry screening force (the velites). So notionally there were in the heavy infantry of a standard two-legion consular army something like 120 different “chess pieces” that notionally the general could move around on their own and thus notionally the legion was capable of fairly complex tactical maneuvers.

You may have noted that word “notionally” because now we get into the limits of drill and synchronized discipline, because this isn’t a system for limitless tactical flexibility of the sort one gets in video games. Instead, recall that the idea here is to create coordinated movement and fighting (the synchronized discipline) through rigorous, repeated practice (drill). Of course one needs to practice specific things. Some of those things are going to be obvious: a drill for marching forward, or for turning the unit or for advancing on the charge.

In the Roman case, a “standard” battle involved the successive engagement and potentially retreat of each heavy infantry line: first the hastati (the first line) formed a solid line (filling the gaps) and attacked and then, if unsuccessful, retreated and the next line (the principes) would try and so on. Those maneuvers would need to be practiced: forming up, then having each maniple close the gap (we don’t quite know how they did this, but see below), the attack itself (which also involved usually throwing pila – heavy javelins), then retreat behind the next line if things went poorly. It’s also pretty clear from a battle like Cynoscephelae (197) or Bibracte (58) that individual maniples or cohorts (the Romans start using the larger 480-man-cohort as the basic maneuver unit during the second century BC) could be “driven” over the battlefield to a degree so there were probably drills for wheeling and turning.

Now even in this “standard” battle there is a lot of movement: maniples need to open and close gaps, advance and retreat and so on. This is what I mean by saying this army is a complex machine: it has a lot of moving parts that need to move together. The men in a maniple need to move together to make that mutually-supporting line and the maniples need to move together with each other to cover flanks and allow retreats. In terms of how the individual men moved, I’ve tended to think in terms of a “flow” model akin to this video of South Korean riot police training, rather than the clunkier Spartacus (1960) model.

But once an army has practiced all of these drills, it creates the opportunity for great improvisation and more complex tactics as well. Commanders, both the general but also his subordinates, can tell a unit to perform a particular maneuver that they have drilled, assuming the communication infrastructure exists in terms of instruments, standard shouted commands and battle standards (and note [that] Roman methods of battlefield communication were relatively well developed). That, for instance, allowed Aemilius Paullus to give orders to his first legion at Pydna for each of those maneuver units to either push forward or give ground independently, presenting the Macedonian phalanx with a tactical problem (an unevenly resisting line) it did not have a good solution for (Plut. Aem. 20.8-10). Having good junior officers […] was required but it wasn’t enough – those officers needed units which were already sufficiently drilled so that their orders (to press hard or retreat and reform in this case) could actually be carried out by soldiers for whom the response to those calls had become natural through that very drill.

At the same time I don’t want to give the wrong impression: even for the Romans battles where there was this sort of on-the-field improvising led by the general were uncommon (though not extremely rare). For the majority of battles, the legionary “machine” simply pushed forward in its standard way, even when – as at Cannae (216) – pushing forward normally proved to be disastrous. Just because an army can fight flexibly doesn’t mean it will or even that it should.

Bret Devereaux, “Collections: Total Generalship: Commanding Pre-Modern Armies, Part IIIa”, A Collection of Unmitigated Pedantry, 2022-06-17.

March 6, 2026

QotD: Operations, strategy, and tactics

Filed under: History, Military, Quotations — Tags: , , — Nicholas @ 01:00

Operations is the middle layer of military analysis, below strategy and above tactics. Operations concerns the movement of forces (often over multiple lines of advance to fully utilize the transportation network available) and their logistical support. Fundamentally, operations are about getting forces to the objectives specified in your strategy with sufficient supply to sustain themselves, so that once there they can employ your tactics to achieve victory. The specific task of crafting operations which will achieve a set of strategic objectives is called “operational art” in US doctrine. Operational failures typically manifest as logistics and maneuver failures – particularly operational plans with unreasonable timetables – both of which have been particularly in evidence in the initial Russian invasion [of Ukraine in 2022].

[…]

Strategy is the upper layer of military analysis. Fundamentally strategy concerns the identification of final objectives, the way those objectives can be achieved and the resources to be used to achieve those objectives; these three components of strategy in US doctrine are termed “Ends, Ways, and Means” respectively. Strategy is thus the “big picture” thinking behind an action, including the decisions to both commence hostilities and end them.

[…]

Tactics are the lowest layer of military analysis. Tactics concern the methods to be used to win battles. Things like flanking, suppressive fire, ambushes, etc. are tactics. A military’s tactical system is often spelled out in doctrine. In theory, operations is designed to deliver forces to battles in such a way (positioning, comparative force, etc.) that their tactics can win those battles, while strategy should aim to ensure that winning those particular battles will achieve the desired political end (whatever concessions are desired). It is important to distinguish actions which are strategy (designed to directly produce a desired end to the conflict) from those which are merely tactical (designed to achieve a local success or advantage in a given engagement). It is important when assessing failures in war to distinguish between strategic failures (typically a failure to come up with realistic goals and the means to reach them), operational failures (e.g. logistics failures or unreasonable maneuver timetables) and tactical failures (e.g. failure to use combined arms effectively).

Bret Devereaux, “Miscellanea: A Very Short Glossary of Military Terminology”, A Collection of Unmitigated Pedantry, 2022-03-25.

February 28, 2026

QotD: The “Balance of Terror” in the missile age

The advance of missile and rocket technology in the late 1950s started to change the strategic picture; the significance of Sputnik (launched in 1957) was always that if the USSR could orbit a small satellite around the Earth, they could do the same with a nuclear weapon. By 1959, both the USA and the USSR had mounted nuclear warheads on intercontinental ballistic missiles (ICBMs), fulfilling Brodie’s prophecy that nuclear weapons would accelerate the development of longer-range and harder to intercept platforms: now the platforms had effectively infinite range and were effectively impossible to intercept.

This also meant that a devastating nuclear “first strike” could now be delivered before an opponent would know it was coming, or at least on extremely short notice. A nuclear power could no longer count on having enough warning to get its nuclear weapons off before the enemy’s nuclear strike had arrived. Bernard Brodie grappled with these problems in Strategy in the Missile Age (1959) but let’s focus on a different theorist, Albert Wohlstetter, also with the RAND Corporation, who wrote The Delicate Balance of Terror (1958) the year prior.

Wohlstetter argued that deterrence was not assured, but was in fact fragile: any development which allowed one party to break the other’s nuclear strike capability (e.g. the ability to deliver your strike so powerfully that the enemy’s retaliation was impossible) would encourage that power to strike in the window of vulnerability. Wohstetter, writing in the post-Sputnik shock, saw the likelihood that the USSR’s momentary advantage in missile technology would create such a moment of vulnerability for the United States.

Like Brodie, Wohlstetter concluded that the only way to avoid being the victim of a nuclear first strike (that having the enemy hit you with their nukes) was being able to credibly deliver a second strike. This is an important distinction that is often misunderstood; there is a tendency to read these theorists (Dr. Strangelove does this to a degree and influences public perception on this point) as planning for a “winnable” nuclear war (and some did, just not these fellows here), but indeed the point is quite the opposite: they assume nuclear war is fundamentally unwinnable and to be avoided, but that the only way to avoid it successfully is through deterrence and deterrence can only be maintained if the second strike (that is, your retaliation after your opponent’s nuclear weapons have already gone off) can be assured. Consequently, planning for nuclear war is the only way to avoid nuclear war – a point we’ll come back to.

Wohlstetter identifies six hurdles that must be overcome in order to provide a durable, credible second strike system – and remember, it is the perception of the system, not its reality that matters (though reality may be the best way to create perception). Such systems need to be stable in peacetime (and Wohlstetter notes that stability is both in the sense of being able to work in the event after a period of peace, but also such that they do not cause unintended escalation; he thus warns against, for instance, just keeping lots of nuclear-armed bombers in the air all of the time), they must be able to survive the enemy’s initial nuclear strikes, it must be possible to decide to retaliate and communicate that to the units with the nuclear weapons, then they must be able to reach enemy territory, then they have to penetrate enemy defenses, and finally they have to be powerful enough to guarantee that whatever fraction do penetrate those defenses are powerful enough to inflict irrecoverable damage.

You can think of these hurdles as a series of filters. You start a conflict with a certain number of systems and then each hurdle filters some of them out. Some may not work in the event, some may be destroyed by the enemy attack, some may be out of communication, some may be intercepted by enemy defenses. You need enough at the end to do so much damage that it would never be worth it to sustain such damage.

This is the logic behind the otherwise preposterously large nuclear arsenals of the United States and the Russian Federation (inherited from the USSR). In order to sustain your nuclear deterrent, you need more weapons than you would need in the event because you are planning for scenarios in which some large number of weapons are lost in the enemy’s first strike. At the same time, as you overbuild nuclear weapons to counter this, you both look more like you are planning a first strike and your opponent has to estimate that a larger portion of their nuclear arsenal may be destroyed in that (theoretical) first strike, which means they too need more missiles.

What I want to note about this logic is that it neatly explains why nuclear disarmament is so hard: nuclear weapons are, in a deterrence scenario, both necessary and useless. Necessary, because your nuclear arsenal is the only thing which can deter an enemy with nuclear weapons, but that very deterrence renders the weapons useless in the sense that you are trying to avoid any scenario in which you use them. If one side unilaterally disarmed, nuclear weapons would suddenly become useful – if only one side has them, well, they are the “absolute” weapon, able to make up for essentially any deficiency in conventional strength – and once useful, they would be used. Humanity has never once developed a useful weapon they would not use in extremis; and war is the land of in extremis.

Thus the absurd-sounding conclusion to fairly solid chain of logic: to avoid the use of nuclear weapons, you have to build so many nuclear weapons that it is impossible for a nuclear-armed opponent to destroy them all in a first strike, ensuring your second-strike lands. You build extra missiles for the purpose of not having to fire them.

(I should note here that these concerns were not the only things driving the US and USSR’s buildup of nuclear weapons. Often politics and a lack of clear information contributed as well. In the 1960s, US fears of a “missile gap” – which were unfounded and which many of the politicians pushing them knew were unfounded – were used to push for more investment in the US’s nuclear arsenal despite the United States already having at that time a stronger position in terms of nuclear weapons. In the 1970s and 1980s, the push for the development of precision guidance systems – partly driven by inter-agency rivalry in the USA and not designed to make a first strike possible – played a role in the massive Soviet nuclear buildup in that period; the USSR feared that precision systems might be designed for a “counter-force” first strike (that is a first strike targeting Soviet nuclear weapons themselves) and so built up to try to have enough missiles to ensure survivable second strike capability. This buildup, driven by concerns beyond even deterrence did lead to absurdities: when the SIOP (“Single Integrated Operational Plan”) for a nuclear war was assessed by General George Lee Butler in 1991, he declared it, “the single most absurd and irresponsible document I had ever reviewed in my life”. Having more warheads than targets had lead to the assignment of absurd amounts of nuclear firepower on increasingly trivial targets.)

All of this theory eventually filtered into American policy making in the form of “mutually assured destruction” (initially phrased as “assured destruction” by Secretary of Defense Robert McNamara in 1964). The idea here was, as we have laid out, that US nuclear forces would be designed to withstand a first nuclear strike still able to launch a retaliatory second strike of such scale that the attacker would be utterly destroyed; by doing so it was hoped that one would avoid nuclear war in general. Because different kinds of systems would have different survivability capabilities, it also led to procurement focused on a nuclear “triad” with nuclear systems split between land-based ICBMs in hardened silos, forward-deployed long-range bombers operating from bases in Europe and nuclear-armed missiles launched from submarines which could lurk off an enemy coast undetected. The idea here is that with a triad it would be impossible for an enemy to assure themselves that they could neutralize all of these systems, which assures the second strike, which assures the destruction, which deters the nuclear war you don’t want to have in the first place.

It is worth noting that while the United States and the USSR both developed such a nuclear triad, other nuclear powers have often seen this sort of secure, absolute second-strike capability as not being essential to create deterrence. The People’s Republic of China, for instance, has generally focused their resources on a fewer number of systems, confident that even with a smaller number of bombs, the risk of any of them striking an enemy city (typically an American city) would be enough to deter an enemy. As I’ve heard it phrased informally by one western observer, a strategy of, “one bomb and we’ll be sure to get it to L.A.” though of course that requires more than one bomb and one doubts the PRC phrases their doctrine so glibly (note that China is, in theory committed to developing a triad, they just haven’t bothered to actually really do so).

Bret Devereaux, “Collections: Nuclear Deterrence 101”, A Collection of Unmitigated Pedantry, 2022-03-11.

February 22, 2026

QotD: The shift from “motte-and-bailey” construction to stone castles

As we move to stone construction and especially full stone construction (which we’ll define as the point when at least one complete curtain wall – don’t worry, we’ll define that in a second – is in stone) in the 12th century, we’re beginning to contemplate a different kind of defense. The wooden motte and bailey, as we’ve seen, mostly served to resist both raids and “hasty” assaults, thus forcing less coordinated or numerous attackers to set in to starve the castle out or go home. But stone walls are a much larger investment in time and resources; they also require a fair bit more careful design in order to be structurally sound. For all of that expense, the builder wants quite a bit of a security, and in the design of stone castles it is hard not to notice increasing attention towards resisting a deliberate assault; stone castles of the 12th century and beyond are increasingly being designed to stand up to the best that the “small army” playbook can throw at them. Of course it is no accident that this is coming at the same time that medieval European population and wealth is beginning to increase more rapidly, leaving political authorities (read: the high nobility) with both the resources for impressive new castles (although generally the number of castles falls during this period – fewer, stronger castles) and at the same time with more resources to invest in the expertise of siegecraft (meaning that an attacker is more likely to have fancy tools like towers, catapults and better coordination to use them).

To talk about how these designs work, we need to clear some terminology. The (typically thin) wall that runs the circuit of the castle and encloses the bailey is called a “curtain wall“. In stone castles, there may be multiple curtain walls, arranged concentrically (a design that seems to emerge in the Near East and makes its way to Europe in the 13th century via the crusades); the outermost complete circuit (the primary wall, as it were) is called the enceinte. Increasingly, the keep in stone castles is moved into the bailey (that is, it sits at the center of the castle rather than off to one side), although of course stone versions of motte and bailey designs exist. In some castle design systems, with stone the keep itself drops away, since the stone walls and towers often provided themselves enough space to house the necessary peacetime functions; in Germany there often was no keep (that is, no core structure that contained the core of the fortified house), but there often was a bergfriede, a smaller but still tall “fighting tower” to serve the tactical role of the keep (an elevated, core position of last-resort in a defense-in-depth arrangement) without the peacetime role.

While the wooden palisade curtain walls of earlier motte and bailey castles often lacked many defensive features (though sometimes you’d have towers and gatehouses to provide fighting positions around the gates), stone castles tend to have lots of projecting towers which stick out from the curtain wall. The value of projecting towers is that soldiers up on those towers have clear lines of fire running down the walls, allowing them to target enemies at the base of the curtain wall (the term for this sort of fire is “enfilade” fire – when you are being hit in the side). Clearly what is being envisaged here is the ability to engage enemies doing things like undermining the base of walls or setting up ladders or other scaling devices.

The curtain walls themselves also become fighting positions. Whether on a tower or on the wall itself, the term for the fighting position at the top is a “battlement”. Battlements often have a jagged “tooth” pattern of gaps to provide firing positions; the term for the overall system is crenellation; the areas which have stone are merlons, while the gaps to fire through are crenals. The walkway behind both atop the wall is the chemin de ronde, allure or “wall-walk”. One problem with using the walls themselves as fighting positions is that it is very hard to engage enemies directly beneath the wall or along it without leaning out beyond the protection of the wall and exposing yourself to enemy fire. The older solution to this were wooden, shed-like projections from the wall called “hoarding”; these were temporary, built when a siege was expected. During the crusades, European armies encountered Near Eastern fortification design which instead used stone overhangs (with the merlons on the outside) with gaps through which one might fire (or just drop things) directly down at the base of the wall; these are called machicolations and were swiftly adopted to replace hoardings, since machicolations were safer from both literal fire (wood burns, stone does not) and catapult fire, and also permanent. All of this work on the walls and the towers is designed to allow a small number of defenders to exchange fire effectively with a large number of attackers, and in so doing to keep those attackers from being able to “set up shop” beneath the walls.

[I]t is worth noting something about the amount of fire being developed by these projecting towers: the goal is to prevent the enemy operating safely at the wall’s base, not to prohibit approaches to the wall. These defenses simply aren’t designed to support that much fire, which makes sense: castle garrisons were generally quite small, often dozens or a few hundred men. While Hollywood loves sieges where all of the walls of the castle are lined with soldiers multiple ranks deep, more often the problem for the defender was having enough soldiers just to watch the whole perimeter around the clock (recall the example at Antioch: Bohemond only needs one traitor to access Antioch because one of its defensive towers was regularly defended by only one guy at night). It is actually not hard to see that merely by looking at the battlements: notice in the images here so far often how spaced out the merlons of the crenellation are. The idea here isn’t maximizing fire for a given length of wall but protecting a relatively small number of combatants on the wall. As we’ll see, that is a significant design choice: castle design assumes the enemy will reach the walls and aims to prevent escalade once they are there; later in this series we’ll see defenses designed to prohibit effective approach itself.

As with the simpler motte and bailey, stone castles often employ a system of defense in depth to raise the cost of an attack. At minimum, generally, that system consists of a moat (either wet or dry), the main curtain walls (with their towers and gatehouses) and then a central keep. Larger castles, especially in the 13th century and beyond, adopting cues from castle design in the Levant (via the crusades) employed multiple concentric rings of walls. Generally these were set up so that the central ring was taller, either by dint of terrain (as with a castle set on a hill) or by building taller walls, than the outer ring. The idea here seems not to be stacking fire on approaching enemies, but ensuring that the inner ring could dominate the outer ring if the latter fell to attackers; defenders could fire down on attackers who would lack cover (since the merlons of the outer ring would face the other way). As an aside, the concern to be firing down is less about the energy imparted by a falling arrow (though this is more meaningful with javelins or thrown rocks) and more about a firing position that denies enemies cover by shooting down at them (think about attackers, for instance, crossing a dry moat – if your wall is the right height and the edges of the moat are carefully angled, you can set up a situation where the ditch never actually offers the attackers any usable cover, but you need to be high up to do it!).

Speaking of the moat, this is a common defensive element (essentially just a big ditch!) which often gets left out of pop culture depictions of castles and siege warfare, but it accomplishes so many things at such a low cost premium. Even assuming the moat is “dry”! For attackers on foot (say, with ladders) looking to approach the wall, the moat is an obstacle that slows them down without potentially providing any additional cover (it is also likely to disorder an attack). For sappers (attackers looking to tunnel under the walls and then collapse the tunnel to generate a breach), the depth of the ditch forces them to dig deeper, which in turn raises the demands in both labor and engineering to dig their tunnel. For any attack with siege engines (towers, rams, or covered protective housings made so that the wall can be approached safely), the moat is an obstruction that has to be filled in before those engines can move forward – a task which in turn broadcasts the intended route well in advance, giving the defenders a lot of time to prepare.

Well-built stone castles of this sort were stunningly resistant to assault, even with relatively small garrisons (dozens or a few hundred, not thousands). That said, building them was very expensive; maintaining them wasn’t cheap either. For both castles and fortified cities, one ubiquitous element in warfare of the period (and in the ancient period too, by the by) was the rush when war was in the offing to repair castle and town walls, dig out the moat and to clear buildings that during peace had been built int he firing lines of the castle or city walls.

Bret Devereaux, “Collections: Fortification, Part III: Castling”, A Collection of Unmitigated Pedantry, 2021-12-10.

Older Posts »

Powered by WordPress