Quotulatiousness

January 22, 2024

QotD: Mao’s theory of “protracted war” as adapted to Vietnamese conditions by Võ Nguyên Giáp

The primary architect of Vietnam’s strategy, initially against French colonial forces and then later against the United States and the US-backed South Vietnamese (Republic of Vietnam or RVN) government was Võ Nguyên Giáp.

Giáp was facing a different set of challenges in Vietnam facing either France or the United States which required the framework of protracted war to be modified. First, it must have been immediately apparent that it would never be possible for a Vietnamese-based army to match the conventional military capability of its enemies, pound-for-pound. Mao could imagine that at some point the Red Army would be able to win an all-out, head-on-head fight with the Nationalists, but the gap between French and American capabilities and Vietnamese Communist capabilities was so much wider.

At the same time, trading space for time wasn’t going to be much of an option either. China, of course, is a very large country, with many regions that are both vast, difficult to move in, and sparsely populated. It was thus possible for Mao to have his bases in places where Nationalist armies literally could not reach. That was never going to be possible in Vietnam, a country in which almost the entire landmass is within 200 miles of the coast (most of it is far, far less than that) and which is about 4% the size of China.

So the theory is going to have to be adjusted, but the basic groundwork – protract the war, focus on will rather than firepower, grind your enemy down slowly and proceed in phases – remains.

I’m going to need to simplify here, but Giáp makes several key alterations to Mao’s model of protracted war. First, even more than Mao, the political element in the struggle was emphasized as part of the strategy, raised to equality as a concern with the military side and fused with the military operation; together they were termed dau tranh, roughly “the struggle”. Those political activities were divided into three main components. Action among one’s own people consisted of propaganda and motivation designed to reinforce the will of the populace that supported the effort and to gain recruits. Then, action among the enemy people – here meaning Vietnamese who were under the control of the French colonial government or South Vietnam and not yet recruited into the struggle – a mix of propaganda and violent action to gain converts and create dissension. Finally, action against the enemy military, which consisted of what we might define as terroristic violence used as message-sending to negatively impact enemy morale and to encourage Vietnamese who supported the opposition to stop doing so for their own safety.

Part of the reason the political element of this strategy was so important was that Giáp knew that casualty ratios, especially among guerrilla forces – on which, as we’ll see, Giáp would have to rely more heavily – would be very unfavorable. Thus effective recruitment and strong support among the populace was essential not merely to conceal guerrilla forces but also to replace the expected severe losses that came with fighting at such a dramatic disadvantage in industrial firepower.

That concern in turn shaped force-structure. Giáp theorized an essentially three-tier system of force structure. At the bottom were the “popular troops”, essentially politically agitated peasants. Lightly armed, minimally trained but with a lot of local knowledge about enemy dispositions, who exactly supports the enemy and the local terrain, these troops could both accomplish a lot of the political objectives and provide information as well as functioning as local guerrillas in their own villages. Casualties among popular troops were expected to be high as they were likely to “absorb” reprisals from the enemy for guerrilla actions. Experienced veterans of these popular troops could then be recruited up into the “regional troops”, trained men who could now be deployed away from their home villages as full-time guerrillas, and in larger groups. While popular troops were expected to take heavy casualties, regional troops were carefully husbanded for important operations or used to organize new units of popular troops. Collectively these two groups are what are often known in the United States as the Viet Cong, though historians tend to prefer their own name for themselves, the National Liberation Front (Mặt trận Dân tộc Giải phóng miền Nam Việt Nam, “National Liberation Front for South Vietnam”) or NLF. Finally, once the French were forced to leave and Giáp had a territorial base he could operate from in North Vietnam, there were conventional forces, the regular army – the People’s Army of Vietnam (PAVN) – which would build up and wait for that third-phase transition to conventional warfare.

The greater focus on the structure of courses operating in enemy territory reflected Giáp’s adjustment of how the first phase of the protracted war would be fought. Since he had no mountain bases to fall back to, the first phase relied much more on political operations in territory controlled by the enemy and guerrilla operations, once again using the local supportive population as the cover to allow guerrillas and political agitators (generally the same folks, cadres drawn from the regional troops to organize more popular troops) to move undetected. Guerrilla operations would compel the less-casualty-tolerant enemy to concentrate their forces out of a desire for force preservation, creating the second phase strategic stalemate and also clearing territory in which larger mobile forces could be brought together to engage in mobile warfare, eventually culminating in a shift in the third phase to conventional warfare using the regional and regular troops.

Finally, unlike Mao, who could envision (and achieve) a situation where he pushed the Nationalists out of the territories they used to recruit and supply their armies, the Vietnamese Communists had no hope (or desire) to directly attack France or the United States. Indeed, doing so would have been wildly counter-productive as it likely would have fortified French or American will to continue the conflict.

That limitation would, however, demand substantial flexibility in how the Vietnamese Communists moved through the three phases of protracted war. This was not something realized ahead of time, but something learned through painful lessons. Leadership in the Democratic Republic of Vietnam (DRV = North Vietnam) was a lot more split than among Mao’s post-Long-March Chinese Communist Party; another important figure, Lê Duẩn, who became general secretary in 1960, advocated for a strategy of “general offensive” paired with a “general uprising” – essentially jumping straight to the third phase. The effort to implement that strategy in 1964 nearly overran the South, with ARVN (Army of the Republic of Vietnam – the army of South Vietnam) being defeated by PAVN and NLF forces at the Battles of Bình Giã and Đồng Xoài (Dec. 1964 and June 1965, respectively), but this served to bring the United States more fully into the war – a tactical and operational victory that produced a massive strategic setback.

Lê Duẩn did it again in 1968 with the Tet Offensive, attempting a general uprising which, in an operational sense, mostly served to reveal NLF and PAVN formations, exposing them to US and ARVN firepower and thus to severe casualties, though politically and thus strategically the offensive ended up being a success because it undermined American will to continue the fight. American leaders had told the American public that the DRV and the NLF were largely defeated, broken forces – the sudden show of strength exposed those statements as lies, degrading support at home. Nevertheless, in the immediate term, the Tet Offensive’s failure on the ground nearly destroyed the NLF and forced the DRV to back down the phase-ladder to recover. Lê Duẩn actually did it again in 1972 with the Eastern Offensive when American ground troops were effectively gone, exposing his forces to American airpower and getting smashed up for his troubles.

It is difficult to see Lê Duẩn’s strategic impatience as much more than a series of blunders – but crucially Giáp’s framework allowed for recovery from these sorts of defeats. In each case, the NLF and PAVN forces were compelled to do something Mao’s model hadn’t really envisaged, which was to transition back down the phase system, dropping back to phase II or even phase I in response to failed transitions to phase III. By moving more flexibly between the phases (while retaining a focus on the conditions of eventual strategic victory), the DRV could recover from such blunders. I think Wayne Lee actually puts it quite well that whereas Mao’s plan relied on “many small victories” adding up to a large victory (without the quick decision of a single large victory), Giáp’s more flexible framework could survive many small defeats on the road to an eventual strategic victory when the will of the enemy to continue the conflict was exhausted.

Bret Devereaux, “Collections: How the Weak Can Win – A Primer on Protracted War”, A Collection of Unmitigated Pedantry, 2022-03-03.

January 17, 2024

QotD: Did Hari Seldon live in vain?

Filed under: Books, History, Quotations — Tags: , , , , , , — Nicholas @ 01:00

Both cliodynamics and psychohistory assume these differences and problems “come out in the wash” over a long enough period and a big enough sample. It doesn’t take much of a counterfactual thought experiment about how small changes by individuals could lead to enormous historical differences to see that they don’t. The defense that cliodynamics only deals in probabilities is little comfort here: in fact the apparent randomness (which one may argue is merely complexity on a scale that is beyond simulation) swallows the patterns. One could easily argue, for instance, that the extremely unlikely career of our fellow Temujin is a necessary cause (albeit merely one of many, several of which might be considered deeply improbable) for the fact that all commercial pilots and air traffic controllers worldwide have to learn a form of English (which one may well assume has its own structural knock-on effects in terms of the language used for business and from there the outsized cultural impact of English-speaking countries).1 No one in 1158 was likely to have supposed that English – a language at that time not even spoken by the English nobility (they spoke French)! – would become the first truly global lingua franca (and arguably the only one, though here caveats may overwhelm the claim) and thus the language of aviation. But that is precisely the kind of big structural change that is going to be really impactful on all sorts of other questions, like patterns of commerce, wealth, culture and influence.

Such complex causation defies general laws (even before we get into the fact that humans also observe history, which creates even more complexity) with such tremendous import from such unlikely events in an experiment which can only be run once without a control.

The other problem is evidence. Attempting to actually diagnose and model societies like this demands a lot of the data and not merely that you need a lot of it. You need consistent data which projects very far back in history which is accurate and fairly complete, so that it can be effectively compared. Trying, for instance, to compare ancient population estimates, which often have error bars of 100% or more, with modern, far more precise population estimates is bound to cause some real problems in teasing out clear correlations in data. The assumptions you make in tuning those ancient population figures can and will swallow any conclusions you might draw from the comparison with modern figures beyond the fairly obvious (there are more people now). But even the strongest administrative states now have tremendous difficulty getting good data on their own lower classes! Much of the “data” we think we have are themselves statistical estimates. The situation even in the very recent past is much worse and only degrades from there as one goes further back!

By way of example, I was stunned that Turchin figures he can identify “elite overproduction” and quantify wealth concentration into the deep past, including into the ancient world (Romans, late Bronze Age, etc). I study the Romans; their empire is only 2,000 years ago and moreover probably the single best-attested ancient society apart from perhaps Egypt or China (and even then I think Rome comes out quite solidly ahead). And even in that context, our estimates for the population of Roman Italy range from c. 5m to three to four times that much. Estimates for the size of the Roman budget under Augustus or Tiberius (again, by far the best attested period we have) range wildly (though within an order of magnitude, perhaps around 800 million sestertii). Even establishing a baseline for this society with the kind of precision that might let you measure important but modest increases in the size of the elite is functionally impossible with such limited data.

When it comes to elites, we have at best only one historical datapoint for the size of the top Roman census class (the ordo equester) and it’s in 225 BC, but as reported by Polybius in the 140s and also he may have done the math wrong and it also isn’t clear if he’s actually captured the size of the census class! We know in the imperial period what the minimum wealth requirement to be in the Senate was, but we don’t know what the average wealth of a senator was (we tend to hopefully assume that Pliny the Younger is broadly typically, but he might not have been!), nor do we know the size of the senatorial class itself (formed as a distinct class only in the empire), nor do we know how many households there were of senatorial wealth but which didn’t serve in the Senate because their members opted not to run for public office. One can, of course, make educated guesses for these things (it is often useful and important to do so), but they are estimates founded on guesses supported by suppositions; a castle of sand balanced atop other castles of sand. We can say with some confidence that the Late Roman Republic and the Early Roman Empire saw tremendous concentrations of wealth; can we quantity that with much accuracy? No, not really; we can make very rough estimates, but these are at the mercy of their simplifying assumptions.

And this is, to be clear, the very best attested ancient society and only about 2,000 years old at that. The data situation for other ancient societies can only be worse – unless, unless one begins by assuming elite overproduction is a general feature of complex, wealthy societies and then reads that conclusion backwards into what little data exists. But that isn’t historical research; it is merely elevating confirmation bias to a research methodology.

As noted, I have other nitpicks – particularly the tendency to present very old ideas as new discoveries, like secular cycles (Polybius, late 2nd century BC) or war as the foundation of complex societies (Heraclitus, d. c. 475 BC) without always seeming to appreciate just how old and how frequently recurring the idea is (such that it might, for instance, be the sort of intuitive idea many people might independently come up with, even if it was untrue or that it might be the kind of idea that historians had considered long ago and largely rejected for well established reasons) – but this will, I hope, suffice for a basic explanation of why I find the idea of this approach unsatisfying. This is, to be clear, not a rejection of the role of data or statistics in history, both of which can be tremendously important. Nor is it a rejection of the possible contributions of non-historians (who have important contributions to make), though I would ask that someone wading into the field familiarize themselves with it (perhaps by doing some traditional historical research), before declaring they had revolutionized the field. Rather it is an argument both that these things cannot replace more traditional historical methods and also that their employment, like the employment of any historical method, must come with a very strong dose of epistemic humility.

Psychohistory only works in science fiction where the author, as the god of his universe, can make it work. Today’s psychohistorians have no such power.

Bret Devereaux, “Fireside Friday: October 15, 2021”, A Collection of Unmitigated Pedantry, 2021-10-15.


    1. For those confused by the causation, the Mongols are considered the most likely vector for the transmission of gunpowder from China, where it was invented, to Europe. Needless to say, having a single polity spanning the entire Eurasian Steppe at the precise historical moment for this to occur sure seems like a low probability event! In any event, European mastery of gunpowder led directly into European naval dominance in the world’s oceans (its impact on land warfare dominance is much more complex) which in turn led to European dominance at sea. At the same time, the English emphasis on gunnery over boarding actions early in this period (gunpowder again) provided a key advantage which contributed to subsequent British naval dominance among European powers (and also the British navy’s “cult of gunnery” in evidence to the World Wars at least), which in turn allowed for the wide diffusion of English as a business and trade language. In turn, American and British prominence in the post-WWII global order made English the natural language for NATO and thus the ICAO convention that English be used universally for all aircraft communication.

January 12, 2024

QotD: Rome’s Italic “allies”

Filed under: Europe, History, Law, Military, Quotations — Tags: , , , , , — Nicholas @ 01:00

The Roman Republic spent its first two and a half centuries (or so) expanding fitfully through peninsular Italy (that is, Italy south of the Po River Valley, not including Sicily). This isn’t the place for a full discussion of the slow process of expanding Roman control (which wouldn’t be entirely completed until 272 with the surrender of Tarentum). The consensus position on the process is that it was one in which Rome exploited local rivalries to champion one side or the other making an ally of the one by intervening and the other by defeating and subjecting them (this view underlies the excellent M.P. Fronda, Between Rome and Carthage: Southern Italy During the Second Punic War (2010); E.T. Salmon, The Making of Roman Italy (1982) remains a valuable introduction to the topic). More recently, N. Terranato, The Early Roman Expansion into Italy (2019) has argued for something more based on horizontal elite networks and diplomacy, though this remains decidedly a minority opinion (I myself am rather closer to the consensus position, though Terranato has a point about the role of elite negotiation in the process).

The simple (and perhaps now increasingly dated) way I explain this to my students is that Rome follows the Goku Model of Imperialism: I beat you, therefore we are now friends. Defeated communities in Italy (the system is different outside of Italy) are made to join Rome’s alliance network as socii (“allies”), do not have tribute imposed on them, but must supply their soldiers to fight with Rome when Rome is at war, which is always.

It actually doesn’t matter for us how this expansion was accomplished; rather we’re interested in the sort of order the Romans set up when they did expand. The basic blueprint for how Rome interacted with the Italians may have emerged as early as 493 with the Foedus Cassianum, a peace treaty which ended a war between Rome and [the] Latin League (an alliance of ethnically Latin cities in Latium). To simplify quite a lot, the Roman “deal” with the communities of Italy which one by one came under Roman power went as follows:

  • All subject communities in Italy became socii (“allies”). This was true if Rome actually intervened to help you as your ally, or if Rome intervened against you and conquered your community.
  • The socii retained substantial internal autonomy (they kept their own laws, religions, language and customs), but could have no foreign policy except their alliance with Rome.
  • Whenever Rome went to war, the socii were required to send soldiers to assist Rome’s armies; the number of socii in Rome’s armies ranged from around half to perhaps as much as two thirds at some points (though the socii outnumbered the Romans in Italy about 3-to-1 in 225, so the Romans made more strenuous manpower demands on themselves than their allies).
  • Rome didn’t impose tribute on the socii, though the socii bore the cost of raising and paying their detachments of troops in war (except for food, which the Romans paid for, Plb. 6.39.14).
  • Rome goes to war every year.
  • No, seriously. Every. Year. From 509 to 31BC, the only exception was 241-235. That’s it. Six years of peace in 478 years of republic. The socii do not seem to have minded very much; they seem to have generally been as bellicose as the Romans and anyway …
  • The spoils of Roman victory were split between Rome and the socii. Consequently, as one scholar memorably put it, the Roman alliance was akin to, “a criminal operation which compensates its victims by enrolling them in the gang and inviting them to share to proceeds of future robberies” (T. Cornell, The Beginnings of Rome (1995)).
  • The alliance system included a ladder of potential relationships with Rome which the Romans might offer to loyal allies.

Now this isn’t a place for a long discussion of the Roman alliance system in Italy (that place is in the book I am writing), so I want us to focus more narrowly on the bolded points here and how they add up to significant changes in who counted as “Roman” over time. But I should note here that while I am calling this a Roman “alliance system” (because the Romans call these fellows socii, allies) this was by no means an equal arrangement: Rome declared the wars, commanded the armies and set the quotas for military service. The “allies” were thus allies in name only, but in practice subjects; nevertheless the Roman insistence on calling them allies and retaining the polite fiction that they were junior partners rather than subject communities, by doing things like sharing the loot and glory of victory, was a major contributor to Roman success (as we’ll see).

First, the Roman alliance system was split into what were essentially tiers of status. At the top were Roman citizens optimo iure (“full rights”, literally “with the best right”) often referred to on a community basis as civitas cum suffragio (“citizenship with the vote”). These were folks with the full benefits of Roman citizenship and the innermost core of the Roman polity, who could vote and (in theory, though for people of modest means, only in theory) run for office. Next were citizens non optimo iure, often referred to as having civitas sine suffragio (“citizenship without the vote”); they had all of the rights of Roman citizens except for political participation in Rome. This was almost always because they lived in communities well outside the city of Rome with their own local government (where they could vote); we’ll talk about how you get those communities in a second. That said, citizens without the vote still had the right to hold property in Roman territory and conduct business with the full protection of a Roman citizen (ius commercii) and the right to contract legal marriages with Roman citizens (ius conubii). They could do everything except for vote or run for offices in Rome itself.

Next down on the list were socii (allies) of Latin status (note this is a legal status and is entirely disconnected from Latin ethnicity; by the end of this post, Rome is going to be block-granting Latin status to Gauls in Cisalpine Gaul, for instance). Allies of Latin status got the benefits of the ius commercii, as well as the ability to move from one community with Latin status to another without losing their status. Unlike the citizens without the vote, they didn’t automatically get the right to contract legal marriages with Roman citizens, but in some cases the Romans granted that right to either individuals or entire communities (scholars differ on exactly how frequently those with Latin status would have conubium with Roman citizens; the traditional view is that this was a standard perk of Latin status, but see Roselaar, op. cit.). That said, the advantages of this status were considerable – particularly the ability to conduct business under Roman law rather than what the Romans called the “ius gentium” (“law of peoples”) which governed relations with foreigners (peregrini in Roman legal terms) and were less favorable (although free foreigners in Rome had somewhat better protections, on the whole, than free foreigners – like metics – in a Greek polis).

Finally, you had the socii who lacked these bells and whistles. That said, because their communities were allies of Rome in Italy (this system is not exported overseas), they were immune to tribute, Roman magistrates couldn’t make war on them and Roman armies would protect them in war – so they were still better off than a community that was purely of peregrini (or a community within one of Rome’s provinces; Italy was not a province, to be clear).

The key to this system is that socii who stayed loyal to Rome and dutifully supplied troops could be “upgraded” for their service, though in at least some cases, we know that socii opted not to accept Roman citizenship but instead chose to keep their status as their own community (the famous example of this were the allied soldiers of Praenesti, who refused Roman citizenship in 211, Liv. 23.20.2). Consequently, whole communities might inch closer to becoming Romans as a consequence of long service as Rome’s “allies” (most of whom, we must stress, were at one point or another, Rome’s Italian enemies who had been defeated and incorporated into Rome’s Italian alliance system).

But I mentioned spoils and everyone loves loot. When Rome beat you, in the moment after you lost, but before the Goku Model of Imperialism kicked in and you became friends, the Romans took your stuff. This might mean they very literally sacked your town and carried off objects of value, but it also – and for us more importantly – meant that the Romans seized land. That land would be added to the ager Romanus (the body of land in Italy held by Rome directly rather than belonging to one of Rome’s allies). But of course that land might be very far away from Rome which posed a problem – Rome was, after all, effectively a city-state; the whole point of having the socii-system is that Rome lacked both the means and the desire to directly govern far away communities. But the Romans didn’t want this land to stay vacant – they need the land to be full of farmers liable for conscription into Rome’s armies (there was a minimum property requirement for military service because you needed to be able to buy your own weapons so they had to be freeholding farmers, not enslaved workers). By the by, you can actually understand most of Rome’s decisions inside Italy if you just assume that the main objective of Roman aristocrats is to get bigger armies so they can win bigger battles and so burnish their political credentials back in Rome – that, and not general altruism (of which the Romans had fairly little), was the reason for Rome’s relatively generous alliance system.

The solution was for Rome to essentially plant little Mini-Me versions of itself on that newly taken land. This had some major advantages: first, it put farmers on that land who would be liable for conscription (typically placing them in carefully measured farming plots through a process known as centuriation), either as socii or as Roman citizens (typically without the vote). Second, it planted a loyal community in recently conquered territory which could act as a position of Roman control; notably, no Latin colony of this sort rebelled against Rome during the Second Punic War when Hannibal tried to get as many of the socii to cast off the Romans as he could.

What is important for what we are doing here is to note that the socii seem to have been permitted to contribute to the initial groups settling in these colonies and that these colonies were much more tightly tied to Rome, often having conubium – that right of intermarriage again – with Roman citizens. The consequence of this is that, by the late third century (when Rome is going to fight Carthage) the ager Romanus – the territory of Rome itself – comprises a big chunk of central Italy […] but the people who lived there as Roman citizens (with and without the vote) were not simply descendants of that initial Roman citizen body, but also a mix of people descended from communities of socii throughout Italy.

Bret Devereaux, “Collections: The Queen’s Latin or Who Were the Romans, Part II: Citizens and Allies”, A Collection of Unmitigated Pedantry, 2021-06-25.

January 8, 2024

QotD: Nomadic cultures’ territorial needs

Filed under: Americas, Asia, History, Quotations — Tags: , , , — Nicholas @ 01:00

This bears little resemblance to the strategic concerns of historical nomads. As a direct consequence of failing to understand the subsistence systems that nomads relied on, [George R.R.] Martin [in his descriptions of the Dothraki nomad culture] has also rendered their patterns of warfare functionally unintelligible.

The chief thing that nomads, both Great Plains Native Americans and Eurasian Steppe Nomads used violence to secure control of is the one thing the Dothraki never do: territory. To agrarian elites (who write most of our sources) and modern viewers, the vast expanses of grassland that nomads live on often look “empty” and “unused” (and thus not requiring protection), but that’s not correct at all. Those “empty” grasslands are very much in use; the nomads know this and are abundantly willing to defend those expanses of grass with lethal force to keep out interlopers. Remember: the knife’s edge of subsistence for nomads is very thin indeed, so it takes only a small disruption of the subsistence system to push the community into privation.

For the Eurasian Steppe nomad, the grass that isn’t near their encampment is in the process of regrowth for the season or year when it will be near their encampment and need to support their herds. Allowing some rival nomadic group to move their sheep and their horses over your grassland – eating the essential grass along the way – means that grass won’t be there for your sheep and your horses when you need it; and when the sheep starve, so will you. So if you are stronger than the foreign interloper, you will gather up all of your warriors and confront them directly. If you are weaker, you will gather your warriors and raid the interloper, trying to catch members of their group when they’re alone, to steal horses and sheep (we’ll come back to that); you are trying to inflict a cost for being on your territory so that they will go away and not come back.

The calculus for nomadic hunters like the Great Plains Native Americans is actually fairly similar. Land supports bison, bison support tribal groupings, so tribal groups defend access to land with violent reprisals against groups that stray into their territory or hunt “their” bison. And of course the reverse is true – these groups aren’t merely looking to hold on to their own territory, but to expand their subsistence base by taking new territory. Remember: the large tribe is the safe tribe; becoming the large tribe means having a larger subsistence base. And on either the plains or the steppe, the subsistence base is fundamentally measured in grass and the animals – be they herded sheep or wild bison – that grass supports. Both Secoy and McGinnis (op. cit.) are full of wars of these sorts on the Great Plains, where one group, gaining a momentary advantage, violently pushes others to gain greater territory (and thus food) for itself. For instance, Secoy (op. cit., 6-32) discusses how access to horses allowed the Plains Apache to rapidly violently expand over the southern Plains in the late 17th century, before being swept off of them by the fully nomadic Ute and Comanche in the first third of the 18th. As McGinnis notes (op. cit., 16ff), on the Northern Plains, prior to 1800 it initially was the Shoshone who were dominant and expanding, but around 1800 began to be pushed out by the Blackfoot, who in turn would, decades later, be pushed by the expanding Sioux.

This kind of warfare is different from the way that settled, agrarian armies take territory. Generally, the armies of agrarian states seek to seize (farm-) land with its population of farmers mostly intact and exert control both over the land and the people subsequently in order to extract the agricultural surplus. But generally (obviously there are notable exceptions) nomads both lack the administrative structures to exert that kind of control and are also very able to effectively resist that sort of control themselves (it is hard for even nomads to tax nomads), making “empire building” along agrarian lines difficult or undesirable (unless you are the Mongols). So instead, polities are trying to inflict losses (typically more through raiding and ambush than battle). Since rivals will tend to avoid areas that become unsafe due to frequent raiding, the successful tribe can essentially push back an opposing tribe with frequent raids. In extreme circumstances, a group may feel threatened enough to get up and move entirely – which of course creates conflict wherever they go, since their plan is to disposess the next group along the way of their territory.

Within that security context, larger scale groupings – alliances, confederations, and super-tribal “nations” – are common. On the Eurasian Steppe, such alliances tended to be personal, although there was a broad expectation that a given ethnic grouping would work together against other ethnic groupings (an expectation that Chinggis actually worked very hard, once he became the Great Khan of a multi-ethnic “Mongol” army, to break up through the decimal organization system; this reorganization is part of what made the Mongol Empire so much more successful than previous Steppe confederations). Likewise, even a cursory look at the Native Americans of the Great Plains produces both a set of standard enmities (the Sioux and the Crow, for instance) but also webs of peace agreements, treaties, alliances, confederations and so on. The presence of British, French, Spanish and American forces (both traders and military forces) fit naturally into that system; the Plains Apache allied with the Spanish against the Comanche, the Crow with the United States against the Sioux and so on. Such allies might not only help out in a conflict, but also deter war and raiding because their strength and friendship made lethal retaliation likely (don’t attack someone allied to Chinggis Khan and expect to survive the experience …).

Exactly none of that complexity appears with the Dothraki, who have no alliances, no peace agreements, no confederations and no territory to attack or defend. Instead, the Dothraki simply sail around the grass sea, fighting whenever they should chance to meet.

Bret Devereaux, “Collections: That Dothraki Horde, Part IV: Screamers and Howlers”, A Collection of Unmitigated Pedantry, 2021-01-08.

January 3, 2024

QotD: Iron and steel

Filed under: History, Quotations, Science, Technology — Tags: , , , , , — Nicholas @ 01:00

I don’t want to get too bogged down in the exact chemistry of how the introduction of carbon changes the metallic matrix of the iron; you are welcome to read about it. As the carbon content of the iron increases, the iron’s basic characteristics – its ductility and hardness (among others) – changes. Pure iron, when it takes a heavy impact, tends to deform (bend) to absorb that impact (it is ductile and soft). Increasing the carbon-content makes the iron harder, causing it to both resist bending more and also to hold an edge better (hardness is the key characteristic for holding an edge through use). In the right amount, the steel is springy, bending to absorb impacts but rapidly returning to its original shape. But too much carbon and the steel becomes too hard and not ductile enough, causing it to become brittle.

Compared to the other materials available for tools and weapons, high carbon “spring steel” was essentially the super-material of the pre-modern world. High carbon steel is dramatically harder than iron, such that a good steel blade will bite – often surprisingly deeply – into an iron blade without much damage to itself. Moreover, good steel can take fairly high energy impacts and simply bend to absorb the energy before springing back into its original shape (rather than, as with iron, having plastic deformation, where it bends, but doesn’t bend back – which is still better than breaking, but not much). And for armor, you may recall from our previous look at arrow penetration, a steel plate’s ability to resist puncture is much higher than the same plate made of iron (bronze, by the by, performs about as well as iron, assuming both are work hardened). of course, different applications still prefer different carbon contents; armor, for instance, tended to benefit from somewhat lower carbon content than a sword blade.

It is sometimes contended that the ancients did not know the difference between iron and steel. This is mostly a philological argument based on the infrequency of a technical distinction between the two in ancient languages. Latin authors will frequently use ferrum (iron) to mean both iron and steel; Greek will use σίδηρος (sideros, “iron”) much the same way. The problem here is that high literature in the ancient world – which is almost all of the literature we have – has a strong aversion to technical terms in general; it would do no good for an elite writer to display knowledge more becoming to a tradesman than a senator. That said in a handful of spots, Latin authors use chalybs (from the Greek χάλυψ) to mean steel, as distinct from iron.

More to the point, while our elite authors – who are, at most dilettantish observers of metallurgy, never active participants – may or may not know the difference, ancient artisans clearly did. As Tylecote (op. cit.) notes, we see surface carburization on tools as clearly as 1000 B.C. in the Levant and Egypt, although the extent of its use and intentionality is hard to gauge to due rust and damage. There is no such problem with Gallic metallurgy from at least the La Tène period (450 BCE – 50 B.C.) or Roman metallurgy from c. 200 B.C., because we see evidence of smiths quite deliberately varying carbon content over the different parts of sword-blades (more carbon in the edges, less in the core) through pattern welding, which itself can leave a tell-tale “streaky” appearance to the blade (these streaks can be faked, but there’s little point in faking them if they are not already understood to signify a better weapon). There can be little doubt that the smith who welds a steel edge to an iron core to make a sword blade understands that there is something different about that edge (especially since he cannot, as we can, precisely test the hardness of the two every time – he must know a method that generally produces harder metal and be working from that assumption; high carbon steel, properly produced, can be much harder than iron, as we’ll see).

That said, our ancient – or even medieval – smiths do not understand the chemistry of all of this, of course. Understanding the effects of carbuzation and how to harness that to make better tools must have been something learned through experience and experimentation, not from theoretical knowledge – a thing passed from master to apprentice, with only slight modification in each generation (though it is equally clear that techniques could move quite quickly over cultural boundaries, since smiths with an inferior technique need only imitate a superior one).

Bret Devereaux, “Collections: Iron, How Did They Make It, Part IVa: Steel Yourself”, A Collection of Unmitigated Pedantry, 2020-10-09.

December 18, 2023

QotD: A short history of the (long) Fifth Century

The chaotic nature of the fragmentation of the Western Roman Empire makes a short recounting of its history difficult but a sense of chronology and how this all played out is going to be necessary so I will try to just hit the highlights.

First, its important to understand that the Roman Empire of the fourth and fifth centuries was not the Roman Empire of the first and second centuries (all AD, to be clear). From 235 to 284, Rome had suffered a seemingly endless series of civil wars, waged against the backdrop of worsening security situations on the Rhine/Danube frontier and a peer conflict in the east against the Sassanid Empire. These wars clearly caused trade and economic disruptions as well as security problems and so the Roman Empire that emerges from the crisis under the rule of Diocletian (r. 284-305), while still powerful and rich by ancient standards, was not as powerful or as rich as in the first two centuries and also had substantially more difficult security problems. And the Romans subsequently are never quite able to shake the habit of regular civil wars.

One of Diocletian’s solutions to this problem was to attempt to split the job of running the empire between multiple emperors; Diocletian wanted a four emperor system (the “tetrarchy” or “rule of four”) but what stuck among his successors, particular Constantine (r. 306-337) and his family (who ruled till 363), was an east-west administrative divide, with one emperor in the east and one in the west, both in theory cooperating with each other ruling a single coherent empire. While this was supposed to be a purely administrative divide, in practice, as time went on, the two halves increasing had to make do with their own revenues, armies and administration; this proved catastrophic for the western half, which had less of all of these things (if you are wondering why the East didn’t ride to the rescue, the answer is that great power conflict with the Sassanids). In any event, with the death of Theodosius I in 395, the division of the empire became permanent; never again would one man rule both halves.

We’re going to focus here almost entirely on the western half of the empire […]

The situation on the Rhine/Danube frontier was complex. The peoples on the other side of the frontier were not strangers to Roman power; indeed they had been trading, interacting and occasionally raiding and fighting over the borders for some time. That was actually part of the Roman security problem: familiarity had begun to erode the Roman qualitative advantage which had allowed smaller professional Roman armies to consistently win fights on the frontier. The Germanic peoples on the other side had begun to adopt large political organizations (kingdoms, not tribes) and gained familiarity with Roman tactics and weapons. At the same time, population movements (particularly by the Huns) further east in Europe and on the Eurasian Steppe began creating pressure to push these “barbarians” into the empire. This was not necessarily a bad thing: the Romans, after conflict and plague in the late second and third centuries, needed troops and they needed farmers and these “barbarians” could supply both. But […] the Romans make a catastrophic mistake here: instead of reviving the Roman tradition of incorporation, they insisted on effectively permanent apartness for the new arrivals, even when they came – as most would – with initial Roman approval.

This problem blows up in 378 in an event – the Battle of Adrianople – which marks the beginning of the “decline and fall” and thus the start of our “long fifth century”. The Goths, a Germanic-language speaking people, pressured by the Huns had sought entry into Roman territory; the emperor in the East, Valens, agreed because he needed soldiers and farmers and the Goths might well be both. Local officials, however, mistreated the arriving Goth refugees leading to clashes and then a revolt; precisely because the Goths hadn’t been incorporated into the Roman military or civil system (they were settled with their own kings as “allies” – foederati – within Roman territory), when they revolted, they revolted as a united people under arms. The army sent to fight them, under Valens, engaged foolishly before reinforcements could arrive from the West and was defeated.

In the aftermath of the defeat, the Goths moved to settle in the Balkans and it would subsequently prove impossible for the Romans to move them out. Part of the reason for that was that the Romans themselves were hardly unified. I don’t want to get too deep in the weeds here except to note that usurpers and assassinations among the Roman elite are common in this period, which generally prevented any kind of unified Roman response. In particular, it leads Roman leaders (both generals and emperors) desperate for troops, often to fight civil wars against each other, to rely heavily on Gothic (and later other “barbarian”) war leaders. Those leaders, often the kings of their own peoples, were not generally looking to burn the empire down, but were looking to create a place for themselves in it and so understandably tended to militate for their own independence and recognition.

Indeed, it was in the context of these sorts of internal squabbles that Rome is first sacked, in 410 by the Visigothic leader Alaric. Alaric was not some wild-eyed barbarian freshly piled over the frontier, but a Roman commander who had joined the Roman army in 392 and probably rose to become king of the Visigoths as well in 395. Alaric had spent much of the decade before 410 alternately feuding with and working under Stilicho, a Romanized Vandal, who had been a key officer under the emperor Theodosius I (r. 379-395) and a major power-player after his death because he controlled Honorius, the young emperor in the West. Honorius’ decision to arrest and execute Stilicho in 408 seems to have precipitated Alaric’s move against Rome. Alaric’s aim was not to destroy Rome, but to get control of Honorius, in particular to get supplies and recognition from him.

That pattern: Roman emperors, generals and foederati kings – all notionally members of the Roman Empire – feuding, was the pattern that would steadily disassemble the Roman Empire in the west. Successful efforts to reassert the direct control of the emperors on foederati territory naturally created resentment among the foederati leaders but also dangerous rivalries in the imperial court; thus Flavius Aetius, a Roman general, after stopping Attila and assembling a coalition of Visigoths, Franks, Saxons and Burgundians, was assassinated by his own emperor, Valentinian III in 454, who was in turn promptly assassinated by Aetius’ supporters, leading to another crippling succession dispute in which the foederati leaders emerged as crucial power-brokers. Majorian (r. 457-461) looked during his reign like he might be able to reverse this fragmentation, but his efforts at reform offended the senatorial aristocracy in Rome, who then supported the foederati leader Ricimer (half-Seubic, half-Visigoth but also quite Romanized) in killing Majorian and putting the weak Libius Severus (r. 461-465) on the throne. The final act of all of this comes in 476 when another of these “barbarian” leaders, Odoacer, deposed the latest and weakest Roman emperor, the boy Romulus Augustus (generally called Romulus Augustulus – the “little” Augustus) and what was left of the Roman Empire in the west ceased to exist in practice (Odoacer offered to submit to the authority of the Roman Emperor in the East, though one doubts his real sincerity). Augustulus seems to have taken it fairly well – he retired to an estate in Campania originally built by the late Republican Roman general Lucius Licinius Lucullus and lived out his life there in leisure.

The point I want to draw out in all of this is that it is not the case that the Roman Empire in the west was swept over by some destructive military tide. Instead the process here is one in which the parts of the western Roman Empire steadily fragment apart as central control weakens: the empire isn’t destroyed from outside, but comes apart from within. While many of the key actors in that are the “barbarian” foederati generals and kings, many are Romans and indeed (as we’ll see next time) there were Romans on both sides of those fissures. Guy Halsall, in Barbarian Migrations and the Roman West (2007) makes this point, that the western Empire is taken apart by actors within the empire, who are largely committed to the empire, acting to enhance their own position within a system the end of which they could not imagine.

It is perhaps too much to suggest the Roman Empire merely drifted apart peacefully – there was quite a bit of violence here and actors in the old Roman “center” clearly recognized that something was coming apart and made violent efforts to put it back together (as Halsall notes, “The West did not drift hopelessly towards its inevitable fate. It went down kicking, gouging and screaming”) – but it tore apart from the inside rather than being violently overrun from the outside by wholly alien forces.

Bret Devereaux, “Collections: Rome: Decline and Fall? Part I: Words”, A Collection of Unmitigated Pedantry, 2022-01-14.

December 14, 2023

QotD: The rise of castles in early Medieval Europe

Filed under: Europe, History, Military, Quotations — Tags: , , , , — Nicholas @ 01:00

While fortifications obviously had existed a long time, when we talk about castles, what we really mean is a kind of fortified private residence which also served as a military base. This form of fortification really only becomes prominent (as distinct from older walled towns and cities) in 9th century, in part because the collapse of central authority (due in turn to the fragmentation of the Carolingian Empire) led to local notables fortifying their private residences. This process was, unsurprisingly, particularly rapid and pronounced in the borderlands of the various Carolingian splinter kingdoms (where there were peer threats from the other splinters) and in areas substantially exposed to Scandinavian (read: Viking) raiding. And so functionally, a castle is a fortified house, though of course large castles could encompass many other functions. In particular, the breakdown of central authority meant that these local aristocrats also represented much of the local government and administration, which they ran not through a civil bureaucracy but through their own households and so in consequence their house (broadly construed) was also the local administrative center.

Now, we can engage here in a bit of a relatable thought experiment: how extensively do your fortify your house (or apartment)? I’ll bet the answer is actually not “none” – chances are your front door locks and your windows are designed to be difficult to open from the outside. But how extensive those protections are vary by a number of factors: homes in high crime areas might be made more resistant (multiple deadbolts, solid exterior doors rather than fancy glass-pane doors, possibly even barred windows at ground level). Lots of neighbors can lower the level of threat for a break-in, as can raw obscurity (as in a house well out into the country). Houses with lots of very valuable things in them might invest in fancy security systems, or at least thief deterring signs announcing fancy security systems. And of course the owner’s ability to actually afford more security is a factor. In short, home defenses respond to local conditions aiming not for absolute security, but for a balance of security and cost: in safe places, home owners “consume” that security by investing less heavily in it, while homeowners who feel less safety invest more in achieving that balance, in as much as their resources allow. And so the amount of security for a house is not a universal standard but a complicated function of the local danger, the resources available and the individual home owner’s risk tolerance. Crucially, almost no one aims for absolute home security.

And I go through this thought process because in their own way the same concerns dictate how castles – or indeed, any fortification – is constructed, albeit of course a fortified house that aims to hold off small armies rather than thieves is going to have quite a bit more in the way of defenses than your average house. No fortification is ever designed to be absolutely impenetrable (or perhaps most correctly put, no wise fortress designer ever aims at absolute impenetrability; surely some foolish ones have tried). This is a fundamental mistake in assessing fortifications that gets made very often: concluding that because no fortification can be built to withstand every assault, that fortification itself is useless; but withstanding every assault is not the goal. The goal is not to absolutely prohibit every attack but merely to raise the cost of an attack above either a potential enemy’s willingness to invest (so they don’t bother) or above their ability to afford (so the attack is attempted and fails) and because all of this is very expensive the aim is often a sort of minimum acceptable margin of security against an “expected threat” (which might, mind you, still be a lot of security, especially if the “expected threat” is very high). This is true of the castle itself, if for no other reason than that resources are scarce and there are always other concerns competing for them, but also for every component of its defenses: individual towers, gates and walls are not designed to be impenetrable, merely difficult enough.

This is particularly true in castle design because the individuals building these castles often faced fairly sharp limitations in the resources at their disposal. Castles as a style of fortification emerge in a context of political fragmentation, in particular the collapse of the Carolingian Empire, which left even the notional large kingdoms (like the kingdom of France) internally fragmented. Castles were largely being built not by kings but by counts and dukes who held substantial landholdings but nothing like the resources of Charlemagne or Louis the Pious, much less the Romans or Assyrians. Moreover, the long economic and demographic upswing of the Middle Ages was only just beginning to gain momentum; the great cities of the Roman world had shrunk away and the total level of economic production declined, so the sum resources available to these rulers were lower. Finally, the loss of the late Roman bureaucracy (replaced by these fragmented realms running on an economic system best termed “manorialism”) meant that the political authorities (the nobility) often couldn’t even get a hold of a very large portion of the available economic production they did have. Consequently, castle construction is all about producing what security you can with as little labor, money and resources as possible (this is always true of any fortification, mind you, merely that in this period the resource constraints are much tighter).

Bret Devereaux, “Collections: Fortification, Part III: Castling”, A Collection of Unmitigated Pedantry, 2021-12-10.

December 11, 2023

Roman glossary

Filed under: Europe, Government, History, Religion — Tags: , , , , , — Nicholas @ 03:00

As I continue to post QotD entries drawn from Bret Devereaux’s fascinating historical blog A Collection of Unmitigated Pedantry (with Dr. Devereaux’s kind permission, I hasten to add), the number of specialized terms from the Roman Republic and Empire also expands. As some of these terms pop up in my shorter excerpts without immediate context, I think that a glossary for Rome is called for (similar to the Spartan glossary, as there’s a lot more Roman content coming up, it being Dr. Devereaux’s area of academic specialization) to help explain the terms that I think may need expansion in these excerpts from his longer posts. As usual, most of the information is drawn directly from ACOUP (often from more than one original post) and where I’ve felt the need to interpolate any additional information it is enclosed in [square brackets].


(more…)

December 10, 2023

QotD: Roman citizenship

Filed under: Europe, History, Law, Quotations — Tags: , , , , — Nicholas @ 01:00

As with other ancient self-governing citizen bodies, the populus Romanus (the Roman people – an idea that was defined by citizenship) restricted political participation to adult citizen males (actual office holding was further restricted to adult citizen males with military experience, Plb. 6.19.1-3). And we should note at the outset that citizenship was stratified both by legal status and also by wealth; the Roman Republic openly and actively counted the votes of the wealthy more heavily than those of the poor, for instance. So let us avoid the misimpression that Rome was an egalitarian society; it was not.

The most common way to become a Roman citizen was by birth, though the Roman law on this question is more complex and centers on the Roman legal concept of conubium – the right to marry and produce legally recognized heirs under Roman law. Conubium wasn’t a right held by an individual, but a status between two individuals (though Roman citizens could always marry other Roman citizens). In the event that a marriage was lawfully contracted, the children followed the legal status of their father; if no lawfully contracted marriage existed, the child followed the status of their mother (with some quirks; Ulpian, Reg. 5.2; Gaius, Inst. 1.56-7 – on the quirks and applicability in the Republic and conubium in general, see S.T. Roselaar, “The Concept of Conubium in the Roman Republic” in New Frontiers: Law and Society in the Roman World, ed. P.J. du Plessis (2013)).

Consequently the children of a Roman citizen male in a legal marriage would be Roman citizens and the children of a Roman citizen female out of wedlock would (in most cases; again, there are some quirks) be Roman citizens. Since the most common way for the parentage of a child to be certain is for the child to be born in a legal marriage and the vast majority of legal marriages are going to involve a citizen male husband, the practical result of that system is something very close to, but not quite exactly the same as, a “one parent” rule (in contrast to Athens’ two-parent rule). Notably, the bastard children of Roman women inherited their mother’s citizenship (though in some cases, it would be necessarily, legally, to conceal the status of the father for this to happen, see Roselaar, op. cit., and also B. Rawson, “Spruii and the Roman View of Illegitimacy” in Antichthon 23 (1989)), where in Athens, such a child would have been born a nothos and thus a metic – resident non-citizen foreigner.

The Romans might extend the right of conubium with Roman citizens to friendly non-citizen populations; Roselaar (op. cit.) argues this wasn’t a blanket right, but rather made on a community-by-community basis, but on a fairly large scale – e.g. extended to all of the Campanians in 188 B.C. Importantly, Roman colonial settlements in Italy seem to pretty much have always had this right, making it possible for those families to marry back into the citizen body, even in cases where setting up their own community had caused them to lose all or part of their Roman citizenship (in exchange for citizenship in the new community).

The other long-standing way to become a Roman citizen was to be enslaved by one and then freed. An enslaved person held by a Roman citizen who was then freed (or manumitted) became a libertus (or liberta), by custom immediately the client of their former owner (this would be made into law during the empire) and by law a Roman citizen, although their status as a freed person barred them from public office. Since they were Roman citizens (albeit with some legal disability), their children – assuming a validly contracted marriage – would be full free-born Roman citizens, with no legal disability. And, since freedmen and freedwomen were citizens, they also could contract valid marriages with other Roman citizens, including freeborn ones […]. While most enslaved people in the Roman world had little to no hope of ever being manumitted (enslaved workers, for instance, on large estates far from their owners), Roman economic and social customs functionally required a significant number of freed persons and so a meaningful number of new Roman citizens were always being minted in the background this way. Rome’s apparent liberality with admission into citizenship seems to have been a real curiosity to the Greek world.

These processes thus churned in the background, minting new Romans on the edges of the populus Romanus who subsequently became full members of the Roman community and thus shared fully in the Roman legal identity.

Bret Devereaux, “Collections: The Queen’s Latin or Who Were the Romans, Part II: Citizens and Allies”, A Collection of Unmitigated Pedantry, 2021-06-25.

December 6, 2023

QotD: What Charles VIII brought back from his Naples vacation

… the basic formula for what would become gunpowder (saltpeter, charcoal, and sulfur in a roughly 75%, 15%, 10% mixture) was clearly in use in China by 1040 when we have our first attested formula, though saltpeter had been being refined and used as an incendiary since at least 808. The first guns appear to be extrapolations from Chinese incendiary “fire lances” (just add rocks!) and the first Chinese cannon appear in 1128. Guns arrive in Europe around 1300; our first representation of a cannon is from 1326, while we hear about them used in sieges beginning in the 1330s; the Mongol conquest and the sudden unification of the Eurasian Steppe probably provided the route for gunpowder and guns to move from China to Europe. Notably, guns seem to have arrived as a complete technology: chemistry, ignition system, tube and projectile.

There was still a long “shaking out” period for the new technology: figuring out how to get enough saltpeter for gunpowder (now that’s a story we’ll come back to some day), how to build a large enough and strong enough metal barrel, and how to actually use the weapons (in sieges? against infantry? big guns? little guns?) and so on. By 1453, the Ottomans have a capable siege-train of gunpowder artillery. Mehmed II (r. 1444-1481) pummeled the walls of Constantinople with some 5,000 shots using some 55,000 pounds of gunpowder; at last Theodosius’ engineers had met their match.

And then, in 1494, Charles VIII invaded Italy – in a dispute over the throne of Naples – with the first proper mobile siege train in Christendom (not in Europe, mind you, because Mehmed had beat our boy Chuck here by a solid four decades). A lot of changes had been happening to make these guns more effective: longer barrels allowed for more power and accuracy, wheeled carriages made them more mobile, trunnions made elevation control easier and some limited degree of caliber standardization reduced windage and simplified supply (though standardization at this point remains quite limited).

The various Italian states, exactly none of whom were excited to see Charles attempting to claim the Kingdom of Naples, could have figured that the many castles and fortified cities of northern Italy were likely to slow Charles down, giving them plenty of time to finish up their own holidays before this obnoxious French tourist showed up. On the 19th of October, 1494, Charles showed up to besiege the fortress at Mordano – a fortress which might well have been expected to hold him up for weeks or even months; on October 20th, 1494, Charles sacked Mordano and massacred the inhabitants, after having blasted a breach with his guns. Florence promptly surrendered and Charles marched to Naples, taking it in 1495 (it surrendered too). Francesco Guicciardini phrased it thusly (trans. via Lee, Waging War, 228),

    They [Charles’ artillery] were planted against the Walls of a Town with such speed, the Space between the Shots was so little, and the Balls flew so quick, and were impelled with such Force, that as much Execution was done in a few Hours, as formerly, in Italy, in the like Number of Days.

The impacts of the sudden apparent obsolescence of European castles were considerable. The period from 1450 to 1550 sees a remarkable degree of state-consolidation in Europe (broadly construed) castles, and the power they gave the local nobility to resist the crown, had been one of the drivers of European fragmentation, though we should be careful not to overstate the gunpowder impact here: there are other reasons for a burst of state consolidation at this juncture. Of course that creates a run-away effect of its own, as states that consolidate have the resources to employ larger and more effective siege trains.

Now this strong reaction doesn’t happen everywhere or really anywhere outside of Europe. Part of that has to do with the way that castles were built. Because castles were designed to resist escalade, the walls needed to be built as high as possible, since that was the best way to resist attacks by ladders or towers. But of course, given a fixed amount of building resources, building high also means building thin (and European masonry techniques enabled tall-and-thin construction with walls essentially being constructed with a thick layer of fill material sandwiched between courses of stone). But “tall and thin”, while good against ladders, was a huge liability against cannon.

By contrast, city walls in China were often constructed using a rammed earth core. In essence, earth was piled up in courses and packed very tightly, and then sheathed in stone. This was a labor-intensive building style (but large cities and lots of state capacity meant that labor was available), and it meant the walls had to be made thick in order to be made tall since even rammed earth can only be piled up at an angle substantially less than vertical. But against cannon, the result was walls which were already massively thick, impossible to topple over and the earth-fill, unlike European stone-fill, could absorb some of the energy of the impact without cracking or shattering. Even if the stone shell was broken, the earth wouldn’t tumble out (because it was rammed), but would instead self-seal small gaps. And no attacker could hope that a few lucky hits to the base of a wall built like this would cause it to topple over, given how wide it is at the base. Consequently, European castle walls were vulnerable to cannon in a way that contemporary walls in many other places, such as China, were not. Again, path dependence in fortification matters, because of that antagonistic co-evolution.

In the event, in Italy, Charles’ Italian Vacation started to go badly almost immediately after Naples was taken. A united front against him, the League of Venice, formed in 1495 and fought Charles to a bloody draw at Fornovo in July, 1495. In the long-term, French involvement would draw in the Habsburgs, whose involvement would prevent the French from making permanent gains in a series of wars in Italy lasting well into the 1550s.

But more relevant for our topic was the tremendous shock of that first campaign and the sudden failure of defenses which had long been considered strong. The reader can, I’d argue, detect the continued light tremors of that shock as late as Machiavelli’s The Prince (1532, but perhaps written in some form by 1513). Meanwhile, Italian fortress designers were already at work retrofitting old castles and fortifications (and building new ones) to more effectively resist artillery. Their secret weapon? Geometry.

Bret Devereaux, “Collections: Fortification, Part IV: French Guns and Italian Lines”, A Collection of Unmitigated Pedantry, 2021-12-17.

November 28, 2023

QotD: The tactical problem of attacking WW1 trenches

Filed under: Europe, History, Military, Quotations, Weapons, WW1 — Tags: , , , , — Nicholas @ 01:00

The trench stalemate is the result of a fairly complicated interaction of weapons which created a novel tactical problem. The key technologies are machine guns, barbed wire and artillery (though as we’ll see, artillery almost ought to be listed here multiple times: the problems are artillery, machine guns, trenches, artillery, barbed wire, artillery, and artillery), but their interaction is not quite straight-forward. The best way to understand the problem is to walk through an idealized, “platonic form” of an attack over no man’s land and the problems it presents.

[…]

So, the first problem: artillery. Neither side starts the war in trenches. Rather the war starts with large armies, consisting mostly of infantry with rifles, backed up by smaller amounts of cavalry for scouting duties (who typically fight dismounted because this is 1914, not 1814) and substantial amounts of artillery, mostly smaller caliber direct-fire1 guns, maneuvering in the open, trying to do fancy things like flanking and enveloping attacks to win by movement rather than by brute attrition (though it is worth noting that this war of maneuver is also the period of the highest casualties on a per-day basis). The tremendous lethality of those weapons – both rifles that are accurate for hundreds of yards, machine guns that can deny entire areas of the battlefield to infantry and the artillery, which is utterly murderous against any infantry it can see and by far the most lethal part of the equation – all of that demands trenches. Trenches shield the infantry from all of that firepower. So you end up with parallel trenches, typically a few hundred yards apart as the armies settle in to defenses and maneuver breaks down (because the armies are large enough to occupy the entire front from the Alps to the Sea).

The new problem this creates, from the perspective of the defender, is how to defend these trenches. If enemies actually get close to them, they are very vulnerable because the soldier at the top of the trench has a huge advantage against enemies in the trench: he can fire down more easily, can throw grenades down very easily and also has an enormous mechanical advantage if the fight comes to bayonets and trench-knives, which it might. If you end up fighting at the lip of your trench against massed enemy infantry, you have almost certainly already lost. The defensive solution here, of course, are those machine guns which can deploy enough fire to prohibit enemies moving over no man’s land: put a bunch of those in strong-points in your trench line and you can prevent enemy infantry from reaching you.

Now the attacker has the problem: how to prevent the machine guns from making approach impossible. The popular conception here is that WWI generals didn’t “figure out” machine guns for a long time; that’s not quite true. By the end of 1914, most everyone seems to have recognized that attacking into machine guns without some way of shutting them down was futile. But generals who had done their studies already had the ready solution: the way to beat infantry defenses was with artillery and had been for centuries. Light, smaller, direct-fire guns wouldn’t work2 but heavy, indirect-fire howitzers could! Now landing a shell directly in a trench was hard and trenches were already being zig-zagged to prevent shell fragments flying down the whole line anyway, so actually annihilating the defenses wasn’t quite in the cards (though heavy shells designed to penetrate the ground with large high-explosive payloads could heave a hundred meters of trench along with all of their inhabitants up into the air at a stretch with predictably fatal results). But anyone fool enough to be standing out during a barrage would be killed, so your artillery could force enemy gunners to hide in deep dugouts designed to resist artillery. Machine gunners hiding in deep dugouts can’t fire their machine guns at your approaching infantry.

And now we have the “race to the parapet”. The attacker opens with a barrage, which has two purposes: silence enemy artillery (which could utterly ruin the attack if it isn’t knocked out) and second to disable the machine guns: knock out some directly, force the crews of the rest to flee underground. But attacking infantry can’t occupy a position its own artillery is shelling, so there is some gap between when the shells stop and when the attack arrives. In that gap, the defender is going to rush to set up their machine guns while the attacker rushes to get to the lip of the trench:first one to get into position is going to inflict a terrible slaughter on the other.

Now the defender begins to look for ways to slant the race to his advantage. One option is better dugouts and indeed there is fairly rapid development in sophistication here, with artillery-resistant shelters dug many meters underground, often reinforced with lots of concrete. Artillery which could have torn apart the long-prepared expensive fortresses of a few decades earlier struggle to actually kill all of the infantry in such positions (though they can bury them alive and men hiding in a dugout are, of course, not at the parapet ready to fire). The other option was to slow the enemy advance and here came barbed wire. One misconception to clear up here: the barbed wire here is not like you would see on a fence (like an animal pen, or as an anti-climb device at the top of a chain link fence), it is not a single wire or a set of parallel wires. Rather it is set out in giant coils, like massive hay-bales of barbed wire, or else strung in large numbers of interwoven strands held up with wooden or metal posts. And there isn’t merely one line of it, but multiple lines deep. If the attacker goes in with no preparation, the result will be sadly predictable even without machine guns: troops will get stuck at the wire (or worse yet, on the wire) and then get shot to pieces. But even if troops have wire-cutters, cutting the wire and clearing passages through it will still slow them down … and this is a race.

Bret Devereaux, “Collections: No Man’s Land, Part I: The Trench Stalemate”, A Collection of Unmitigated Pedantry, 2021-09-17.


    1. Direct fire here means the guns fire on a low trajectory; you are more or less pointing them where you want the shell to go and shooting straight at it, as you might with a traditional firearm.

    2. The problem with direct-fire artillery here is that you cannot effectively hide it in a trench (because it’s direct fire) and you can’t keep it well concealed, so in the event of an attack, the enemy is likely to begin by using their artillery to disable your artillery. The limitations of direct-fire guns hit the French particularly hard once the trench stalemate set in, because it reduced the usefulness of their very effective 75mm field gun (the famed “French 75” after which the modern cocktail is named [Forgotten Weapons did a video covering both]). That didn’t make direct-fire guns useless, but it put a lot more importance on much heavier indirect-fire artillery.

November 24, 2023

QotD: “Citizenship” in the ancient and classical world

Filed under: Europe, Greece, History, Quotations — Tags: , , , , , — Nicholas @ 01:00

… before we dive into how Roman citizenship worked, we need to have a baseline for how citizenship worked in most ancient polities so we can get a sense of the way Roman citizenship is typical and the ways that it is different. In the broader ancient Mediterranean world, citizenship was generally a feature of self-governing urban polities (“city-states”). Though I am going to use Athens as my “type model” here, citizenship was not exclusively Greek; Italic communities; Carthage seems to have had a very similar system. That said, our detailed knowledge of the laws of many of the smaller Greek poleis is very limited; we only know that the Athenian system was regarded more-or-less as “typical” (as opposed to Sparta, consistently regarded as unusual or strange, though even more closed to new entrants than Athens).

Citizenship status was clearly extremely important to the ancients whose communities had it. Greek and Roman writers, for instance, do not generally write that “Athens” or “Carthage” do something (go to war, make peace, etc), but rather that “the Athenians” or “the Carthaginians” do so – the body of citizens acts, not the state. Only citizens (or more correctly, adult citizen-males) were permitted to engage in direct political activity – voting, speaking in the assembly, or holding office – in a Greek polis; at Athens, for a non-citizen to do any of these things (or to pretend to be a citizen) carried the death penalty. This status was a jealously guarded one. It had other legal privileges; as early as Draco’s homicide law (laid down in 622/1) it is clear that there were legal advantages to Athenian citizenship. After Solon (Archon in 594), Athenian citizens became legally immune to being reduced to slavery; non-citizen foreigners who fell into debt were apparently not so protected (for more on this, see S. Lape, Race and Citizen Identity in the Classical Athenian Democracy (2010), 9ff). Citizenship, for those who had it, was likely the most important communal identity they had – certainly more so than linguistic or ethnic connections (an Athenian was an Athenian first, a Greek a distant second).

So then who got to be a citizen? At Athens, the rules changed a little over time. Solon’s reforms may mark the point at which citizenship became the controlling identity (Lape, op. cit. makes this argument). While Solon himself briefly opened up Athenian citizenship to migrants with useful skills, that door was soon slammed shut (Plut. Sol. 24.2); citizenship was largely limited to children with both a citizen father and a citizen mother (this seems to have been more flexible early on but was codified into law in 451/0 by Pericles). Bastards (the Greek term is nothoi) were barred from the citizenship at least from the reforms of Cleisthenes in 509/8. This exclusivity was not unique to Athens; recall that Spartiate status worked the same way (albeit covering an even smaller class of people). Likewise, our Latin sources on Carthage – no Carthaginian account of their government (or indeed any Carthaginian literature) survives – suggest that only Carthaginians whose descent could be traced to the founding settlers had full legal rights. Under the reforms of Cleisthenes (509/8), each Athenian, upon coming of age, had their claim to citizenship assessed by the members of their respective deme (a legally defined neighborhood) to determine if they were of Athenian citizen stock on both sides.

It is worth discussing the implication of those rules. The rules of Athenian citizenship imagine the citizen body as a collection of families, recreating itself, generation to generation, with perhaps occasional expulsions, but with minimal new entrants. Citizens only married other citizens because that was the only condition under which they could have valid citizen children. Such a policy creates a legally defined ethnic group that is – again, legally – incapable of incorporating or mixing with other groups. In this sense, Athenian citizenship, like most ancient citizenship, was radically exclusionary. Thousands of people lived permanently in Athens – resident foreigners called metics – with no hope of ever gaining Athenian citizenship, because there were no formal channels to ever do so.

(As an aside, it was possible for the Athenian citizenry to admit new members, but only by an act of the Ekklesia, the Athenian assembly. For a modern sense of what that means, imagine if it was only possible to become an American citizen by an act of Congress (good luck with the 60 votes to break a filibuster!) that names you, specifically as a new citizen. We don’t know exactly how many citizens were so admitted into the Athenian citizen body, but it was clearly very low – probably only a few hundred through the entire fourth century, for instance. In practice, this was a system where there were no formal mechanisms for naturalizing new citizens at all, that only occasionally made very specific exceptions for individuals or communities.)

In short, while there were occasional exceptions where the doorway to citizenship in a community might open briefly, in practice the citizen body in a Greek polis was a closed group of families which replaced themselves through the generations but did not admit new arrivals and instead prided themselves on the exclusive value of the status they held. The fact that the citizen body of these poleis couldn’t expand to admit new members or incorporate new communities but had become calcified and frozen would eventually doom the Greeks to lose their independence, since polities of such small size could not compete in the world of great kingdoms and empires that emerged with Philip II and Alexander.

Bret Devereaux, “Collections: The Queen’s Latin or Who Were the Romans, Part II: Citizens and Allies”, A Collection of Unmitigated Pedantry, 2021-06-25.

November 20, 2023

QotD: Flax and linen in the ancient and medieval world

Linen fabrics are produced from the fibers of the flax plant, Linum usitatissimum. This common flax plant is the domesticated version of the wild Linum bienne, domesticated in the northern part of the fertile crescent no later than 7,000 BC, although wild flax fibers were being used to produce textiles even earlier than that. Consequently the use of linen fibers goes way back. In fact, the oldest known textiles are made from flax, including finds of fibers at Nahal Hemar (7th millennium BC), Çayönü (c. 7000 BC), and Çatalhöyük (c. 6000 BC). Evidence for the cultivation of flax goes back even further, with linseed from Tell Asward in Syria dating to the 8th millennium BC. Flax was being cultivated in Central Europe no later than the second half of the 7th millennium BC.

Flax is a productive little plant that produces two main products: flax seeds, which are used to produce linseed oil, and the bast of the flax plant which is used to make linen. The latter is our focus here so I am not going to go into linseed oil’s uses, but it should be noted that there is an alternative product. That said, my impression is that flax grown for its seeds is generally grown differently (spaced out, rather than packed together) and generally different varieties are used. That said, flax cultivated for one purpose might produce some of the other product (Pliny notes this, NH 19.16-17)

Flax was a cultivated plant (which is to say, it was farmed); fortunately we have discussed quite a bit about farming in general already and so we can really focus in on the peculiarities of the flax plant itself; if you are interested in the activities and social status of farmers, well, we have a post for that. Flax farming by and large seems to have involved mostly the same sorts of farmers as cereal farming; I get no sense in the Greco-Roman agronomists, for instance, that this was done by different folks. Flax farming changed relatively little prior to mechanization; my impression reading on it is that flax was farmed and gathered much the same in 1900 BC as it was in 1900 AD. In terms of soil, flax requires quite a lot of moisture and so grows best in either deep loam or (more commonly used in the ancient world, it seems) alluvial soils; in both cases, it should be loose, unconsolidated “sandy” (that is, small particle-sized) soil. Alluvium is loose, often sandy soil that is the product of erosion (that is to say, it is soil composed of the bits that have been eroded off of larger rocks by the action of water); the most common place to see lots of alluvial soil are in the flood-plains of rivers where it is deposited as the river floods forming what is called an alluvial plain.

Thus Pliny (NH 19.7ff) when listing the best flax-growing regions names places like Tarragona, Spain (with the seasonally flooding Francoli river) or the Po River Basin in Italy (with its large alluvial plain) and of course Egypt (with the regular flooding of the Nile). Pliny notes that linen from Sætabis in Spain was the best in Europe, followed by linens produced in the Po River Valley, though it seems clear that the rider here “made in Europe” in his text is meant to exclude Egypt, which would have otherwise dominated the list – Pliny openly admits that Egyptian flax, while making the least durable kind of linen (see below on harvesting times) was the most valuable (though he also treats Egyptian cotton which, by his time, was being cultivated in limited amounts in the Nile delta, as a form of flax, which obviously it isn’t). Flax is fairly resistant to short bursts of mild freezing temperatures, but prolonged freezes will kill the plants; it seems little accident that most flax production seems to have happened in fairly warm or at least temperate climes.

Flax is (as Pliny notes) a very fast growing plant – indeed, the fastest growing crop he knew of. Modern flax grown for fibers is generally ready for harvesting in roughly 100 days and this accords broadly with what the ancient agronomists suggest; Pliny says that flax is sown in spring and harvested in summer, while the other agronomists, likely reflecting practice further south suggest sowing in late fall and early winter and likewise harvesting relatively quickly. Flax that is going to be harvested for fibers tended to be planted in dense bunches or rows (Columella notes this method but does not endorse it, De Rust. 2.10.17). The reason for this is that when placed close together, the plants compete for sunlight by growing taller and thinner and with fewer flowers, which maximizes the amount of stalk per plant. By contrast, flax planted for linseed oil is more spaced out to maximize the number of flowers (and thus the amount of seed) per plant.

Once the flax was considered ready for harvest, it was pulled up out of the ground (including the root system) in bunches in handfuls rather than as individual plants […] and then hung to dry. Both Pliny and Columella (De Rust. 2.10.17) note that this pulling method tended to tear up the soil and regarded this as very damaging; they are on to something, since none of the flax plant is left to be plowed under, flax cultivation does seem to be fairly tough on the soil (for this reason Columella advises only growing flax in regions with ideal soil for it and where it brings a good profit). The exact time of harvest varies based on the use intended for the flax fibers; harvesting the flax later results in stronger, but rougher, fibers. Late-pulled flax is called “yellow” flax (for the same reason that blond hair is called “flaxen” – it’s yellow!) and was used for more work-a-day fabrics and ropes.

Bret Devereaux, “Collections: Clothing, How Did They Make It? Part I: High Fiber”, A Collection of Unmitigated Pedantry, 2021-03-05.

November 16, 2023

QotD: Infantry soldiers in the age of pike and shot

Filed under: Europe, History, Military, Quotations — Tags: , , , , , — Nicholas @ 01:00

The pike and the musket shifted the center of warfare away from aristocrats on horses towards aristocrats commanding large bodies of non-aristocratic infantry. But, as comes out quite clearly in their writing, those aristocrats were quite confident that the up-jumped peasants in their infantry lacked any in-born courage at all. Instead, they assumed (in their prejudice) that such soldiers would require relentless synchronized drilling in order to render the complex sequence of actions to reload a musket absolutely mechanical. As Lee points out [in Waging War], this training approach wasn’t necessary – other contemporary societies adapted to gunpowder just fine without it – but was a product of the values and prejudices of the European aristocracy of the 1500 and 1600s.

Such soldiers were, in their ideal, to quickly but mechanically reload their weapons, respond to orders and shift formation more or less oblivious to the battle around them. Indeed, uniforms for these soldiers came to favor high, starched collars precisely to limit their field of vision. This is not the man who, in Tyrtaeus’ words (elsewhere in his corpus), “bites on his lip and stands against the foe” but rather a human who, in the perfect form, was so mechanical in motions and habits that their courage or lack thereof, their awareness of the battlefield or lack thereof, didn’t matter at all. But at least, the [Classical] Greek might think, at least such men still ought not quail under fire but instead stood tall in the face of it.

After all, as late as the Second World War, it was thought that good British officers ought not duck or take cover under fire, in order to demonstrate and model good coolness under fire for their soldiers. The impression I get from talking to recent combat veterans (admittedly, American ones rather than British, since I live in the United States) is that an officer who behaved in that same way on today’s battlefield would be thought reckless (or stupid), not brave. Instead, the modern image of courage under fire is the soldier moving fast, staying low, moving to and through cover whenever possible – recklessness is discouraged precisely because it might put a comrade in danger.

Instead, the courage that is valued in many of today’s armies is the courage to stay calm and make cool, rational decisions. It is, to borrow the first line in Rudyard Kipling’s “If-“, “If you can keep your head when all about you/Are losing theirs and blaming it on you.” Which is not at all what was expected of the 17th century infantryman, whose officers trusted him to make nearly no decisions at all! But, as we’ve discussed, the modern system of combat demands that lots of decisions be devolved down further and further in the command hierarchy, with senior officers giving subordinates (often down to NCOs) the freedom to alter plans on the fly at the local level so long as they are following the general mission instructions (a system often referred to by its German term, auftragstaktik).

Bret Devereaux, “Collections: The Universal Warrior, Part IIa: The Many Faces of Battle”, A Collection of Unmitigated Pedantry, 2021-02-05.

November 12, 2023

QotD: Archaeological evidence on the foundation of Rome

The first thing we need to talk about is the physical location of Rome and the peoples directly around it. […] Rome in its earliest history was, essentially, a frontier city, placed at the very northern end of Latium, the region of Italy that was populated by Latin-speakers. Rome’s position on the Tiber River put it at the cultural meeting place of the Etruscan (and Faliscian) cultural zone to the North, Latium to the South and Umbrian-speaking peoples in the Apennine uplands to the North-East. To the West, of course, lay the Sea, which by Rome’s legendary founding date was already beginning to fill with seaborne merchants, particularly Phoenician and Greek ones […] These patterns of settlements and cultural zones are both attested in our literary sources but also show up fairly clearly in the archaeology of the region.

Rome itself, a cluster of hills situated at an important ford over the Tiber (and thus a natural trade and migration route running north-south along Italy’s Western side), was already inhabited by the close of the Neolithic with small settlement clusters on several of its hills. As you might well imagine, excavating pre-historic Rome is difficult, due to the centuries of development piled on top of it and the fact that in many cases pre-historic evidence must exist directly below subsequent ruins which are now cultural heritage sites. Nevertheless, archaeology sheds quite a lot of light. That archaeological evidence allows us to reject the sort of “empty fields” city foundation that Livy implies. Rather than being “founded”, the city of Rome as we know it formed out of the political merger of these communities (the technical term is synoecism from Greek συνοικισμóς, literally “[putting] the houses together”). There is, importantly, no clear evidence of any archaeological discontinuity between the old settlements on the hills and the newly forming city; these seem to have been the same people. The Palatine hill, which is “chosen” by Romulus in the legend and would be the site of the houses of Rome’s most important and affluent citizens during the historical period, seems to have been the most prominent of these settlements even at this early stage.

A key event in this merging comes in the mid-600s, when these hill-communities begin draining the small valley that lay between the Capitoline and Palatine hills; this valley would naturally have been marshy and quite useless but once drained, it formed a vital meeting place at the center of these hill communities – what would become the Forum Romanum. That public works project – credited by the Romans to the semi-legendary king Tarquinius Priscus (Plin. Natural History 36.104ff) – is remarkably telling, both because it signals that there was enough of a political authority in Rome to marshal the resources to see it done (suggesting somewhat more centralized government, perhaps early kings) and because the new forum formed the meeting place and political center for these communities, quite literally binding them together into a single polity. It is at this point that we can really begin speaking of Rome and Romans with confidence.

What does our archaeology tell us about this early community at this point and for the next several centuries?

The clearest element of this early polity is the Latin influence. Linguistically, Rome was of Latium, spoke (and wrote their earliest inscriptions) in Latin and it falls quite easily to reason that the majority of the people in these early hilltop communities around the Tiber ford were culturally and linguistically Latins. But there are also strong signs of Etruscan and Greek influence in the temples. For instance, in the Forum Boarium (between the Tiber and the Palatine), we see evidence for a cult location dating to the seventh century, with a temple constructed there in the early sixth century (and reconstructed again towards the end of that century); votive offerings recovered from the site include Attic ware pottery and a votive ivory figurine of a lion bearing an inscription in Etruscan.

Archaeological evidence for the Sabines is less evident. Distinctive Sabine material culture hasn’t been recovered from Rome as of yet. There are some clear examples of linguistic influence from Sabine to Latin, although the Romans often misidentify them; the name of the Quirinal hill, for instance (thought by the Romans to be where the Sabines settled after joining the city) doesn’t seem to be Sabine in origin. That said, religious institutions associated with the hill in the historical period (particularly the priests known as the Salii Collini) may have some Sabine connections. More notably, a number of key Roman families (gentes in Latin; we might translate this word as “clans”) claimed Sabine descent. Of particular note, several of these are Patrician gentes, meaning that they traced their lineage to families prominent under the kings or very early in the Republic. Among these were the Claudii (a key family in Roman politics from the founding of the Republic to the early Empire; Liv. 2.16), the Tarpeii (recorded as holding a number of consulships in the fifth century), and the Valerii (prominent from the early days of the Republic and well into the empire; Dionysius 2.46.3). There seems little reason to doubt the ethnic origins of these families.

So on the one hand we cannot say with certainty that there were Sabines in Rome in the eighth century as Livy would have it (though nothing rules it out), but there very clearly were by the foundation of the Republic in 509. The Sabine communities outside of Rome (because it is clear they didn’t all move into Rome) were absorbed in 290 and granted citizenship sine suffrago (citizenship without the vote) almost immediately; voting rights came fairly quickly thereafter in 268 BC (Vel. Pat. 1.14.6-7). The speed with which these Sabine communities outside of Rome were admitted to full citizenship speaks, I suspect, to the degree to which the Sabines were already by that point seen as a kindred people (despite the fact that they spoke a language quite different from Latin; Sabine Osco-Umbrian was its own language, albeit in the same language family).

The only group we can say quite clearly that there is no evidence for in early Rome from Livy’s fusion society are the Trojans; there is no trace of Anatolian influence this early (and we might expect the sudden intrusion of meaningful amounts of Anatolian material culture to be really obvious). Which is to say that Aeneas is made up; no great surprise there.

But Livy’s conception of an early Roman community – perhaps at the end of the sixth century rather than in the middle of the eighth – that was already a conglomeration of peoples with different linguistic, ethnic and religious backgrounds is largely confirmed by the evidence. Moreover, layered on top of this are influences that speak to this early Rome’s connectedness to the broader Mediterranean milieu – I’ve mentioned already the presence of Greek cultural products both in Rome and in the area surrounding it. Greek and eastern artistic motifs (the latter likely brought by Phoenician traders) appear with the “Orientalizing” style in the material culture of the area as early as 730 B.C. – no great surprise there either as the Greeks had begun planting colonies in Italy and Sicily by that point and Phoenician traders are clearly active in the region as well. Evidently Carthaginian cultural contacts also existed at an early point; the Romans made a treaty with Carthage in the very first year of the Republic, which almost certainly seems like it must have replaced some older understanding between the Roman king and Carthage (Polybius 3.22.1). Given the trade contacts, it seems likely that there would have been Phoenician merchants in permanent residence in Rome; evidence for such permanently resident Greeks is even stronger.

In short, our evidence suggests that were one to walk the forum of Rome at the dawn of the Republic – the beginning of what we might properly call the historical period for Rome – you might well hear not only Latin, but also Sabine Umbrian, Etruscan and Greek and even Phoenician spoken (to be clear, those are three completely different language families; Umbrian, Latin and Greek are Indo-European languages, Phoenician was a Semitic language and Etruscan is a non-Indo-European language which may be a language isolate – perhaps the modern equivalent might be a street in which English, French, Italian, Chinese and Arabic are all spoken). The objects on sale in the markets might be similarly diverse.

I keep coming back to the languages, by the by, because I want to stress that these really were different people. There is a tendency – we will come back to it next time – for a lot of modern folks to assume that, “Oh well, these are all Italians, right?” But the idea of “Italians” as such didn’t exist yet (and Italy even today isn’t quite so homogeneous as many people outside of it often assume!). And we know that the different languages were mirrored by different religious and cultural practices (although material culture – the “stuff” of daily life, was often shared through trade contacts). Languages thus make a fairly clear and easy marker for a whole range of cultural differences, though – and we will come back to this as well – it is important to remember that people’s identities are often complex; identity is generally a layered, “yes, but also …” affair. I have only glanced over this, but we also see traces of Latin, Etruscan, Greek and Umbrian religious practice in early Roman sanctuaries and our later literary sources; Phoenician influence has also been posited – we know at least that there was a temple to Uni/Astarte in Pyrgi within 30 miles of Rome so Phoenician religious influence could never have been that far away.

We thus have to conclude that Livy is correct on at least one thing: Rome seems to have been a multi-ethnic, diverse place from the beginning with a range of languages, religious practices. Rome was a frontier town at the beginning and it had the wide mix of peoples that one would expect of such a frontier town. It sat at the juncture of Etruria (inhabited by Etruscans) to the north, of Latium (inhabited by Latins) to the South, and of the Apennine mountains (inhabited by Umbrians like the Sabines). At the same time, Rome’s position on the Tiber ford made it the logical place for land-based trade (especially from Greek settlements in Campania, like Cumae, Capua and Neapolis – that is, Naples) to cross the Tiber moving either north or south. Finally, the Tiber River is navigable up to the ford (and the Romans were conscious of the value of this, e.g. Liv 5.54), so Rome was also a natural destination point for seafaring Greek and Phoenician traders looking for a destination to sell their wares. Rome was, in short, far from a homogeneous culture; it was a place where many different peoples meet, even in its very earliest days. Indeed, as we will see, that fact is probably part of what positioned Rome to become the leading city of Italy.

(For those looking to track down some of these archaeological references or get a sense of the source material, though it is now a touch dated, The Cambridge Ancient History, Vol 7.2: The Rise of Rome to 220 B.C., edited by F.W. Walbank, A.E. Astin, M.W. Frederiksen, and R.M. Ogilvie (1990) offers a fairly good overview, particularly the early chapters by Ogilvie, Torelli and Momigliano. For something more suited to regular folks, when I teach this I use M.T. Boatwright, D.J. Gargola, N. Lenski and R.J.A. Talbert, The Romans: From Village To Empire (2012) and it has a decent textbook summary, p. 22-42, covering early Rome with particularly good reference to the archaeology)

Bret Devereaux, “Collections: The Queen’s Latin or Who Were the Romans? Part I: Beginnings and Legends”, A Collection of Unmitigated Pedantry, 2021-06-11.

« Newer PostsOlder Posts »

Powered by WordPress