Quotulatiousness

July 17, 2025

QotD: War elephants in India

… we are going to look at the place war elephants held in society through two lenses: what war elephants meant to the societies that used them and what they often mean in popular culture – as we’ll see, these are connected topics. Previously in this series, we looked at the battlefield advantages and drawbacks of war elephants; now let’s take them off of the battlefield.

This may seem a strange approach to use to end a discussion of war elephants – after all, these are war elephants – but as will soon become apparent, war elephants are almost impossible to fully understand outside of the social and political context in which they are most useful.

First, we are going to look at how elephants fit into the ancient and medieval political systems which used them as weapons of war. I want to stress very strongly here that what I am presenting is essentially the main argument of Trautmann’s Elephants and Kings (2015), not something I dreamed up. For the sake of brevity, I am leaving out a lot of detail here – but you know where to go to find the argument in full.

Last time, we introduced a problem: while awesome, war elephants were very expensive and relatively easy to counter on the battlefield. This answered the question of why the Romans and Chinese mostly ignored the elephant as a weapon-system despite having access to it, but it raised a second question: if the elephant was at best a limited weapon, why did its use persist in India? After all, if the Romans could figure out how to beat these things, surely the Indians could too!

Part of the answer, of course, is that some of the logistical problems that existed for states located at the edges of elephant’s natural range simply don’t apply to states closer to the source. Indian kings could (and did!) deploy elephants in far greater numbers than Seleucid or Roman armies could. In particular, North Indian rulers, rather than relying on long distance trade, could acquire elephants through trade relations with “forest peoples” in their own hinterland. We have reports of armies with not hundreds but thousands of elephants from, for instance, the Nanda or Maurya empires. Nevertheless, while these factors simplified elephant logistics, they hardly made the use of the animals cheap.

What Trautmann instead observes is that the rise of war elephants occurred specifically in the context of kingship in India. Indeed, elephants were associated with kingship through royal elephant hunts and domesticated elephants kept for show even before war elephants were developed. Around 1400 B.C. the chariot arrives in India, bringing with it a military aristocracy where the nobles – and the noblest of all nobles is, of course, the king – rode into battle.

(I keep finding myself recommending it, but I’ll again note – for a good rundown of the value of chariots as royal symbols more than battlefield weapons, check out chapter 2 of Lee, Waging War (2016).)

That was the context the war elephant emerged into. By the fifth century or so, the war elephant seems to be displacing the chariot as the quintessential vehicle of the warrior-aristocrat (and thus the ultimate warrior-aristocrat, the king). Interestingly, the Mahabharata (fourth century B.C., but with components that may date as early as the ninth) preserves some of this shift, with a mix of aristocrats on chariot and aristocrats on elephant. As chariots faded (they were tactically inferior to true cavalry which was arising at this time), elephants progressively became the vehicle for the important warriors.

It’s not hard to see the appeal. For the warrior-aristocrat, battle isn’t just about winning, but is also about social status and position. Put another way: why does anyone put up with warrior-aristocrats, who get to live in luxury and boss everyone around? The implicit reason (sometimes explicit) across cultures is that it is the martial prowess – typically the personal, physical combat skill – that justifies the existence of the military aristocrat. You need Sir-Better-Than-You (to use a European framing) because you need someone who has mastered a difficult combat art (mounted combat) and is very, very good at it.

The warrior-aristocrat needs to be seen being a warrior aristocrat. For this purpose the elephant (much like its chariot forerunner) is perfect. Fighting from the back of an animal is a difficult skill which requires a lot of training the common folk do not have time to do. It also requires being able to afford and maintain a very expensive military asset commoners cannot afford. And not only does it allow the warrior-aristocrat to have an out-sized impact on the battle, but it literally elevates him over his fellow men so he can be seen (and it could not have escaped anyone that this was a physical realization of his actual high status). So long as the elephant remained even moderately militarily valuable, it was a perfect vehicle for a warrior-aristocrat to display his power and prowess.

And even more so for the king. Not only can the king ride his own elephant, but with his vast resources, he can procure elephants for his retainers. What is more impressive than a warrior aristocrat who has his own elephant? A warrior-king who has hundreds or thousands of elephants and his own warrior aristocrats to mount them. The thing is, a king’s actual power derives from the perception of his power – showing off the king’s military might makes him more likely to be obeyed (in ways – like tax collection – which allow him to further enhance his military might). This isn’t just a vanity project for the king (though it is that too) – extravagant displays of royal power are a key component of remaining king (the key big-word idea here is legitimacy).

This pattern in turn becomes self-reinforcing: as kings use elephants to show off (and thus reinforce) their power, elephants become symbols of royal power all on their own. Trautmann (2015) tracks this spread, particularly in South-East Asia – as the Indian model of kingship spreads into that region, war elephants spread with it. Whereas in places where there is plenty of contact, but the institution of Indian-style kingship doesn’t spread, war elephants are used rarely, if at all.

This in turn answers another quandary: why war elephants appealed to Hellenistic (that is, the heirs of Alexander) monarchs. Macedonian monarchy was not a form of Indian kingship – it had grown up in Macedon and been influenced by exposure to the Great Kings of Persia all on its own – but it was very similar in many ways. Compatible, we might say. Macedonian monarchs did not ride elephants (they rode horses), but they did need to be seen demonstrating martial excellence before their armies, just like Indian kings. In that context, the display of wealth and royal power implied by fielding a large elephant corps could be powerful, even if the king himself didn’t ride on an elephant. This is, perhaps most vividly demonstrated with Seleucus I Nicator, who earned himself the nickname “The Elephant King” and even produced coins advertising that fact […] This tie between elephants and kings seems to have been quite strong. Trautmann (2015) notes that even within India, states without kings (oligarchies, independent tribes and cities, etc) only rarely acquired elephants and never in the same sort of numbers as kings. So even when elephants are cheaper – because they are close by – unless you need elephants as physical symbols of the power and legitimacy of the king and his warrior-aristocrats, they are largely not worth the effort to procure.

The one great exception is Carthage – by the time it was using war elephants, Carthage was a mixed republic (much like Rome), and yet employed elephants extensively. Unfortunately, we have no sense of if Carthage – like Rome – would have abandoned elephants given time. The earliest attestation we have of Carthaginian war elephants is 262 B.C. (although they would have encountered them earlier from Pyrrhus of Epirus) and Carthage is completely gone in 146 B.C. It is possible Rome simply caught Carthage in the same “trying them out” phase of elephant use Rome would undergo in the second century B.C. and that Carthage may too have largely abandoned war elephants had it not been destroyed.

Bret Devereaux, “Collections: War Elephants, Part III: Elephant Memories”, A Collection of Unmitigated Pedantry, 2019-08-09.

July 11, 2025

QotD: Pyrrhus arrives in Magna Graecia to support the Tarantines

Filed under: Greece, History, Italy, Military, Quotations — Tags: , , , , — Nicholas @ 01:00

The Roman response to Pyrrhus’ initial arrival was hardly panic. Military operations in Etruria for 280, under the consul Tiberius Corucanius, continued for the year, while the other consul, Publius Valerius Laevinius, went south to fight Pyrrhus and shore up Rome’s position in Southern Italy. We don’t have clear numbers for the size of the armies at Heraclea – Plutarch stresses that they were big (Plut. Pyrrh. 16.3) – but I think it is fair to suppose that Lavinius probably has a regular consular army with two legions and attached socii, roughly 20,000 men. It has sometimes been supposed this might have been a double-strength army (so 40,000 men) on the basis of some of our sources (including Plutarch) suggesting somewhat nebulously that it was of great size.

There are a few reasons I think this is unlikely. First, sources enlarging armies to fit the narrative magnitude of battles is a very common thing. But more to the point, Pyrrhus has crossed to Italy with 28,500 men total and – as Plutarch notes – hasn’t had a chance to link any of his allies up to his army. That may mean he hasn’t even reabsorbed his scouting force of 3,000 and he may well have also had to drop troops off to hold settlements, secure supplies and so on. Pyrrhus’ initial reluctance to engage (reported by Plutarch) is inconsistent with him wildly outnumbering the Romans, but his decision to wait for reinforcements within reach of the Romans is also inconsistent with the Romans wildly outnumbering him. So a battle in which Pyrrhus has perhaps 20-25,000 men and the Romans a standard two-legion, two-alae army of 20,000 give or take, seems the most plausible.1

The two forces met along the River Siris at Heraclea on the coastal edge of Lucania, Laevinius having pushed deep into southern Italy to engage Pyrrhus. As usual for these battles, we have descriptions or partial descriptions from a host of sources (in this case, Plutarch, Dionysius of Halicarnassus, Zonaras, Florus) which don’t always agree, leaving the modern historian in a bit of a pickle. Generally, we assume that a lot of the later Roman narratives of a famous defeat are likely to have been tailored to try and minimize the embarrassment, either by implying the battle was closer than it was or that Pyrrhus was a very impressive foe (or both) or other “face-saving” inventions. Worse yet, all of our sources are writing at substantial chronological distance, the Romans not really having started to record their own history until decades later (though there would have been Greek sources for later historians to work with). Generally, Patrick Kent tends to conclude that – somewhat unusually – Plutarch’s moralizing focus renders him more reliable here: Plutarch feels no need to cover for embarrassing Roman defeats or to embellish battle narratives (which he’d rather keep short, generally) because his focus is on the character of Pyrrhus. Broadly speaking, I think that’s right and so I too am going to generally prefer Plutarch’s narratives here.

A fairly handy map of Pyrrhus’ campaigns (though some of the detail is lost in the big sweeping arrows). What is notable is, apart from Pyrrhus’ lightning raid into Latium in 280, he is almost invariably fighting in “friendly” territory, either in Lucania (Heraclea), Apulia (Asculum) or Samnium (Beneventum), the lands of his allies. Pyrrhus never fights an actual pitched battle on Roman-controlled territory, which I think speaks to his strategic intent: to carve out a kingdom in Greater Greece, not to conquer the whole of Italy.
Wikimedia Commons.

The battle was defined by Pyrrhus’ use of terrain – Pyrrhus thought delay might be wiser (to link up with his allies) but left a blocking force on the river (the Romans being on the other side). The Romans responded by forcing the river – typical Roman aggression – but Plutarch at least thinks it caught Pyrrhus by surprise (he hadn’t fought Romans before) and so it leaves him in a scramble. He charges his cavalry (Plut. Pyrrh. 16.5) to give his main phalanx time to form up for battle resulting in what seems like a cavalry engagement near the river. Pyrrhus nearly gets himself killed in the fighting, but survives and falls back to his main infantry force, which then met the Romans in an infantry clash. The infantry fighting was fierce according to Plutarch and Pyrrhus, still shaken from being almost killed, had to come out and rally his troops. In the end, the Romans are described as hemmed in by Pyrrhus’ infantry and elephants before some of his Greek cavalry – from Thessaly, the best horse-country in Greece – delivers the decisive blow, routing the Roman force.

It is, on the one hand, a good example of the Hellenistic army “kit” using almost all of its tactical elements: an initial – presumably light infantry – screen holding the river, followed by a cavalry screen to enable the phalanx to deploy, then a fierce and even infantry fight, finally decided by what seems to be flanking actions by cavalry and elephants. Plutarch (Pyrrh. 17.4) gives two sets of casualty figures, one from Dionysius and another from Hieronymus; the former says that the Romans lost 15,000 to Pyrrhus’ 13,000 killed, the latter that the Romans lost 7,000 to Pyrrhus’ just a bit less than 4,000 killed. The latter seems almost certainly more accurate. In either case, the Roman losses were heavier, but Pyrrhus’ losses were significant and as Plutarch notes, his losses were among his best troops.

Even in the best case, in victory, Pyrrhus had lost around 15% of his force (~4,000 out of 28,000), a heavy set of losses. Indeed, normally if an army loses 15% of its total number in a battle, we might well assume they lost. Roman losses, as noted, were heavier still, but as we’ve discussed, the Romans have strategic depth (in both geography, political will and military reserves) – Pyrrhus does not. By contrast, Alexander III reportedly wins at Issus (333) with just 150 dead (and another 4,802 wounded or missing; out of c. 37,000) and at Gaugamela (331) with roughly 1,500 losses (out of c. 47,000). The Romans will win at Cynoscephelae (197) with just 700 killed.

This isn’t, I think, a product of Pyrrhus failing at all, but rather a product of the attritional nature of Roman armies: even in defeat they draw blood. Even Hannibal’s great victory at Cannae (216) costs him 5,700 men, according to Polybius (more, according to Livy). But the problem for Pyrrhus is that his relatively fragile Hellenistic army isn’t built to repeatedly take those kinds of hits: Pyrrhus instead really needs big blow-out victories where he takes few losses and destroys or demoralizes his enemy. And the Roman military system does not offer such one-sided battles often.

Nevertheless, Pyrrhus shows that a Hellenistic army, capable handled, could beat a third-century Roman army, albeit not cleanly, and that is well worth noting.

Bret Devereaux, “Collections: Phalanx’s Twilight, Legion’s Triumph, Part IIIb: Pyrrhus”, A Collection of Unmitigated Pedantry, 2024-03-08.


    1. I should note, this is Kent’s assessment as well.

July 5, 2025

QotD: Roman provinces under the Republic

Filed under: Europe, Government, History, Quotations — Tags: , , — Nicholas @ 01:00

When Rome first expands overseas in 264, they opt not to continue replicating the socii-system as they go, but rather through a gradual and ad hoc process, develop a separate system of governance-by-magistrate for these provinciae or “provinces”, though as we’ll see the exact meaning of this word changes over time as well.

While the Romans mostly improvise this system for just a handful of provinces – most of the basic patterns of Roman provincial governance develop in just the first four provinces1 – that system becomes the customary way Roman magistrates [consuls and praetors]and promagistrates [proconsuls and propraetors] handled the overseas provinces they were assigned to. Consequently, it was replicated over and over again through Rome’s steadily expanding empire. By the time the Republic collapses into the Empire, Rome will have not four provinces but fourteen; by the end of the reign of Augustus, as the Roman Empire largely took the borders it would mostly hold for the next four centuries, there were just under thirty provinces. Yet the way Rome will govern these provinces largely continued to hold to model established in the Republic, at least through to the Severan Dynasty (193-225 AD), if not further.

As a result, the improvised system the Romans developed for those first four provinces would end up being how the vast majority of people in the Roman empire would experience Roman governance.

Bret Devereaux, “Collections: How to Roman Republic 101, Addenda: The Provinces”, A Collection of Unmitigated Pedantry, 2023-11-03.


    1. Sicily, Corsica et Sardinia, Nearer Spain and Further Spain, gained in the period from 241 to 197 and Rome’s only provinces until the addition of Macedonia in 147.

June 29, 2025

QotD: People in the past believed their own religion

What I think this show [Game of Thrones] has fallen into is the assumption – almost always made by someone outside a society looking in – that the local religion is so silly that no one of true intelligence (which always seems to mean “the ruling class” – I am amazed how even blue-collar students will swiftly self-identify with knights and nobles over commoners when reading history) could believe it. This is the mistake my students make – they don’t believe medieval Catholicism or Roman paganism, and so they weakly assume that no one (or at least, none of the “really smart” people) at the time really did either. Of course this is wrong: People in the past believed their own religion.

It is an old mistake – Polybius makes it of the Romans, writing in c. 150 BC. Polybius notes that Roman religion was “distinctly superior” for maintaining the cohesion of the Roman state, but that “they (= the ruling class) have adopted this course for the sake of the common people” (Plb. 6.56.6-8), by which he means for the sake of the “fickle multitude” which must be “held in invisible terrors and pageantry”. Polybius, I should note, immediately contradicts himself – one benefit of this religion, he says, is that Roman magistrates (read: elite Senators) do not steal from the public treasury even when they are not watched, which would seem to imply that even many very elite Romans believed their own religion (also, irony alert: those famously incorruptible Romans …). Poor men, after all, are not elected to watch the treasury.

Fortunately, Polybius is not our only source on Roman culture, so it is possible to say – and I want to be clear here – Polybius is wrong (on this point at least). Roman elites took their religion very seriously. There are exceptions, of course – Julius Caesar was an Epicurean philosophically, although this does not seem to have altogether disrupted his performance of the duties of Rome’s highest priesthood (he was Pontifex Maximus). Nevertheless, such philosophically minded men were generally regarded with scorn by even the Roman elites (Cicero heaps such scorn on Cato in the Pro Murena in what were clearly intended as friendly “ribbing” before a mostly elite audience), who nevertheless greatly valued religious character (in contrast, Cicero is quick to accuse his hated enemy Catiline of irreligion before the Senate). Roman leaders vowed temples and sacrifices in duress, and built and gave them in victory. Lavish dedications to the gods from all levels of Roman society confirm the overall picture: the Romans believed their religion. Yes, even the elite Romans (there is, I should note, some complicated in that elites tended to separate what they viewed as religion from common “superstition”, but that’s a topic for another day.) And, if anything, the evidence for elite “buy-in” to Medieval religion is far more voluminous.

Yet none of the main characters of Game of Thrones, apart from perhaps Catelyn Stark, is devoted to their region’s traditional religion in any private sense. I cannot recall at any point any character protesting an action on account of religion – no marches held up by high holy days (a regular occurrence in ancient and medieval literature), no groups, individuals or places religiously exempt from violence. The Faith has certain rules about sexuality, but they’re rules that are not only flouted privately by individuals in the show, but also quite publicly by the last ruling dynasty for centuries.

Moreover, none of the ruling characters seem to have any religious functions – or if they do, they simply ignore them most of the time. Keep in mind, these people believe that their gods – be it the Seven or the Old Gods – are powerful divine beings who are directly and immediately interested in the world and who act on that interest. Keeping such beings happy is vitally important – it is not an afterthought. No society, so far as I can tell, has ever believed there is enough gold or enough armies to save you if you have enraged the gods. There is simply no off-setting an angry divinity (and before the smart guy asks, “what about a favorable divinity?” the Greeks had an answer for that – go read Euripides’ Hippolytus. It ends badly for the mortals).

The lack of religious duties or functions for characters who are kings is particularly surprising. Kingship has three core roles in almost all human cultures where the institution appears: kings are 1) chief judge, 2) chief general, and 3) chief priest. That third role appears more or less prominently in almost all societies. In ancient Egypt and (at times) in Mesopotamia, kings were held to literally be either earthly incarnations of major gods or minor gods in their own right. Roman Emperors held the office of Pontifex Maximus, and took over as the chief priest of the state, before becoming gods on their deaths. The Chinese emperor was the “Son of Heaven” and was tasked with maintaining the right relationship with the divine (the “mandate of heaven”). The emperors of Japan are purported to be direct descendants of the goddess Amaterasu (and they have a family tree to back it up).

The relationship between medieval kings and the Church was more complicated, because of the existence of a clear religious head (the Pope) outside of secular authority, but medieval kingship retained a strong sense of religious purpose. Coronation rituals often involved the clergy and were essentially religious rituals, because kingly power was still thought to be bestowed by God. Kings, in turn, had a special role in keeping their kingdom in right relationship with the divine, both through just rule (which included protecting the Church) and through the performance of religious rituals. Those rituals, of course, worked both ways: both believed to be religious efficacious (as in they helped bring about God’s good favor), but also valuable tools of building royal legitimacy (hat tip to my colleague Elizabeth Hassler’s excellent dissertation on the topic of holy kingship). In some places (England and France most notably), kings were thought holy enough to be able to heal certain diseases miraculously by touch – a king who couldn’t was clearly insufficiently pious.

Bret Devereaux, “New Acquisitions: How It Wasn’t: Game of Thrones and the Middle Ages, Part II”, A Collection of Unmitigated Pedantry, 2019-06-04.

June 23, 2025

QotD: Recruiting and organization under the “Marian reforms”

Filed under: Europe, History, Military, Quotations — Tags: , , , , — Nicholas @ 01:00

This is the most important one, but perhaps a bit less complicated than cohorts: the notion that Marius began the process of taking volunteers and proletarii at that and thus “professionalized” the Roman army. As with the equipment, this is at least something our sources do say … more or less.

Sallust reports that Marius, “after he saw that the spirits of the plebs were aroused, he swiftly loaded ships with supplies, pay, weapons and other requirements; with them he ordered Aulus Manlius, his legate, to set out. Meanwhile himself he enrolled soldiers, not according to the mos maiorum [‘the customs of the ancestors’] from the census classes, but making use of whoever wished to go, mostly the capite censi [‘those counted by heads’ = the propertyless poor or proletarii]” (Sall. Iug. 86.1-2, trans mine). Plutarch repeats this report, that Marius violated custom by enrolling men who didn’t meet the property qualification for military service (Plut. Mar. 9.1).

There are a few oddities here to start, though. First, Sallust quickly notes that this resulted in Marius having an army rather larger than what the Senate had actually authorized (Sall. Iug. 86.4) and that’s actually quite a neat detail that may explain part of what’s going on here because this has, in a way, happened before. In 134, Scipio Aemilianus was elected consul for the second time (illegally, again) with a mandate to end the frustrating Roman war against the Celtiberian stronghold of Numantia in Spain. The Senate, however, denied Scipio authorization to raise fresh troops, to which Scipio responded by enlisting some 4,000 volunteers to replenish his legion; Appian says this was done with the consent of the Senate, but Plutarch’s brief note on it sure implies Scipio Aemilianus is end-running around Senatorial efforts to stifle him (App. Hisp. 84; Plut. Mor. 201A-B). And this too was hardly the first time for this sort of end-run; Scipio Africanus (what is it with Scipiones!?) back in 205 agitated for his invasion of Africa to end the Second Punic War and was given the province of Sicily with authorization to go to Africa if he thought it necessary, but the Senate registered its displeasure by refusing to let him levy troops, at which point – wait for it – Scipio took volunteers, equipping and financing his force through the socii and even building a fleet that way (Liv. 28.45.9-12).

In short, the Senate sometimes tried to trim the sails of generals it was displeased with – and Marius reportedly had gotten elected on a campaign platform of “to hell with the Senate” (Sall. Iug. 84.1) – by limiting the size of their armies or refusing to allow them to conduct a levy. And since 205 (a century before Marius), popular generals had occasionally juked this effort by the Senate by instead calling for volunteers, which the Senate could not stop. Marius is not doing something new in taking volunteers to supplement an army through the levy.

He also doesn’t keep doing it. After Marius wins in Africa with his volunteer-supplemented army (the bulk of which of course were still recruited through the dilectus under Metellus), he returns to Italy to take over the war against the Cimbri and Teutones but he doesn’t keep up the volunteer force, instead taking command of his predecessor Rutilius Rufus’ normally levied army (Front. Strat. 4.2.2). In practice, Marius probably took volunteers in part for that first army because the Senate was diverting available levy manpower towards the early phases of the Cimbric War (or at least that was a convenient excuse to kneecap him) – a series of costly military disasters for Rome which likely soaked up much of the manpower the Senate was willing to raise. Once Marius has access to that “primary” stream of manpower generated through the dilectus, he uses it and seems to stop using volunteers.

But what of recruiting the capite censi? Well, that isn’t quite new either, although it surely wasn’t typical. For one, it wasn’t that the poor absolutely never served; Polybius notes that the capite censi served in the fleet (Polyb. 6.19.2). But we also see non-assidui (assidui being the term for those wealthy enough to be liable for normal conscription) in a range of other emergencies. Livy reports in 329 a “crowd of sellularii [men who work sedentary trades, literally, ‘stoolsmen’], a type least suited for military service, were called into the army” (Livy 8.20.4), though the historicity of this report is questionable given the early date. In 296, Etruscan entrance into the Third Samnite War causes a draft of “not only the freeborn or the iuniores took the oath, but cohorts were made of seniores and centuries of freedmen” (Livy 10.21.4). Gellius (16.10.1) quotes Ennius reporting the proletarii were pulled into the armies in 280, presumably in response to Pyrrhus’ victory at Heraclea. And during the Second Punic War the Romans pulled out all of the stops, recruiting debtors and men convicted of capital crimes (Livy 23.14.3), enrolling slaves into the army (called the volones; you free them first and then draft them, Livy 27.38 and 28.10, Val. Max. 7.6.1) and as noted above, taking volunteers more generally.

As an aside, if you are wondering why the Romans seem in some of these to skip recruiting freeborn capite censi and go straight to freedmen and enslaved people, I think there are two answers here for this period. First, many of the available freeborn poor are probably already in service in the fleet. Second, there probably aren’t that many of them. Recall our chart of Roman social classes – the capite censi in the third century is quite small, almost certainly outnumbered by enslaved persons in Italy. But the population of Italy was rising over the third and especially second century and without adding new farmland, those new freeborn Romans may have swelled the ranks of the capite censi, leading to a much larger propertyless class by the late second century or the first century.1 Consequently, there may have been a lot more capite censi worth recruiting by Marius’ day, when Rome no longer needed to keep a large navy at sea (not facing any naval powers in its wars) and the number of capite censi having risen.

Finally, Marius does not mark the end of the Roman dilectus! Evidently Roman conscription persisted at least to the end of the Roman civil wars, as Suetonius reports Augustus (perhaps when he was still Octavian) inflicting the traditional penalty of being sold into slavery for draft-dodging on a Roman eques who cut the fingers off of his two sons to make them ineligible for military service (Suet. Aug. 24.1). Indeed we have attestations of the dilectus in 55, 52, 50, 49, AD 6 and AD 9.2 Even once the army is fairly clearly primarily a volunteer force, at least notionally the ability to hold a levy when necessary to fill the ranks remained “on the books” and Trajan (r. 98-117 AD) holds at least one levy because he punishes a father for the same reason Augustus had done (Dig. 49.16.4.12). So the traditional dilectus remained a thing Roman leaders could do well into the empire. In practice it seems safe to assume the system by the mid-first century is substantially ad hoc, as the census straight up doesn’t happen from 69 BC to 28 BC, which would make it hard to actually enforce the property requirements. But the process doesn’t stop in 107 and there’s no reason to suppose from 107 to 69, with the census being regularly conducted, that most annual levies were not conducted along traditional property lines.3

So the most we might say is that a one-time crisis expedient in earlier periods slowly becomes a standard way to supplement legions and then the standard way to recruit them, with the old normal method of the dilectus instead becoming the unusual way to supplement in a crisis. It’s unclear exactly when that shift-over point happens, but it sure isn’t in the career of Gaius Marius, who sits clearly in the “volunteers as a crisis response” side of the issue.

And what of the notion that Gaius Marius introduced both citizenship as a reward for service as a regular bonus and also that he instituted the paying of soldiers at the completion of a campaign to render them loyal? Well on the latter point, the Romans had been distributing spoils to the soldiers at the end of a campaign as a lump-sum payment since the beginning. This is exceedingly well reflected in Livy’s accounting of the years from 201 to 167 (where we have a nice continuous burst of Livy), see for instance Livy ::deep breath:: 30.45, 31.20, 33.23, 33.37, 34.46, 34.52, 36.40, 37.59, 39.5, 39.7, 40.34, 40.43, 40.59, 41.7, 41.13, 45.40, 45.43.4 And the idea that Roman victories might seize land which would then be settled as Roman coloniae, creating new land for Roman settlers was also not new (Wikipedia has a convenient list of Roman coloniae). So Marius is simply promising to do a thing Roman commanders regularly did, essentially saying, “serve with me, because I’m going to win and victory will make us rich”. Which is exactly the reason volunteers rushed to serve with Scipio Africanus and Scipio Aemilianus: they anticipated a lucrative victory for such well-regarded commanders.

And by now you may well be asking, “but wait, then when does the system change?” Because after all, I said that by the early empire, we can pretty clearly see an army primarily composed of professional, long-service volunteers who receive substantial retirement bonuses and are permanently stationed on the frontiers. Who is responsible for that? And in response, I give you, this guy.

It’s Augustus. It was always Augustus. Or at least I should say that is my view, given the evidence. Older scholarship – I think here of Keppie (1984) in particular – tended to assume that because most of the big changes happened with Marius (but we’ve seen they don’t) that Octavian/Augustus probably made only minimal changes to the military system he inherited from Julius Caesar. I don’t think that’s correct. I think if we look at the evidence in more detail it becomes clear that Augustus is the “break” (though not a clean break by any means) and that in fact we need to start regarding Augustus as a military reformer of some significant scale rather than merely the codifier of a Caesarian military system (though he probably does that too).

Augustus, after all, institutes regular bonuses for discharge, establishing a treasury funded by a regular tax to meet the expense rather than simply promising that he would win a lot and so soldiers would get rich off of their share of the booty (Res Gestae 17). And it’s not hard to see the problem he’s responding to – the massive military buildup of the Roman civil wars had left Octavian, as the victor, with the red-hot potato of hundreds of thousands of soldiers who were promised the spoils of victory, including large numbers of men who didn’t win but who, if not settled down somehow would disrupt the state (RG 3). Earlier in the civil wars, Octavian had used proscriptions and land confiscations to solve this problem but as emperor, he needed a permanent solution, thus the establishment of the aerarium militare and its discharge bonuses (praemia). Before that, you simply had generals promising to feast their soldiers off of the property of the vanquished; the civil wars had only changed that in that the vanquished were now Romans. It also establishes a standard length of service, creating that professional, long-service army.

There’s a related issue which is the fate of the citizen equites and the velites. Caesar’s armies in Gaul seem to have neither, so the assumption was that the shift to recruiting proletarii meant that these wealth-based distinctions (the richest Romans serve as equites, the poorest as velites) dropped away, leaving a uniform heavy infantry legion. And in a schematic it makes sense: both roles are absorbed by the auxilia and indeed Caesar makes use of a lot of Gallic cavalry auxiliaries. But as François Gauthier recently pointed out,5 it’s not all clear that the velites really did vanish in the late-second/early-first century. Cicero still refers to to them writing in the 40s (Cic. Fam. 9.20; Brut., 271) and their apparent absence in Caesar’s writing may well just be an accident of Caesar’s avoidance of technical language. Caesar doesn’t generally talk about hastati or triarii much either; he prefers milites (“soldiers”). Likewise, it’s clear the citizen cavalry – the equites – survived Marius; as Jeremiah McCall notes, we have good evidence for citizen equites at least as late as the 90s BC and suggests the citizen cavalry probably vanished in the 80s as a result of the Social War and Sulla‘s Civil War.6 It surely did not happen in 107 or 104.

Meanwhile the auxilia as a mature part of the Roman army really only emerge under Augustus, and not even right at the beginning of his reign either. Roman armies needed cavalry and light infantry to function, so once again we may not be looking at a clean break but rather a period of transition as a result of some generals preference for (non-Italian) allied or auxiliary cavalry and light infantry and the formalization of that system not in 107 with Marius but again in 27 with Augustus.

Marius is also sometimes credited with the idea of extending citizenship to non-citizens who served, which is a catastrophic misreading of one episode in his career. For one, this gets read as meaning that Marius extended citizenship to all of the Italians in his army or that he made it standard to do so. Note for instance this line pulled from Wikipedia:

    Finally, Marius granted citizens of the Italian allies (Etruria, Picenum, etc.) full Roman citizenship if they fought for Rome and completed a period of service in the Roman army.

[Dated] 6/25/2023, specified in the hope this page changes to be less wrong.

And that’s very much not right either. We have evidence for only a handful of citizenship extensions by Marius. In particular, of his army he extended citizenship to just two cohorts (c. 1,000 men) from Camerinum (Plut. Mor. 202D, Cic. Pro Balbo 46.). I can only assume this gets misunderstood because some writers don’t know their unit sizes, but Marius had 32,000 men in his army at Vercellae (101 BC), probably something like half of which were socii. These two cohorts were a comparatively tiny fraction. Marius also seems to have selected a very small number of his other socii veterans for citizenship (Cic. Pro Balbo 48), but there was no blanket grant of citizenship. Of course there wasn’t, this issue remained substantially unsolved until the Social War (91-87BC); if Roman levies had been calmly minting new citizens out of thousands of Italians through the 90s, there would hardly have been a cause for the Social War.

Instead, citizenship as a reward for service is an artifact of the imperial period and the auxilia. The Roman use of non-Roman, non-socii troops to supplement their armies was not new, but it emerged as a formalized, permanent part of the Roman army not during the civil wars – where such units where both ad hoc but also not nearly so numerous – but under the reign of Augustus, coming to form about half of the army by the end of his reign (Tac. Ann. 4.5; on the emergence of the auxilia, see I. Haynes, Blood of the Provinces (2013)). Indeed, as Haynes notes (op. cit. 49), it is actually only under Tiberius (r. 14-37) that we get direct evidence of citizenship grants to auxilia and the practice even then seems at least somewhat irregular (though it comes to be regularized).

In short that, the notion that Gaius Marius instituted the pattern of granting citizenship to serving non-citizens on discharge is simply wrong; that’s not in our sources. That doesn’t become consistent until Tiberius well over a century later. Gaius Marius did recruit volunteer capite censi into his army once but didn’t make a habit of it and as such isn’t a major reformer so much as a key step in a slow process of change which reaches its decisive point probably under Augustus, more than half a century after Gaius Marius died. He wasn’t the first to do either thing, whatever our sources say.

Bret Devereaux, “Collections: The Marian Reforms Weren’t a Thing”, A Collection of Unmitigated Pedantry, 2023-06-30.


    1. For more on the dynamics of this, see N. Rosenstein, Rome at War (2004), as this is part of his central argument.

    2. For textual references, see Brunt, Italian Manpower (1971), 636-7.

    3. We do not know how that process would have accounted for the massive expansion of the Roman citizen class due to the Social War. But evidently it did!

    4. These were happily already compiled by Brunt, op. cit., 394.

    5. “Did velites Really Disappear in the Late Roman Republic?” Historia 70 (2021).

    6. J.B. McCall, The Cavalry of the Roman Republic (2002), 100-113..

June 17, 2025

QotD: What is a “tank”?

Filed under: Britain, History, Military, Quotations, Weapons, WW1 — Tags: , , , , — Nicholas @ 01:00

… the tank was a direct response to the battlefield conditions of WWI, in particular the trench stalemate on the Western front. The idea of some kind of armored “land cruiser” (potentially armed with machine guns) had been floated before WWI but never seriously considered and developed on, but serious development only began in 1915 with the formation of the Landship Committee early that year. Famously, they needed a code-name for their planned vehicle and opted first for “water carrier” and then for “tank”, thus giving the tank its peculiar English name.

And we should stop to note that as with any question of definition, this one too is language-sensitive. The exact confines of a term vary from one language to another; kampfpanzer, for instance is not necessarily an exact synonym for “tank”.

In any event, the basic demands of early tanks were dictated by the realities of the Western Front: a tank needed to be able to resist small arms fire (particularly machine guns), deliver direct supporting fire itself, it needed to be able to move on the muddy, artillery-flattened ground and it needed to be able to cross a trench. This last requirement – the need to be able to both climb a parapet (usually c. 4ft) and then cross over an 8ft wide trench – was significant in the design of early tanks.

Those factors in turn dictated a lot of the design of early tanks. The armor demands of resisting small arms fire meant that the vehicle would be heavy (and indeed, as soon as tanks appeared amongst Allied troops, their German opponents began introducing more powerful bullets, like the K bullet and later the 13.2mm anti-tank round fired from the Mauser 1918 T-Gewehr). And here is the first advantage of tracks. The weight of a vehicle is distributed along all of the area of contact it has with the ground; with tires that area is limited to the bottom of the tire so the total area of ground contact is fairly low, which is fine for most vehicles.

But tanks are heavy. Really heavy. Even something like the Renault FT could mass around 7 tons and by later standards that would be classified as a tankette (a “mini-tank” as it were); by WWII, medium tanks often clocked in around 30 tons. If you put a vehicle like that on tires, you are going to create a LOT of pressure on those small points of contact. That might still be OK if you are just going to drive on roads and other firm surfaces which can take the pressure. But remember: tanks were designed for the Western Front, which looks like this.

Fortunately for the landship committee, this wasn’t a new problem: farming tractors were also heavy and also had to operate in churned up (in this case, plowed) soft soil; the heaviest of these vehicles had much the same problem and the solution was continuous tracks or “treads”. When kept properly tensioned – tune in, by the by, to Nicholas “The Chieftain” Moran’s YouTube for more than you ever want to know about track tension – the track distributes the weight of the tank across the entire section of the track touching the ground, which reduces the ground pressure at any given point, allowing a big heavy tank to roll over terrain where even a much lighter wheeled vehicle would get stuck.

This is one of those points where the functionality of a tank (what a tank does) has such a strong influence on design that the design implications of the functionality become part of the definition: a tank has to be heavily armored and has to be able to move off-road and as a result has to be tracked, not wheeled. One might be able to imagine some sort of exotic technology that might make it possible to do all of the things a tank does without tracks, but we don’t have that yet.

The other factor was fire. I’ve mentioned this before, but one of the significant background factors of WWI is that a lot of the belligerents misjudged the kind of artillery they’d need for a general European war. Not to get too deep into the weeds here, but most of the belligerents expected a relatively rapid war of maneuver and so thought that light, direct-fire artillery like the famed French ’75 (the Matériel de 75mm Mle 1897) would be the most useful. Those guns could be moved quickly and could deliver a lot of quick firepower on static or moving formations of enemy infantry in support of friendly infantry.

The problem is that in the conditions of trench warfare, those guns – as they were configured, at least – were far less useful. They were, first off, much shorter in range which meant they had to be brought dangerously far forward to do their direct fire role – often so far forward they could be engaged by enemy rifles and machine guns. This was compounded by the fact that direct fire at range was ineffective against trench works (which are dug down into the earth). But at the same time, the value of rapid firing (because these lighter guns could fire a lot faster than the heavy, indirect fire artillery) direct fire artillery remained high, if only you could get it to the fight.

This was also a problem a tank could solve: as a mobile, armored platform it could move a rapid-firing direct fire gun forward without immediately being knocked out by enemy small arms to support the infantry. There is, I should note, early complexity on this point, with both “male” (heavy direct fire cannon focused) and “female” (machine gun focused) tanks in WWI though in the end “hermaphrodite” designs with both capabilities (but much more focus on the main cannon) triumph, so that’s what we’ll focus on.

And that gets us the fundamental role structure for tanks: enough armor to resist enemy small arms (but with the understanding that some weapons will always be effective against the tank), enough mobility to cross the churned up battlefield and some direct fire capability to support the infantry crossing it at the same time.

Bret Devereaux, “Collections: When is a ‘Tank’ Not a Tank?”, A Collection of Unmitigated Pedantry, 2022-05-06.

June 11, 2025

QotD: “Pike and Shot” in the early gunpowder era

Filed under: Europe, Gaming, History, Military, Quotations, Weapons — Tags: , , , , , — Nicholas @ 01:00

… this is why the pike[-armed infantry] fought in squares: it was assumed the cavalry was mobile enough to strike a group of pikemen from any direction and to whirl around in the empty spaces between pike formations, so a given pike square had to be able to face its weapons out in any direction or, indeed, all directions at once.

Instead, pike and shot were combined into a single unit. The “standard” form of this was the tercio, the Spanish organizational form of pike and shot and one which was imitated by many others. In the early 16th century, the standard organization of a tercio – at least notionally, as these units were almost never at full strength – was 2,400 pikemen and 600 arquebusiers. In battle, the tercio itself was the maneuver unit, moving as a single formation (albeit with changing shape); they were often deployed in threes (thus the name “tercio” meaning “a third”) with two positioned forward and the third behind and between, allowing them to support each other. The normal arrangement for a tercio was a “bastioned square” with a “sleeve of shot”: the pikes formed a square at the center, which was surrounded by a thin “sleeve” of muskets, then at each corner of the sleeve there was an additional, smaller square of shot. Placing those secondary squares (the “bastions” – named after the fortification element) on the corner allowed each one a wide potential range of fire and would mean that any enemy approaching the square would be under fire at minimum from one side of the sleeve and two of the bastions.

That said, if drilled properly, the formation could respond dynamically to changing conditions. Shot might be thrown forward to provide volley-fire if there was no imminent threat of an enemy advance, or it might be moved back to shelter behind the square if there was. If cavalry approached, the square might be hollowed and the shot brought inside to protect it from being overrun by cavalry. In the 1600s, against other pike-and-shot formations, it became more common to arrange the formation linearly, with the pike square in the center with a thin sleeve of shot while most of the shot was deployed in two large blocks to its right and left, firing in “countermarch” (each man firing and moving to the rear to reload) in order to bring the full potential firepower of the formation to bear.

Indeed it is worth expanding on that point: volley fire. The great limitation for firearms (and to a lesser extent crossbows) was the combination of frontage and reloading time: the limited frontage of a unit restricted how many men could shoot at once (but too wide a unit was vulnerable and hard to control) and long reload times meant long gaps between shots. The solution was synchronized volley fire allowing part of a unit to be reloading while another part fired. In China, this seems to have been first used with crossbows, but in Europe it really only catches on with muskets – we see early experiments with volley fire in the late 1500s, with the version that “catches on” being proposed by William Louis of Nassau-Dillenburg (1560-1620) to Maurice of Nassau (1567-1625) in 1594; the “countermarch” as it came to be known ends up associated with Maurice. Initially, the formation was six ranks deep but as reloading speed and drill improved, it could be made thinner without a break in firing, eventually leading to 18th century fire-by-rank drills with three ranks (though by this time these were opposed by drills where the first three ranks – the front kneeling, the back slightly offset – would all fire at once but with different sections of the line firing at different times (“fire-by-platoon”)).

Coming back to Total War, the irony is that while the basic components of pike-and-shot warfare exist in both Empire: Total War and for the Empire faction in Total War: Warhammer, in both games it isn’t really possible to actually do pike-and-shot warfare. Even if an army combines pikes and muskets, the unit sizes make the kind of fine maneuvers required of a pike-and-shot formation impossible and while it is possible to have missile units automatically retreat from contact, it is not possible to have them pointedly retreat into a pike unit (even though in Empire, it was possible to form hollow squares, a formation developed for this very purpose).

Indeed if anything the Total War series has been moving away from the gameplay elements which would be necessary to make representing this kind of synchronized discipline and careful formation fighting possible. While earlier Total War games experimented with synchronized discipline in the form of volley-fire drills (e.g. fire by rank), that feature was essentially abandoned after Total War: Shogun 2‘s Fall of the Samurai DLC in 2012. Instead of firing by rank, musket units in Total War: Warhammer are just permitted to fire through other members of their unit to allow all of the soldiers in a formation – regardless of depth or width – to fire (they cannot fire through other friendly units, however). That’s actually a striking and frustrating simplification: volley fire drills and indeed everything about subsequent linear firearm warfare was focused on efficient ways to allow more men to be actively firing at once; that complexity is simply abandoned in the current generation of Total War games.

Bret Devereaux, “Collection: Total War‘s Missing Infantry-Type”, A Collection of Unmitigated Pedantry, 2022-04-01.

June 5, 2025

QotD: Political institutions of the late western Roman Empire

… last week we noted how the collapse of the Roman Empire in the West did not destroy the Roman cultural sphere so much as accelerate its transformation (albeit into a collection of fragmented fusion cultures which were part “Roman” mixed with other things), it did bring an end to the Roman state in the west (but not the east) and an end to Roman governance. But here too, we have to be careful in defining what that governance meant, because the Roman Empire of August, 378 AD was not the Roman Empire of August, 14 AD. This is a point that is going to come up again and again because how one views the decline of the fifth and sixth centuries depends in part on what the benchmark is: are we comparing it to the empire of Hadrian (r. 117-138) or the empire of Valentinian (r. 364-375)? Because most students are generally more familiar with the former (because it tends to be get focused on in teaching), there is a tendency to compare 476 directly with Rome under the Nervan-Antonines (96-192) without taking into account the events of the third and early fourth century.

Roman rule as effectively codified under the first emperor, Augustus (r. 31BC – 14AD) was relatively limited and indirect, not because the Romans believed in something called “limited government” but because the aims of the Roman state were very limited (secure territory, collect taxes) and the administrative apparatus for doing those things was also very limited. The whole of the central Roman bureaucracy in the first century probably consisted of just a few hundred senatorial and equestrian officials (supported, of course, by the army and also several thousand enslaved workers employed either by the state directly or in the households of those officials) – this for an empire of around 50 million people. Instead, day to day affairs in the provinces – public works, the administration of justice, the regulation of local markets, etc. – were handled by local governments, typically centered in cities (we’ll come back to them in a moment). Where there were no cities, the Romans tended to make new ones for this purpose. Roman officials could then interact with the city elites (they preferred oligarchic city governments because they were easier to control) and so avoid having to interact directly with the populace in a more granular way unless there was a crisis.

By contrast, the Roman governance system that emerges during the reigns of Diocletian (r. 284-305) and Constantine (r. 306-337) was centralized and direct. The process of centralizing governance had been going on for some time, really since the beginning of the empire, albeit slowly. The Constitutio Antoniniana (212), which extended Roman citizenship to all free persons in the empire, in turn had the effect of wiping out all of the local law codes and instead extending Roman law to cover everyone and so doubtless accelerated the process.

During the Crisis of the Third Century (235-284) this trend accelerates substantially; the sources for this period are relatively poor, making it hard to see this process clearly. Nevertheless, the chaotic security situation led Roman generals and usurpers to make much greater demands of whatever local communities were in their reach, while at the same time once in power, emperors sought to draw a clearer distinction between their power and that of their subordinates in an effort to “coup proof” their regimes. That new form of Roman rule was both completed and then codified by Diocletian (r. 284-305): the emperor was set visually apart, ruling from palaces in special regalia and wearing crowns, while at the same time the provinces were reorganized into smaller units that could be ruled much more directly.

Diocletian intervened in the daily life of the empire in a way that emperors before largely had not. When his plan to reform the Roman currency failed, sparking hyper-inflation (whoops!), Diocletian responded with his Edict on Maximum Prices, an effort to fix the prices of many goods empire wide. Now previous emperors were not averse to price fixing, mind you, but such efforts had almost always been restricted to staple goods (mostly wheat) in Rome itself or in Italy (typically in response to food shortages). Diocletian attempted to enforce religious unity by persecuting Christians; his successors by the end of the century would be attempting to enforce religious unity by persecuting non-Christians. Whereas before taxes had been assessed on communities, Diocletian planned a tax system based on assessments of individual landholders based on a regular census; when actually performing a regular census proved difficult, Constantine responded by mandating that coloni – the tenant farmers and sharecroppers of the empire – must stay on the land they had been farming so that their landlords would be able to pay the taxes, casually abrogating a traditional freedom of Roman citizens for millions of farmers out of administrative convenience. Of course all of this centralized direction demanded bureaucrats and the bureaucracy during this period swelled to probably around 35,000 officials (compared to the few hundred under Augustus!).

All of this matters here because it is this kind of government – centralized, bureaucratic, religiously framed and interventionist, which the new rulers of the fifth century break-away kingdoms will attempt to emulate. They will mostly fail, leading to a precipitous decline in state capacity. This process worked differently in different areas: in Britain, the Roman government largely withered away from neglect and was effectively gone before the arrival of the Saxons and Angles, a point made quite well by Robin Flemming in the first chapter of Britain after Rome (2010), while in Spain, Gaul, Italy and even to an extent North Africa, the new “barbarian” rulers attempted to maintain Roman systems of rule.

This is thus an odd point where the “change and continuity” and “decline and fall” camps can both be right at the same time. There is continuity here, as new kings mostly established regimes that used the visual language, court procedure and to a degree legal and bureaucratic frameworks of Late Roman imperial rule. On the other hand, those new kingdoms fairly clearly lacked the resources, even with respect to their smaller territories, to engage in the kind of state activity that the Late Roman state had, for instance, towards the end of the fourth century. Instead, central administration largely failed in the West, with the countryside gradually becoming subject to local rural magnates (who might then be attached to the king) rather than civic or central government.

The problem rulers faced was two-fold: first that the Late Roman system, in contrast to its earlier form, demanded a large, literate bureaucracy, but the economic decline of the fifth century (which we’ll get to next time) came with a marked decline in literacy, which in turn meant that the supply of literate elites to staff those positions was itself shrinking (while at the same time secular rulers found themselves competing with the institutional Church for those very same literate elites). Second – and we’ll deal with this in more depth in just a moment – Roman rule had worked through cities, but all over the Roman Empire (but most especially in the West), cities were in decline and the population was both shrinking and ruralizing.

That decline in state capacity is visible in a number of different contexts. Bryan Ward-Perkins (Rome and the End of Civilization (2005), 148ff) notes for instance a sharp decline in the size of Churches, which for Christian rulers (both the post-Constantine emperors and the new “barbarian” kings) were major state building projects meant to display either royal or local noble wealth and power; Church size really only reaches Late Roman equivalent in the west (an important caveat here, to be sure) in the ninth century. In this kind of context it is hard to say that Visigothic or Merovingian rulers are actually just doing a different form of rulership because they’re fairly clearly not – they just don’t have the resources to throw at expensive building projects, even when you adjust for their smaller realms.

Nor is it merely building projects. Under Constantine, the Romans had maintained a professional army of around 400,000 troops. Much of the success of the Roman Empire had been its ability to provide “public peace” within its borders (at least by the relatively low standards of the ancient world). While the third century had seen quite a lot of civil war and the in the fourth century the Roman frontiers were cracking, for much of the empire the legions continued to do their job: war remained something that happened far away. This was a substantial change from the pre-Roman norm where war was a regular occurrence basically everywhere.

The kingdoms that emerged from the collapse of Roman rule proved incapable of either maintaining a meaningful professional army or provisioning much of that public peace (though of course the Roman state in the west had also proved incapable of doing this during the fifth century). Instead those kingdoms increasingly relied on armies led by (frequently mounted) warrior-aristocrats, composed of a general levy of the landholding population. We’ve actually discussed some of the later forms of this system – the Anglo-Saxon fyrd and the Carolingian levy system – already; those systems are useful reference points because they’re quite a bit better attested in our evidence and reflect many of the general principles of how we suppose earlier armies to have been organized.

The shift to a militia army isn’t necessarily a step backwards – the army of the Middle Roman Republic had also been a landholder’s militia – except that in this case it also marked a substantial decrease in scale. Major Merovingian armies – like the one that fought at Tours in 732 – tended to be around 10,000-20,000 men (mostly amateurs), compared to Late Roman field armies frequently around 40,000 professional soldiers or the astounding mobilizations of the Roman Republic (putting around 225,000 – that is not a typo – citizen-soldiers in the field in 214BC, for instance). Compared to the armies of the Hellenistic Period (323-31BC) or the Roman Empire, the ability of the post-Roman kingdoms to mobilize force was surprisingly limited and the armies they fielded also declined noticeably in sophistication, especially when it came to siege warfare (which of course also required highly trained, often literate engineers and experts).

That said, it cannot be argued that the decline of “public peace” had merely begun in the fifth century. One useful barometer of the civilian sense of security is the construction of city walls well within the empire: for the first two centuries, many Roman cities were left unwalled. But fresh wall construction within the Empire in places like Northern Spain or Southern France begins in earnest in the third century (presumably in response to the Crisis) and then intensifies through the fifth century, suggesting that rather than a sudden collapse of security, there had been a steady but significant decline (though again this would thus place the nadir of security somewhere in the early Middle Ages), partially abated in the fourth century but then resumed with a vengeance in the fifth.

Consequently the political story in the West is one of an effort to maintain some of the institutions of Roman governance which largely fails, leading to the progressive fragmentation and localization of power. Precisely because the late Roman system was so top-heavy and centralized, the collapse of central Roman rule mortally wounded it and left the successor states of Rome with much more limited resources and administration to try to achieve their aims.

Bret Devereaux, “Collections: Rome: Decline and Fall? Part II: Institutions”, A Collection of Unmitigated Pedantry, 2022-01-28.

May 30, 2025

QotD: “Have fun storming the castle!”

… the expected threat is going to shape the calculation of what margin of security is acceptable, which brings us back to our besieger’s playbook. You may recall when we looked at the Assyrian siege toolkit, that many of the most effective techniques assumed a large, well-coordinated army which could dispose of a lot of labor (from the soldiers) on many different projects at once while also having enough troops ready to fight to keep the enemy bottled up and enough logistic support to keep the army in the field for however long all of that took. In short, this is a playbook that strong, well-organized states (with strong, well-organized armies) are going to excel at. But, as we’ve just noted, the castle emerges in the context of fragmentation which produces a lot of little polities (it would be premature to call them states) with generally quite limited administrative and military capacity; the “big army” siege playbook which demands a lot of coordination, labor and expertise is, for the most part, out of reach.

Clifford Rogers has already laid out a pretty lay-person accessible account of the medieval siege playbook (in Soldiers’ Lives Through History: The Middle Ages (2007), 111-143; the book is pricey, so consider your local library), so I won’t re-invent the wheel here but merely note some general features. Rogers distinguishes between hasty assaults using mostly ladders launched as soon as possible as a gamble with a small number of troops to try to avoid a long siege, and deliberate assaults made after considerable preparation, often using towers, sapping, moveable shelters designed to resist arrow fire and possibly even catapults. We’ve already discussed hasty assaults here, so let’s focus on deliberate assaults.

While sapping (tunneling under and collapsing fortifications) remained in use, apart from filling in ditches, the mole-and-ramp style assaults of the ancient world are far less common, precisely because most armies (due to the aforementioned fragmentation combined with the increasing importance in warfare of a fairly small mounted elite) lacked both the organizational capacity and the raw numbers to do them. The nature of these armies as retinues of retinues also made coordination between army elements difficult. The Siege of Antioch (1097-8) [during] the First Crusade is instructive; though the siege lasted nine months, the crusaders struggled to even effectively blockade the city until a shipment of siege materials (lumber, mostly) arrived in March of 1098 (five months after the beginning of the siege). Meanwhile, coordinating so that part of the army guarded the exits of the city (to prevent raids by the garrison) while the other part of the army foraged supplies had proved mostly too difficult, leading to bitter supply shortages among the crusaders. Even with materials delivered to them, the crusaders used them to build a pair of fortified towers blocking exits from the city, rather than the sort of elaborate sapping and ramps; the city was taken not by assault but by treachery – a very common outcome to a siege! – when Bohemond of Taranto bribed a guard within the city to let the crusaders sneak a small force in. All of this despite the fact that the crusader army was uncommonly large by medieval European standards, numbering perhaps 45,000.

Crucially, in both hasty and deliberate assaults, the emphasis for the small army toolkit tends to be on escalade (going over the walls) using ladders or moveable wooden towers, rather than the complex systems of earthworks favored by the “big army” siege system or breaching – a task which medieval (or ancient!) artillery was generally not capable of. The latter, of course, is a much more certain method of assault – give a Roman army a few months and almost any fortress could be taken with near certainty – but it was a much more demanding method in terms of the required labor and coordination. Thwarting escalade is mostly a question of the height of defenses (because a taller wall requires a taller ladder, tower or ramp) and good fields of fire for the defenders (particularly the ability to fire at attackers directly up against the wall, since that’s where the ladders are likely to be).

The other major threat to castle walls (apart from the ever-present threat of sapping) was catapults, but I want to deal with those next time for reasons that I suspect will make sense then. For now it is worth simply noting that catapults, even the mighty trebuchets of the 14th century were generally used to degrade defenses (smashing towers, destroying crenellation, damaging gatehouses) rather than to produce breaches. They could in some cases do that, but only with tremendous effort and a lot of time (and sometimes not even then). Consequently, for most castles the greatest threat remained escalade, followed by treachery or starvation, followed by sapping, followed by artillery.

Bret Devereaux, “Collections: Fortification, Part III: Castling”, A Collection of Unmitigated Pedantry, 2021-12-10.

May 24, 2025

QotD: Comparing living standards and technology between the Roman period and medieval western Europe

The first crucial question here is exactly when in the Middle Ages one means. There is a tendency to essentialize the European Middle Ages, often suggesting that the entire period reflected a regression from antiquity, but the medieval period is very long, stretching about a thousand years (c. 500 to c. 1500 AD). There is also the question of where one means; the trajectory of the eastern Mediterranean is much different than the western Mediterranean. I am going to assume we really mean western Europe.

While I am convinced that the evidence suggests there was a drop in living standards and some loss of technology in the immediate aftermath of the collapse of the Roman Empire in the West, most of that drop was fairly short-lived. But exactly when development in medieval Europe meets and then exceeds the same for antiquity (typically we’re comparing the second century height of the Roman Empire) also depends on exactly what kind of measure is being used.

If the question, for instance, is agricultural productivity on a per capita basis (the most important component of per capita economic production), medieval Europe probably moves ahead of the Roman Empire fairly quickly with the introduction of better types of plow and widespread use of watermills for grinding grain. My understanding is that by c. 1000 AD, watermills show up fairly frequently in things like monastic charters, suggesting they were reasonably widespread (the Romans used watermills too, though their spread was uneven) and by that point, plow technology had also moved forward, mostly through the development of plow types better suited to Europe’s climate. So as best we can tell, the farmer of c. 1000 AD had better tools than his Roman predecessors and probably had such for some time.

If the question is technology and engineering, once again what you see depends on where you look. Some technologies don’t appear to have regressed much, if at all, ironworking being one example where it seems like little to nothing was lost. On the other hand, in western Europe, the retreat in architecture is really marked and it is hard to say when you would judge the new innovations (like flying buttresses) to have equaled some of the lost ones (like concrete); certainly the great 12th/13th century Cathedrals (e.g. Notre Dame, the Duomo di Sienna and I suppose the lesser Duomo di Firenze, if we must include it) seem to me to have matched or exceeded all but perhaps the biggest Roman architectural projects. Though we have to pause here because in many cases the issue was less architectural know-how (though that was a factor) as state capacity: the smaller and more fragmented states of the European Middle Ages didn’t have the resources the Roman Empire did.

If one instead looks for urbanization and population as the measure of development, the Middle Ages looks rather worse. First and Second century Rome is probably unmatched in Europe until the very late 1700s, early 1800s, when first London (c. 1800) and Paris (c. 1835) reach a million. So one looking for matches for the large cities and magnificent municipal infrastructure of the Romans will have rather a long wait. Overall population is much more favorable as a measurement to the Middle Ages. France probably exceeds its highest Roman population (c. 9m) by or shortly after 1000AD, Italy (c. 7.5m) by probably 1200; Spain is the odd one out, with Roman Hispania (est. 7.5m) probably only matched in the early modern period. So for most of the Middle Ages you are looking at a larger population, but also a more rural one. That’s not necessarily bad though; pre-modern cities were hazardous places due to sanitation and disease; such cities had a markedly higher mortality, for instance. On the flip-side, fewer, smaller cities means less economic specialization.

So one’s answer often depends very much on what one values most. For my own part, I’d say by 1000 or 1100 we can very safely say the “recovery” phase of the Middle Ages is clearly over (and I think you could make an argument for setting this point substantially earlier but not meaningfully later), though even this is somewhat deceptive because it implies that no new technological ground was being broken before then, which is not true. But the popular conception that the whole of the Middle Ages reflects a retreat from the standards of antiquity is to be discarded.

Bret Devereaux, Referenda ad Senatum: August 6, 2021: Feelings at the Fall of the Republic, Ancient and Medieval Living Standards, and Zombies!”, A Collection of Unmitigated Pedantry, 2021-08-06.

May 18, 2025

QotD: Processing raw wool to make woollen cloth

Filed under: Europe, Greece, History, Quotations — Tags: , , , , — Nicholas @ 01:00

Now when we left our wool it had just been shorn from our sheep. It is however, raw, oily from being on the sheep, likely still somewhat dirty, of uneven grades and types and also of course contains the other two fibers in the fleece (hair and kemp) which need to be removed before it can be used. The various processes used to get wool ready for spinning (or for sale) were sometimes collectively called “dressing” and involved various methods of sorting, scouring, combing, and washing.

The first step is sorting, dividing the raw wool into grades and types based on any number of factors, including fiber length, color, texture, crimp, strength, ability to take dye and so on. Different parts of the sheep produce wool with somewhat different qualities in this regard, but there are also differences based on the sex of the sheep, their health, age, diet, and for ewes whether they have had lambs. In order to get the best results in spinning (or the best value in selling) it is necessary to separate these grades out, grouping like wool with like. Too much mixing of fiber quality can make the end-product textile patchy in color, texture and its ability to take dye (the last one being quite visible, of course) and is to be avoided. This sorting was generally done by hand.

At this point, with the wool sorted, it could be sold, or further processed. The key question at this point was if the wool was to be washed or scoured (it would be combed or carded in either case, but this decision generally has to be made at this point). Scouring removes the lanolin (an oil secreted by the sheep which effectively waterproofs their wool) and other impurities. Leaving the lanolin in the wool can help with the spinning process and also to preserve the wool, but if the wool is to be dyed before being spun (for instance, if it is to be made into colored yarn rather than dyed as a whole fabric after weaving), it must be washed (or the lanolin will prevent the dye from sticking). Scouring could also be useful for wool that was going to be transported; in some cases the lanolin and other impurities might amount for up to 40% of the total weight of the raw wool (Gleba, op. cit. 98).

Practices in this regard clearly differed. In Greece, wool seems often to have been spun unwashed and women might use an epinetron, a ceramic thigh-protector, to keep the grease of the wool roving off of their clothes. On the flip side, both Varro (Rust 2.2.18) and Columella (De Rust. 11.35) assume that wool is generally to be washed (though they are thinking of wool being sold by large estates for commercial purposes and thus may have dyeing in mind). J.S. Lee notes that in medieval England wools with longer staples (that is, that forms into longer clusters or locks of fibers) were unscoured while short staple wools (which might be used in knitting) were more likely to be scoured. Scouring might be done on a small scale in the home or on a larger scale by either producers (before sale) or by clothiers and other purchasers (before dyeing).

Pre-modern scouring generally meant bathing the wool in a solution of warm water along with some agent that would remove the lanolin and other greases and impurities. The most common scouring agent was urine, something that pre-modern communities had in abundance; the ammonia content of urine allows it to break up and wash away the greases in the wool. Alternately, in the ancient period, the soapwort was sometimes used, as soaking its leaves in water could create a form of soap. By the early modern period, potash might also be used for this purpose, but even in the 1500s, it seems that urine was the most commons scouring agent in England. The process is smelly but generally fairly simple: the wool is allowed to sit in a solution of the scouring agent (again, generally urine) and warm water. Scoured wool would need to be re-oiled after it was dried to lubricate and protect the wool; typically olive oil was used for this purpose (both during the ancient and early modern periods) although J.S. Lee notes (op. cit. 45) that in the earlier parts of the Middle Ages, butter might be used instead in parts of Europe where olive oil was difficult to obtain in quantity.

Next, the wool has to be carded or combed, to remove any unusable or imperfect fibers or dirt, along with separating the strands by length and getting any tangles out before spinning. Let’s treat combing first, as it is the older of the two methods. Wool combs (in the ancient world, these were generally made of wood, bone or horn, but combs from the medieval period onward seem to generally be made with metal teeth projecting through a wooden handle) were used in pairs with the aid of a lubricant (grease, olive oil; these days there are specialty “combing oils”). One comb, the “moving comb” would be worked through the wool while the other comb which held the wool together was kept stationary, sometimes on the comber’s knee; in some cases it would secured to a fixed post (called a “combing stock”). You can see a demonstration of the basic method here.

Carding came later, though I have found no consensus on how much later. Gleba (Textile Production, 98) suggests that carding may have been in use in Italy by the end of the Roman period, while J.S. Lee (op. cit., 45) supposes carding to have been adopted into Europe via borrowing from the Islamic cotton industries of Sicily or Spain in the late 1200s. These suggestions are, of course, not mutually exclusive but I am hesitant to render a verdict between them. In any event, by late Middle Ages, carding is also a reasonably common processing method. Hand carders are generally wider, more paddle-like wooden boards with handles and pierced through by iron teeth; the earliest carders used teasel heads in place of the iron teeth (and the word “card” here actually comes from Latin, carduus, meaning thistle, referring to the use of teasel heads). Like combs they are used in pairs, with the wool placed on one, often held on the thigh, and then the other carder is drawn over the first until the wool is ready for spinning. You can see a demonstration here, and a direct comparison of the two kinds of tools here.

Though obviously quite similar methods (albeit with different tools) the two methods produce importantly different results in a couple of different ways. Both methods will remove remaining hair or kemp along with dirt or other particles that aren’t wool. But combed fibers generally produced stronger yarns (as I understand it, this is partly because it doesn’t straighten them out so much, allowing them to better tangle together during spinning), but combing is also a bit more wasteful in material terms, as shorter fibers are discarded in the process. Consequently, once both processes were available, they might both be used (and still are by practitioners of traditional wool-working today, as the video links above show), with combing more often used for long-fibered wools and carding for short-fibered wools.

Bret Devereaux, “Collections: Clothing, How Did They Make It? Part II: Scouring in the Shire”, A Collection of Unmitigated Pedantry, 2021-03-12.

May 12, 2025

QotD: The Gracchi

Filed under: Europe, Government, History, Law, Quotations — Tags: , , , — Nicholas @ 01:00

Tiberius and Gaius Gracchus’ tribunates – both of which ended with them being killed (I think it is perfectly fair to say “murdered”) – typically occupy a position in survey coverage of the Roman Republic as the inciting incident that begins (if not quite causes) the collapse of the Republic itself, the first outbreak of violence in Roman politics, the first escalation in a spiral that would lead to the repeated outbreak of civil war in the first century. And that is certainly how they were understood in antiquity; both Plutarch and Appian make this claim (App. BCiv. 1.17; Plut. Ti. Gracch. 20.1). And in part because the sources (again, Plutarch and Appian) frame the Gracchi quite positively and in part, to be frank, because their reforms are generally “left-coded” in a university environment that is inherently sympathetic to left-coded things, the Gracchi tend to come across to students as righteous reformers killed by foolish, hidebound and greedy reactionary Roman senators. And that is, to be fair, a potentially valid reading (if employed with some caveats).

But it is also generally the only reading students get and it is not the only valid reading of the evidence we have. So for this week, I want to complicate the Gracchi, presenting some of the details that often get left out of introductory surveys. In particular, we’re going to discuss the problems that Tiberius Gracchus’ key law, the Lex Sempronia Agraria was designed to solve and I am going to argue that Tiberius was attempting to solve a problem that didn’t exist (though he couldn’t have known it), a view which is now quite common in the scholarship but almost entirely absent in how we tend to teach the Gracchi.

But more to the point, I am going to argue that Tiberius and Gaius Gracchus’ behavior did, in fact, violate the norms of the Republic and that it was not entirely unreasonable for the senatorial elite to conclude these men, in their unrestrained and nakedly ambitious approach to politics, represented a real threat to the Roman political order and that they might be aiming for something approaching a “soft coup” in the context of a political order whose features – including the democratic ones – worked through an unwritten constitution of norms (what the Romans called the mos maiorum, “the customs of the ancestors”), which both brothers actively undermined. The claim that the Gracchi threatened to make themselves tyrants was not an empty claim and that is the dark reflection of their role as well intentioned reformers.

In short, then, if the only version of the Gracchi you have encountered is that of the near-saintly, then martyred proto-progressive reformers, that’s not quite the complete picture (and the left-coding of their ideas is decidedly anachronistic). Naturally, in trying to complicate this picture, I am essentially taking the position of prosecutor, so this “take” is going to be far more negative on the Gracchi than how I would, say, teach them in class or, indeed, how I regard them myself.

So the way we’re going to approach this problem is first to discuss the problem that Tiberius Gracchus thought he was addressing (and some of the issues there), before walking through the means he used to push forward the Lex Sempronia Agraria. Then I want to look at some of the wide-ranging laws proposed by Gaius Gracchus to assess the degree to which those laws cohere and ways we might understand his program and actions, potentially rather more negatively.

Bret Devereaux, “Collections: On the Gracchi, Part I: Tiberius Gracchus”, A Collection of Unmitigated Pedantry, 2025-01-17.

May 6, 2025

QotD: World War I shattered the European notion of what “war” actually was

Filed under: Europe, History, Military, Quotations, WW1 — Tags: , , — Nicholas @ 01:00

Echoes of those views continue to appear in western literature until the impersonal carnage of the Western Front seem to finally snuff them out. But it isn’t that the generations and generations before 1914 had never experienced war, but that war had changed.

We’ve actually talked before about just how profoundly our modern view of war and battle (and battlefields) is conditioned by the experience of the first world war and the vast literary production of the generation that went through those trenches. Certainly for English (and German and French, etc.) literature, World War I seems to almost snap the tradition in half, making everything before it feel trite and washing the whole of war literature in grim tones of field grey.

And, of course, that is the point. World War I was a new kind of war that shattered the old certainties born out of the old kinds of war. It is often a mistake to assume those old certainties had been born out of some eternal peace, but while the 1800s had not seen a general European war, they had seen many wars, in the many imperial possessions of European countries, on the edges of what the British or French considered “Europe” and also in the heart of Europe itself (not to mention a few dustups in the Americas). These were not peaceful societies confronting their first war and shocked by the experience, but very bellicose societies encountering for the first time a new sort of war and being stunned at how different it was from what they had expected, from the wars of their (recent!) past.

All of which is to say war, war really does change. And warriors with it.

Bret Devereaux, “Collections: The Universal Warrior, Part IIb: A Soldier’s Lot”, A Collection of Unmitigated Pedantry, 2021-02-05.

April 30, 2025

QotD: The experience of the infantryman through the ages

What about the other common difficulties of soldiering? How universal are those experiences: the bad food, long marches, heavy burdens and difficult labor and toil?

Well, here is where we come back to the note I made earlier about how “warring” and “soldiering” were different verbs with different meanings. After all, while soldiering implies these difficulties, warring doesn’t, necessarily. And it isn’t hard to see why – the warrior classes in these societies, often being aristocrats, generally didn’t do a lot of these things. It is, for instance, noted in the Roman sources when a general chose to eat the same food as his soldiers, because most Roman aristocrats didn’t when they served as generals or military tribunes. The privileges of rank and class applied.

And that’s something we see with medieval aristocrats too. On the one hand, Jean de Bueil talks about the “difficulties and travail” of war, but at the same time, Clifford Rogers notes one (fictional and lavish, but not outrageous) war party “suitable for a baron or banneret” included a chaplain, three heralds, four trumpeters, two drummers, four pages, two varlets (that is, servants for the pages), two cooks, a forager, a farrier, an armorer, twelve more serving men (with horses, presumably both as combatants and as servants), and a majordomo to manage them all – in addition to the one lord, three knights and nine esquires (C. Rogers, Soldiers’ Lives through History: the Middle Ages (2007), 28-9).

Jean le Bel (quoted in Rogers, op. cit.) contrasts the situation of the nobles in Edward III’s army (1327), where “one could see great nobility well served with a great plenty of dishes and sweets – such strange ones that I wouldn’t know to name or describe them. There one could see ladies richly adorned and nobly ornamented” while in the camp proper an open brawl between the regular soldiers from England and Hainault broke out and eventually turned into an open battle in which 316 died, but so segregated was the camp that, “most of the knights and of their masters were then at court, and knew nothing of this” (Rogers, 66-7). Likewise, except in fairly extreme positions, most of the ditch-digging, camp-building duties would fall to the common soldiers (and, as Roel Konijnendijk can quite accurately tell you, ditches are important! When in doubt, dig some ditches – or make others dig ditches for you).

That said, these differences are not merely confined to the high aristocrats. Marching under a heavy load is often given as one example of the quintessential “soldier experience”, but it seems that many Greek hoplites went to war with a personal slave or servant to carry their equipment for them, despite being infantrymen. The Romans carried equipment and supplies something closer to what a modern soldier might (both in terms of weight and also, apart from ammunition, in terms of what was carried), but then non-Roman sources like the Greek writer Polybius (18.18.1-7) or the Jewish writer Josephus (BJ 3.95) appear quite stunned with the amount of tools and equipment the Romans carry (and Polybius, by the by, is writing before Marius’ mules). Evidently the Roman impedementia was quite a bit heavier, though even the Macedonians carried much more than a Greek hoplite army (Note Engels, Alexander the Great and the Logistics of the Macedonian Army, 1978 on this).

Meanwhile, Jonathan Roth is quick to note (in The Logistics of the Roman Army at War (264 B.C. – A.D. 235) (1999)) that despite either bad or insufficient rations being a common complaint of soldiers, such complaints appear absent from Roman sources, even in the context of legionary mutinies. Indeed, the evidence suggests that Roman soldiers ate quite well, with fairly ample rations. In camp the Roman soldier’s diet was not so different from what he might eat in peacetime (especially once we get into the imperial period with legions stationed in semi-permanent bases); on the march they had to make do with bucellatum, a hard biscuit something like hardtack. But for many Italian peasants, the diet doesn’t seem to have been much worse – or much different – from what they ate in peacetime.

By way of sharp contrast to the plodding, heavily loaded but surely very lethal Roman legionary, the impis of the Zulu traveled fast, light and sometimes somewhat hungry. Zulu warriors generally carried only their equipment on the march, while supplies were carried by udibi, boys serving as porters. Even then, such supplies were minimal – the Zulu force that arrived at Rorke’s Drift (1879) had only been out six days, but none of the warriors in it had eaten in two. Such minimally supplied flying columns, moving fast and with considerable stealth (one cannot read anything on the Anglo-Zulu war without noticing how, even with cavalry scouts, Zulu impis seem so often just to appear next to British forces) were the norm for Zulu warfare. And to be clear, this wasn’t some “primitive” or underdeveloped form of war – the light and fast operational movements of the Zulu were intentional (much of it was a product of Shaka’s reforms) and very effective – albeit not so effective as to offset the massive advantages the British possessed in population, economic capacity or military technology. Nevertheless, not even every sort of common soldier was the heavily loaded, slow moving, well-fed ditch-digging sort like the Romans. The “soldier experience” needs to cover the lightly loaded and armed, fast moving, hungry, non-ditch-digging Zulu experience too.

And then of course when we consider nomadic peoples, we find that in many cases their lives on campaign were not that much different from their lives at peacetime, involving many of the same skills and activities.

In short, the experience of the drudgery of war – the bad food, long toil, heavy encumbrance and so on was all still quite contingent (or we might say “dependent”) on the society going to war. Social divisions mattered. Expectations about masculine behavior mattered. Military systems mattered. Yes, modern armies in the European tradition expect their soldiers to do a lot of labor and drudgery, but remember where that military system came from: it was the system of the common soldiers serving under the aristocrats who most certainly did not do those things but who did impose sharp, corporal discipline. Which, to be clear, doesn’t make this system ineffective – it was clearly effective. The point here is that it was socially contingent – a different society would have come up with a different system. And they did! The Early Modern European system is only one way to organize an army and historically speaking not even the most common.

Bret Devereaux, “Collections: The Universal Warrior, Part IIb: A Soldier’s Lot”, A Collection of Unmitigated Pedantry, 2021-02-05.

April 24, 2025

QotD: The Phalanx

… we need to distinguish what sort of phalanx because this is not the older hoplite phalanx in two very important ways: first, it is equipped and fights differently, but second it has a very different place in the overall tactical system: the Macedonian phalanx may be the “backbone” of a Hellenistic army, but it is not the decisive arm of the system.

So let’s start with the equipment, formation and fighting style. The older hoplite phalanx was a shield wall, using the large, c. 90cm diameter aspis and a one-handed thrusting spear, the dory. Only the front rank in a formation like this engaged the enemy, with the rear ranks providing replacements should the front hoplites fall as well as a morale force of cohesion by their presence which allowed the formation to hold up under the intense mental stress of combat. But while hoplites notionally covered each other with their shields, they were mostly engaged in what were basically a series of individual combats. As we noted with our bit on shield walls, the spacing here seems to have been wide enough that while the aspis of your neighbor is protecting you in that it occupies physical space that enemy weapons cannot pass through, you are not necessarily hunkered down shoulder-to-shoulder hiding behind your neighbor’s shield.

The Macedonian or sarisa-phalanx evolves out of this type of combat, but ends up quite different indeed. And this is the point where what should be a sentence or two is going to turn into a long section. The easy version of this section goes like this: the standard Macedonian phalangite (that is, the soldier in the phalanx) carried a sarisa, a two-handed, 5.8m long (about 19ft) pike, along with an aspis, a round shield of c. 75cm carried with an arm and neck strap, a sword as a backup weapon, a helmet and a tube-and-yoke cuirass, probably made out of textile. Officers, who stood in the first rank (the hegemones) wore heavier armor, probably consisting of either a muscle cuirass or a metal reinforced (that is, it has metal scales over parts of it) tube-and-yoke cuirass. I am actually quite confident that sentence is basically right, but I’m going to have to explain every part of it, because in popular treatments, many outdated reconstructions of all of this equipment survive which are wrong. Bear witness, for instance, to the Wikipedia article on the sarisa which gets nearly all of this wrong.

Wikipedia‘s article on the topic as of January, 2024. Let me point out the errors here.
1) The wrong wood, the correct wood is probably ash, not cornel – the one thing Connolly gets wrong on this weapon (but Sekunda, op. cit. gets right).
2) The wrong weight, entirely too heavy. The correct weight should be around 4kg, as Connolly shows.
3) Butt-spikes were not exclusively in bronze. The Vergina/Aigai spike is iron, though the Newcastle butt is bronze (but provenance, ????)
4) They could be anchored in the ground to stop cavalry. This pike is 5.8m long, its balance point (c. 1.6m from the back) held at waist height (c. 1m), so it would be angled up at something like 40 degrees, so anchoring the butt in the ground puts the head of the sarisa some 3.7m (12 feet) in the air – a might bit too high, I may suggest. The point could be brought down substantially if the man was kneeling, which might be workable. More to the point, the only source that suggests this is Lucian, a second century AD satirist (Dial Mort. 27), writing two centuries after this weapon and its formation had ceased to exist; skepticism is advised.
5) We’ll get to shield size, but assuming they all used the 60cm shield is wrong.
6) As noted, I don’t think these weapons were ever used in two parts joined by a tube and also the tube at Vergina/Aigai was in iron. Andronikos is really clear here, it is a talon en fer and a douille en fer. Not sure how that gets messed up.

Sigh. So in detail we must go. Let us begin with the sarisa (or sarissa; Greek uses both spellings). This was the primary weapon of the phalanx, a long pike rather than the hoplite‘s one-handed spear (the dory). And we must discuss its structure, including length, because this is a case where a lot of the information in public-facing work on this is based on outdated scholarship, compounded by the fact that the initial reconstructions of the weapon, done by Minor Markle and Manolis Andronikos, were both entirely unworkable and, I think, quite clearly wrong. The key works to actually read are the articles by Peter Connolly and Nicholas Sekunda.1 If you are seeing things which are not working from Connolly and Sekunda, you may safely discard them.

Let’s start with length; one sees a very wide range of lengths for the sarisa, based in part on the ancient sources. Theophrastus (early third century BC) says it was 12 cubits long, Polybius (mid-second century) says it was 14 cubits, while Asclepiodotus (first century AD) says the shortest were 10 cubits, while Polyaenus (second century AD) says that the length was 16 cubits in the late fourth century.2 Two concerns come up immediately: the first is that the last two sources wrote long after no one was using this weapon and as a result are deeply suspect, whereas Theophrastus and Polybius saw it in use. However, the general progression of 12 to 14 to 16 – even though Polyaenus’ word on this point is almost worthless – has led to the suggestion that the sarisa got longer over time, often paired to notions that the Macedonian phalanx became less flexible. That naturally leads into the second question, “how much is a cubit?” which you will recall from our shield-wall article. Connolly, I think, has this clearly right: Polybius is using a military double-cubit that is arms-length (c. 417mm for a single cubit, 834mm for the double), while Theophrastus is certainly using the Athenian cubit (487mm), which means Theophrastus’ sarisa is 5.8m long and Polybius’ sarisa is … 5.8m long. The sarisa isn’t getting longer, these two fellows have given us the same measurement in slightly different units. This shaft is then tapered, thinner to the tip, thicker to the butt, to handle the weight; Connolly physically reconstructed these, armed a pike troupe with them, and had the weapon perform as described in the sources, which I why I am so definitively confident he is right. The end product is not the horribly heavy 6-8kg reconstructions of older scholars, but a manageable (but still quite heavy) c. 4kg weapon.

Of all of the things, the one thing we know for certain about the sarisa is that it worked.

Next are the metal components. Here the problem is that Manolis Andronikos, the archaeologist who discovered what remains our only complete set of sarisa-components in the Macedonian royal tombs at Vergina/Aigai managed to misidentify almost every single component (and then poor Minor Markle spent ages trying to figure out how to make the weapon work with the wrong bits in the wrong place; poor fellow). The tip of the weapon is actually tiny, an iron tip made with a hollow mid-ridge massing just 100g, because it is at the end of a very long lever and so must be very light, while the butt of the weapon is a large flanged iron butt (0.8-1.1kg) that provides a counter-weight. Finally, Andronikos proposed that a metal sleeve roughly 20cm in length might have been used to join two halves of wood, allowing the sarisa to be broken down for transport or storage; this subsequently gets reported as fact. But no ancient source reports this about the weapon and no ancient artwork shows a sarisa with a metal sleeve in the middle (and we have a decent amount of ancient artwork with sarisae in them), so I think not.3

Polybius is clear how the weapon was used, being held four cubits (c. 1.6m) from the rear (to provide balance), the points of the first five ranks could project beyond the front man, providing a lethal forward hedge of pike-points.4 As Connolly noted in his tests, while raised, you can maneuver quite well with this weapon, but once the tips are leveled down, the formation cannot readily turn, though it can advance. Connolly noted he was able to get a English Civil War re-enactment group, Sir Thomas Glemham’s Regiment of the Sealed Knot Society, not merely to do basic maneuvers but “after advancing in formation they broke into a run and charged”. This is not necessarily a laboriously slow formation – once the sarisae are leveled, it cannot turn, but it can move forward at speed.

The shield used by these formations is a modified form of the old hoplite aspis, a round, somewhat dished shield with a wooden core, generally faced in bronze.5 Whereas the hoplite aspis was around 90cm in diameter, the shield of the sarisa-phalanx was smaller. Greek tends to use two words for round shields, aspis and pelte, the former being bigger and the latter being smaller, but they shift over time in confusing ways, leading to mistakes like the one in the Wikipedia snippet above. In the classical period, the aspis was the large hoplite shield, while the pelte was the smaller shield of light, skirmishing troops (peltastai, “peltast troops”). In the Hellenistic period, it is clear that the shield of the sarisa-phalanx is called an “aspis” – these troops are leukaspides, chalkaspides, argyraspides (“white shields”, “bronze shields”, “silver shields” – note the aspides, pl. of aspis in there). This aspis is modestly smaller than the hoplite aspis, around 75cm or so in diameter; that’s still quite big, but not as big.

Then we have some elite units from this period which get called peltastai but have almost nothing to do with classical period peltastai. Those older peltasts were javelin-equipped light infantry skirmishers. But Hellenistic peltastai seem to be elite units within the phalanx who might carry the sarisa (but perhaps a shorter one) and use a smaller shield which gets called the pelte but is not the pelte of the classical period. Instead, it is built exactly like the Hellenistic aspis – complete with a strap-suspension system suspending it from the shoulder – but is smaller, only around 65cm in diameter. These sarisa-armed peltastai are a bit of a puzzle, though Asclepiodotus (1.2) in describing an ideal Hellenistic army notes that these guys are supposed to be heavier than “light” (psiloi) troops, but lighter than the main phalanx, carrying a smaller shield and a shorter sarisa, so we might understand them as an elite force of infantry perhaps intended to have a bit more mobility than the main body, but still be able to fight in a sarisa-phalanx. They may also have had less body-armor, contributing that the role as elite “medium” infantry with more mobility.6

Finally, our phalangites are armored, though how much and with what becomes really tricky, fast. We have an inscription from Amphipolis7 setting out military regulations for the Antigonid army which notes fines for failure to have the right equipment and requires officers (hegemones, these men would stand in the front rank in fighting formation) to wear either a thorax or a hemithorakion, and for regular soldiers where we might expect body armor, it specifies a kottybos. All of these words have tricky interpretations. A thorax is chest armor (literally just “a chest”), most often somewhat rigid armor like a muscle cuirass in bronze or a linothorax in textile (which we generally think means the tube-and-yoke cuirass), but the word is sometimes used of mail as well.8 A hemithorakion is clearly a half-thorax, but what that means is unclear; we have no ancient evidence for the kind of front-plate without back-plate configuration we get in the Middle Ages, so it probably isn’t that. And we just straight up don’t know what a kottybos is, although the etymology seems to suggest some sort of leather or textile object.9

In practice there are basically two working reconstructions out of that evidence. The “heavy” reconstruction10 assumes that what is meant by kottybos is a tube-and-yoke cuirass, and thus the thorax and hemithorakion must mean a muscle cuirass and a metal-reinforced tube-and-yoke cuirass respectively. So you have a metal-armored front line (but not entirely muscle cuirasses by any means) and a tube-and-yoke armored back set of ranks. I would argue the representational evidence tends to favor this; we most often see phalangites associated with tube-and-yoke cuirasses, rarely with muscle cuirasses (but sometimes!) and not often at all in situations where they have the rest of their battle kit (helmet, shield, sarisa) as required for the regular infantry by the inscription but no armor.

Then there is the “light” reconstruction11 which instead reads this to mean that only the front rank had any body armor at all and the back ranks only had what amounted to thick travel cloaks. Somewhat ironically, it would be really convenient for the arguments I make in scholarly venues if Sekunda was right about this … but I honestly don’t think he is. My judgment rebels against the notion that these formations were almost entirely unarmored and I think our other evidence cuts against it.12

Still, even if we take the “heavy” reconstruction here, when it comes to armor, we’re a touch less well armored compared to that older hoplite phalanx. The textile tube-and-yoke cuirass, as far as we can tell, was the cost-cutting “cheap” armor option for hoplites (as compared to more expensive bell- and later muscle-cuirasses in bronze). That actually dovetails with helmets: Hellenistic helmets are lighter and offer less coverage than Archaic and Classical helmets do as well. Now that’s by no means a light formation; the tube-and-yoke cuirass still offers good protection (though scholars currently differ on how to reconstruct it in terms of materials). But of course all of this makes sense: we don’t need to be as heavily armored, because we have our formation.

Bret Devereaux, “Collections: Phalanx’s Twilight, Legion’s Triumph, Part Ia: Heirs of Alexander”, A Collection of Unmitigated Pedantry, 2024-01-19.


    1. So to be clear, that means the useful is P. Connolly, “Experiments with the sarissa” JRMES 11 (2000) and N. Sekunda, “The Sarissa” Acta Universitatis Lodziensis 23 (2001). The parade of outdated scholarship is Andronikos, “Sarissa” BCH 94 (1970); M. Markle, “The Macedonian sarissa” AJA 81 (1977) and “Macedonian arms and tactics” in Macedonia and Greece in Late Classical and Early Hellenistic Times, (1982), P.A. Manti, “The sarissa of the Macedonian infantry” Ancient World 23.2 (1992) and “The Macedonian sarissa again” Ancient World 25.2 (1994), J.R. Mixter, “The length of the Macedonian sarissa” Ancient World 23.2 (1992). These weren’t, to be clear, bad articles, but they are stages of development in our understanding, which are now past.

    2. Theophrastus HP 3.12.2. Polyb. 18.29.2. Asclepiodotus Tact. 5.1; Polyaenus Strat. 2.29.2. Also Leo Tact. 6.39 and Aelian Tact. 14.2 use Polybius’ figure, probably quoting him.

    3. Also, what very great fool wants his primary weapon, which is – again – a 5.8m long pike that masses around 4kg to be held together in combat entirely by the tension and friction of a c. 20cm metal sleeve?

    4. Christopher Matthew, op. cit., argues that Polybius must be wrong because if the weapon is gripped four cubits from the rear, it will foul the rank behind. I find this objection unconvincing because, as noted above and below, Peter Connolly did field drills with a pike troupe using the weapon and it worked. Also, we should be slow to doubt Polybius who probably saw the weapon and its fighting system first hand.

    5. What follows is drawn from K. Liampi, Makedonische Schild (1998), which is the best sustained study of Hellenistic period shields.

    6. Sekunda reconstructs them this way, without body armor, in Macedonian Armies after Alexander, (2013). I think that’s plausible, but not certain.

    7. Greek text is in Hatzopoulos, op. cit.

    8. Polyb. 30.25.2. Also of scale, Hdt. 9.22, Paus. 1.21.6.

    9. The derivation assumed to be from κοσύμβη or κόσσυμβος, which are a sort of shepherd’s heavy cloak.

    10. Favored by Hatzopoulos, Everson and Connolly.

    11. Favored by Sekunda and older scholarship, as well as E. Borza, In the Shadow of Olympus (1990), 204-5, 298-9.

    12. Representational evidence, but also the report that when Alexander got fresh armor for his army, he burned 25,000 sets of old, worn out armor. Curtius 9.3.21; Diodorus 17.95.4. Alexander does not have 25,000 hegemones, this must be the armor of the general soldiery and if he’s burning it, it must be made of organic materials. I think the correct reading here is that Alexander’s soldiers mostly wore textile tube-and-yoke cuirasses.

« Newer PostsOlder Posts »

Powered by WordPress