Lesson 2 — The Family Tree of Language

The Etymology Course for ZRR0 · Lesson 2 of the series

Where we left off

In Lesson 1 you met English as a mutt — a cross-bred dog of a language, stitched together from Germanic bones, a thick coat of French, and a Latin-and-Greek pedigree it puts on for special occasions. That's your brand thesis in linguistic form: the strongest things are hybrids.

But here's the question that should be itching at you: if English is a mix of all those languages — where did THOSE languages come from?

Pull that thread. It doesn't stop at Latin. It doesn't stop at Greek. It goes all the way back to one tongue, spoken by people who left no writing, on a grassland north of the Black Sea, around 6,000 years ago.

This lesson is about that tongue, and the giant family tree that grew out of it. By the end, you'll see something most people never realize: most of the Western world — and a big chunk of the East — is speaking dialects of one ancient language. English, Spanish, Hindi, Russian, Greek, Persian. Cousins. All of them.

Let's prove it.

Part 1 — The detective story (how we even know this)

Start with a real "aha," because this is one of the great intellectual discoveries in history and almost nobody learns it.

In 1786, a British judge in India named William Jones was studying Sanskrit — the ancient sacred language of India, totally unrelated to Europe as far as anyone assumed. Different continent. Different gods. Different alphabet.

But Jones kept noticing things he couldn't explain. The Sanskrit word for "father" was pitár. Latin's was pater. Greek's was patḗr. The word for "three" in Sanskrit was tráyas. Latin: trēs. Greek: treîs.

These weren't borrowed words. They were core words — father, three, mother, is, two — the words a language never trades away. And they matched across languages separated by thousands of miles that supposedly had nothing to do with each other.

Jones made the leap. He wrote that Sanskrit, Greek, and Latin share

"a stronger affinity... than could possibly have been produced by accident; so strong, indeed, that no philologer could examine them all three, without believing them to have sprung from some common source, which, perhaps, no longer exists."

Read that last line again. A common source which perhaps no longer exists. He's describing a parent language that died so long ago it left no record — and deducing it must have existed purely from the family resemblance of its children.

That ghost language got a name: Proto-Indo-European — "Proto" (first), "Indo" (it reached India), "European" (it reached Europe). We call it PIE for short.

The detective principle: You can prove a parent existed by lining up the children. If five siblings all have the same rare nose, there was a parent with that nose — even if you never met them and no photo survives. Linguists do this with words instead of noses. The method is called the comparative method, and it's as rigorous as anything in science.

Part 2 — The cognate magic

Here's the single most important idea in this whole course. The word is cognate (Latin co- "together" + gnatus "born" — literally born together, siblings).

Cognates are words in different languages that descend from the same ancestor word. Not borrowed — inherited. Same grandparent, scattered to different countries, each one aging differently.

Let's watch one word fan out across the family. Take PIE *ph₂tḗr (don't worry about the symbols — it's roughly "puh-TAIR," meaning father):

Branch	The word for "father"
English (Germanic)	father
German (Germanic)	Vater (the V is said like an F)
Latin (Italic)	pater
Spanish (from Latin)	padre
French (from Latin)	père
Greek (Hellenic)	patḗr
Sanskrit (Indo-Iranian)	pitár
Persian (Indo-Iranian)	pedar

Stop and feel that. A Texan saying "father," a Spaniard saying "padre," an ancient Roman saying "pater," a German saying "Vater," and a Sanskrit priest 3,000 years ago saying "pitár" — they are all saying the same word. It left one mouth in one tribe, and 6,000 years of drift turned it into eight costumes. But it's one word underneath.

That's the magic. Once you can see it, you can't unsee it. Languages stop looking like separate things and start looking like the same river photographed at different bends.

Why the sounds drift but don't break

You might ask: if it's the same word, why isn't it spelled the same? Why father but pater?

Because sound change isn't random — it's regular. When the Germanic branch split off, it ran every single "p" sound through a transformation into "f." Every one. That's why:

Latin pēs → English foot (and "pedal," "pedestrian," borrowed back later from Latin, keep the original p!)
Latin piscis → English fish
Latin pater → English father

This isn't a coincidence; it's a law — Grimm's Law, discovered by Jacob Grimm (yes, the fairy-tale brother). The drift is so consistent that you can predict it. P becomes F, T becomes TH, K becomes H, across the entire Germanic branch.

The aha: "father" and "pater" look different but ARE the same word — and the difference between them isn't sloppiness, it's a rule you can run forwards and backwards like a decoder ring.

Try it yourself: Latin for "foot" is ped-. By Grimm's Law, p→f and the d hardens toward t. What English body-part word does that predict? (Answer: foot. You just reconstructed it.) Now try Latin cornu (k-sound, "horn"): k→h gives you... horn. You're now doing comparative linguistics.

Part 3 — The tree

So PIE had children, and those children had children. Here's the family laid out. Read it top to bottom like a genealogy.

                          PROTO-INDO-EUROPEAN (PIE)
                     ~4500–2500 BC · Pontic-Caspian steppe
                                     │
        ┌──────────────┬────────────┼────────────┬──────────────┐
        │              │            │            │              │
   GERMANIC        ITALIC      HELLENIC      CELTIC        INDO-IRANIAN
        │              │            │            │              │
   ┌────┼────┐         │         (Greek)   ┌─────┴────┐    ┌─────┴─────┐
   │    │    │         │                   │          │    │           │
English German Dutch  LATIN              Irish     Welsh  Sanskrit   Old Persian
Swedish Norse  …       │                 Scots-                │          │
                       │                 Gaelic            Hindi/Urdu   Persian
              ┌────────┼────────┐                          Bengali     (Farsi)
              │   │    │    │    │                          Punjabi      Kurdish
           Spanish French Italian Portuguese Romanian         …
              (the ROMANCE languages)

        … plus more branches: SLAVIC (Russian, Polish, Czech, Ukrainian),
          BALTIC (Lithuanian, Latvian), ARMENIAN, ALBANIAN, and the
          extinct ones (Tocharian, Hittite — the oldest written cousin).

A few things to notice, because each one is its own little revelation:

Latin isn't the root — it's a branch. People talk like Latin is the granddaddy of everything. It's not. Latin is one daughter of PIE (through the Italic branch). It just happened to have famous kids — the Romance languages. "Romance" here has nothing to do with love; it means "descended from Rome" (Latin Romanicus). Spanish, French, Italian, Portuguese, Romanian are all Latin, grown old in five different countries.

English sits in Germanic. Despite all that French vocabulary you learned about in Lesson 1, English's skeleton — its grammar, its commonest words (is, was, have, the, water, house, eat) — is Germanic. The French is a coat. The bones are German. That's why a mutt: Germanic frame, Romance fur.

Greek is an only child. Hellenic is a branch with basically one surviving member — Greek — which is why ancient Greek connects so directly to modern Greek, and why Greek roots feel so "pure" in our science vocabulary.

Sanskrit is your distant Eastern cousin. That Indian sacred language William Jones puzzled over? Same family. When a yoga teacher says namaste or you hear the word guru or karma, those come down the Indo-Iranian branch — the eastern wing of the exact same family that produced English.

Part 4 — Who were these people, really?

This isn't just word-puzzles. There were actual humans who spoke PIE, and in the last decade ancient-DNA science has put a face on them.

The leading answer — the Steppe (Kurgan) hypothesis — places the homeland on the Pontic-Caspian steppe, the vast grasslands north of the Black and Caspian Seas (today's Ukraine and southern Russia), roughly 4500–3500 BC. The strongest candidate culture is called the Yamnaya.

These were not city-builders. They were herders on horseback with wagons — and that's the whole point. The wagon and the domesticated horse gave them range. They spread, generation by generation, in every direction: west into Europe, east toward India and Iran. They didn't have an empire or an army the way Rome did. They had mobility, and they had time.

And everywhere their descendants went, the language went, then split, then split again — like one family scattering across a continent until the great-grandkids can't understand each other on the phone, but still all have grandma's recipe for the same word.

The number to keep: Languages descended from PIE are spoken by roughly 40% of all humans alive today as a first language. Almost half the planet is speaking the great-great-grandchildren of one Bronze Age herder tongue.

Part 5 — The bridge to your trip (Latin → Spanish)

You're heading to South America, so let's make this load-bearing for you specifically.

Spanish is not a foreign mystery. Spanish is Latin that moved to Iberia and grew up. Which means: every Latin root you learn is a down payment on Spanish. You're not learning two things; you're learning one thing twice.

Watch the family resemblance once you know the tree:

PIE / Latin source	English (Germanic side)	Spanish (Latin side)	Notice
Latin pater	father	padre	t softened to d (Latin t between vowels → d in Spanish)
Latin mater	mother	madre	same softening
Latin trēs (PIE *tréyes)	three	tres	nearly untouched
Latin nox / noct-	night	noche	Latin "ct" → Spanish "ch" — a regular rule!
Latin octō	eight	ocho	same ct→ch rule (octopus → ocho)
Latin factum	(fact)	hecho	Latin f- → Spanish h- (silent), ct→ch

See that last column? Those aren't things to memorize one by one. They're rules, the same way Grimm's Law was a rule. Latin ct almost always became Spanish ch. Latin starting f often went silent as h (that's why facere "to do/make" became hacer). Once you learn the handful of sound-laws, you can predict Spanish words from Latin ones — and predict English's borrowed Latin words at the same time.

Try it yourself: English "nocturnal" and "octave" both kept the Latin ct. Apply the ct→ch rule and you get the everyday Spanish words for "night" and "eight." Which are they? (noche, ocho.) You just generated vocabulary you'll use ordering dinner in Buenos Aires — from a sound-law.

This is the superpower of the family tree: it turns three languages (English, Latin, Spanish) into one connected map. Learn a root once, and it pays you in all three.

Part 6 — Tie-back: the mutt, and the name

Two quick connections to lock it in.

The brand thesis. Lesson 1 said English is a mutt. Now you can place that mutt on the tree precisely: English is a Germanic dog wearing a Romance coat — both of which are children of PIE. So even English's "mixed" nature is a mix of siblings. The Germanic and the Latin in English aren't strangers thrown together; they're estranged cousins reunited inside one language. The mutt is a family reunion. That's a sharper version of your ZRR0 thesis: hybridity isn't dilution — it's branches of one root recombining and getting stronger for it.

The name (light touch — we'll go deep later). "Robinson" is pure Germanic-English machinery: Robin + son, "son of Robin," the same -son you see in Johnson, Jackson, Anderson — a Germanic patronymic suffix. And that -son? It's a cousin of Latin's idea of patrīnus / Spanish padre-lineage and even Sanskrit pitár — all of them descend from the PIE family's deep obsession with naming a person by their father. The very structure "son-of" that your surname is built on is a PIE inheritance. You'll see this fully in the name-focused lesson; for now just clock it: your last name is a fossil of the same father-word we tracked across eight languages above.

The one thing to remember

If you forget everything else, keep this:

Most of the West, and much of the East, speaks dialects of one dead language. English, Spanish, Greek, Russian, Hindi, Persian — not neighbors who borrowed from each other, but siblings who inherited from a common parent that died 6,000 years ago on a grassland and left no writing. We reconstructed the parent from the family resemblance of its children. And once you can see the resemblance — father, padre, pater, Vater, pitár — every new language you touch is no longer a wall. It's a cousin you haven't been introduced to yet.

Next lesson, we go inside a single word and take it apart to the atoms — prefixes, roots, suffixes — and you'll start reading words you've never seen before like an x-ray.