Pinyin, Demystified
A friendly self-teach guide to reading and pronouncing Mandarin in Roman letters.
1What romanization actually is
A romanization system — not a different language, not a different alphabet.
Mandarin Chinese is written with characters (, ) — symbols that mostly carry meaning, not sound. Pinyin (, , literally "spell-sound") is a system that uses Roman letters to write the sounds of those characters.
It's a tool, mostly used for three things:
- Teaching kids and foreigners how to pronounce characters.
- Typing — when you type "" on a phone, romanization is what turns it into .
- Spelling out names and places in English text (Beijing, Xi'an, Shanghai).
2The big idea: every syllable = Initial + Final + Tone
Mandarin syllables follow one tidy formula. Once you see it, the language stops feeling random.
Almost every Mandarin syllable is built from three pieces:
- Initial — the consonant sound at the start (like , , ).
- Final — the vowel(s) and any ending consonant (like , , ).
- Tone— the pitch contour you say it with (1 of 4, plus a "neutral" tone).
So = m (initial) + a (final) + tone 1(high flat). Change any one of the three, and you've changed the word.
(mother) · (hemp) · (horse) · (to scold)
There are 21 initials and about 35 finals. Combine them with 4 tones and you cover essentially all of Mandarin. Not all combinations exist (e.g. there's no fe), but the system is finite and learnable in a weekend.
3The four tones (and the sneaky fifth)
Tones aren't optional. They're part of the word, the way vowels are part of 'cat.'
English uses pitch for emotion("Really?" goes up) but not to change a word's meaning. Mandarin uses pitch to distinguish words. The tone marks sit on top of the main vowel.
Hold a high, steady pitch — like singing one note.
Pitch rises sharply, like asking a question.
Drops low and stays there. Textbooks show a "dip up" but in real speech it's mostly just low.
Drops sharply from high to low — emphatic.
The neutral tone
Some syllables are unstressed and have no tone mark — these are called neutral tone. They're short, light, and quick. Common in grammatical particles like (question marker) or the second syllable of words like (mom) and (friend).
Where does the tone mark go?
If you see multiple vowels in a syllable, the tone mark follows this priority:
a > o > e > i > u > ü
So in , the mark goes on the a. In , on the e. The exception: iu and ui — the mark goes on the second letter (, ).
4The 21 initials (consonants)
Most are easy. A handful are the difference between sounding fluent and sounding like a tourist.
Tap any romanization letter to hear it. Rows tinted in rose are the ones English speakers commonly trip over.
Group 1 — Lips & tongue tip (mostly easy)
| Pinyin | How to say it | Like in English |
|---|---|---|
| Like English "p" in spy — lips together, no puff of air. | spy, not boy | |
| Like English "p" in pie — with a strong puff of air. | pie | |
| Just like English "m." | mom | |
| Just like English "f." | fun | |
| Like English "t" in stop — no air puff. | stop, not dog | |
| Like English "t" in top — with air puff. | top | |
| Just like English "n." | no | |
| Just like English "l." | love | |
| Like English "k" in sky — no air puff. | sky, not go | |
| Like English "k" in kite — with air puff. | kite | |
| Like English "h" but a bit raspier — slight friction in the throat. | hat (a little harsher) |
Group 2 — The "j q x" trio (palatals)
| Pinyin | How to say it | Like in English |
|---|---|---|
| Tongue flat, tip behind lower teeth. Like a soft "j" — almost between English "j" and "y". | jeep (lighter, smiley) | |
| Not like English "q." It's like "ch" said with a wide smile, no air through lips. | cheap (smiley, lighter) | |
| Not like English "x." It's like "sh" said with a wide smile. | she (smiley, lighter) |
Group 3 — The "zh ch sh r" set (retroflex)
| Pinyin | How to say it | Like in English |
|---|---|---|
| Curl tongue tip up toward the roof of your mouth. Then say "j." Unaspirated. | jerk (with curled tongue) | |
| Same tongue position as zh, but with a strong puff of air. | church (with curled tongue) | |
| Same tongue position, but a continuous "sh" sound. | shrub (with curled tongue) | |
| Same tongue position; cross between English "r" in red and the "s" in pleasure. No lip rounding. | Roughly azure said with curled tongue |
Group 4 — The "z c s" trio (sharp, in front)
| Pinyin | How to say it | Like in English |
|---|---|---|
| Like the "ds" in kids — said as a single sound. | kids, beds | |
| Not like English "c." It's the "ts" in cats, with a puff of air. | cats, bits | |
| Just like English "s." | sun |
Group 5 — Semi-vowels
| Pinyin | How to say it | Like in English |
|---|---|---|
| Like English "y" in yes. Often used as a placeholder when a syllable starts with the i sound. | yes | |
| Like English "w" in way. Placeholder when a syllable starts with the u sound. | way |
5The finals (vowels & vowel + n/ng)
Once initials are sorted, finals are the other half of every syllable.
Simple finals
| Pinyin | How to say it | Roughly like |
|---|---|---|
| Open "ah." | father | |
| Like "wo" — a slight "w" glide into "o." | wore | |
| Like the "uh" in duh, but pulled back in the throat. | duh, her (no R) | |
| "ee" — usually. But see the warning below. | see | |
| "oo," with rounded lips. | moon | |
| Round your lips like saying "oo," then try to say "ee" without un-rounding. (Same as German ü or French tu.) | No English equivalent |
Compound finals (vowel combos)
| Pinyin | Sounds like |
|---|---|
| eye | |
| hey | |
| cow | |
| oh / so | |
| ya (as in "ya'll") | |
| yeh — note: NOT "ee-ee" | |
| yow | |
| yo — short form of "iou" | |
| wah | |
| waw | |
| way — short form of "uei" | |
| "ü" + "eh" (try saying "you-eh") |
Nasal finals (vowel + n or ng)
The difference between -n and -ng matters for meaning. -n closes the mouth (tongue at the alveolar ridge); -ngstays open (back of tongue, like English "sing").
| Pinyin | Sounds like |
|---|---|
| ahn | |
| ahng (open mouth) | |
| uhn | |
| uhng | |
| een (like "seen") | |
| eeng (like "seeing") | |
| oong (rounded — like "oongh") | |
| yen (NOT "yan"!) | |
| yang | |
| wahn | |
| wahng | |
| wun (short for "uen") |
6The traps: spellings that lie to English readers
If you remember nothing else, remember this list. These are the letters most people get wrong.
| You see | You'd guess | It actually is | Example |
|---|---|---|---|
| "ks" or "z" | "sh" (with a smile) | = "shee-ahn" | |
| "kw" | "ch" (with a smile) | = "chee" (seven) | |
| "k" or "s" | "ts" | = "tsai" (vegetable) | |
| "z" | "dz" | = "dzow" (early) | |
| "zh" like measure | "j" with curled tongue | = "jong" (middle) | |
| English "r" | Curled-tongue voiced buzz | = "rrr" (sun, day) | |
| "eh" | "uh" (when alone) | = "huh" (drink) | |
| "ee" | "uh"-buzz after z/c/s/zh/ch/sh/r | = "shrr" | |
| "ee-an" | "yen" | = "tyen" (sky) | |
| b/p, d/t, g/k | voiced/voiceless | unaspirated/aspirated | sounds closer to "pa" |
7Tone change rules (tone sandhi)
The tones marked on paper aren't always what comes out of your mouth. Three small rules.
Rule 1: Two third tones in a row
When you have two tone-3 syllables back to back, the first one becomes a tone 2 (rising). The spelling doesn't change — only the pronunciation.
+ → spoken as → (hello!)
Rule 2: The word (— "not")
Normally tone 4 (). But before another tone-4 word, it switches to tone 2 (bú).
+ → (wrong)
Rule 3: The word (— "one")
Default is tone 1. But:
- Before a tone-4 syllable, it becomes tone 2: (one piece).
- Before any other tone (1, 2, 3), it becomes tone 4: (one sheet).
8Spelling quirks worth knowing
A handful of conventions that look weird until you know why.
Apostrophes split syllables
(the city) is two syllables: + . Without the apostrophe, "Xian" could be misread as one syllable (). Apostrophes appear when a syllable starting with a/o/e follows another syllable, to avoid ambiguity.
"y" and "w" as placeholders
A syllable can't start with a bare i, u, or ü — romanization spells in a placeholder:
- i → y: i alone is written ; ie alone is .
- u → w: u alone is ; uang alone is .
- ü → yu: ü alone is ; üe alone is .
Contracted spellings
Some finals are spelled shorter than they sound:
- iou → iu (e.g. , but pronounce the "o").
- uei → ui (e.g. , pronounce the "e").
- uen → un (e.g. , pronounce the "e").
Capitalization
Just like English: capitalize at the start of sentences and for proper nouns. Surnames go first in Chinese names: Lǐ Wěi (Lǐ = surname, Wěi = given name).
9Practice with real words
Try saying each one out loud before clicking "Reveal." Mouth memory beats eye memory.
City name:
City name:
City name:
Common word:
Common phrase:
Famous food:
Tea variety:
Tricky one:
10Mini-quiz
Pick the best answer for each. Instant feedback.
1. The romanization sounds most like:
2. Which is true about the four tones?
3. The romanization sounds most like:
4. Pinyin sounds like:
5. The difference between romanization and is:
6. is actually pronounced:
7. Why does have an apostrophe?
8. (sky) sounds most like: