Pinyin
Adapted from Wikipedia · Discoverer experience
Hanyu Pinyin, or simply pinyin, is the most common way to write Standard Chinese using the letters of the Roman alphabet. It helps people all over the world read and say Chinese words correctly. In China and Singapore, pinyin is used to teach children how to speak and write Chinese characters. It is also used on computers to help type Chinese words and to organize Chinese dictionaries.
Pinyin breaks down each Chinese sound into two parts: an initial consonant and a final vowel sound. Special marks above the letters show the different tones used in Chinese, which change the meaning of words. These marks are often left out when writing Chinese names in English.
Hanyu Pinyin was created in the 1950s by a group of Chinese language experts, led by Zhou Youguang, who is known as the “father of pinyin.” It became the official system in China in 1958 and was later adopted by the United Nations and made into an international standard. Today, pinyin is used by learners and teachers of Chinese around the world.
History
Matteo Ricci, a Jesuit missionary in China, wrote the first book using the Latin alphabet to write Chinese in 1605. Later, other missionaries also created books to help Westerners understand Chinese sounds.
In the 1950s, Chinese linguists created Hanyu Pinyin as a way to write Chinese sounds using the Latin alphabet. They combined ideas from older systems to make it easier for everyone to learn and use. Hanyu Pinyin was officially adopted in 1958 and became widely used in schools, helping people learn to read and speak Standard Chinese. Today, it is the most common way to write Chinese using the Latin alphabet.
Syllables
Chinese sounds are described using two parts: initials and finals. Initials are usually single consonant sounds, while finals can be more complex, sometimes including extra sounds called medials. For example, the sound in the word for "clothes" (衣) might change slightly depending on how it is used in a sentence.
One special sound in Chinese is the "ü" sound, which looks like a "u" with an umlaut (like in German). This is used to show a different pronunciation, such as in the word for "donkey" (驴). However, because it can be hard to type or show this symbol, sometimes "v" is used instead. This can cause confusion, especially with names, but rules have been made to help with this in important documents like passports.
| b p m f | d t n l | g k h | j q x | zh ch sh r | z c s |
| Pinyin | IPA | Description |
|---|---|---|
| b | [p] | Unaspirated p, like in English spark. |
| p | [pʰ] | Strongly aspirated p, like in English pay. |
| m | [m] | Like the m in English may. |
| f | [f] | Like the f in English fair. |
| d | [t] | Unaspirated t, like in English stop. |
| t | [tʰ] | Strongly aspirated t, like in English take. |
| n | [n] | Like the n in English nay. |
| l | [l]~[ɾ] | Varies between the l in English lay and tt in American English better. |
| g | [k] | Unaspirated k, like in English skill. |
| k | [kʰ] | Strongly aspirated k, like in English kiss. |
| h | [x]~[h] | Varies between the h in English hat, and the ch in Scottish English loch. |
| j | [tɕ] | Alveolo-palatal, unaspirated. No direct equivalent in English, but similar to the ch in English churchyard. |
| q | [tɕʰ] | Alveolo-palatal, aspirated. No direct equivalent in English, but similar to the ch in English punchy. |
| x | [ɕ] | Alveolo-palatal, unaspirated. No direct equivalent in English, but similar to the sh in English push. |
| zh | [ʈʂ]~[d͡ʒ] | Retroflex, unaspirated. Like j in English jack. |
| ch | [ʈʂʰ]~[ʃ] | Retroflex, aspirated. Varies between the ch in English church and sh in English bushy. |
| sh | [ʂ]~[ɹ̠̊˔] | Retroflex, unaspirated. Like sh in shirt. |
| r | [ɻ~ʐ]~[ɹ] | Retroflex. No direct equivalent in English, but varies between the r in English reduce and the s in English measure. |
| z | [ts] | Unaspirated. Like the zz in English pizza. |
| c | [tsʰ] | Aspirated. Like the ts in English bats. |
| s | [s] | Like the s in English say. |
| w | [w] | Like the w in English water. |
| y | [j] | Like the y in English yes. |
| yu | [ɥ] | Like the hu in French huit, see below. |
Rime Medial | ∅ | -e / -o / -ê | -a | -ei | -ai | -ou | -ao | -en | -an | -eng | -ang | er | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ∅ | [ɨ] -i | [ɤ] e -e | [ɛ] ê -ê | [a] a -a | [ei̯] ei -ei | [ai̯] ai -ai | [ou̯] ou -ou | [au̯] ao -ao | [ən] en -en | [an] an -an | [əŋ] eng -eng | [aŋ] ang -ang | [ɚ] er |
y- -i- | [i] yi -i | [je] ye -ie | [ja] ya -ia | [jou̯] ([iu̯]) you -iu | [jau̯] yao -iao | [in] yin -in | [jɛn] yan -ian | [iŋ] ying -ing | [jaŋ] yang -iang | ||||
w- -u- | [u] wu -u | [wo] wo -uo | [wa] wa -ua | [wei̯] ([ui̯]) wei -ui | [wai̯] wai -uai | [wən] ([un]) wen -un | [wan] wan -uan | [wəŋ~ʊŋ] weng -ong | [waŋ] wang -uang | ||||
yu- -ü- | [y] yu -ü | [ɥe] yue -üe | [yn] yun -ün | [ɥɛn] yuan -üan | [jʊŋ] yong -iong | ||||||||
| Pinyin | IPA | Form with zero initial | Explanation |
|---|---|---|---|
| -i | [ɹ̩~z̩], [ɻ̩~ʐ̩] | (N/A) | -i is a buzzed continuation of the consonant following z-, c-, s-, zh-, ch-, sh- or r-. In all other cases, -i has the sound of bee. |
| a | [a] | a | like English father, but a bit more fronted |
| e | [ɤ] ⓘ, [ə] | e | a back, unrounded vowel (similar to English duh, but not as open). Pronounced as a sequence [ɰɤ]. |
| ai | [ai̯] | ai | like English eye, but a bit lighter |
| ei | [ei̯] | ei | as in hey |
| ao | [au̯] | ao | approximately as in cow; the a is much more audible than the o |
| ou | [ou̯] | ou | as in North American English so |
| an | [an] | an | like British English ban, but more central |
| en | [ən] | en | as in taken |
| ang | [aŋ] | ang | as in German Angst. (Starts with the vowel sound in father and ends in the velar nasal; like song in some dialects of American English) |
| eng | [əŋ] | eng | like e in en above but with ng appended |
| ong | [ʊŋ]~[o̞ʊŋ] | (weng) | starts with the vowel sound in book and ends with the velar nasal sound in sing. Varies between [oŋ] and [uŋ] depending on the speaker. |
| er | [aɚ̯]~[əɹ] | er | Similar to the sound in bar in English. Can also be pronounced [ɚ] depending on the speaker. |
| Finals beginning with i- (y-) | |||
| i | [i] | yi | like English bee |
| ia | [ja] | ya | as i + a; like English yard |
| ie | [je] | ye | as i + ê where the e (compare with the ê interjection) is pronounced shorter and lighter |
| iao | [jau̯] | yao | as i + ao |
| iu | [jou̯] | you | as i + ou |
| ian | [jɛn] | yan | as i + an; like English yen. Varies between [jen] and [jan] depending on the speaker. |
| in | [in] | yin | as i + n |
| iang | [jaŋ] | yang | as i + ang |
| ing | [iŋ] | ying | as i + ng |
| iong | [jʊŋ] | yong | as i + ong. Varies between [joŋ] and [juŋ] depending on the speaker. |
| Finals beginning with u- (w-) | |||
| u | [u] | wu | like English oo |
| ua | [wa] | wa | as u + a |
| uo/o | [wo] | wo | as u + o where the o (compare with the o interjection) is pronounced shorter and lighter (spelled as o after b, p, m or f) |
| uai | [wai̯] | wai | as u + ai, as in English why |
| ui | [wei̯] | wei | as u + ei, as in English way |
| uan | [wan] | wan | as u + an |
| un | [wən] | wen | as u + en; as in English won |
| uang | [waŋ] | wang | as u + ang |
| (ong) | [wəŋ] | weng | as u + eng |
| Finals beginning with ü- (yu-) | |||
| ü | [y] ⓘ | yu | as in German über or French lune (pronounced as English ee with rounded lips; spelled as u after j, q or x) |
| üe | [ɥe] | yue | as ü + ê where the e (compare with the ê interjection) is pronounced shorter and lighter (spelled as ue after j, q or x) |
| üan | [ɥɛn] | yuan | as ü + an. Varies between [ɥen] and [ɥan] depending on the speaker (spelled as uan after j, q or x) |
| ün | [yn] | yun | as ü + n (spelled as un after j, q or x) |
| Interjections | |||
| ê | [ɛ] | ê | as in bet |
| o | [ɔ] | o | approximately as in British English office; the lips are much more rounded |
| io | [jɔ] | yo | as i + o |
Tones
The pinyin system uses special marks called diacritics to show the tones of Mandarin Chinese. These marks help indicate how a word is pronounced. There are four main tones, each shown with a different symbol: a flat line for the first tone, an acute accent for the second, a curved hat for the third, and a grave accent for the fourth. The fifth tone, called the neutral tone, has no special mark.
Before computers, people sometimes used numbers instead of these special marks to show tones. For example, the word "tóng" could be written as "tong2". Each number corresponds to one of the five tones.
| Tone | Examples | IPA |
|---|---|---|
| 1 | mā (macron) ma1 | ma˥ |
| 2 | má (acute accent) ma2 | ma˧˥ |
| 3 | mǎ (caron) ma3 | ma˨˩˦ |
| 4 | mà (grave accent) ma4 | ma˥˩ |
| Neutral | ma |
| -a | -e | -o | -i | -u | |
|---|---|---|---|---|---|
| a- | ào | ài | |||
| e- | èi | ||||
| o- | òu | ||||
| i- | ià, iào | iè | iò(ng) | iù | |
| u- | uà, uài | uò | uì | ||
| ü- | üà(n), uà(n) | üè, uè | |||
Spacing, capitalization, and punctuation
Standard Chinese has many words made of several syllables. In pinyin, spacing is meant to follow word boundaries, much like in other languages that use the Latin alphabet. Official rules for these spellings were set in 1988.
In real usage today, pinyin is often written syllable by syllable, a change that began after the government body overseeing language reforms was reorganized in 1985. Some people preferred this method because it kept pinyin from becoming an alternative way to write Chinese characters.
Comparison with other orthographies
Pinyin replaced older systems like Wade–Giles and postal romanization, becoming the official way to write Chinese sounds using the Latin alphabet. In 1982, it became the international standard, and many countries and organizations, including the United Nations, adopted it.
Pinyin helps English speakers understand Chinese sounds better than older systems, though some letters are used differently than in English. Chinese characters are still needed because they give meaning to words that sound the same in modern Chinese. Pinyin is mainly used for Standard Chinese based on Beijing Mandarin and is not used for other Chinese languages like Cantonese or Hokkien.
| IPA | i | je | jou | jɛn | in | iŋ | jʊŋ | u | wo | wei | wən | wəŋ | y | ɥe | ɥɛn | yn |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pinyin | yi | ye | you | yan | yin | ying | yong | wu | wo/o | wei | wen | weng | yu | yue | yuan | yun |
| Tongyong Pinyin | wun | wong | ||||||||||||||
| Wade–Giles | i/yi | yeh | yu | yen | yung | wên | wêng | yü | yüeh | yüan | yün | |||||
| Bopomofo | ㄧ | ㄧㄝ | ㄧㄡ | ㄧㄢ | ㄧㄣ | ㄧㄥ | ㄩㄥ | ㄨ | ㄨㄛ/ㄛ | ㄨㄟ | ㄨㄣ | ㄨㄥ | ㄩ | ㄩㄝ | ㄩㄢ | ㄩㄣ |
| example | 一 | 也 | 又 | 言 | 音 | 英 | 用 | 五 | 我 | 位 | 文 | 翁 | 玉 | 月 | 元 | 雲 |
| IPA | p | pʰ | m | fəŋ | tjou | twei | twən | tʰɤ | ny | ly | kɤ | kʰɤ | xɤ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pinyin | b | p | m | feng | diu | dui | dun | te | nü | lü | ge | ke | he |
| Tongyong Pinyin | fong | diou | duei | nyu | lyu | ||||||||
| Wade–Giles | p | pʻ | fêng | tiu | tui | tun | tʻê | nü | lü | ko | kʻo | ho | |
| Bopomofo | ㄅ | ㄆ | ㄇ | ㄈㄥ | ㄉㄧㄡ | ㄉㄨㄟ | ㄉㄨㄣ | ㄊㄜ | ㄋㄩ | ㄌㄩ | ㄍㄜ | ㄎㄜ | ㄏㄜ |
| example | 玻 | 婆 | 末 | 封 | 丟 | 兌 | 頓 | 特 | 女 | 旅 | 歌 | 可 | 何 |
| IPA | tɕjɛn | tɕjʊŋ | tɕʰin | ɕɥɛn | ʈʂɤ | ʈʂɨ | ʈʂʰɤ | ʈʂʰɨ | ʂɤ | ʂɨ | ɻɤ | ɻɨ | tsɤ | tswo | tsɨ | tsʰɤ | tsʰwo | tsʰɨ | sɤ | swo | sɨ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pinyin | jian | jiong | qin | xuan | zhe | zhi | che | chi | she | shi | re | ri | ze | zuo | zi | ce | cuo | ci | se | suo | si |
| Tongyong Pinyin | jyong | cin | syuan | jhe | jhih | chih | shih | rih | zih | cih | sih | ||||||||||
| Wade–Giles | chien | chiung | chʻin | hsüan | chê | chih | chʻê | chʻih | shê | shih | jê | jih | tsê | tso | tzŭ | tsʻê | tsʻo | tzʻŭ | sê | so | ssŭ |
| Bopomofo | ㄐㄧㄢ | ㄐㄩㄥ | ㄑㄧㄣ | ㄒㄩㄢ | ㄓㄜ | ㄓ | ㄔㄜ | ㄔ | ㄕㄜ | ㄕ | ㄖㄜ | ㄖ | ㄗㄜ | ㄗㄨㄛ | ㄗ | ㄘㄜ | ㄘㄨㄛ | ㄘ | ㄙㄜ | ㄙㄨㄛ | ㄙ |
| example | 件 | 囧 | 秦 | 宣 | 哲 | 之 | 扯 | 赤 | 社 | 是 | 惹 | 日 | 仄 | 左 | 字 | 策 | 撮 | 次 | 色 | 索 | 斯 |
| IPA | ma˥ | ma˧˥ | ma˨˩˦ | ma˥˩ | ma |
|---|---|---|---|---|---|
| Pinyin | mā | má | mǎ | mà | ma |
| Tongyong Pinyin | ma | må | |||
| Wade–Giles | ma1 | ma2 | ma3 | ma4 | ma |
| Bopomofo | ㄇㄚ | ㄇㄚˊ | ㄇㄚˇ | ㄇㄚˋ | ˙ㄇㄚ |
| example (Chinese characters) | 媽 | 麻 | 馬 | 罵 | 嗎 |
Typography and encoding
Pinyin uses special symbols to show the tones of Chinese sounds. These symbols come from a set called Combining Diacritical Marks, instead of another set called Spacing Modifier Letters. Important Chinese character sets, like GB 2312 and JIS X 0212, include these special letters, so they can be shown correctly on computers using Unicode.
Some special characters, like ⟨ḿ⟩ and ⟨ǹ⟩, can cause confusion because different systems, such as GBK and GB 18030, place them in different parts of the encoding. This means that some fonts and input methods might show these characters differently.
Generally, it is suggested that pinyin use a certain style of letters, like a single-story 'a' and 'g', and a thicker second-tone mark, but there is no strict rule about this.
| Letter | First tone | Second tone | Third tone | Fourth tone | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Combining Diacritical Marks | ◌̄ (U+0304) | ◌́ (U+0301) | ◌̌ (U+030C) | ◌̀ (U+0300) | ||||||||||||
| Common letters | ||||||||||||||||
| Uppercase | A | Ā (U+0100) | Á (U+00C1) | Ǎ (U+01CD) | À (U+00C0) | |||||||||||
| E | Ē (U+0112) | É (U+00C9) | Ě (U+011A) | È (U+00C8) | ||||||||||||
| I | Ī (U+012A) | Í (U+00CD) | Ǐ (U+01CF) | Ì (U+00CC) | ||||||||||||
| O | Ō (U+014C) | Ó (U+00D3) | Ǒ (U+01D1) | Ò (U+00D2) | ||||||||||||
| U | Ū (U+016A) | Ú (U+00DA) | Ǔ (U+01D3) | Ù (U+00D9) | ||||||||||||
| Ü (U+00DC) | Ǖ (U+01D5) | Ǘ (U+01D7) | Ǚ (U+01D9) | Ǜ (U+01DB) | ||||||||||||
| Lowercase | a | ā (U+0101) | á (U+00E1) | ǎ (U+01CE) | à (U+00E0) | |||||||||||
| e | ē (U+0113) | é (U+00E9) | ě (U+011B) | è (U+00E8) | ||||||||||||
| i | ī (U+012B) | í (U+00ED) | ǐ (U+01D0) | ì (U+00EC) | ||||||||||||
| o | ō (U+014D) | ó (U+00F3) | ǒ (U+01D2) | ò (U+00F2) | ||||||||||||
| u | ū (U+016B) | ú (U+00FA) | ǔ (U+01D4) | ù (U+00F9) | ||||||||||||
| ü (U+00FC) | ǖ (U+01D6) | ǘ (U+01D8) | ǚ (U+01DA) | ǜ (U+01DC) | ||||||||||||
| Rare letters | ||||||||||||||||
| Uppercase | Ê (U+00CA) | Ê̄ (U+00CA U+0304) | Ế (U+1EBE) | Ê̌ (U+00CA U+030C) | Ề (U+1EC0) | |||||||||||
| M | M̄ (U+004D U+0304) | Ḿ (U+1E3E) | M̌ (U+004D U+030C) | M̀ (U+004D U+0300) | ||||||||||||
| N | N̄ (U+004E U+0304) | Ń (U+0143) | Ň (U+0147) | Ǹ (U+01F8) | ||||||||||||
| Lowercase | ê (U+00EA) | ê̄ (U+00EA U+0304) | ế (U+1EBF) | ê̌ (U+00EA U+030C) | ề (U+1EC1) | |||||||||||
| m | m̄ (U+006D U+0304) | ḿ (U+1E3F) | m̌ (U+006D U+030C) | m̀ (U+006D U+0300) | ||||||||||||
| n | n̄ (U+006E U+0304) | ń (U+0144) | ň (U+0148) | ǹ (U+01F9) | ||||||||||||
| Notes a.^ Yellow cells indicate that there are no single Unicode character for that letter; the character shown here uses Combining Diacritical Mark characters to display the letter. b.^ Grey cells indicate that Xiandai Hanyu Cidian does not include pinyin with that specific letter. | ||||||||||||||||
| Uppercase | Lowercase | Note | Example |
|---|---|---|---|
| Ĉ (U+0108) | ĉ (U+0109) | Abbreviation of ch | 长; 長 can be spelled as ĉáŋ |
| Ŝ (U+015C) | ŝ (U+015D) | Abbreviation of sh | 伤; 傷 can be spelled as ŝāŋ |
| Ẑ (U+1E90) | ẑ (U+1E91) | Abbreviation of zh | 张; 張 can be spelled as Ẑāŋ |
| Ŋ (U+014A) | ŋ (U+014B) | Abbreviation of ng | 让; 讓 can be spelled as ràŋ 嗯 can be spelled as ŋ̀ |
| Chinese | Pinyin | Usage | Example |
|---|---|---|---|
| U+3002 。 IDEOGRAPHIC FULL STOP | U+002E . FULL STOP | End of sentence | 你好。 Nǐ hǎo. |
U+FF0C , FULLWIDTH COMMA U+3001 、 IDEOGRAPHIC COMMA | U+002C , COMMA | Connecting clauses | 你,好吗? Nǐ, hǎo ma? |
| U+2014 — EM DASH (×2) | U+2014 — EM DASH | Division of clauses mid-sentence | 枢纽部分——中央大厅 shūniǔ bùfèn — zhōngyāng dàtīng |
| U+2026 … HORIZONTAL ELLIPSIS (×2) | U+2026 … HORIZONTAL ELLIPSIS | Redaction of part of a passage | 我…… Wǒ… |
| —N/a | U+00B7 · MIDDLE DOT | Neutral tone marker placed before the syllable | 吗 ·ma |
| U+002D - HYPHEN-MINUS | Hyphenation of abbreviated compounds | 公关 gōng-guān | |
| U+0027 ' APOSTROPHE | Syllable segmentation | 西安 - Xī'ān (compared to 先 - xiān) |
Usage
The spelling of Chinese geographical or personal names in pinyin has become the most common way to write them in English. Pinyin is the main way to type Chinese words on computers in mainland China, unlike in Taiwan where another system called bopomofo is more common.
Families who speak Mandarin use pinyin to help children connect written characters with the words they already know. Children usually start learning it in kindergarten, but it is not used in textbooks after primary school. Families who speak a different language also use pinyin to teach children how to say Mandarin words correctly when they are learning new vocabulary in elementary school.
Since 1958, pinyin has also been used in adult education. It helps people who cannot read Chinese characters to learn to read and write through pinyin.
Pinyin is also helpful for people who are learning Mandarin as a new language. It is used to explain pronunciation and grammar, along with Chinese characters. Books for learners often include both the characters and pinyin. The way pinyin helps learners is somewhat like how extra letters are used in Japanese books to show pronunciation.
Computer input
Simple computer systems that could only use basic letters and numbers made it easier to use pinyin without special marks. Today, most computers can show Chinese characters and let people type them using a regular keyboard with special programs. Some tablets also let people write characters by hand, and the computer tries to recognize what was written.
Pinyin with special marks can be typed using special keyboard setups or other tools.
Sorting techniques
Chinese text can be sorted using pinyin, which helps find words when you know how they sound but not how they are written. Characters can be sorted by their pinyin spelling. When syllables are the same, they are sorted by tone, with neutral tones coming last.
Words with more than one character can be sorted in two ways: by each character, as in the Xiandai Hanyu Cidian, or by the whole word’s string, then by tone. This method is used in the ABC Chinese–English Dictionary.
By region
Taiwan
See also: Chinese language romanization in Taiwan and Tongyong Pinyin
Between October 2002 and January 2009, Taiwan used Tongyong Pinyin, a version of Hanyu Pinyin made for local use. After that, Taiwan started to use Hanyu Pinyin instead. Tongyong Pinyin was made to write not just Standard Chinese, but also other varieties spoken on the island. The ruling Kuomintang party preferred the system used in mainland China and internationally. This led to disagreements about which system to use, with each side connected to different political groups.
Today, many street signs in Taiwan use Tongyong Pinyin or other older systems. Efforts to make Hanyu Pinyin the standard in Taiwan have had mixed results, with most place names still not changed. People in Taiwan can choose how their personal names are written on passports, using different systems. Using Hanyu Pinyin on signs for a metro line in 2017 caused some protests, even though officials said it was to help foreigners.
Singapore
See also: Chinese language romanization in Singapore
Singapore adopted Hanyu Pinyin as the official system for Mandarin in public use starting in the 1980s, as part of the Speak Mandarin Campaign. Schools also use Hanyu Pinyin to teach Mandarin. While it has been widely accepted in government use and for place names and newer businesses, some people still prefer older ways of writing personal names and words from other Chinese languages commonly spoken in Singapore, like Hokkien, Teochew, and Cantonese.
Special names
See also: SASM/GNC romanization, Tibetan pinyin, and Guangdong Romanization
Place names from non-Han languages such as Mongolian, Uyghur, and Tibetan are also written using pinyin. This is done according to rules set in 1976. The pinyin letters are used to try to match the sounds of these languages as closely as possible, which can make the spellings different from both the original name and how it is written in Chinese.
Images
This article is a child-friendly adaptation of the Wikipedia article on Pinyin, available under CC BY-SA 4.0.
Images from Wikimedia Commons. Tap any image to view credits and license.
Safekipedia