Pinyin
Adapted from Wikipedia · Adventurer experience
Hanyu Pinyin
Hanyu Pinyin, or simply pinyin, is the most common way to write Standard Chinese using the letters of the Roman alphabet. It helps people all over the world read and say Chinese words correctly. In China and Singapore, pinyin is used to teach children how to speak and write Chinese characters. It is also used on computers to help type Chinese words and to organize Chinese dictionaries.
Pinyin breaks down each Chinese sound into two parts: an initial consonant and a final vowel sound. Special marks above the letters show the different tones used in Chinese. These marks are often left out when writing Chinese names in English.
Hanyu Pinyin was created in the 1950s by a group of Chinese language experts, led by Zhou Youguang, who is known as the “father of pinyin.” It became the official system in China in 1958 and was later adopted by the United Nations. Today, pinyin is used by learners and teachers of Chinese around the world.
History
Matteo Ricci, a Jesuit missionary in China, wrote the first book using the Latin alphabet to write Chinese in 1605. Later, other missionaries also created books to help Westerners understand Chinese sounds.
In the 1950s, Chinese language experts created Hanyu Pinyin. They used the Latin alphabet to write Chinese sounds. They mixed ideas from older systems to make it easier for everyone. Hanyu Pinyin became official in 1958. It is now used in schools to help people learn to read and speak Standard Chinese. Today, it is the most common way to write Chinese with the Latin alphabet.
Syllables
Chinese sounds are split into two parts: initials and finals. Initials are single consonant sounds. Finals are more complex and can include extra sounds.
For example, the sound in the word for "clothes" (衣) can change a little depending on how it is used in a sentence.
One special sound in Chinese is the "ü" sound. It looks like a "u" with an umlaut, like in German. This shows a different way to say a word, like in "donkey" (驴). But because this symbol can be hard to type, sometimes "v" is used instead. This can be confusing, especially with names. But there are rules to help with this in important places like passports.
| b p m f | d t n l | g k h | j q x | zh ch sh r | z c s |
| Pinyin | IPA | Description |
|---|---|---|
| b | [p] | Unaspirated p, like in English spark. |
| p | [pʰ] | Strongly aspirated p, like in English pay. |
| m | [m] | Like the m in English may. |
| f | [f] | Like the f in English fair. |
| d | [t] | Unaspirated t, like in English stop. |
| t | [tʰ] | Strongly aspirated t, like in English take. |
| n | [n] | Like the n in English nay. |
| l | [l]~[ɾ] | Varies between the l in English lay and tt in American English better. |
| g | [k] | Unaspirated k, like in English skill. |
| k | [kʰ] | Strongly aspirated k, like in English kiss. |
| h | [x]~[h] | Varies between the h in English hat, and the ch in Scottish English loch. |
| j | [tɕ] | Alveolo-palatal, unaspirated. No direct equivalent in English, but similar to the ch in English churchyard. |
| q | [tɕʰ] | Alveolo-palatal, aspirated. No direct equivalent in English, but similar to the ch in English punchy. |
| x | [ɕ] | Alveolo-palatal, unaspirated. No direct equivalent in English, but similar to the sh in English push. |
| zh | [ʈʂ]~[d͡ʒ] | Retroflex, unaspirated. Like j in English jack. |
| ch | [ʈʂʰ]~[ʃ] | Retroflex, aspirated. Varies between the ch in English church and sh in English bushy. |
| sh | [ʂ]~[ɹ̠̊˔] | Retroflex, unaspirated. Like sh in shirt. |
| r | [ɻ~ʐ]~[ɹ] | Retroflex. No direct equivalent in English, but varies between the r in English reduce and the s in English measure. |
| z | [ts] | Unaspirated. Like the zz in English pizza. |
| c | [tsʰ] | Aspirated. Like the ts in English bats. |
| s | [s] | Like the s in English say. |
| w | [w] | Like the w in English water. |
| y | [j] | Like the y in English yes. |
| yu | [ɥ] | Like the hu in French huit, see below. |
Rime Medial | ∅ | -e / -o / -ê | -a | -ei | -ai | -ou | -ao | -en | -an | -eng | -ang | er | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ∅ | [ɨ] -i | [ɤ] e -e | [ɛ] ê -ê | [a] a -a | [ei̯] ei -ei | [ai̯] ai -ai | [ou̯] ou -ou | [au̯] ao -ao | [ən] en -en | [an] an -an | [əŋ] eng -eng | [aŋ] ang -ang | [ɚ] er |
y- -i- | [i] yi -i | [je] ye -ie | [ja] ya -ia | [jou̯] ([iu̯]) you -iu | [jau̯] yao -iao | [in] yin -in | [jɛn] yan -ian | [iŋ] ying -ing | [jaŋ] yang -iang | ||||
w- -u- | [u] wu -u | [wo] wo -uo | [wa] wa -ua | [wei̯] ([ui̯]) wei -ui | [wai̯] wai -uai | [wən] ([un]) wen -un | [wan] wan -uan | [wəŋ~ʊŋ] weng -ong | [waŋ] wang -uang | ||||
yu- -ü- | [y] yu -ü | [ɥe] yue -üe | [yn] yun -ün | [ɥɛn] yuan -üan | [jʊŋ] yong -iong | ||||||||
| Pinyin | IPA | Form with zero initial | Explanation |
|---|---|---|---|
| -i | [ɹ̩~z̩], [ɻ̩~ʐ̩] | (N/A) | -i is a buzzed continuation of the consonant following z-, c-, s-, zh-, ch-, sh- or r-. In all other cases, -i has the sound of bee. |
| a | [a] | a | like English father, but a bit more fronted |
| e | [ɤ] ⓘ, [ə] | e | a back, unrounded vowel (similar to English duh, but not as open). Pronounced as a sequence [ɰɤ]. |
| ai | [ai̯] | ai | like English eye, but a bit lighter |
| ei | [ei̯] | ei | as in hey |
| ao | [au̯] | ao | approximately as in cow; the a is much more audible than the o |
| ou | [ou̯] | ou | as in North American English so |
| an | [an] | an | like British English ban, but more central |
| en | [ən] | en | as in taken |
| ang | [aŋ] | ang | as in German Angst. (Starts with the vowel sound in father and ends in the velar nasal; like song in some dialects of American English) |
| eng | [əŋ] | eng | like e in en above but with ng appended |
| ong | [ʊŋ]~[o̞ʊŋ] | (weng) | starts with the vowel sound in book and ends with the velar nasal sound in sing. Varies between [oŋ] and [uŋ] depending on the speaker. |
| er | [aɚ̯]~[əɹ] | er | Similar to the sound in bar in English. Can also be pronounced [ɚ] depending on the speaker. |
| Finals beginning with i- (y-) | |||
| i | [i] | yi | like English bee |
| ia | [ja] | ya | as i + a; like English yard |
| ie | [je] | ye | as i + ê where the e (compare with the ê interjection) is pronounced shorter and lighter |
| iao | [jau̯] | yao | as i + ao |
| iu | [jou̯] | you | as i + ou |
| ian | [jɛn] | yan | as i + an; like English yen. Varies between [jen] and [jan] depending on the speaker. |
| in | [in] | yin | as i + n |
| iang | [jaŋ] | yang | as i + ang |
| ing | [iŋ] | ying | as i + ng |
| iong | [jʊŋ] | yong | as i + ong. Varies between [joŋ] and [juŋ] depending on the speaker. |
| Finals beginning with u- (w-) | |||
| u | [u] | wu | like English oo |
| ua | [wa] | wa | as u + a |
| uo/o | [wo] | wo | as u + o where the o (compare with the o interjection) is pronounced shorter and lighter (spelled as o after b, p, m or f) |
| uai | [wai̯] | wai | as u + ai, as in English why |
| ui | [wei̯] | wei | as u + ei, as in English way |
| uan | [wan] | wan | as u + an |
| un | [wən] | wen | as u + en; as in English won |
| uang | [waŋ] | wang | as u + ang |
| (ong) | [wəŋ] | weng | as u + eng |
| Finals beginning with ü- (yu-) | |||
| ü | [y] ⓘ | yu | as in German über or French lune (pronounced as English ee with rounded lips; spelled as u after j, q or x) |
| üe | [ɥe] | yue | as ü + ê where the e (compare with the ê interjection) is pronounced shorter and lighter (spelled as ue after j, q or x) |
| üan | [ɥɛn] | yuan | as ü + an. Varies between [ɥen] and [ɥan] depending on the speaker (spelled as uan after j, q or x) |
| ün | [yn] | yun | as ü + n (spelled as un after j, q or x) |
| Interjections | |||
| ê | [ɛ] | ê | as in bet |
| o | [ɔ] | o | approximately as in British English office; the lips are much more rounded |
| io | [jɔ] | yo | as i + o |
Tones
The pinyin system uses special marks to show the tones of Mandarin Chinese. These marks help us know how to say a word. There are four main tones, each with its own symbol: a flat line for the first tone, an acute accent for the second, a curved hat for the third, and a grave accent for the fourth. The fifth tone, called the neutral tone, has no special mark.
Before computers, people sometimes used numbers instead of these marks to show tones. For example, the word "tóng" could be written as "tong2". Each number matches one of the five tones.
| Tone | Examples | IPA |
|---|---|---|
| 1 | mā (macron) ma1 | ma˥ |
| 2 | má (acute accent) ma2 | ma˧˥ |
| 3 | mǎ (caron) ma3 | ma˨˩˦ |
| 4 | mà (grave accent) ma4 | ma˥˩ |
| Neutral | ma |
| -a | -e | -o | -i | -u | |
|---|---|---|---|---|---|
| a- | ào | ài | |||
| e- | èi | ||||
| o- | òu | ||||
| i- | ià, iào | iè | iò(ng) | iù | |
| u- | uà, uài | uò | uì | ||
| ü- | üà(n), uà(n) | üè, uè | |||
Spacing, capitalization, and punctuation
Standard Chinese has many words made of several syllables. In pinyin, spaces are used to show where words begin and end, similar to other languages that use the Latin alphabet. Official rules for these spellings were set in 1988.
Today, pinyin is often written syllable by syllable. This change started after the government body overseeing language reforms was reorganized in 1985. Some people liked this method because it kept pinyin from replacing Chinese characters.
Comparison with other orthographies
Pinyin replaced older systems like Wade–Giles and postal romanization. It is now the official way to write Chinese sounds using the Latin alphabet. In 1982, it became the international standard. Many countries and organizations, including the United Nations, adopted it.
Pinyin helps English speakers understand Chinese sounds better than older systems. Some letters are used differently than in English. Chinese characters are still needed because they give meaning to words that sound the same in modern Chinese. Pinyin is mainly used for Standard Chinese based on Beijing Mandarin and is not used for other Chinese languages like Cantonese or Hokkien.
| IPA | i | je | jou | jɛn | in | iŋ | jʊŋ | u | wo | wei | wən | wəŋ | y | ɥe | ɥɛn | yn |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pinyin | yi | ye | you | yan | yin | ying | yong | wu | wo/o | wei | wen | weng | yu | yue | yuan | yun |
| Tongyong Pinyin | wun | wong | ||||||||||||||
| Wade–Giles | i/yi | yeh | yu | yen | yung | wên | wêng | yü | yüeh | yüan | yün | |||||
| Bopomofo | ㄧ | ㄧㄝ | ㄧㄡ | ㄧㄢ | ㄧㄣ | ㄧㄥ | ㄩㄥ | ㄨ | ㄨㄛ/ㄛ | ㄨㄟ | ㄨㄣ | ㄨㄥ | ㄩ | ㄩㄝ | ㄩㄢ | ㄩㄣ |
| example | 一 | 也 | 又 | 言 | 音 | 英 | 用 | 五 | 我 | 位 | 文 | 翁 | 玉 | 月 | 元 | 雲 |
| IPA | p | pʰ | m | fəŋ | tjou | twei | twən | tʰɤ | ny | ly | kɤ | kʰɤ | xɤ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pinyin | b | p | m | feng | diu | dui | dun | te | nü | lü | ge | ke | he |
| Tongyong Pinyin | fong | diou | duei | nyu | lyu | ||||||||
| Wade–Giles | p | pʻ | fêng | tiu | tui | tun | tʻê | nü | lü | ko | kʻo | ho | |
| Bopomofo | ㄅ | ㄆ | ㄇ | ㄈㄥ | ㄉㄧㄡ | ㄉㄨㄟ | ㄉㄨㄣ | ㄊㄜ | ㄋㄩ | ㄌㄩ | ㄍㄜ | ㄎㄜ | ㄏㄜ |
| example | 玻 | 婆 | 末 | 封 | 丟 | 兌 | 頓 | 特 | 女 | 旅 | 歌 | 可 | 何 |
| IPA | tɕjɛn | tɕjʊŋ | tɕʰin | ɕɥɛn | ʈʂɤ | ʈʂɨ | ʈʂʰɤ | ʈʂʰɨ | ʂɤ | ʂɨ | ɻɤ | ɻɨ | tsɤ | tswo | tsɨ | tsʰɤ | tsʰwo | tsʰɨ | sɤ | swo | sɨ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pinyin | jian | jiong | qin | xuan | zhe | zhi | che | chi | she | shi | re | ri | ze | zuo | zi | ce | cuo | ci | se | suo | si |
| Tongyong Pinyin | jyong | cin | syuan | jhe | jhih | chih | shih | rih | zih | cih | sih | ||||||||||
| Wade–Giles | chien | chiung | chʻin | hsüan | chê | chih | chʻê | chʻih | shê | shih | jê | jih | tsê | tso | tzŭ | tsʻê | tsʻo | tzʻŭ | sê | so | ssŭ |
| Bopomofo | ㄐㄧㄢ | ㄐㄩㄥ | ㄑㄧㄣ | ㄒㄩㄢ | ㄓㄜ | ㄓ | ㄔㄜ | ㄔ | ㄕㄜ | ㄕ | ㄖㄜ | ㄖ | ㄗㄜ | ㄗㄨㄛ | ㄗ | ㄘㄜ | ㄘㄨㄛ | ㄘ | ㄙㄜ | ㄙㄨㄛ | ㄙ |
| example | 件 | 囧 | 秦 | 宣 | 哲 | 之 | 扯 | 赤 | 社 | 是 | 惹 | 日 | 仄 | 左 | 字 | 策 | 撮 | 次 | 色 | 索 | 斯 |
| IPA | ma˥ | ma˧˥ | ma˨˩˦ | ma˥˩ | ma |
|---|---|---|---|---|---|
| Pinyin | mā | má | mǎ | mà | ma |
| Tongyong Pinyin | ma | må | |||
| Wade–Giles | ma1 | ma2 | ma3 | ma4 | ma |
| Bopomofo | ㄇㄚ | ㄇㄚˊ | ㄇㄚˇ | ㄇㄚˋ | ˙ㄇㄚ |
| example (Chinese characters) | 媽 | 麻 | 馬 | 罵 | 嗎 |
Typography and encoding
Pinyin uses special symbols to show the tones of Chinese sounds. These symbols come from a set called Combining Diacritical Marks. Important Chinese character sets, like GB 2312 and JIS X 0212, include these special letters. This helps them show up correctly on computers using Unicode.
Some special characters, like ⟨ḿ⟩ and ⟨ǹ⟩, can be confusing. Different systems, such as GBK and GB 18030, place them in different parts of the encoding. This means that some fonts and input methods might show these characters differently.
Generally, it is suggested that pinyin use a certain style of letters, like a single-story 'a' and 'g', and a thicker second-tone mark, but there is no strict rule about this.
| Letter | First tone | Second tone | Third tone | Fourth tone | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Combining Diacritical Marks | ◌̄ (U+0304) | ◌́ (U+0301) | ◌̌ (U+030C) | ◌̀ (U+0300) | ||||||||||||
| Common letters | ||||||||||||||||
| Uppercase | A | Ā (U+0100) | Á (U+00C1) | Ǎ (U+01CD) | À (U+00C0) | |||||||||||
| E | Ē (U+0112) | É (U+00C9) | Ě (U+011A) | È (U+00C8) | ||||||||||||
| I | Ī (U+012A) | Í (U+00CD) | Ǐ (U+01CF) | Ì (U+00CC) | ||||||||||||
| O | Ō (U+014C) | Ó (U+00D3) | Ǒ (U+01D1) | Ò (U+00D2) | ||||||||||||
| U | Ū (U+016A) | Ú (U+00DA) | Ǔ (U+01D3) | Ù (U+00D9) | ||||||||||||
| Ü (U+00DC) | Ǖ (U+01D5) | Ǘ (U+01D7) | Ǚ (U+01D9) | Ǜ (U+01DB) | ||||||||||||
| Lowercase | a | ā (U+0101) | á (U+00E1) | ǎ (U+01CE) | à (U+00E0) | |||||||||||
| e | ē (U+0113) | é (U+00E9) | ě (U+011B) | è (U+00E8) | ||||||||||||
| i | ī (U+012B) | í (U+00ED) | ǐ (U+01D0) | ì (U+00EC) | ||||||||||||
| o | ō (U+014D) | ó (U+00F3) | ǒ (U+01D2) | ò (U+00F2) | ||||||||||||
| u | ū (U+016B) | ú (U+00FA) | ǔ (U+01D4) | ù (U+00F9) | ||||||||||||
| ü (U+00FC) | ǖ (U+01D6) | ǘ (U+01D8) | ǚ (U+01DA) | ǜ (U+01DC) | ||||||||||||
| Rare letters | ||||||||||||||||
| Uppercase | Ê (U+00CA) | Ê̄ (U+00CA U+0304) | Ế (U+1EBE) | Ê̌ (U+00CA U+030C) | Ề (U+1EC0) | |||||||||||
| M | M̄ (U+004D U+0304) | Ḿ (U+1E3E) | M̌ (U+004D U+030C) | M̀ (U+004D U+0300) | ||||||||||||
| N | N̄ (U+004E U+0304) | Ń (U+0143) | Ň (U+0147) | Ǹ (U+01F8) | ||||||||||||
| Lowercase | ê (U+00EA) | ê̄ (U+00EA U+0304) | ế (U+1EBF) | ê̌ (U+00EA U+030C) | ề (U+1EC1) | |||||||||||
| m | m̄ (U+006D U+0304) | ḿ (U+1E3F) | m̌ (U+006D U+030C) | m̀ (U+006D U+0300) | ||||||||||||
| n | n̄ (U+006E U+0304) | ń (U+0144) | ň (U+0148) | ǹ (U+01F9) | ||||||||||||
| Notes a.^ Yellow cells indicate that there are no single Unicode character for that letter; the character shown here uses Combining Diacritical Mark characters to display the letter. b.^ Grey cells indicate that Xiandai Hanyu Cidian does not include pinyin with that specific letter. | ||||||||||||||||
| Uppercase | Lowercase | Note | Example |
|---|---|---|---|
| Ĉ (U+0108) | ĉ (U+0109) | Abbreviation of ch | 长; 長 can be spelled as ĉáŋ |
| Ŝ (U+015C) | ŝ (U+015D) | Abbreviation of sh | 伤; 傷 can be spelled as ŝāŋ |
| Ẑ (U+1E90) | ẑ (U+1E91) | Abbreviation of zh | 张; 張 can be spelled as Ẑāŋ |
| Ŋ (U+014A) | ŋ (U+014B) | Abbreviation of ng | 让; 讓 can be spelled as ràŋ 嗯 can be spelled as ŋ̀ |
| Chinese | Pinyin | Usage | Example |
|---|---|---|---|
| U+3002 。 IDEOGRAPHIC FULL STOP | U+002E . FULL STOP | End of sentence | 你好。 Nǐ hǎo. |
U+FF0C , FULLWIDTH COMMA U+3001 、 IDEOGRAPHIC COMMA | U+002C , COMMA | Connecting clauses | 你,好吗? Nǐ, hǎo ma? |
| U+2014 — EM DASH (×2) | U+2014 — EM DASH | Division of clauses mid-sentence | 枢纽部分——中央大厅 shūniǔ bùfèn — zhōngyāng dàtīng |
| U+2026 … HORIZONTAL ELLIPSIS (×2) | U+2026 … HORIZONTAL ELLIPSIS | Redaction of part of a passage | 我…… Wǒ… |
| —N/a | U+00B7 · MIDDLE DOT | Neutral tone marker placed before the syllable | 吗 ·ma |
| U+002D - HYPHEN-MINUS | Hyphenation of abbreviated compounds | 公关 gōng-guān | |
| U+0027 ' APOSTROPHE | Syllable segmentation | 西安 - Xī'ān (compared to 先 - xiān) |
Usage
The spelling of Chinese place or people names in pinyin is now the most common way to write them in English. Pinyin is the main way to type Chinese words on computers in mainland China. In Taiwan, another system called bopomofo is more common.
Families who speak Mandarin use pinyin to help children connect written characters with the words they know. Children usually start learning it in kindergarten, but it is not used in textbooks after primary school. Families who speak a different language also use pinyin to teach children how to say Mandarin words correctly when they are learning new vocabulary in elementary school.
Since 1958, pinyin has also been used in adult education. It helps people who cannot read Chinese characters to learn to read and write through pinyin.
Pinyin is also helpful for people who are learning Mandarin as a new language. It is used to explain pronunciation and grammar, along with Chinese characters. Books for learners often include both the characters and pinyin.
Computer input
Simple computer systems that could only use basic letters and numbers made it easier to use pinyin without special marks. Today, most computers can show Chinese characters and let people type them using a regular keyboard with special programs. Some tablets also let people write characters by hand, and the computer tries to recognize what was written.
Pinyin with special marks can be typed using special keyboard setups or other tools.
Sorting techniques
Chinese text can be sorted using pinyin, which helps find words when you know how they sound but not how they are written. Characters can be sorted by their pinyin spelling. When syllables are the same, they are sorted by tone, with neutral tones coming last.
Words with more than one character can be sorted in two ways: by each character, as in the Xiandai Hanyu Cidian, or by the whole word’s string, then by tone. This method is used in the ABC Chinese–English Dictionary.
By region
Taiwan
See also: Chinese language romanization in Taiwan and Tongyong Pinyin
Between October 2002 and January 2009, Taiwan used Tongyong Pinyin, a version of Hanyu Pinyin made for local use. After that, Taiwan started to use Hanyu Pinyin instead. Tongyong Pinyin was made to write not just Standard Chinese, but also other varieties spoken on the island. The ruling Kuomintang party preferred the system used in mainland China and internationally.
Today, many street signs in Taiwan use Tongyong Pinyin or other older systems. People in Taiwan can choose how their personal names are written on passports, using different systems.
Singapore
See also: Chinese language romanization in Singapore
Singapore adopted Hanyu Pinyin as the official system for Mandarin in public use starting in the 1980s, as part of the Speak Mandarin Campaign. Schools also use Hanyu Pinyin to teach Mandarin. While it has been widely accepted in government use and for place names and newer businesses, some people still prefer older ways of writing personal names and words from other Chinese languages commonly spoken in Singapore, like Hokkien, Teochew, and Cantonese.
Special names
See also: SASM/GNC romanization, Tibetan pinyin, and Guangdong Romanization
Place names from non-Han languages such as Mongolian, Uyghur, and Tibetan are also written using pinyin. This is done according to rules set in 1976. The pinyin letters are used to try to match the sounds of these languages as closely as possible, which can make the spellings different from both the original name and how it is written in Chinese.
Images
This article is a child-friendly adaptation of the Wikipedia article on Pinyin, available under CC BY-SA 4.0.
Images from Wikimedia Commons. Tap any image to view credits and license.
Safekipedia