Safekipedia
1958 introductionsChinese words and phrasesISO standardsPhonetic alphabets

Pinyin

Adapted from Wikipedia · Adventurer experience

A kindergarten in Shanghai with a sign encouraging everyone to speak Standard Mandarin.

Hanyu Pinyin

Hanyu Pinyin, or simply pinyin, is the most common way to write Standard Chinese using the letters of the Roman alphabet. It helps people all over the world read and say Chinese words correctly. In China and Singapore, pinyin is used to teach children how to speak and write Chinese characters. It is also used on computers to help type Chinese words and to organize Chinese dictionaries.

Pinyin breaks down each Chinese sound into two parts: an initial consonant and a final vowel sound. Special marks above the letters show the different tones used in Chinese. These marks are often left out when writing Chinese names in English.

Hanyu Pinyin was created in the 1950s by a group of Chinese language experts, led by Zhou Youguang, who is known as the “father of pinyin.” It became the official system in China in 1958 and was later adopted by the United Nations. Today, pinyin is used by learners and teachers of Chinese around the world.

History

A facade of a kindergarten in Zhengzhou, Henan, with writing using simplified characters and pinyin

Matteo Ricci, a Jesuit missionary in China, wrote the first book using the Latin alphabet to write Chinese in 1605. Later, other missionaries also created books to help Westerners understand Chinese sounds.

In the 1950s, Chinese language experts created Hanyu Pinyin. They used the Latin alphabet to write Chinese sounds. They mixed ideas from older systems to make it easier for everyone. Hanyu Pinyin became official in 1958. It is now used in schools to help people learn to read and speak Standard Chinese. Today, it is the most common way to write Chinese with the Latin alphabet.

Syllables

Chinese sounds are split into two parts: initials and finals. Initials are single consonant sounds. Finals are more complex and can include extra sounds.

For example, the sound in the word for "clothes" (衣) can change a little depending on how it is used in a sentence.

One special sound in Chinese is the "ü" sound. It looks like a "u" with an umlaut, like in German. This shows a different way to say a word, like in "donkey" (驴). But because this symbol can be hard to type, sometimes "v" is used instead. This can be confusing, especially with names. But there are rules to help with this in important places like passports.

 b  p  m  f  d  t  n  l  g  k  h  j  q  x  zh  ch  sh  r  z  c  s 
PinyinIPADescription
b[p]Unaspirated p, like in English spark.
p[]Strongly aspirated p, like in English pay.
m[m]Like the m in English may.
f[f]Like the f in English fair.
d[t]Unaspirated t, like in English stop.
t[]Strongly aspirated t, like in English take.
n[n]Like the n in English nay.
l[l]~[ɾ]Varies between the l in English lay and tt in American English better.
g[k]Unaspirated k, like in English skill.
k[]Strongly aspirated k, like in English kiss.
h[x]~[h]Varies between the h in English hat, and the ch in Scottish English loch.
j[]Alveolo-palatal, unaspirated. No direct equivalent in English, but similar to the ch in English churchyard.
q[tɕʰ]Alveolo-palatal, aspirated. No direct equivalent in English, but similar to the ch in English punchy.
x[ɕ]Alveolo-palatal, unaspirated. No direct equivalent in English, but similar to the sh in English push.
zh[ʈʂ]~[d͡ʒ]Retroflex, unaspirated. Like j in English jack.
ch[ʈʂʰ]~[ʃ]Retroflex, aspirated. Varies between the ch in English church and sh in English bushy.
sh[ʂ]~[ɹ̠̊˔]Retroflex, unaspirated. Like sh in shirt.
r[ɻ~ʐ]~[ɹ]Retroflex. No direct equivalent in English, but varies between the r in English reduce and the s in English measure.
z[ts]Unaspirated. Like the zz in English pizza.
c[tsʰ]Aspirated. Like the ts in English bats.
s[s]Like the s in English say.
w[w]Like the w in English water.
y[j]Like the y in English yes.
yu[ɥ]Like the hu in French huit, see below.
Rime
Medial
-⁠e / -⁠o / -⁠ê-⁠a-⁠ei-⁠ai-⁠ou-⁠ao-⁠en-⁠an-⁠eng-⁠anger
[ɨ]
-⁠i
[ɤ]
e
-⁠e
[ɛ]
ê
-⁠ê
[a]
a
-⁠a
[ei̯]
ei
-⁠ei
[ai̯]
ai
-⁠ai
[ou̯]
ou
-⁠ou
[au̯]
ao
-⁠ao
[ən]
en
-⁠en
[an]
an
-⁠an
[əŋ]
eng
-⁠eng
[aŋ]
ang
-⁠ang
[ɚ]
er
y⁠-
-⁠i⁠-
[i]
yi
-⁠i
[je]
ye
-⁠ie
[ja]
ya
-⁠ia
[jou̯] ([iu̯])
you
-⁠iu
[jau̯]
yao
-⁠iao
[in]
yin
-⁠in
[jɛn]
yan
-⁠ian
[iŋ]
ying
-⁠ing
[jaŋ]
yang
-⁠iang
w⁠-
-⁠u⁠-
[u]
wu
-⁠u
[wo]
wo
-⁠uo
[wa]
wa
-⁠ua
[wei̯] ([ui̯])
wei
-⁠ui
[wai̯]
wai
-⁠uai
[wən] ([un])
wen
-⁠un
[wan]
wan
-⁠uan
[wəŋ~ʊŋ]
weng
-⁠ong
[waŋ]
wang
-⁠uang
yu⁠-
-⁠ü⁠-
[y]
yu
-⁠ü
[ɥe]
yue
-⁠üe
[yn]
yun
-⁠ün
[ɥɛn]
yuan
-⁠üan
[jʊŋ]
yong
-⁠iong
PinyinIPAForm with zero initialExplanation
-i[ɹ̩~], [ɻ̩~ʐ̩](N/A)-i is a buzzed continuation of the consonant following z-, c-, s-, zh-, ch-, sh- or r-. In all other cases, -i has the sound of bee.
a[a]alike English father, but a bit more fronted
e[ɤ] , [ə]ea back, unrounded vowel (similar to English duh, but not as open). Pronounced as a sequence [ɰɤ].
ai[ai̯]ailike English eye, but a bit lighter
ei[ei̯]eias in hey
ao[au̯]aoapproximately as in cow; the a is much more audible than the o
ou[ou̯]ouas in North American English so
an[an]anlike British English ban, but more central
en[ən]enas in taken
ang[aŋ]angas in German Angst.
(Starts with the vowel sound in father and ends in the velar nasal; like song in some dialects of American English)
eng[əŋ]englike e in en above but with ng appended
ong[ʊŋ]~[o̞ʊŋ](weng)starts with the vowel sound in book and ends with the velar nasal sound in sing. Varies between [oŋ] and [uŋ] depending on the speaker.
er[aɚ̯]~[əɹ]erSimilar to the sound in bar in English. Can also be pronounced [ɚ] depending on the speaker.
Finals beginning with i- (y-)
i[i]yilike English bee
ia[ja]yaas i + a; like English yard
ie[je]yeas i + ê where the e (compare with the ê interjection) is pronounced shorter and lighter
iao[jau̯]yaoas i + ao
iu[jou̯]youas i + ou
ian[jɛn]yanas i + an; like English yen. Varies between [jen] and [jan] depending on the speaker.
in[in]yinas i + n
iang[jaŋ]yangas i + ang
ing[iŋ]yingas i + ng
iong[jʊŋ]yongas i + ong. Varies between [joŋ] and [juŋ] depending on the speaker.
Finals beginning with u- (w-)
u[u]wulike English oo
ua[wa]waas u + a
uo/o[wo]woas u + o where the o (compare with the o interjection) is pronounced shorter and lighter (spelled as o after b, p, m or f)
uai[wai̯]waias u + ai, as in English why
ui[wei̯]weias u + ei, as in English way
uan[wan]wanas u + an
un[wən]wenas u + en; as in English won
uang[waŋ]wangas u + ang
(ong)[wəŋ]wengas u + eng
Finals beginning with ü- (yu-)
ü[y] yuas in German über or French lune (pronounced as English ee with rounded lips; spelled as u after j, q or x)
üe[ɥe]yueas ü + ê where the e (compare with the ê interjection) is pronounced shorter and lighter (spelled as ue after j, q or x)
üan[ɥɛn]yuanas ü + an. Varies between [ɥen] and [ɥan] depending on the speaker (spelled as uan after j, q or x)
ün[yn]yunas ü + n (spelled as un after j, q or x)
Interjections
ê[ɛ]êas in bet
o[ɔ]oapproximately as in British English office; the lips are much more rounded
io[jɔ]yoas i + o

Tones

The pinyin system uses special marks to show the tones of Mandarin Chinese. These marks help us know how to say a word. There are four main tones, each with its own symbol: a flat line for the first tone, an acute accent for the second, a curved hat for the third, and a grave accent for the fourth. The fifth tone, called the neutral tone, has no special mark.

Before computers, people sometimes used numbers instead of these marks to show tones. For example, the word "tóng" could be written as "tong2". Each number matches one of the five tones.

ToneExamplesIPA
1
 (macron)
ma1
ma˥
2
ma2
ma˧˥
3
 (caron)
ma3
ma˨˩˦
4
ma4
ma˥˩
Neutral
ma
·ma (middle dot)
ma0
ma5
ma
Placement of the pinyin tone mark
-a-e-o-i-u
a-àoài
e-èi
o-òu
i-, iàoiò(ng)
u-, uài
ü-üà(n), uà(n)üè, uè

Spacing, capitalization, and punctuation

Standard Chinese has many words made of several syllables. In pinyin, spaces are used to show where words begin and end, similar to other languages that use the Latin alphabet. Official rules for these spellings were set in 1988.

Today, pinyin is often written syllable by syllable. This change started after the government body overseeing language reforms was reorganized in 1985. Some people liked this method because it kept pinyin from replacing Chinese characters.

Comparison with other orthographies

Pinyin replaced older systems like Wade–Giles and postal romanization. It is now the official way to write Chinese sounds using the Latin alphabet. In 1982, it became the international standard. Many countries and organizations, including the United Nations, adopted it.

Pinyin helps English speakers understand Chinese sounds better than older systems. Some letters are used differently than in English. Chinese characters are still needed because they give meaning to words that sound the same in modern Chinese. Pinyin is mainly used for Standard Chinese based on Beijing Mandarin and is not used for other Chinese languages like Cantonese or Hokkien.

Vowels i, u, y
IPAijejoujɛninjʊŋuwoweiwənwəŋyɥeɥɛnyn
Pinyinyiyeyouyanyinyingyongwuwo/oweiwenwengyuyueyuanyun
Tongyong Pinyinwunwong
Wade–Gilesi/yiyehyuyenyungwênwêngyüehyüanyün
Bopomofoㄧㄝㄧㄡㄧㄢㄧㄣㄧㄥㄩㄥㄨㄛ/ㄛㄨㄟㄨㄣㄨㄥㄩㄝㄩㄢㄩㄣ
example
Non-sibilant consonants
IPApmfəŋtjoutweitwəntʰɤnylykʰɤ
Pinyinbpmfengdiuduiduntegekehe
Tongyong Pinyinfongdioudueinyulyu
Wade–Gilespfêngtiutuituntʻêkokʻoho
Bopomofoㄈㄥㄉㄧㄡㄉㄨㄟㄉㄨㄣㄊㄜㄋㄩㄌㄩㄍㄜㄎㄜㄏㄜ
example
Sibilant consonants
IPAtɕjɛntɕjʊŋtɕʰinɕɥɛnʈʂɤʈʂɨʈʂʰɤʈʂʰɨʂɤʂɨɻɤɻɨtsɤtswotsɨtsʰɤtsʰwotsʰɨswo
Pinyinjianjiongqinxuanzhezhichechisheshirerizezuozicecuocisesuosi
Tongyong Pinyinjyongcinsyuanjhejhihchihshihrihzihcihsih
Wade–Gileschienchiungchʻinhsüanchêchihchʻêchʻihshêshihjihtsêtsotzŭtsʻêtsʻotzʻŭsossŭ
Bopomofoㄐㄧㄢㄐㄩㄥㄑㄧㄣㄒㄩㄢㄓㄜㄔㄜㄕㄜㄖㄜㄗㄜㄗㄨㄛㄘㄜㄘㄨㄛㄙㄜㄙㄨㄛ
example
Tones
IPAma˥ma˧˥ma˨˩˦ma˥˩ma
Pinyinma
Tongyong Pinyinma
Wade–Gilesma1ma2ma3ma4ma
Bopomofoㄇㄚㄇㄚˊㄇㄚˇㄇㄚˋ˙ㄇㄚ
example (Chinese characters)

Typography and encoding

Pinyin uses special symbols to show the tones of Chinese sounds. These symbols come from a set called Combining Diacritical Marks. Important Chinese character sets, like GB 2312 and JIS X 0212, include these special letters. This helps them show up correctly on computers using Unicode.

Some special characters, like ⟨ḿ⟩ and ⟨ǹ⟩, can be confusing. Different systems, such as GBK and GB 18030, place them in different parts of the encoding. This means that some fonts and input methods might show these characters differently.

Generally, it is suggested that pinyin use a certain style of letters, like a single-story 'a' and 'g', and a thicker second-tone mark, but there is no strict rule about this.

Accent alphabets in Hanyu Pinyin[a][b]
LetterFirst toneSecond toneThird toneFourth tone
Combining Diacritical Marks◌̄ (U+0304)◌́ (U+0301)◌̌ (U+030C)◌̀ (U+0300)
Common letters
UppercaseAĀ (U+0100)Á (U+00C1)Ǎ (U+01CD)À (U+00C0)
EĒ (U+0112)É (U+00C9)Ě (U+011A)È (U+00C8)
IĪ (U+012A)Í (U+00CD)Ǐ (U+01CF)Ì (U+00CC)
OŌ (U+014C)Ó (U+00D3)Ǒ (U+01D1)Ò (U+00D2)
UŪ (U+016A)Ú (U+00DA)Ǔ (U+01D3)Ù (U+00D9)
Ü (U+00DC)Ǖ (U+01D5)Ǘ (U+01D7)Ǚ (U+01D9)Ǜ (U+01DB)
Lowercaseaā (U+0101)á (U+00E1)ǎ (U+01CE)à (U+00E0)
eē (U+0113)é (U+00E9)ě (U+011B)è (U+00E8)
iī (U+012B)í (U+00ED)ǐ (U+01D0)ì (U+00EC)
oō (U+014D)ó (U+00F3)ǒ (U+01D2)ò (U+00F2)
uū (U+016B)ú (U+00FA)ǔ (U+01D4)ù (U+00F9)
ü (U+00FC)ǖ (U+01D6)ǘ (U+01D8)ǚ (U+01DA)ǜ (U+01DC)
Rare letters
UppercaseÊ (U+00CA)Ê̄ (U+00CA U+0304)Ế (U+1EBE)Ê̌ (U+00CA U+030C)Ề (U+1EC0)
MM̄ (U+004D U+0304)Ḿ (U+1E3E)M̌ (U+004D U+030C)M̀ (U+004D U+0300)
NN̄ (U+004E U+0304)Ń (U+0143)Ň (U+0147)Ǹ (U+01F8)
Lowercaseê (U+00EA)ê̄ (U+00EA U+0304)ế (U+1EBF)ê̌ (U+00EA U+030C)ề (U+1EC1)
mm̄ (U+006D U+0304)ḿ (U+1E3F)m̌ (U+006D U+030C)m̀ (U+006D U+0300)
nn̄ (U+006E U+0304)ń (U+0144)ň (U+0148)ǹ (U+01F9)
Notes
a.^ Yellow cells indicate that there are no single Unicode character for that letter; the character shown here uses Combining Diacritical Mark characters to display the letter.
b.^ Grey cells indicate that Xiandai Hanyu Cidian does not include pinyin with that specific letter.
Shorthand pinyin letters
UppercaseLowercaseNoteExample
Ĉ (U+0108)ĉ (U+0109)Abbreviation of ch长; 長 can be spelled as ĉáŋ
Ŝ (U+015C)ŝ (U+015D)Abbreviation of sh伤; 傷 can be spelled as ŝāŋ
Ẑ (U+1E90)ẑ (U+1E91)Abbreviation of zh张; 張 can be spelled as Ẑāŋ
Ŋ (U+014A)ŋ (U+014B)Abbreviation of ng
让; 讓 can be spelled as ràŋ
嗯 can be spelled as ŋ̀
Symbol comparison
ChinesePinyinUsageExample
U+3002 。 IDEOGRAPHIC FULL STOPU+002E . FULL STOPEnd of sentence你好。 Nǐ hǎo.
U+FF0C , FULLWIDTH COMMA
U+3001 、 IDEOGRAPHIC COMMA
U+002C , COMMAConnecting clauses你,好吗? Nǐ, hǎo ma?
U+2014 — EM DASH (×2)U+2014 — EM DASHDivision of clauses mid-sentence枢纽部分——中央大厅 shūniǔ bùfèn — zhōngyāng dàtīng
U+2026 … HORIZONTAL ELLIPSIS (×2)U+2026 … HORIZONTAL ELLIPSISRedaction of part of a passage我…… Wǒ…
—N/aU+00B7 · MIDDLE DOTNeutral tone marker placed before the syllable·ma
U+002D - HYPHEN-MINUSHyphenation of abbreviated compounds公关 gōng-guān
U+0027 ' APOSTROPHESyllable segmentation西安 - Xī'ān (compared to 先 - xiān)

Usage

The spelling of Chinese place or people names in pinyin is now the most common way to write them in English. Pinyin is the main way to type Chinese words on computers in mainland China. In Taiwan, another system called bopomofo is more common.

Families who speak Mandarin use pinyin to help children connect written characters with the words they know. Children usually start learning it in kindergarten, but it is not used in textbooks after primary school. Families who speak a different language also use pinyin to teach children how to say Mandarin words correctly when they are learning new vocabulary in elementary school.

Since 1958, pinyin has also been used in adult education. It helps people who cannot read Chinese characters to learn to read and write through pinyin.

Pinyin is also helpful for people who are learning Mandarin as a new language. It is used to explain pronunciation and grammar, along with Chinese characters. Books for learners often include both the characters and pinyin.

Computer input

Simple computer systems that could only use basic letters and numbers made it easier to use pinyin without special marks. Today, most computers can show Chinese characters and let people type them using a regular keyboard with special programs. Some tablets also let people write characters by hand, and the computer tries to recognize what was written.

Pinyin with special marks can be typed using special keyboard setups or other tools.

Sorting techniques

Chinese text can be sorted using pinyin, which helps find words when you know how they sound but not how they are written. Characters can be sorted by their pinyin spelling. When syllables are the same, they are sorted by tone, with neutral tones coming last.

A slogan written on a school wall featuring pinyin annotations without tonal marks or word grouping

Words with more than one character can be sorted in two ways: by each character, as in the Xiandai Hanyu Cidian, or by the whole word’s string, then by tone. This method is used in the ABC Chinese–English Dictionary.

By region

Taiwan

See also: Chinese language romanization in Taiwan and Tongyong Pinyin

Between October 2002 and January 2009, Taiwan used Tongyong Pinyin, a version of Hanyu Pinyin made for local use. After that, Taiwan started to use Hanyu Pinyin instead. Tongyong Pinyin was made to write not just Standard Chinese, but also other varieties spoken on the island. The ruling Kuomintang party preferred the system used in mainland China and internationally.

Today, many street signs in Taiwan use Tongyong Pinyin or other older systems. People in Taiwan can choose how their personal names are written on passports, using different systems.

Singapore

See also: Chinese language romanization in Singapore

Singapore adopted Hanyu Pinyin as the official system for Mandarin in public use starting in the 1980s, as part of the Speak Mandarin Campaign. Schools also use Hanyu Pinyin to teach Mandarin. While it has been widely accepted in government use and for place names and newer businesses, some people still prefer older ways of writing personal names and words from other Chinese languages commonly spoken in Singapore, like Hokkien, Teochew, and Cantonese.

Special names

See also: SASM/GNC romanization, Tibetan pinyin, and Guangdong Romanization

Place names from non-Han languages such as Mongolian, Uyghur, and Tibetan are also written using pinyin. This is done according to rules set in 1976. The pinyin letters are used to try to match the sounds of these languages as closely as possible, which can make the spellings different from both the original name and how it is written in Chinese.

CustomaryOfficial pinyinCharacters
ShigatseXigazê日喀则; 日喀則; Rìkāzé
UrumchiÜrümqi乌鲁木齐; 烏魯木齊; Wūlǔmùqí
LhasaLhasa拉萨; 拉薩; Lāsà
HohhotHohhot呼和浩特; Hūhéhàotè
GolmudGolmud格尔木; 格爾木; Gé'ěrmù
QiqiharQiqihar齐齐哈尔; 齊齊哈爾; Qíqíhā'ěr

Images

Station sign for Taichung Port Station in Taiwan.

This article is a child-friendly adaptation of the Wikipedia article on Pinyin, available under CC BY-SA 4.0.

Images from Wikimedia Commons. Tap any image to view credits and license.