Safekipedia
1958 introductionsChinese words and phrasesISO standardsPhonetic alphabets

Pinyin

Adapted from Wikipedia · Discoverer experience

The entrance to No. 6 Kindergarten in Jinshui District, Zhengzhou.

Hanyu Pinyin, or simply pinyin, is the most common way to write Standard Chinese using the letters of the Roman alphabet. It helps people all over the world read and say Chinese words correctly. In China and Singapore, pinyin is used to teach children how to speak and write Chinese characters. It is also used on computers to help type Chinese words and to organize Chinese dictionaries.

Pinyin breaks down each Chinese sound into two parts: an initial consonant and a final vowel sound. Special marks above the letters show the different tones used in Chinese, which change the meaning of words. These marks are often left out when writing Chinese names in English.

Hanyu Pinyin was created in the 1950s by a group of Chinese language experts, led by Zhou Youguang, who is known as the “father of pinyin.” It became the official system in China in 1958 and was later adopted by the United Nations and made into an international standard. Today, pinyin is used by learners and teachers of Chinese around the world.

History

A facade of a kindergarten in Zhengzhou, Henan, with writing using simplified characters and pinyin

Matteo Ricci, a Jesuit missionary in China, wrote the first book using the Latin alphabet to write Chinese in 1605. Later, other missionaries also created books to help Westerners understand Chinese sounds.

In the 1950s, Chinese linguists created Hanyu Pinyin as a way to write Chinese sounds using the Latin alphabet. They combined ideas from older systems to make it easier for everyone to learn and use. Hanyu Pinyin was officially adopted in 1958 and became widely used in schools, helping people learn to read and speak Standard Chinese. Today, it is the most common way to write Chinese using the Latin alphabet.

Syllables

Chinese sounds are described using two parts: initials and finals. Initials are usually single consonant sounds, while finals can be more complex, sometimes including extra sounds called medials. For example, the sound in the word for "clothes" (衣) might change slightly depending on how it is used in a sentence.

One special sound in Chinese is the "ü" sound, which looks like a "u" with an umlaut (like in German). This is used to show a different pronunciation, such as in the word for "donkey" (驴). However, because it can be hard to type or show this symbol, sometimes "v" is used instead. This can cause confusion, especially with names, but rules have been made to help with this in important documents like passports.

 b  p  m  f  d  t  n  l  g  k  h  j  q  x  zh  ch  sh  r  z  c  s 
PinyinIPADescription
b[p]Unaspirated p, like in English spark.
p[]Strongly aspirated p, like in English pay.
m[m]Like the m in English may.
f[f]Like the f in English fair.
d[t]Unaspirated t, like in English stop.
t[]Strongly aspirated t, like in English take.
n[n]Like the n in English nay.
l[l]~[ɾ]Varies between the l in English lay and tt in American English better.
g[k]Unaspirated k, like in English skill.
k[]Strongly aspirated k, like in English kiss.
h[x]~[h]Varies between the h in English hat, and the ch in Scottish English loch.
j[]Alveolo-palatal, unaspirated. No direct equivalent in English, but similar to the ch in English churchyard.
q[tɕʰ]Alveolo-palatal, aspirated. No direct equivalent in English, but similar to the ch in English punchy.
x[ɕ]Alveolo-palatal, unaspirated. No direct equivalent in English, but similar to the sh in English push.
zh[ʈʂ]~[d͡ʒ]Retroflex, unaspirated. Like j in English jack.
ch[ʈʂʰ]~[ʃ]Retroflex, aspirated. Varies between the ch in English church and sh in English bushy.
sh[ʂ]~[ɹ̠̊˔]Retroflex, unaspirated. Like sh in shirt.
r[ɻ~ʐ]~[ɹ]Retroflex. No direct equivalent in English, but varies between the r in English reduce and the s in English measure.
z[ts]Unaspirated. Like the zz in English pizza.
c[tsʰ]Aspirated. Like the ts in English bats.
s[s]Like the s in English say.
w[w]Like the w in English water.
y[j]Like the y in English yes.
yu[ɥ]Like the hu in French huit, see below.
Rime
Medial
-⁠e / -⁠o / -⁠ê-⁠a-⁠ei-⁠ai-⁠ou-⁠ao-⁠en-⁠an-⁠eng-⁠anger
[ɨ]
-⁠i
[ɤ]
e
-⁠e
[ɛ]
ê
-⁠ê
[a]
a
-⁠a
[ei̯]
ei
-⁠ei
[ai̯]
ai
-⁠ai
[ou̯]
ou
-⁠ou
[au̯]
ao
-⁠ao
[ən]
en
-⁠en
[an]
an
-⁠an
[əŋ]
eng
-⁠eng
[aŋ]
ang
-⁠ang
[ɚ]
er
y⁠-
-⁠i⁠-
[i]
yi
-⁠i
[je]
ye
-⁠ie
[ja]
ya
-⁠ia
[jou̯] ([iu̯])
you
-⁠iu
[jau̯]
yao
-⁠iao
[in]
yin
-⁠in
[jɛn]
yan
-⁠ian
[iŋ]
ying
-⁠ing
[jaŋ]
yang
-⁠iang
w⁠-
-⁠u⁠-
[u]
wu
-⁠u
[wo]
wo
-⁠uo
[wa]
wa
-⁠ua
[wei̯] ([ui̯])
wei
-⁠ui
[wai̯]
wai
-⁠uai
[wən] ([un])
wen
-⁠un
[wan]
wan
-⁠uan
[wəŋ~ʊŋ]
weng
-⁠ong
[waŋ]
wang
-⁠uang
yu⁠-
-⁠ü⁠-
[y]
yu
-⁠ü
[ɥe]
yue
-⁠üe
[yn]
yun
-⁠ün
[ɥɛn]
yuan
-⁠üan
[jʊŋ]
yong
-⁠iong
PinyinIPAForm with zero initialExplanation
-i[ɹ̩~], [ɻ̩~ʐ̩](N/A)-i is a buzzed continuation of the consonant following z-, c-, s-, zh-, ch-, sh- or r-. In all other cases, -i has the sound of bee.
a[a]alike English father, but a bit more fronted
e[ɤ] , [ə]ea back, unrounded vowel (similar to English duh, but not as open). Pronounced as a sequence [ɰɤ].
ai[ai̯]ailike English eye, but a bit lighter
ei[ei̯]eias in hey
ao[au̯]aoapproximately as in cow; the a is much more audible than the o
ou[ou̯]ouas in North American English so
an[an]anlike British English ban, but more central
en[ən]enas in taken
ang[aŋ]angas in German Angst.
(Starts with the vowel sound in father and ends in the velar nasal; like song in some dialects of American English)
eng[əŋ]englike e in en above but with ng appended
ong[ʊŋ]~[o̞ʊŋ](weng)starts with the vowel sound in book and ends with the velar nasal sound in sing. Varies between [oŋ] and [uŋ] depending on the speaker.
er[aɚ̯]~[əɹ]erSimilar to the sound in bar in English. Can also be pronounced [ɚ] depending on the speaker.
Finals beginning with i- (y-)
i[i]yilike English bee
ia[ja]yaas i + a; like English yard
ie[je]yeas i + ê where the e (compare with the ê interjection) is pronounced shorter and lighter
iao[jau̯]yaoas i + ao
iu[jou̯]youas i + ou
ian[jɛn]yanas i + an; like English yen. Varies between [jen] and [jan] depending on the speaker.
in[in]yinas i + n
iang[jaŋ]yangas i + ang
ing[iŋ]yingas i + ng
iong[jʊŋ]yongas i + ong. Varies between [joŋ] and [juŋ] depending on the speaker.
Finals beginning with u- (w-)
u[u]wulike English oo
ua[wa]waas u + a
uo/o[wo]woas u + o where the o (compare with the o interjection) is pronounced shorter and lighter (spelled as o after b, p, m or f)
uai[wai̯]waias u + ai, as in English why
ui[wei̯]weias u + ei, as in English way
uan[wan]wanas u + an
un[wən]wenas u + en; as in English won
uang[waŋ]wangas u + ang
(ong)[wəŋ]wengas u + eng
Finals beginning with ü- (yu-)
ü[y] yuas in German über or French lune (pronounced as English ee with rounded lips; spelled as u after j, q or x)
üe[ɥe]yueas ü + ê where the e (compare with the ê interjection) is pronounced shorter and lighter (spelled as ue after j, q or x)
üan[ɥɛn]yuanas ü + an. Varies between [ɥen] and [ɥan] depending on the speaker (spelled as uan after j, q or x)
ün[yn]yunas ü + n (spelled as un after j, q or x)
Interjections
ê[ɛ]êas in bet
o[ɔ]oapproximately as in British English office; the lips are much more rounded
io[jɔ]yoas i + o

Tones

The pinyin system uses special marks called diacritics to show the tones of Mandarin Chinese. These marks help indicate how a word is pronounced. There are four main tones, each shown with a different symbol: a flat line for the first tone, an acute accent for the second, a curved hat for the third, and a grave accent for the fourth. The fifth tone, called the neutral tone, has no special mark.

Before computers, people sometimes used numbers instead of these special marks to show tones. For example, the word "tóng" could be written as "tong2". Each number corresponds to one of the five tones.

ToneExamplesIPA
1
 (macron)
ma1
ma˥
2
ma2
ma˧˥
3
 (caron)
ma3
ma˨˩˦
4
ma4
ma˥˩
Neutral
ma
·ma (middle dot)
ma0
ma5
ma
Placement of the pinyin tone mark
-a-e-o-i-u
a-àoài
e-èi
o-òu
i-, iàoiò(ng)
u-, uài
ü-üà(n), uà(n)üè, uè

Spacing, capitalization, and punctuation

Standard Chinese has many words made of several syllables. In pinyin, spacing is meant to follow word boundaries, much like in other languages that use the Latin alphabet. Official rules for these spellings were set in 1988.

In real usage today, pinyin is often written syllable by syllable, a change that began after the government body overseeing language reforms was reorganized in 1985. Some people preferred this method because it kept pinyin from becoming an alternative way to write Chinese characters.

Comparison with other orthographies

Pinyin replaced older systems like Wade–Giles and postal romanization, becoming the official way to write Chinese sounds using the Latin alphabet. In 1982, it became the international standard, and many countries and organizations, including the United Nations, adopted it.

Pinyin helps English speakers understand Chinese sounds better than older systems, though some letters are used differently than in English. Chinese characters are still needed because they give meaning to words that sound the same in modern Chinese. Pinyin is mainly used for Standard Chinese based on Beijing Mandarin and is not used for other Chinese languages like Cantonese or Hokkien.

Vowels i, u, y
IPAijejoujɛninjʊŋuwoweiwənwəŋyɥeɥɛnyn
Pinyinyiyeyouyanyinyingyongwuwo/oweiwenwengyuyueyuanyun
Tongyong Pinyinwunwong
Wade–Gilesi/yiyehyuyenyungwênwêngyüehyüanyün
Bopomofoㄧㄝㄧㄡㄧㄢㄧㄣㄧㄥㄩㄥㄨㄛ/ㄛㄨㄟㄨㄣㄨㄥㄩㄝㄩㄢㄩㄣ
example
Non-sibilant consonants
IPApmfəŋtjoutweitwəntʰɤnylykʰɤ
Pinyinbpmfengdiuduiduntegekehe
Tongyong Pinyinfongdioudueinyulyu
Wade–Gilespfêngtiutuituntʻêkokʻoho
Bopomofoㄈㄥㄉㄧㄡㄉㄨㄟㄉㄨㄣㄊㄜㄋㄩㄌㄩㄍㄜㄎㄜㄏㄜ
example
Sibilant consonants
IPAtɕjɛntɕjʊŋtɕʰinɕɥɛnʈʂɤʈʂɨʈʂʰɤʈʂʰɨʂɤʂɨɻɤɻɨtsɤtswotsɨtsʰɤtsʰwotsʰɨswo
Pinyinjianjiongqinxuanzhezhichechisheshirerizezuozicecuocisesuosi
Tongyong Pinyinjyongcinsyuanjhejhihchihshihrihzihcihsih
Wade–Gileschienchiungchʻinhsüanchêchihchʻêchʻihshêshihjihtsêtsotzŭtsʻêtsʻotzʻŭsossŭ
Bopomofoㄐㄧㄢㄐㄩㄥㄑㄧㄣㄒㄩㄢㄓㄜㄔㄜㄕㄜㄖㄜㄗㄜㄗㄨㄛㄘㄜㄘㄨㄛㄙㄜㄙㄨㄛ
example
Tones
IPAma˥ma˧˥ma˨˩˦ma˥˩ma
Pinyinma
Tongyong Pinyinma
Wade–Gilesma1ma2ma3ma4ma
Bopomofoㄇㄚㄇㄚˊㄇㄚˇㄇㄚˋ˙ㄇㄚ
example (Chinese characters)

Typography and encoding

Pinyin uses special symbols to show the tones of Chinese sounds. These symbols come from a set called Combining Diacritical Marks, instead of another set called Spacing Modifier Letters. Important Chinese character sets, like GB 2312 and JIS X 0212, include these special letters, so they can be shown correctly on computers using Unicode.

Some special characters, like ⟨ḿ⟩ and ⟨ǹ⟩, can cause confusion because different systems, such as GBK and GB 18030, place them in different parts of the encoding. This means that some fonts and input methods might show these characters differently.

Generally, it is suggested that pinyin use a certain style of letters, like a single-story 'a' and 'g', and a thicker second-tone mark, but there is no strict rule about this.

Accent alphabets in Hanyu Pinyin[a][b]
LetterFirst toneSecond toneThird toneFourth tone
Combining Diacritical Marks◌̄ (U+0304)◌́ (U+0301)◌̌ (U+030C)◌̀ (U+0300)
Common letters
UppercaseAĀ (U+0100)Á (U+00C1)Ǎ (U+01CD)À (U+00C0)
EĒ (U+0112)É (U+00C9)Ě (U+011A)È (U+00C8)
IĪ (U+012A)Í (U+00CD)Ǐ (U+01CF)Ì (U+00CC)
OŌ (U+014C)Ó (U+00D3)Ǒ (U+01D1)Ò (U+00D2)
UŪ (U+016A)Ú (U+00DA)Ǔ (U+01D3)Ù (U+00D9)
Ü (U+00DC)Ǖ (U+01D5)Ǘ (U+01D7)Ǚ (U+01D9)Ǜ (U+01DB)
Lowercaseaā (U+0101)á (U+00E1)ǎ (U+01CE)à (U+00E0)
eē (U+0113)é (U+00E9)ě (U+011B)è (U+00E8)
iī (U+012B)í (U+00ED)ǐ (U+01D0)ì (U+00EC)
oō (U+014D)ó (U+00F3)ǒ (U+01D2)ò (U+00F2)
uū (U+016B)ú (U+00FA)ǔ (U+01D4)ù (U+00F9)
ü (U+00FC)ǖ (U+01D6)ǘ (U+01D8)ǚ (U+01DA)ǜ (U+01DC)
Rare letters
UppercaseÊ (U+00CA)Ê̄ (U+00CA U+0304)Ế (U+1EBE)Ê̌ (U+00CA U+030C)Ề (U+1EC0)
MM̄ (U+004D U+0304)Ḿ (U+1E3E)M̌ (U+004D U+030C)M̀ (U+004D U+0300)
NN̄ (U+004E U+0304)Ń (U+0143)Ň (U+0147)Ǹ (U+01F8)
Lowercaseê (U+00EA)ê̄ (U+00EA U+0304)ế (U+1EBF)ê̌ (U+00EA U+030C)ề (U+1EC1)
mm̄ (U+006D U+0304)ḿ (U+1E3F)m̌ (U+006D U+030C)m̀ (U+006D U+0300)
nn̄ (U+006E U+0304)ń (U+0144)ň (U+0148)ǹ (U+01F9)
Notes
a.^ Yellow cells indicate that there are no single Unicode character for that letter; the character shown here uses Combining Diacritical Mark characters to display the letter.
b.^ Grey cells indicate that Xiandai Hanyu Cidian does not include pinyin with that specific letter.
Shorthand pinyin letters
UppercaseLowercaseNoteExample
Ĉ (U+0108)ĉ (U+0109)Abbreviation of ch长; 長 can be spelled as ĉáŋ
Ŝ (U+015C)ŝ (U+015D)Abbreviation of sh伤; 傷 can be spelled as ŝāŋ
Ẑ (U+1E90)ẑ (U+1E91)Abbreviation of zh张; 張 can be spelled as Ẑāŋ
Ŋ (U+014A)ŋ (U+014B)Abbreviation of ng
让; 讓 can be spelled as ràŋ
嗯 can be spelled as ŋ̀
Symbol comparison
ChinesePinyinUsageExample
U+3002 。 IDEOGRAPHIC FULL STOPU+002E . FULL STOPEnd of sentence你好。 Nǐ hǎo.
U+FF0C , FULLWIDTH COMMA
U+3001 、 IDEOGRAPHIC COMMA
U+002C , COMMAConnecting clauses你,好吗? Nǐ, hǎo ma?
U+2014 — EM DASH (×2)U+2014 — EM DASHDivision of clauses mid-sentence枢纽部分——中央大厅 shūniǔ bùfèn — zhōngyāng dàtīng
U+2026 … HORIZONTAL ELLIPSIS (×2)U+2026 … HORIZONTAL ELLIPSISRedaction of part of a passage我…… Wǒ…
—N/aU+00B7 · MIDDLE DOTNeutral tone marker placed before the syllable·ma
U+002D - HYPHEN-MINUSHyphenation of abbreviated compounds公关 gōng-guān
U+0027 ' APOSTROPHESyllable segmentation西安 - Xī'ān (compared to 先 - xiān)

Usage

The spelling of Chinese geographical or personal names in pinyin has become the most common way to write them in English. Pinyin is the main way to type Chinese words on computers in mainland China, unlike in Taiwan where another system called bopomofo is more common.

Families who speak Mandarin use pinyin to help children connect written characters with the words they already know. Children usually start learning it in kindergarten, but it is not used in textbooks after primary school. Families who speak a different language also use pinyin to teach children how to say Mandarin words correctly when they are learning new vocabulary in elementary school.

Since 1958, pinyin has also been used in adult education. It helps people who cannot read Chinese characters to learn to read and write through pinyin.

Pinyin is also helpful for people who are learning Mandarin as a new language. It is used to explain pronunciation and grammar, along with Chinese characters. Books for learners often include both the characters and pinyin. The way pinyin helps learners is somewhat like how extra letters are used in Japanese books to show pronunciation.

Computer input

Simple computer systems that could only use basic letters and numbers made it easier to use pinyin without special marks. Today, most computers can show Chinese characters and let people type them using a regular keyboard with special programs. Some tablets also let people write characters by hand, and the computer tries to recognize what was written.

Pinyin with special marks can be typed using special keyboard setups or other tools.

Sorting techniques

Chinese text can be sorted using pinyin, which helps find words when you know how they sound but not how they are written. Characters can be sorted by their pinyin spelling. When syllables are the same, they are sorted by tone, with neutral tones coming last.

A slogan written on a school wall featuring pinyin annotations without tonal marks or word grouping

Words with more than one character can be sorted in two ways: by each character, as in the Xiandai Hanyu Cidian, or by the whole word’s string, then by tone. This method is used in the ABC Chinese–English Dictionary.

By region

Taiwan

See also: Chinese language romanization in Taiwan and Tongyong Pinyin

Between October 2002 and January 2009, Taiwan used Tongyong Pinyin, a version of Hanyu Pinyin made for local use. After that, Taiwan started to use Hanyu Pinyin instead. Tongyong Pinyin was made to write not just Standard Chinese, but also other varieties spoken on the island. The ruling Kuomintang party preferred the system used in mainland China and internationally. This led to disagreements about which system to use, with each side connected to different political groups.

Today, many street signs in Taiwan use Tongyong Pinyin or other older systems. Efforts to make Hanyu Pinyin the standard in Taiwan have had mixed results, with most place names still not changed. People in Taiwan can choose how their personal names are written on passports, using different systems. Using Hanyu Pinyin on signs for a metro line in 2017 caused some protests, even though officials said it was to help foreigners.

Singapore

See also: Chinese language romanization in Singapore

Singapore adopted Hanyu Pinyin as the official system for Mandarin in public use starting in the 1980s, as part of the Speak Mandarin Campaign. Schools also use Hanyu Pinyin to teach Mandarin. While it has been widely accepted in government use and for place names and newer businesses, some people still prefer older ways of writing personal names and words from other Chinese languages commonly spoken in Singapore, like Hokkien, Teochew, and Cantonese.

Special names

See also: SASM/GNC romanization, Tibetan pinyin, and Guangdong Romanization

Place names from non-Han languages such as Mongolian, Uyghur, and Tibetan are also written using pinyin. This is done according to rules set in 1976. The pinyin letters are used to try to match the sounds of these languages as closely as possible, which can make the spellings different from both the original name and how it is written in Chinese.

CustomaryOfficial pinyinCharacters
ShigatseXigazê日喀则; 日喀則; Rìkāzé
UrumchiÜrümqi乌鲁木齐; 烏魯木齊; Wūlǔmùqí
LhasaLhasa拉萨; 拉薩; Lāsà
HohhotHohhot呼和浩特; Hūhéhàotè
GolmudGolmud格尔木; 格爾木; Gé'ěrmù
QiqiharQiqihar齐齐哈尔; 齊齊哈爾; Qíqíhā'ěr

Images

Station sign for Taichung Port Station in Taiwan.

This article is a child-friendly adaptation of the Wikipedia article on Pinyin, available under CC BY-SA 4.0.

Images from Wikimedia Commons. Tap any image to view credits and license.