This article is about the IPA in general. For guides to pronouncing IPA transcriptions of English and
foreign words, see Help:IPA English pronunciation key and Help:IPA pronunciation
key.
| International Phonetic
Alphabet |
| Type |
Alphabet |
| Languages |
Reserved for phonetic and
phonemic transcription of any language |
| Time period |
1888 to the present |
| Parent systems |
Romic
alphabet
→ Phonotypic alphabet
→ International Phonetic
Alphabet |
 |
| Note: This page may contain
IPA phonetic symbols in Unicode. |
The International Phonetic Alphabet (IPA)[I] is a system of phonetic notation based on
the Latin alphabet, devised by the International Phonetic Association as a standardized representation of the sounds of
spoken language.[1] The IPA is used by
linguists, speech pathologists and
therapists, foreign language teachers, singers,
actors, lexicographers, and translators.[2][3]
The IPA is designed to represent only those qualities of speech that are distinctive in spoken language: phonemes, intonation, and the separation of words and
syllables.[1] To represent additional qualities of speech such as tooth-gnashing, lisping, and sounds made with a cleft palate, an extended set of
symbols called the Extended IPA is commonly used.[2]
As of 2007, there are 107 distinct letters and 56 diacritics and suprasegmentals in the IPA proper. Occasionally symbols are added, removed, or modified by the
International Phonetic Association.
History
-
In 1886, a group of French and British language
teachers, led by the French linguist Paul Passy, formed what would come to be known (from
1897 onwards) as the International Phonetic Association (in French,
l’Association
phonétique internationale).[4]
The original alphabet was based on a spelling reform for English known as the
Romic alphabet, but in order to make it usable for other languages, the values of the
symbols were allowed to vary from language to language.[5]
For example, the sound /ʃ/ (sh in shoe) was originally
represented with the letter <c> in English, but with the letter <x> in French.[4] However, in 1888, the alphabet was revised so as to be uniform across
languages, thus providing the base for all future revisions.[4][6]
Since its creation, the organization of vowels and consonants in the IPA has remained largely the same. However, the alphabet
itself has undergone a few revisions. The IPA Kiel Convention in 1989 made many
changes to the earlier 1932 version. A minor revision took place in 1993, with the
addition of the mid-central vowel[2] and the removal of symbols for voiceless implosives,[7] and the alphabet was last revised in May 2005, when a symbol for the
labiodental flap was added.[8] Apart from the addition and removal of symbols, changes to the IPA have consisted largely in renaming
symbols and categories, and modifying typefaces.[2]
Extensions of the alphabet are relatively recent; the Extended IPA was created in 1990
and officially adopted by the International
Clinical Phonetics and Linguistics Association in 1994.[9] Also, the VoQS (Voice Quality Symbols) were proposed in 1995 to
provide a system for more detailed transcription of voice production.[10]
Description
A diagram illustrating the International Phonetic Alphabet.
The general principle of the IPA is to provide one symbol for each distinctive sound (or speech segment).[11] This
means that the IPA does not use letter combinations unless the sound being represented can be regarded as a sequence of two or
more sounds.[III] The IPA also
does not usually have separate letters for two sounds if no known language makes a
distinction between them (a property known as "selectiveness"[2]),[IV] and it does not use letters that represent multiple sounds, the way <x>
represents the consonant cluster [ks] in English. Additionally, in the IPA
no letters have sound values that are context-dependent, such as <c> in English (and most other
European languages).
The symbols of the IPA are 107 letters for consonants and vowels, 31 diacritics which
further specify those sounds, and 19 suprasegmentals, which indicate such
qualities as length, tone,
stress, and intonation.[II]
Letterforms
The symbols chosen for the IPA are meant to harmonize with the Latin
alphabet.[V] For this
reason, most symbols are either Latin or Greek
letters, or modifications thereof. However, there are symbols that are neither: for example, the symbol denoting the
glottal stop [ʔ] has the form of a
"gelded" question mark, and was originally an apostrophe.[VI] Indeed, some symbols, such as that of the voiced pharyngeal fricative [ʕ], though modified to
look more Latin, were inspired by glyphs in other writing systems (in this case, the
Arabic letter <ﻉ>,
`ain).[7]
Despite its preference for letters that harmonize with the Latin alphabet, the International Phonetic Association has
occasionally admitted symbols that do not have this property. For example, prior to 1989, the IPA symbols for click consonants were [ʘ], [ʇ], [ʗ], and [ʖ], all of which are clearly derived from Latin and Greek letters, as well as punctuation
marks. However, except for [ʘ], none of these symbols was reflective of
contemporary practice among Khoisanists (the main users of symbols for click
consonants). As a result, they were replaced by the less Latin-like but more widespread symbols [ʘ], [ǀ], [ǃ], [ǂ], and [ǁ] at the IPA Kiel Convention in 1989.[12]
Symbols and sounds
The International Phonetic Alphabet is based on the Latin alphabet, using as few non-Latin forms as possible.[4] The Association created the IPA so that the
sound values of most consonants taken from the Latin alphabet would correspond to
“international usage”.[4] Hence, the
letters [b], [d],
[f], (hard) [ɡ],
(hard) [h], [k],
[l], [m],
[n], [p], (voiceless)
[s], [t],
[v], [w], and
[z] have the values used in English; and the vowels from the Latin alphabet ([a], [e], [i], [o], [u]) correspond to the sound
values of Latin: [i] is like the vowel in machine,
[u] is as in rule, etc. Other letters may differ from
English, but are used with these values in other European languages, such as [j], [r], and [y].
This inventory was extended by using capital or cursive forms, diacritics, and rotation. There are also several derived from
the Greek alphabet ([β], [ɣ], [ɛ], [θ], [ɸ], [χ], and [ʋ]), though the sound values may differ.
[ʋ], for example, is a vowel in Greek, but an only indirectly related
consonant in the IPA.
The sound values of modified Latin letters can often be derived from those of the original letters.[13] For example, letters with a rightward-facing hook at the bottom represent
retroflex consonants; and small capital letters usually represent uvular consonants. Apart from the fact that certain kinds of modification to the shape of a letter
generally correspond to certain kinds of modification to the sound represented, there is no way to deduce the sound represented
by a symbol from the shape of the symbol (unlike, for example, in Visible Speech).
Beyond the letters themselves, there are a variety of secondary symbols which aid in transcription. Diacritic marks can be combined with IPA letters to transcribe modified phonetic values or secondary articulations. There are also
special symbols for suprasegmental features such as stress and tone that are often employed.
Usage
- Further information: Phonetic transcription
A transcription of the French word
ébauche ("sketch, unfinished work").
Although the IPA offers over a hundred symbols for transcribing speech, it is not necessary to use all relevant symbols at the
same time; it is possible to transcribe speech with various levels of accuracy. The most accurate kind of phonetic transcription,
in which sounds are described in as much detail as the system allows, without any regard for the linguistic significance of the
distinctions thus made, is known as narrow transcription. Anything else is termed broad transcription, though
"broad" is obviously a relative term. Both kinds of transcriptions are generally enclosed in brackets,[14] but broad
transcriptions are sometimes enclosed in slashes instead of brackets.
Two phonetic transcriptions of the word "international," demonstrating two distinctly different pronunciations.
Broad transcription only distinguishes sounds which are considered different by speakers of a language. Sounds that may be
pronounced differently between styles and dialects or depending on neighbouring sounds can be considered the "same" sound in the
sense that they are allophones of the same phoneme. When a
word is written as phonemes, it is usually enclosed in slashes. For example, the
American pronunciation of the English word "little" may be transcribed broadly using the IPA as /lɪtl/. This broad transcription merely identifies the separate phonetically relevant
components of the word, and does not indicate the variety of corresponding sounds. On the other hand, the narrow transcription
(placed between square brackets) specifies the way each sound is pronounced. A more narrow transcription of "little" would be
different depending on the way it is said: [lɪɾɫ], [lɪʔɫ], or [lɪːɫ] are just a few
possibilities.
Use in dictionaries
Many British English dictionaries, such as the Oxford Advanced
Learner's Dictionary and the Cambridge Advanced Learner's Dictionary, now use
the International Phonetic Alphabet to represent the pronunciation
of words.[15] However, most American (and some British)
volumes use their own conventions supposed to be more intuitive for
readers unfamiliar with the IPA. For example, the pronunciation-representation systems in many American dictionaries (such as
Merriam-Webster) use "y" for IPA [j] and "sh" for IPA [ʃ], reflecting common representations of
those sounds in written English.[16] (In IPA,
[y] represents the sound of the French u (as in tu), and
[sh] represents the pair of sounds in grass hut.)
One of the benefits of using an alternative to the IPA is the ability to use a single symbol for a sound pronounced
differently in different dialects. For example, the American Heritage Dictionary uses ŏ for the vowel in cot
(kŏt) but ô for the one in
caught (kôt).[17] American regional dialects without the caught-cot merger generally pronounce cŏt like IPA [kʰɑt] (with an open central unrounded
vowel) and côt like IPA [kʰɔt] (with an open back rounded vowel), whereas those with the merger pronounce the vowels ŏ and ô the same
way (for example, like IPA [ɒ] in the Boston dialect). Using one symbol for the vowel in cot (instead of having different symbols for
different pronunciations of the o) enables the dictionary to provide meaningful pronunciations for speakers of most
dialects of English.
The IPA is also not universal among dictionaries in other countries and languages. Mass-market Czech multilingual
dictionaries, for instance, tend to use the IPA only for sounds not found in the Czech
language.[18]
Educational initiative
There is some interest in using native speakers to produce sound and video files of sufficient breadth to completely
demonstrate all the speech sounds covered by the IPA. Such a project would encompass a large subset of the world's languages.
This would aid linguistic and anthropologic research, as well as help teach language learning. Specifically, the development of a
reference standard using the IPA (mirroring the idea of the Rosetta Stone) could be used
in order to preserve intact examples of the sounds of human language. For education, the IPA can help standardize resources which
prepare students and very young children (ages 6-36 months) for universal language acquisition through familiarization and
subsequent imitation of the breadth of human speech sounds.[19] Research by Flege, Mackay and Piske (2002) and Sebastián-Gallés, Echeverría and Bosch (2005) have
shown that early exposure to extra phonetic sounds and uses improves later comprehension and pronunciation (accent).
Use in orthographies and capital variants
- See also: Latin characters in
Unicode
IPA symbols have been incorporated into the standard orthographies of various languages, notably in Subsaharan Africa but in other regions as well. These include for example: Hausa; Fula; Akan;
Gbe languages; and Manding languages.
An example of capital letter forms for IPA symbols is Kabiyé of northern
Togo, which has Ɔ Ɛ Ŋ Ɣ Ʃ (capital ɔ ɛ ŋ ɣ ʃ). Other IPA-paired capitals include Ɓ/Ƃ Ƈ Ɗ/Ƌ Ə/Ǝ Ɠ Ħ Ɯ Ɲ Ɵ Ʈ Ʊ Ʋ
Ʒ.
The abovementioned and other capital forms are supported by Unicode, but appear in Latin
ranges other than the IPA extensions.
Letters
The International Phonetic Alphabet divides its letter symbols into three categories: pulmonic consonants, non-pulmonic consonants, and vowels.[20][21]
Each character is assigned a number, to prevent confusion between similar letters (such as ɵ and θ), for example in printing manuscripts. Different
categories of sounds are assigned different ranges of numbers.
Consonants (pulmonic)
-
A pulmonic consonant is a consonant made by obstructing the glottis (the space between the vocal cords) or oral cavity (the mouth)
and either simultaneously or subsequently letting out air from the lungs. Pulmonic consonants make up the majority of consonants
in the IPA, as well as in human language. All consonants in the English language fall into this category.[22]
The pulmonic consonant table, which includes most consonants, is arranged in rows that designate manner of articulation, meaning how the consonant is produced, and columns that designate
place of articulation, meaning where in the vocal tract the consonant is produced.
The main chart includes only consonants with a single place of articulation.
- Notes
- Asterisks (*) mark reported sounds that do not (yet) have official IPA symbols. See the respective articles for ad hoc
symbols found in the literature.
- Daggers (†) mark IPA symbols that do not yet have official Unicode support. Since May 2005,
this is the case of the labiodental flap, symbolized by a right-hook v:
.[23] In the meantime the similarly shaped
izhitsa (ѵ) is used here.
- In rows where some symbols appear in pairs (the obstruents), the symbol to the
right represents a voiced consonant (except breathy-voiced [ɦ]). However, [ʔ] cannot be voiced. In the other rows (the sonorants), the single symbol represents a voiced consonant.
- Although there is a single symbol for the coronal places of articulation for all consonants but fricatives, when dealing with
a particular language, the symbols are treated as specifically alveolar, post-alveolar, etc., as appropriate for that
language.
- Shaded areas indicate articulations judged to be impossible.
- The symbols [ʁ, ʕ, ʢ] represent either voiced fricatives or
approximants.
- It is primarily the shape of the tongue rather than its position that distinguishes the fricatives [ʃ ʒ], [ɕ ʑ], and [ʂ ʐ].
Coarticulation
Coarticulated consonants are sounds that involve two simultaneous
places of articulation (are pronounced using two parts of the vocal tract). In English, the [w] in "went" is a coarticulated consonant, because it is pronounced by
rounding the lips and raising the back of the tongue. Other languages, such as French
and Swedish, have different coarticulated consonants.
| View this table as an image |
| ʍ |
Voiceless labialized velar approximant |
| w |
Voiced labialized velar approximant |
| ɥ |
Voiced labialized palatal approximant |
| ɕ |
Voiceless palatalized postalveolar (alveolo-palatal) fricative |
| ʑ |
Voiced palatalized postalveolar (alveolo-palatal) fricative |
| ɧ |
Voiceless "palatal-velar" fricative |
- Note
Affricates and double articulation
Affricates and doubly
articulated stops are represented by two symbols joined by a tie bar, either above or below the symbols. The six most
common affricates are optionally represented by ligatures, though this is no longer official IPA usage,[14] because a great number of ligatures would be
required to represent all affricates this way. Alternatively, a superscript notation for a consonant release is sometimes used to
transcribe affricates, for example tˢ for t͡s, paralleling kˣ ~
k͡x. The symbols for the palatal plosives, <c ɟ>, are often used as a convenience for [t͡ʃ d͡ʒ] or similar affricates, even in official IPA publications, so they must be interpreted with
care.
| View this
table as an image. |
| Tie bar |
Ligature |
Description |
| t͡s |
ʦ |
voiceless alveolar affricate |
| d͡z |
ʣ |
voiced alveolar affricate |
| t͡ʃ |
ʧ |
voiceless postalveolar affricate |
| d͡ʒ |
ʤ |
voiced postalveolar affricate |
| t͡ɕ |
ʨ |
voiceless alveolo-palatal affricate |
| d͡ʑ |
ʥ |
voiced alveolo-palatal affricate |
| t͡ɬ |
– |
voiceless alveolar lateral affricate |
| k͡p |
– |
voiceless labial-velar plosive |
| ɡ͡b |
– |
voiced labial-velar plosive |
| ŋ͡m |
– |
labial-velar nasal stop |
- Note
- If your browser uses Arial Unicode MS to display IPA characters, the
following incorrectly formed sequences may look better due to a bug in that font: ts͡, tʃ͡, tɕ͡, dz͡, dʒ͡, dʑ͡, tɬ͡, kp͡, ɡb͡, ŋm͡.
Consonants (non-pulmonic)
Non-pulmonic consonants are sounds which are made without the lungs. These include clicks (found in the Khoisan languages of Africa) and implosives (found in
languages such as Swahili).
- Notes
- Clicks are double articulated and have traditionally been described as having a forward 'release' and a rear 'accompaniment',
with the click letters representing the release. Therefore all clicks would require two letters for proper notation: [k͡ǂ, ɡ͡ǂ, ŋ͡ǂ, q͡ǂ, ɢ͡ǂ, ɴ͡ǂ] etc., or [ǂ͡k, ǂ͡ɡ, ǂ͡ŋ, ǂ͡q, ǂ͡ɢ, ǂ͡ɴ]. When the dorsal articulation is omitted, a [k] may usually be assumed. However, recent research disputes the concept of 'accompaniment'.[25] In such approaches, the click letter represents both
articulations, there is no velar-uvular distinction, and the accompanying letter represents the manner of the click: [ǂ, ɡǂ, ŋǂ] etc.
- Symbols for the voiceless implosives [ƥ, ƭ, ƈ, ƙ, ʠ] are no longer supported by the IPA, though they remain in Unicode. Instead, the IPA uses the voiced
equivalent with a voiceless diacritic: [ɓ̥, ʛ̥], etc.
- Although not confirmed from any language, and therefore not explicitly recognized by the IPA, a retroflex implosive, [ᶑ], is
supported in the Unicode Phonetic Extensions Supplement, added in version 4.1 of the Unicode Standard, or can be created as a
composite [ɗ̢].
- The ejective symbol is often substituted for a superscript glottal stop in glottalized but pulmonic sonorants, such as [mˀ], [lˀ], [wˀ], [aˀ]. These may also be transcribed as creaky [m̰], [l̰], [w̰], [a̰].
Vowels
-
An X-Ray shows the sounds
[i, u, a, ɑ]
Tongue positions of
cardinal front vowels with highest point indicated. The position of
the highest point is used to determine vowel height and backness
The IPA defines a vowel as a sound which occurs at a syllable center.[26] Below is a chart depicting the vowels of the IPA. The IPA maps the vowels according to the position
of the tongue.
The vertical axis of the chart is mapped by vowel height. Vowels pronounced with the
tongue lowered are at the bottom, and vowels pronounced with the tongue raised are at the top. For example, [ɑ] (said as the "a" in "palm") is at the bottom because the tongue is lowered in this
position. However, [i] (said as the vowel in "meet") is at the top because
the sound is said with the tongue raised to the roof of the mouth.
In a similar fashion, the horizontal axis of the chart is determined by vowel
backness. Vowels with the tongue moved towards the front of the mouth (such as [ɛ], the vowel in "met") are to the left in the chart, while those in which it is moved to the back (such as
[ʌ], the vowel in "but") are placed to the right in the chart.
In places where vowels are paired, the right represents a rounded vowel (in which the
lips are rounded) while the left is its unrounded counterpart.
- Notes
- [a] officially represents a front vowel, but there is little
distinction between front and central open vowels, and [a] is frequently
used for an open central vowel.
- [ʊ] and [ɪ] were
written as <ɷ> and <ɩ> respectively in older versions of the IPA.
Diacritics
Diacritics are small markings which are placed around the IPA letter in order to show a certain alteration or more specific
description in the letter's pronunciation.[27]
Sub-diacritics (markings normally placed below a letter or symbol) may be placed above a symbol having a descender (informally
called a tail), e.g. ŋ̊.[27]
The dotless i, <ı>, is used when the dot would interfere with the diacritic. Other IPA
symbols may appear as diacritics to represent phonetic detail: tˢ
(fricative release), bʱ (breathy voice), ˀa (glottal onset), ᵊ (epenthetic schwa), oʊ (diphthongization). More advanced diacritics were developed in the
Extended IPA for more specific pronunciation encoding.
- Notes
- a^ With
aspirated voiced consonants, the aspiration is also voiced. Many linguists prefer one of the diacritics dedicated to breathy
voice.
- b^ Some
linguists restrict this breathy-voice diacritic to sonorants, and transcribe obstruents as
bʱ.
The state of the glottis can be finely transcribed with diacritics. A series of alveolar
plosives ranging from an open to a closed glottis phonation are:
Suprasegmentals
These symbols describe the features of a language above the level of individual consonants and vowels, such as
prosody, tone, length, and stress, which often operate on syllables,
words, or phrases: that is, elements such as the intensity, pitch, and gemination of the sounds
of a language, as well as the rhythm and intonation of speech.[28] Although most of these symbols indicate distinctions that are phonemic at the word level,
symbols also exist for intonation on a level greater than that of the
word.[28]
Obsolete symbols and nonstandard symbols
-
The IPA inherited alternate symbols from various traditions, but eventually settled on one for each sound. The other symbols
are now considered obsolete. An example is ɷ which has been standardised to
ʊ. Several symbols indicating secondary articulation have been dropped
altogether, with the idea that such things should be indicated with diacritics: ƍ for zʷ is one. In addition, the rare voiceless implosive
series ƥ ƭ ƈ ƙ ʠ has been dropped; they are now written ɓ̥ ɗ̥ ʄ̊ ɠ̊ ʛ̥ respectively.
There are also unsupported or ad hoc symbols from local traditions that find their way
into publications that otherwise use the standard IPA. This is especially common with affricates such as ƛ for t͡ɬ (the "tl" in
"Nahuatl").
IPA extensions
-
The Extended IPA, also often abbreviated as extIPA, is a group of symbols whose original purpose was to accurately transcribe
disordered speech. At the IPA Kiel
Convention in 1989, a group of linguists drew up the initial set of symbols for the Extended IPA.[29] The Extended IPA was first published in 1990, and modified over the next
few years before its official publication in the Journal of the International Phonetic Association in 1994 allowed it to
be officially adopted by the ICPLA.[30] While its original purpose was to transcribe disordered
speech, linguists have used it to designate a number of unique sounds within standard communication, such as hushing, gnashing
teeth, and smacking lips. The Extended IPA has also been used to record certain peculiarities in an individual's voice, such as
nasalized voicing.[2]
Aside from the extIPA, another set of symbols is used for voice quality (VoQS), such as whispering.
Sounds that have no symbols in the IPA
The remaining blank cells on the IPA chart can be filled without too much difficulty if the need arises. Some
ad hoc symbols have appeared in the literature, for example for the retroflex lateral flap
and the voiceless lateral fricative series, the epiglottal trill, and the labiodental plosives. (See the grey symbols in the PDF
chart.) Diacritics can supply much of the remainder, which would indeed be appropriate if the sounds were allophones.[31]
Consonants without letters
Consonant sounds are created by adding diacritics to letters with similar sound values. The Spanish bilabial approximant is
commonly written as a lowered fricative, [β̞]. Similarly, voiced lateral
fricatives would be written as raised lateral approximants, [ɭ˔ ʎ̝ ʟ̝]. A
few languages such as Banda have a bilabial flap as the preferred allophone of what is
elsewhere a labiodental flap. Similarly, a labiodental trill would be written [ʙ̪] (bilabial trill and the dental sign). Other taps can be written as extra-short plosives or laterals, e.g.
[ɟ̆ ɢ̆ ʟ̆], though in some cases the diacritic would need to be written
below the letter. A retroflex trill can be written as a retracted [r̠],
just as retroflex fricatives sometimes are. The remaining consonants, the uvular laterals and the palatal trill, while not
strictly impossible, are very difficult to pronounce and are unlikely to occur even as allophones in the world's languages.
Vowels without letters
The vowels are similarly manageable by using diacritics for raising, lowering, fronting, backing, centering, and
mid-centering.[32] For example, the unrounded equivalent
of [ʊ] can be transcribed as mid-centered [ɯ̽], and the rounded equivalent of [æ] as raised [ɶ̝]. True mid vowels are lowered [e̞ ø̞ ɘ̞ ɵ̞ ɤ̞ o̞], while centered [ɪ̈
ʊ̈] and [ä] are near-close and open central vowels, respectively.
The only known vowels that cannot be represented in this scheme are the compressed vowels,
which would require a dedicated diacritic, such as [ʏ̫].
Symbol names
-