www.unifon.org/number-of-phonemes.html short version
.......
On the Number of Phonemes in the English language
Counting the number of  phonemes is like counting the number of colors in a rainbow. When we try to break up a continuous spectrum into discrete units, we move to the realm of fuzzy logic1

by Steve Bett

References Bett- How many phonemes?  23  Brown-JSSS 27/2000/1
Draft of an article for JSSS 30/2001/2
[print version - word document] [final draft]

After reading the eloquent arguments of linguist, Adam Brown [ JSSS 27/2000/1] , I came away almost convinced that a dictionary pronunciation guide was impossible. 

Brown's intention was to explain why it was impossible to specify the exact number of phonemes in English.  He expanded on the argument I had published in the Simpl Spel Newsletter [SSN 4/2000] . I suggested that, while 46 [21v 25c] was a good estimate of the number of phonemes in English speech, the only number that people would be likely to agree on would be the number of uncombined phonemes [36] [14v 22c].1   I thought there could be agreement on 14 pure vowels but not on the exact number of combined vowels. 

Brown doubted the possibility of limited agreement.  He added that if the exact number of vowels cannot be established, then speech sounds cannot be visualized and the alphabetical principle becomes an unrealistic ideal. 
 
The International Phonetic Alphabet isolates 14 pure vowels and 7 combinations
This chart shows the ipa symbol for 14 phonemes and 4 ascii representations
Above are the 21vowels isolated by D. Jones and used by most scholars [e.g., Wijk and Wells] and many dictionary pronunciation guides [e.g., Longman's Dictionary of American English].   14 of these vowels [the ones shown on the right] can be considered to be uncombined or pure vowels.  Well's SAMPA notation, a machine readable  ASCII-IPA, is listed in the second column.  The IPA special symbols are listed [column 1] when they differ from SAMPA, .  SAMPA uses upper case letter for the short vowels, O for awe is the one exception.  Unifon uses short letters for the short vowels and upper case letters for the long vowels.  Englik and Spanglish use double letters [digraphs] for the extended vowels.

Brown's presentation reminded me of the following definition of an alphabet:

...meaningless marks arbitrarily associated with meaningless sounds.

After such a definition, one might conclude that written communication is impossible. 

By defining the goal as  a strict one-to-one correspondence to an exact number of phonemes, Brown is able to build a similar strong case against the possibility of a phonemic transcription. 

Almost every argument that Brown brings up is valid.  A strict one-to-one correspondence requires an exact number of phonemes. If we cannot agree on the exact number of  phonemes then we cannot have a corresponding alphabet or phonemic transcription. 

If we set the level of precision high enough, a one to one correspondence between graphemes and phonemes can  be shown to be impossible to attain. 

In a [04June2000] letter written to the saundspel phonology forum, Michael Avinor  put it this way, "Speech is an analog signal and writing is a digital signal.  To talk about a phoneme we have to cut up continuous speech into discrete units.  Digitizing speech can preserve only a limited part of the speech information." 

Even more information is lost when speech is visualized or represented graphically.  Nevertheless, the fragment of the original that remains can be enough to accurately convey information. 

Sounds are not the only perceptions with fuzzy boundaries

Sounds are not the only things in our perceptual world with fuzzy boundaries. Has anyone ever claimed that a name for a discrete segment of the sound spectrum was any more exact than a color name for a desecrate segment of the color spectrum?  The flaw in the Brown's argument is the implied insistence on a high level of precision.  If we raise the bar of precision high enough, then most ideals can be characterized as unobtainable and unrealistic. 

We cannot see a particular color, e.g., blue, any more than we can hear a particular phoneme.  This does not mean we cannot discriminate or sort blue and yellow objects.  We can be presented with  an instance and then asked to judge whether or not it a member of a category or class of 'blue' things.  As we get near the boundaries [e.g., Is a green ball sorted into the pile of blue objects or yellow objects?], the judgments become more uncertain.  However, there are some modal or mid range instances that nearly everyone will agree is a particular primary color.  There are even instances that nearly everyone will agree is blue-green.  When the blends become more complex, however, agreement becomes harder to achieve. 

The Phoneme - fuzzy by definition

The same is true for the abstract sounds we call phonemes.  Phonemes have been defined as the smallest unit of sound capable of changing the meaning of a word.   The substituion of b for p  in [pit] changes the meaning. Therefore [b] is a phoneme and the p:b distinction is phonemic. [pit : bit] are called minimal pairs.  The substitution of @ for i in exact changes the meaning. 

A phonemes is not really a unit in the sense of a single sound.  It is a range of related sounds treated as a unit -- treated as if they are the same sound.  What we hear are phones. 

The same person will pronounce the same vowel in acoustically different ways in association with different consonants.  People from different speech communities will rarely pronounce the same vowel the same way. 

Three people will have three different concepts as to what phones [physical sounds] should be accepted or rejected as members of a given phoneme.  However, there will be an area where the three concepts overlap - an area of agreement.
ipa alphabet for English 38 phonograms 38 - 40 IPA Phonograms

There are 36 pure phonemes in spoken English. A phonemic system will have 36 symbols [phonograms] for each of these sound segments.
  This chart includes the consonants /tS/ traditionally represented as Ch and /dZ/  represented with a J

  This chart provides two ways to write <herder>. h3d@ and h3'd@' represent the non-rhotic and rhotic dialects of English.

  The main disagreements are with how to apply these symbols, e.g.,  baeth or bath 

SAMPLE: Th@ hard nozd h3rd@r h3rd h3r scim, "bl^ddy m3rd@r!" fram tha ^pp@r cheimb@r @v th@ maensh@n. The two problems with the IPA extended alphabet solution are [1] The new symbols are not supported making It difficult to type and [2] It is not easy to read without a key. The first problem was solved by creating an ascii-ipa such as SAMPA. The ascii-ipa that looks the most like traditional English is Saxon Spanglish. Spanglish uses digraphs instead of new symbols.

100% agreement is possible for 14 clear instances 
of uncombined [primary or pure] vowel phonemes

Any time you try to break up a continuum or spectrum into discrete units, there will be problems at the boundaries.  Not everyone will slice the continuum at precisely the same point. 

However, just as it is possible to achieve nearly 100% agreement on instances of the primary colors, it is possible to get nearly 100% agreement on instances of the primary or uncombined phonemes. 

Words are not pronounced the same in different dialects of English which means that a phonemic representation of one dialect may not correspond to the the speech in a different region.  A phonemic script always presumes a base dialect and such alphabetical writing systems will not always be a reliable guide to the pronunciation of another dialect. 

Thus some people will have to learn a spelling dialect in addition to their local dialect.  This situation is also true in Spain and Italy.  The base dialect, Castilian, for instance, does not always correspond to the local dialect.

Not everyone will agree where i sound stops and e sound begins.  The disagreements will increase when the vowel is unstressed. 

The last phoneme in <vegetable> can be transcribed as vejt@bl, vejt@b@l, or vejt@bUl.   The first phoneme in <because> can be transcribed as becoz, bicoz, or bcoz. The less stress or smaller the discrete unit, the less agreement there will be.

Breaking up the vowel sound continuum into discrete units is analogous to breaking up the color spectrum into discrete colors.  Adding gray into to a color mix could be an analog of removing stress in speech.  Gray [or the lost of brightness and contrast] reduces the ability to discriminate colors just as the loss of stress reduces the ability to discriminate vowels. 

Few people can detect the difference between three pronunciations of [pencil], e.g., pensil, pens@l, and pensl.  Yet most will be annoyed by the written substitution of  pens&l,  pencl,  pensl,  or  pensal  for [pencil].  The objection to the loss of  familiar "eye rhymes," however, is another issue.

When it comes to blends, one may have to make some arbitrary decisions.  Sweet and Jones, for instance, decided not to include triphthongs [or trigraphs] in their list of 46 phonemes.  Many phonemic systems, such as Unifon and Truespel,  arbitrarily limit the number of phonograms to 40. 
 

Alternative Phonemic Transcriptions
our are or -er - in four notations 
the buttons are active, click if you want more detail on a notation
Notation & number of phonograms  - - - - - - - - - - - - - - -  Sample transcription
Unifon keyboard
qr kOrvet iz mI fAvcrit kor.
40
Unifon2 keyboard
aur korvet iz mY fAvxrit kqr
40
  42
Our corvett iz mai feivrit caar.
46
aur cr'vet iz mai 'feivrit caar
  40
our corvvet iz mie faeverit caar

Linguists, such as Daniel Jones, broke the sound spectrum down into three parallel segments [see chart].  The parallel segments have been called tense, lax, and rounded.  The same sound may have three different expressions depending on muscle tension and rounding.

The other dimension of speech sound goes by several names.  One reference is to the jaw position:  open, half open, or closed.  Tongue position during vocalization [front and back] are also used.   Typically four levels of openness are isolated

Finally, there are glided blends of two sounds [diphthongs] that start in one position or cell of the chart below and end up in another. ai, for instance, is a combination of @ or a: and i or i:  [@i] or [ai = aah+ee].

One phono-graphic representation cannot cover all the dialects of English

This is a different issue than the one just discussed.  It is not a question of not being able to recognize clear instances of a phoneme.  Rather it is a case of which phonemes should be used with particular words. 

If the symbol represents a speech sound then when that sound changes the symbol has to change.  This is true unless there are some symbols that merge two sounds.  [o] for instance, could represent both the short /aa/ and the short awe. If one wanted to be specific about the sound, then another symbol coud be employed, e.g. au or Q,  as in  "ai baut aul our aarmy cots."  This would be an easy convention to add to the truespel notation [www.truespel.com] since the o is not used alone.

How do you spell a word that has three different pronunciations depending on the dialect of English someone speaks.  Phonemic notations require a base pronunciation. 

The following tables contrast two regional dialects of English.

From the page at  www.unifon.org/1600spelling.html 
A comparison of the southern and northern dialects of British English
Regional 
Pronunciations
words in list pronounced the same different pronunciations different 
pronunciations
Northern Pronunciation short u
as in hook
short u hwk
as in hook
u:
as in hoop
bush 
butcher
could 
courier 
cushion 
pudding 
put 
wolf
mud 
putt 
rush 
thudding 
touch 
wonder
hook 
book 
cook 
rook 
shook 
took
Southern Pronunciation short u
as in hook
^ L short u 
as in up
short u   hwk
as in hook
Those speaking the northern dialect would pronounce put and putt the same.  But they would not rhyme put and hook.  Words spelled oo are pronounced as in hoop.   In the northern british dialect, kook and cook rhyme.
 
Regional Dialect of English
Narrow transcription of speech
northern british  "I wwnder hau yu cuuk mwd paiz."
southern british  "Ai wunnda hau yu cwk mudd paiz."
general american [GA]  "Ai wunnder hau yu cwk mudd paiz."
truespel  [GA]  "Ie wunder hou yue cook mud piez."
unifon 2  [o=Q/q] [oo=u/v]  [cr=c/cr]  "I wondcr hau U cook mcd pIz"

ASCII Notations from SSN June 2001

Traditional Spelling American
click & listen
British
Spanglish
Englik IPA ASCII-IPA Unifon Merriam
Webster
SAMPA
Father faather faatha faadhx fa:d' faD@ fxDc faD& faD@
Bother baather botha bodhx bod' boD@ boDc ba:D& bAD@
Farther faarther faatha faadhx fa:d' faD@ fxDc faD& faD@
Brother brvther brotha bradhx brLd' brVD@ bruDc brVD& broD@
ASCII IPA notationby Evan Kirshenbaum for sci.lang newsgroup
http://www.hpl.hp.com/personal/Evan_Kirshenbaum/IPA/english.html
 

Not all digraphs are true diphthongs

Not all digraphs are blends or glides. ae, ei, ou and aw are not necessarily blends as would be suggested by their common representation as digraphs in some notations.

One can produce  sounds which will be interpreted as members of these phonemes without moving one's mouth or position of one's tongue. [ae, ei, ou and aw] pronounced either as  pure vowels or as diphthongs will be understood as allophones of the phonemes  represented by these digraphs. 

Were there  an abundance of vowel shapes or graphemes, these four sounds typically referenced by a digraph would have been represented with one symbol - as they are in the IPA.  However, in the case of /ou/ there would be dialects where /@u/ would be a more accurate narrow transcription of regional speech.

As it happens, the pure vowel ei is almost indistinguishable from the blend of e [eh] + i [ee]. The pure vowel aw [as in law] is pronounced in various ways in different dialects.  The diphthongal pronunciation [ aa+U] can be understood in context. 

The pure vowel O or Spanish O can be distinguished from the dipthong /ou/ [awe+oo] but both can be readily understood.  The combination, @U, would be closer to the big O sound but this is not a common representation in TS. 

ou [awe-oo]  for O is not as close or mid range as ei [eh-ee]  for the pure E sound.  That is, it would be easier to distinguish the pure O sound from @U or ou than it would be to distinguish the monothongic or uncombined pronununciation of the phoneme referenced by ei from its alleged component sounds. ei is closer to a mid range pure A sound  than @U is to a mid range pure oh sound.

[ai, oi, iu/yu and au] are true glides or  blends of two-sound [diphthongs].  These three sounds cannot be produced without moving ones mouth.


Vowel Diagram approximating D.Jones' Quadralateralsee graphic above

tung - jaw
-
high-close 
mid-close 
mid-open 
low-open
High - Low Tongue:   Closed - Open Jaw
FRONT- tense      mid lax       rounded- BACK
i:   EEL y          i
iu  yu  USE
U-w                  u:
  é  APE  ei         e
L   -up
au-æu  OUT   O-ou
    e       3:her
@ - -ago
oi  oil          o  awe
    ae       ai I
aa  - car
o cot       o 
red = unstressed, .blue= anglo forms [see Unifon]

The primary phonemes that are obscured in the traditional orthography can be problematic. [ ^/@,  3:r/@r, dh/th, ng/n-g ] .  Some transcription systems see no need to reveal obscured phonemes.  As Brown observes, these obscured phonemes  usually carry a low functional load. 

Notations and Dialects

notation  Tradtional GA SBS SBS
example  herder   hurrder  haada* or  hudda   / 'h3:d /
example  murder  murrder  maada  or  mudda  / 'm3:d /
example  upper   upper  aapa    or   uppa  /'Lp /
example  hurry   hurry  haary   or   hurry  /'h3:ri/
example  bother  bother / baather  botha or  bawhtha  /'bod /
example  bury   tire  berry   tair  burry      taa  /'beri / /'bLri /
example  airy ferry  erry  ferry  erry/eiry  ferry  /'eri /  /'feri / 
*Spanglish has to merge 3: and aa to represent non-rhotic  RP or SBS - airy can be pronounced as in marry 

With a phonetic notation, it is relatively easy to represent a dialect.  The goal of a script such as Spanglish, however, is to ignore dialect differences.  There are also several related ways to spell a pronunciation.  hurrder shows stress, herder does not, hudda corresponds to IPA
/'hLd / and is close to /'h3:d /

The basic difference in the minimal pairs [^/@] ,  [3:r/@r]  is stress.  If the orthography has  another mechanism for representing primary stress [as with Truespel], the need to distinguish these vowels with different letter shapes would be less urgent. 

The minimal pair in the/thug or thy/thigh can be easily distinguished but there are only a handful of words whose meaning would change if these consonants were mispronounced, e.g.,  voiced instead of left unvoiced. [see voicing]

Minor differences in pronunciation

Brown makes much of regional differences in the pronunciation of the vowel in some words. The pronunciation may be evenly split between two slightly different sounds.  [dog], for instance,  can be dawg or daag.  [bath] can be baath or baeth.  These are real problems but they need not lead to paralysis. 

In Spanglish, dog would be written [dog] and pronounced [dawg].  The o is very short and closer to [awe] than [ah].  This is close enough for speech recognition and close enough for a phonemic orthography.  To express the pronunciation /da:g/, Spanglish uses daag or dogg.  To transcribed southern drawl, dog would be written <dawg>.
 
FIRE TOWER [IPA] fai'  tau'       fair taur [slurvian faa taa    fa:r ta:r
Traditional Orthography              British                General. American                       British            GA
OUR CAR WASH [IPA] au' ca: wa:S   aur ca:r wa:S aa kaa waash     aar caar wosh

Finer distinctions between dialects are always possible

IPA limits the number of vowels to 21  and lists  26 consonants.  Romik arbitrarily limits the number of phonemes to 25 vowels and 25 consonants.  It is almost always possible to make finer distinctions even when allowable distinctions are limited to minimal pairs.

The generic saundspel egroup notation, Romik,  includes a purposely ambiguous a, o, and 'r.  These can be specified for particular regional dialects such as [GA] but they can also be left broad and vague in cases where the pronunciation dictionary gives two nearly equally common pronunciations or when the pronunciation dictionaries for [GA] and [RP] disagree.  It is not unusual to have two correct pronunciations of one spelling and both can be understood. 

The goal is an orthography almost as good as existing pronunciation guides

Although Brown overstates his case,  he makes some good observations.  There is no point in striving for a transcription system that is more precise than existing pronunciation guides.  Pronunciation guides will not always be based on the same number of phonemes and will not always be in 100% agreement.  A pronunciation guide for British English will be slightly different from a pronunciation guide for General American.  The differences are addressed in Kelley's vowel chart for Englik and have been discussed on the saundspel phonology forum. 

However, all  pronunciation guides will be more alphabetic and phonemic than the traditional writing system.  The goal is to get closer to the ideal, not necessarily to achieve it.  The goal in many minds is simply a writing system that is as consistent and alphabetic as Italian or Spanish.  This goal clearly falls short of a "strict one to one correspondence."  Most attempts to regularize spelling will usually bring the writing system closer to the ideal. 

There are two types of goals, one is a simple expression of the desired direction the other involves the expectation that the goal will be achieved.  Brown argues that since the goal of a perfect onetime correspondence between phonemes and graphemes is unobtainable the goal should be abandoned.

If there are those who think that we can actually count the number of colors in a rainbow, then the point is well taken.  However, the majority of orthographies are simply trying to find a better way to do what has already to been done in other languages. 

A majority would be satisfied if English could be written in as transparent a way as Italian. 
Agreement on 36 uncombined phonemes is possible
14 yellow vowels - 22 teal green consonants
  Some orthographies will have a unique phonogram for you
  and  haw.   They are clearly combinations.  White indicates 
  an unstressed vowel: Y, ER, W,RLMN are all in white.
lei is listed as a pure vowel rather than a diphthong as in some charts.  vein  [Venn]  their [three]  they. *y when not followed by a vowel is exclusively used as schwa, an unstressed /i:/ e.g.,  very

semivowels [last column]  include  r l m n y w v when these letters are not followed by a vowel. The first 4 are always unstressed which is why Jones invented the shwer phonogram [ 3:].  Spanglish doubles the R to represent the stressed 3: This suggests that the nonrhotic [3] is identical to v /L/ which is not exactly true but close enough given the fact that many pedagogical charts ignore these obscure vowels.. 

urr [3'] as in murrder is a pure vowel missing from many lists of English vowels.  The non-rhotic pronunciation is very close to aa and u.. Henry Sweet reprsented this stressed mid lax vowel as a double schwa [@@] instead of /3/.

Are there 48 sounds in English as the chart indicates?  Certainly there are clear instances of each sound category but this does not mean that they all have to be unique phonograms.  Any number greater than 36 is somewhat arbitrary although a good case can be made for including the combined phonemes  ch, j, as unique phonograms.  Most systems include the diphthongs: ai, ou, and oi.  Many refer to /ai/ as a long vowel as if could be produced without moving one's tongue.  40 phonograms is the most popular number.  Note that the two vowels in [herder] are listed as pure vowels.  The basic difference between the two is stress. M-W represents this sound as ['&] a stressed mid lax vowel.  [er/urr] is written as if it were an r-combination but it is a single sound /3:/.  The other r-combinations cannot be articulated as a single continuous sound. 

According to the chart and the two letter representation, [ei] looks like a diphthong.  However, [ei] is not a diphthong in the same sense as ai, oi and au.  [ei] can be pronounced as a single unobstructed and unaltered tone. It can be a pure vowel.  Phonemically, however,  it would not be distinguished from the e+i blend in the word vein.  [ow] is another case of a possible pure vowel being represented as a diphthong. A majority of English speakers, however, probably pronounce [ow] as a diphthong. If the orthographer's goal is to minimize the number of symbols, he might treat both [ei] and [ow] as diphthongs. IPA does not.

In GA, the letter o in most words is pronounced the same as [aa].  Pronunciation guides for American English will often use the same symbol for [o] and [aa].  As shown above, <bother> is pronounced /baather/ in GA.

The pan-dialect solution is to include all the phonemes an use a hybrid base dialect.  bother would have a British flavor in the first syllable and a rhotic American flavor in the second.

The pan-dialect solution is to keep the short o but allow two interpretations.  In the writing system, o would be a little ambiguous.  In the pronunciation guide another set of symbols [e.g. q for the turned a and Q for the British short o] would be used. 

In Spanglish o=awe and and oCC = aa.  doter = daughter and otter =aater

Conclusion

We cannot count the number of vocalic phonemes in English speech any better than we can count the number of colors in a rainbow.  However, just as we can identify the primary colors in the rainbow, we can identify 36 clear instances of the primary or uncombined phonemes in speech [14 pure vowels, 22 pure consonants].

As we try to make finer distinctions, unanimity of opinion declines.  There will never be much agreement on the exact number of combined vowels.  Most people will want unique phonograms for [ch-tsh], [j-dzh], and [ai] bringing the total to 39. [oi], [au] and a few r- combinations should also be singled out.  [46] is a good number of pure and combined phonemes.  This would be enough to represent the sounds in both British and American dialects. [IPA-40 chart]

As the chart above shows, we can get by with 42. The exact identity of unstressed vowels will continue to be subject to different interpretations.  People will continue to argue about such things as [1] what would be the best way to represent unstressed sounds  [to schwa or not to schwa...]. [2] should the redundant kqx be in the alphabet, [3] should both the voiced and unvoiced [th] be included, etc.

There will be words that continue to be pronounced uniquely in a particular dialect.  Thus a transcription system based on General American with allowances for Southern British may not always accurately represent the pronunciation of some words in these dialects.  It is not that these dialects have any more phonemes [although this is a possibility], they just apply them differently in a few words.  [Review dialect transcription]

The goal is to develop a workable writing system for English that is as good as the Italian writing system is for the Italian language.  Unlike the goal of perfect one-to-one correspondence between graphemes and phonemes, this goal is attainable.  What prevents its realization is not the elusive goal but the fact that any consistent system will respell 60% of the words in English and that most of these respellings look odd to those adept in the traditional writing system.  Some respellings will "offend the eye."  For those who have acquired a high level of word pattern recognition, respelling seems to obliterate certain distinctions isolated by heterographic homophones [know, no]  [dough, doe].

One symbol per sound can remain the stated goal without the expectation that it is the kind of ideal that can ever be fully attained. 

There is certainly no need to abandon this idealistic goal at this point.  It clearly defines the correct direction.  When the English writing system becomes as good as one of the systems used as a pronunciation guide in a dictionary, then the quest can be abandoned. 

Pronunciation guide spelling that is near 100% predictable is about as close as one can expect to get.  Beyond this we quickly reach a point of diminishing returns.  The goal is not to be better than the dictionary pronunciation guide but to approximate it with a practical everyday writing system devoid of ukknsukpported special characters and complicated diacritics. 

We are trying to come up with the best possible visual representation of the abstract phonemes that people have in their heads.  We are trying to achieve a system or representation that is nearly isomorphic with the phonological structure of English speech. We will never reach this goal.  Fortunately, a system that is less than ideal will be  "good enough".  A writing system for English that is as good as the writing systems for Italian and Spanish will be fine.

There is no perfect perfect graphic representation of speech sounds.  Since the writing system is not used to capture subtle differences between different dialects, the system does not have to be as detailed as IPA.  As good as Spanish is quite adequate for English. 

In  linguistics we are working in a realm of fuzzy logic not aristotelian logic where everything is either black or white . . . true or false. 

Brown points out all of the limitations of phonemic spelling and then concludes that since the goals is elusive it should be abandoned.  In building a better system for a broad transcription of English, there is a point of diminishing returns.  This point will be reached long before we have to become concerned about the precision of phonemes or the suitability of a particular base dialect. 

One symbol and only one per sound should remain the goal.

Definitions

fuzzy logic - in classical logic everything was black or white, true or false.  fuzzy logic recognizes a middle ground, e.g., usually true. Fuzzy logic is a superset of conventional (Boolean) logic that has been extended to handle the concept of partial truth -- truth values between "completely true" and "completely false".  It was introduced by Dr. Lotfi Zadeh of UC/Berkeley in the 1960's as a means to model the uncertainty of natural language. [more]
The Sony PalmTop apparently uses a fuzzy logic decision tree algorithm to perform
handwritten (i.e., computer lightpen) Kanji character recognition.

phoneme - a difference in sound that makes a difference in meaning - a range of sounds treated as the  same sound.  A phoneme is abstract concept or category - you cannot see, touch, or hear a phoneme but  you can point to instances.  A phoneme is not one sound but a family of sounds, especially when more than one speaker is involved.  A phoneme is an area.  All instances in that area are referred to as allophones or diaphones.

Phonemes are language specific.  Where English speakers distinguishes two  phonemes
[ lid/rid ], speakers of other languages may hear only one.  R is not distinct from L in Japanese. 

Phonemes are called the smallest unit of meaningful sound in a language. An orthography where the  letters represent sounds is called phonemic. 


Q U I Z
How many phonemes are there in the word brought

To answer the question, look it up in a pronunciation guide and count the phonograms.
b-r--t    Unifon brxt    Spanglish brot    Truespel  braut    Spanglish and Truesepl are not unigraphic and this can distort the count - it is not a simple count of the letters.  Answer: 4

How many  phonemes in the word thorough

The dictionary says  qer'u.  Looks like 5.  This is a little tougher question because it is uncertain if ['u] is one pure vowel or a blend of two phonemes.   The two sounds are close enough to be allophones.  Unifon TerO   Spanglish thero   Truespel thheroe.   Xenglik TerO.  Unifon's one sound per symbol design suggests that it is probably the best transcription system for easy counting.



Notes: 

Writing in 1891 E.V. Graff presented a phonetic alphabet for 37 elementary sounds.  This is the same as the one above except for the addition of hw as in when and where. 

References

Bett, Steve. 2000.  Can we pin down the number of phonemes in English: Simpl Speling Newsletter, April, 2000, p. 7. [phon-inv]  [old version at fortunecity]

Bett, Steve. 2002. On the number of phonemes in English. JSSS, spring, 2002

Brown, Adam.  2000.  The number of phonemes in English: not a simple answer to a simple question.  JSSS 27/2000/1 pp. 11-13

Brown, Roger. 1989. Words and Things. Glencoe Free Press.
 
 

Wells, John.  SAMPA- speech assessment methods phonetic alphabet, 1987  http://www.phon.ucl.ac.uk/home/sampa/home.htm

How many phonemes?   2 3 long version of the published article.



Comments

LESS STALK STRINE  [VY]

Two wimin ar tauking"
  A. "Carmen F T Withers. We reving frenzova. Yeckered
         calm strife from work."

  B."Theng Saula Syme butter monnner diet.  I fed a
        bitifer gairstrick stummick lightly.
        Spin plier nuppagennerbit. Arlga mauve rafter.
        Oliver bye tweet first."

Can Spanglish cope with that?

[SB] I don't think that any phonemic notation would have
any trouble with strine.  Such consistent notations
would be much better suited for narrow transcriptions
than tradspel.

However, Ozspeak would be as problematic for 70%
solutions as it is for TO [a 40% solution].  [40%
consistent and phonemic]

schism - skizm
genuine - jenuin
less stalk strine - lets tok [o]strayn
o-'strAl-y@n, ”-, @-
Louisiana - luuzyanna [local]
            Louisiana - lweezy-anna [ozspeak]

We are talking about spelling citation
[over-] pronunciation rather than capturing
the nuances of regional speech.

If we spelled as we speak, most regional dialects
would not be understandable to outsiders.

I challenged the group
to transcribe an IPA transcription and most had
problems because the words were sometimes run together
and syllables were dropped and vowels were shifted
toward a mid lax pronunciation.

REF: http://www.unifon.org/36fonemes.html#ortho
         www.unifon.org/writing-samples.html



[NS]  I think its not necessary to indicate the most frequent stress pattern [stress onthe first syllable in English] but from a strickly fonological point of view, it would be justified to always indicate stress because stress is phonemic in English.

Of course, first syllable stress is so dominant that it would be to mcuh to ask, I think, and remember that spanish spelling never indicates stress on the penultimate syllable [although there are some rules].

REF:  Is Stress Phonemic?

http://www.unifon.org/phon-inv-art.html
 


Expanded References:
Aaron, P.G.  and R.M. Joshi. 1989. Reading and Writing in Different Orthographic Systems.  Kluwer Academic Publishers in Cooperation with NATO Scientific Affairs Division.
Albrow, K.H. 1972. The English Writing System.  London: Longmans
Barbe, Walter et al. [eds.] 1982 Spelling-Basic Skills for Effective Communication. Columbus, OH: Zaner Bloser
Brown, ROger. 1988. Words and Things. Glencoe Free Press
Carney, Edward.  1994. A Survey of English Spelling.  London: Routledge
Coulmas, Florian. 1989. The Writing Systems of the World. Oxford: Basic 
Coulmas, Florian. 1996. Encyclopedia of Writing Systems. Oxford: Blackwell. 
Crystal, David. 1995. Cambridge Encyclopedia of the English Language
Dewey, Godfrey. 1971. English spelling: Roadblock to reading. NY: Teachers College Press.
Dewey, Godfrey. 1970. Relative Frequency of English Spelling. NY: Teachers College Press. 
Mazuiklewics, Albert. 1970. in Spelling
Haas, W. 1970. Phonographic Translation. Manchester: Manchester University Press [See also Alphabets for English]
Hanna, Paul R., Hanna, J.S. Hodges, R.E. & Rudorf, E.H. (1966). Phoneme 
   Grapheme Correspondences as Cues to Spelling Improvement, 
   Doc.OE-32008, Washington, D.C.: U.S. Dept. of Health, Ed., & Welfare. USGPO
Hanna, Paul (1992) Spelling: Structure & Strategies. University Press of American
Jones, Daniel. 1950. The Pronunciation of English. Cambridge: Cambridge 
   University Press 
Lindgren, Harry. 1969. Spelling Reform: A New Approach. Alpha Books
McGuinness, Diane 1997. Why Our Children Cant Read. Free Press
Martin, J. H. & Friedberg, A. 1986, Writing to Read. NY, Warner Books 
Pitman, James & St. John. 1970. Alphabets and Reading. London: Pitman
Scragg, D.G. 1974. A History of English Spelling.  Manchester: Manchester 
   University Press


14 pages
© 2000 BETA Information Design