Chapter 29. English phone lists

Relating phonemes to sounds is not obvious as people think. Even when one is familar with phone sets its easy to make mistakes when reading lists of phones alone. This is particularly true in reading diphone nonsense words. The table provided here are intended for both the experienced and inexperienced reader of phones, to help you decide on the pronunciation.

These tables are not supposed to be a substitute for a good phonetics course, they are intended to give people a basic idea of the pronunciation of the phone sets used in the particaulr examples in this document. Many simplifying assumptions have been made, and often aren't even mentioned. To the phoneticians out there I apologise, as much as the assumptions are wrong we are here listing atomic discrete phones which we have found useful in building practical systems, even though better sets probably exist.

29.1. US phoneset

Inspite of everyone telling you that there is one and only one US phoneset, when it comes to actually using one you quickly discover there are actually many standard one used by lots of different pieces of software, often the difference betwen them is trivial (e.g. case folding) but computers being fundamentally dumb can't take these trivial differences into account. Here we list the radio phoneset which is used by standard US voices in festival. The definition is in festival/lib/radio_phones.scm. This list was based on those phones that appear in the Boston University FM radio corpus with minor modifications. The list here is exactly those phones which appear in the diphone nonses words as used in the example explained in Chapter 19.

aa

fAther, wAshington

ae

fAt, bAd

ah

bUt, hUsh

ao

lAWn, dOOr, mAll

aw

hOW, sOUth, brOWser

ax

About, cAnoe

ay

hIde, bIble

eh

gEt, fEAther

el

tabLE, usabLE

em

systEM, communisM

en

beatEN

er

fERtile, sEARch, makER

ey

gAte, Ate

ih

bIt, shIp

iy

bEAt, shEEp

ow

lOne, nOse

oy

tOY, OYster

uh

fUll, wOOd

uw

fOOl, fOOd

b

Book, aBrupt

ch

CHart, larCH

d

Done, baD

dh

THat, faTHer

f

Fat, lauGH

g

Good, biGGer

hh

Hello, loopHole

jh

diGit, Jack

k

Camera, jaCK, Kill

l

Late, fuLL

m

Man, gaMe

n

maN, New

ng

baNG, sittiNG

p

Pat, camPer

r

Reason, caR,

s

Sit, maSS

sh

SHip, claSH

t

Tap, baT

th

THeatre, baTH

v

Various, haVe

w

Water, cobWeb

y

Yellow, Yacht

z

Zero, quiZ, boyS

zh

viSion, caSual

pau

short silence

In addition to the phone sthemselves the nonsense word generated by the diphone schema also have some other notations to denote different type of phone.

The use of - (hyphen) in the nonsense word itself is used to denot an explicit syllable boundary. Thus pau t aa n - k aa pau is used to state that the word should be pronounced as tan ka rather than tank ah. Where no explicit syllable boundary is given the pronunciation should be pronounce naturally without any boundary (which is probabaly too underspecified in some cases).

The use of _ (underscore) in phone names is used to denote consonant clusters. That is t_-_r is the /tr/ as found in trip not that in cat run.