Posted: 29th Oct 2002 00:04
You mean make the sounds at run-time to minimise sound file useage? That's really quite hard, but it can be done using memblocks. I mean it's hard to write the arrays to store the sounds in.
http://www.codeproject.com/audio/
That site contains some c source code which does text to speech. Here are some of the sounds you should be using, taken from the site:
// Mouth Mapping Array (from Microsoft's TTSApp Example)
const int g_iMapVisemeToImage[22] = {
0, // SP_VISEME_0 = 0, // Silence
11, // SP_VISEME_1, // AE, AX, AH
11, // SP_VISEME_2, // AA
11, // SP_VISEME_3, // AO
10, // SP_VISEME_4, // EY, EH, UH
11, // SP_VISEME_5, // ER
9, // SP_VISEME_6, // y, IY, IH, IX
2, // SP_VISEME_7, // w, UW
13, // SP_VISEME_8, // OW
9, // SP_VISEME_9, // AW
12, // SP_VISEME_10, // OY
11, // SP_VISEME_11, // AY
9, // SP_VISEME_12, // h
3, // SP_VISEME_13, // r
6, // SP_VISEME_14, // l
7, // SP_VISEME_15, // s, z
8, // SP_VISEME_16, // SH, CH, JH, ZH
5, // SP_VISEME_17, // TH, DH
4, // SP_VISEME_18, // f, v
7, // SP_VISEME_19, // d, t, n
9, // SP_VISEME_20, // k, g, NG
1 // SP_VISEME_21, // p, b, m
This list compares mouth positions against sounds.
Look at this site for phonetic notation:
http://www.phon.ucl.ac.uk/home/sampa/english.htm
Hope that's some help.