

The latter had been emphasized throughout this work, which in turn contributed Segments chosen should capture all the transient and transitional information.

In addition toīeing part of computationally manageable inventory of items, the synthesis The number of segments and the time required toĬover the language increase steadily from word to the phoneme. In the case of shorter units, such as the This involves a tradeoffīetween longer and shorter units. Units or segments that result in smooth concatenation. The design of any concatenative TTS system is to select the most appropriate Childrenįind learning more fun when their typed inputs are mapped to vocalized outputs. Whether it is used as a language learning aid or as a vocal aid. Any TTS should be appealing to the child user IntroductionĪre various critical factors to be considered while designing a TTS system that The quality of the synthesized speech was evaluated using the mean opinion score (MOS). This can be used not only as an interesting language learning aid for the normal child but it also serves as a speech aid to the vocally disabled child. This inexpensive TTS system was implemented in MATLAB, with the synthesis presented by means of a graphical user interface (GUI), thus making it child friendly. Sufficient degree of customization and generalization catering to the needs of the child user had been included through the provision for vocabulary and voice selection to suit the requisites of the child. Linguistic analysis was used to reduce the algorithmic complexity instead of signal processing techniques. Here reduced memory is achieved by the concatenation of phonemes and by replacing phonetic wave files with their LPC coefficients. Most existing TTS systems are unit-selection based, which use standard speech databases available in neutral adult voices. Direct waveform concatenation and linear prediction coding (LPC) are used. This paper discusses the implementation details of a child friendly, good quality, English text-to-speech (TTS) system that is phoneme-based, concatenative, easy to set up and use with little memory.
