Prosody Generation for Child Oriented Speech Synthesis (Prosody Generation)
This project [joint with Alan Black at Carnegie Mellon University and Richard Sproat at AT&T Research] focuses on innovative algorithms for generating highly expressive synthetic speech. Generating expressive speech involves three hard research problems. (i) Computation of abstract tags that specify, e.g., which words need emphasis, and phrasing (e.g., where to pause). (ii) Based on these tags, the system has to compute a fundamental frequency contour. (iii) Severe modification of the stored speech fragments ("acoustic units") to obtain these contours. The central goal of the project is to address these research problems, and create a TTS system that will make the next generation of TTS based language remediation systems viable.