SpeechPlay is an intuitive system for creating expressive synthetic voices in a fun and interactive manner. Control of prosody information in synthesized speech output is based on the visual appearance of the text, which can be manipulated with touch gestures. Users could create/modify contents using their mobile phone (SpeechPlay Mobile Application) and publish/share their work on a large screen (SpeechPlay Surface).


  • Kian Peen Yeo, Suranga Nanayakkara. 2013. SpeechPlay: Composing and Sharing Expressive Speech Through Visually Augmented Text. In Proceedings of the 24th conference of the computer-human interaction special interest group of Australia on Computer-human interaction(OZCHI ’13). ACM, New York, NY, USA. [PDF]