The expressive database includes recordings of 145 sentences (about 20 minutes), each with three expression states: neutral, smile while speaking, and smile after speaking. Along with the original spoken text, we offer three types of data for each sentence:
As an example, the data for the sentence "Upton saw a disaster coming." is listed below. For quick review, the image size of avi is only 288 x 360 @ 50 fps.
expression mode | audio wav file | video yuv file | mpeg avi file |
---|---|---|---|
neutral | s001_1.wav | s001_1.yuv | s001_1.avi |
smile while speaking | s001_2.wav | s001_2.yuv | s001_2.avi |
smile after speaking | s001_3.wav | s001_3.yuv | s001_3.avi |
A detailed description of the database can be found in the readme-file and the text corpus, which we offer as a separate download here:
By Download you agree to the following terms of use:
The article to be cited:
"Realistic Facial Expression Synthesis for an Image-based Talking Head" by Kang Liu and Joern Ostermann, IEEE Conference on Multimedia and Expo, ICME2011 , p. 6, Barcelona, Spain, July 2011
The provided data has been recorded in 2011 by:
Kang Liu
Institut für Informationsverarbeitung
Leibniz Universität Hannover
Appelstr. 9A, 30167 Hannover
Germany