Spanish Synthesis Database UPC ESMA


This database contains the recordings and annotations of of read text material in neutral style. The database was recorded by one female Spanish professional speaker

The database was recorded in a noise-reduced room. The signal was recorded at 32kHz and 16 bits and decimated to 16kHz. It includes a second channel with the laryngograph signal. The speaker read text material in neutral style. The text material is composed by 506 phonetically balanced sentences, 208 phonetically balanced short paragraphs and 62 long paragraphs, giving a total of 1h 45min recorded speech. The database includes the phonetic transcription, phonetic segmentation and epoch labels.

