Swiss-French SpeechDat(II) FDB-3000
View resource name in all available languages
Base de données SpeechDat(II) du suisse-romand FDB-3000
ID:
ELRA-S0104
The Swiss-French SpeechDat(II) FDB-3000 comprises 3000 Swiss-French speakers (1500 males, 1500 females) recorded over the Swiss fixed telephone network. This database is partitioned into 6 CDs, each of which comprises 500 speakers sessions. The speech databases made within the SpeechDat(II) project were validated by SPEX, the Netherlands, to assess their compliance with the SpeechDat format and content specifications.
Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted utterance is stored in a separate file. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
The following items were recorded:
- 5 application words
- 1 sequence of 6 isolated digits including the hash (#) and the star (*)
- 3 connected digits: 1 sheet number, 1 telephone, 1 credit card number (16 digits)
- 2 dates: 1 spontaneous date, e.g. birthday, 1 prompted date, word style
- 3 spelled words from a list of name and titles
- 2 currency money amounts
- 2 numbers: 1 natural number, 1 quantity number (prompted)
- 1 place (province of longest residence)
- 7 optional item: 1 name (spelling table), 1 city name, 1 mother tongue of speaker (spontaneous), 1 education level of speaker (out of 3 choices), 1 type of telephone used, 1 query to telephone directory
- 1 free comment on session
- 1 yes/no question
- 10 phonetically rich sentences
- 1 time phrase (word style)
The following age distribution has been obtained: 69 speakers are below 16 years old, 1006 speakers are between 16 and 30, 944 speakers are between 31 and 45, 629 speakers are between 46 and 60, 311 speakers are over 60, and 41 speakers whose age is unknown.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
View resource description in all available languages