The Transcrigal DB database contains 31 hours of recordings of broadcast news programs recorded from the Television de Galicia (TVG).
TVG is the public Galician television. The speech recordings present variations of topic, speaker, acoustic channel, speaking mode, etc.
The whole corpus has been segmented, labelled and transcribed manually using the tool developed by DGA (DélégationGénérale pour l'Armement, France)
and LDC (Linguistic Data Consortium, USA), called "Transcriber", with conventions similar to those adopted by LDC for the DARPA HUB-4 corpora.
Transcriptions include speaker turns, topics, channel information.
People who looked at this resource also viewed the following: