Home
Register
Login
Browse Resources
Community
Statistics
Help
About
META-SHARE Members
META-SHARE Repositories
META-SHARE Managing Nodes
LR Sharing
Licensing LRs
Notice and Takedown Policy
Privacy
Data Protection
Data Protection Statement
10
Last view: 2024-02-01
Hungarian Read Speech Precisely Labelled Parallel Speech Corpus Collection
ParallelSpeech-hu
ID:
212
Phonetically balanced sentence set read by 10 speakers.
« Back
Download
You don’t have the permission to edit this resource.
Edit Resource
Distribution
Availability
Available - Restricted Use
Start date:
07/15/2012
Licence
MS - C - No ReD - FF
Restrictions:
Academic - Non Commercial Use
Fee:
4,000 EURO pro speaker
Distribution Access/Medium:
DVD - R
Licensors:
Henk Tamás
http://www.tmit.bme.hu
Budapest University of Technology and Economics
BME
Head of Department
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-4188
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://www.tmit.bme.hu
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-4188
Fax: +36-1-463-3107
Distribution rights holders:
Géza Németh
http://speechlab.tmi...
Budapest University of Technology and Economics
BME
professor
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3883
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3883
Fax: +36-1-463-3107
IPR Holder
Gábor Olaszy
http://speechlab.tmi...
Budapest University of Technology and Economics
BME
professor
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3883
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3883
Fax: +36-1-463-3107
Contact Person
Géza Németh
http://speechlab.tmi...
Budapest University of Technology and Economics
BME
professor
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3883
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3883
Fax: +36-1-463-3107
text
audio
Monolingual text corpus
Languages
Hungarian (19,658 Sentences)
Linguality
Linguality type:
Monolingual
Size
19,658 Sentences
Character encoding
ISO - 8859 - 2 (1,089,595 Bytes)
Creation
Creation mode:
Manual
Original Sources
research
Monolingual audio corpus
Languages
Hungarian (91,924 Seconds)
Linguality
Linguality type:
Monolingual
Size
91,924 Seconds
Effective speech duration
91,924 Seconds
Audio duration
91,924 Seconds
Annotation
Segmentation
Tagset:
http://praat.org
Annotated elements:
Other
Segmentation level:
Phoneme
Format:
Praat TextGrid
Annotation Mode:
Mixed (TTS for the text to sound conversion, and manual alignment of sound boundaries)
Annotation Tools:
Profivox development tool
Start date:
08/01/2009
End date:
09/30/2012
Size:
831,941 Phonemes
Annotators:
Klára Laczkó
http://www.tmit.bme.hu
Budapest University of Technology and Economics
BME
staff member
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3437
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3437
Fax: +36-1-463-3107
Content
Speech items:
Phonetically Balanced Sentences
Noise Level:
Low
Setting
Naturality:
Read Speech
Conversational type:
Monologue
Scenario:
Other
Audience:
No
Interactivity:
Non Interactive
Audio Formats
wave/audio (91,924 Seconds)
Compression:
False
Recording quality:
High
Quantization:
16
Number of tracks:
1
Sampling rate:
44100
Signal encoding:
LinearPCM
Recording
Recorders
Mátyás Bartalis
http://speechlab.tmi...
Budapest University of Technology and Economics
BME
research fellow
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3437
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3437
Fax: +36-1-463-3107
Recording platform software:
soundforge_sony
Recording environment:
Studio
Recording device type details:
RME Fireface800
Recording device type:
Other
Capture
Capturing device type details:
AKG C-414 B-ULS
Capturing device type:
Studio Equipment
Person SourceSet
Origin of persons:
Native
Age of persons:
Adult
Sex of persons:
Mixed
Number of persons:
10
Dialect accent of persons:
no dialect
Age range end:
60
Hearing impairment of persons:
No
Number of trained speakers:
2
Age range start:
26
Speaking impairment of persons:
No
Creation
Original Sources
corpora
Resource Creation
Resource Creator
Csaba Zainkó
http://speechlab.tmi...
Budapest University of Technology and Economics
BME
Ph.D. lecturer
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3512
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3512
Fax: +36-1-463-3107
Tamás Bőhm
http://speechlab.tmi...
Budapest University of Technology and Economics
BME
Ph.D. lecturer
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3437
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3437
Fax: +36-1-463-3107
Creation lasted:
07/01/2009 - 09/30/2012
Metadata
Created:
07/09/2012
Last Updated:
07/09/2012
Source:
METANET4U
Revision:
2012-07-09
Metadata Creator
Mátyás Bartalis
http://speechlab.tmi...
Budapest University of Technology and Economics
BME
research fellow
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3437
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3437
Fax: +36-1-463-3107
Version
Version:
1.0
Revision:
Waves, texts, sound boundaries
Last Updated:
07/09/2012
Validation
Validated
Type of Validation:
Content
Validation Mode:
Manual
Mode Details:
Manually checked annotation and labeling of the sound and word boundaries in the corpora
Validator
Tamás Gábor Csapó
http://www.tmit.bme.hu
Budapest University of Technology and Economics
BME
Ph.D. candidate
[javascript protected email address]
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
Tel.: +36-1-463-3512
Fax: +36-1-463-3107
Department of Telecommunications and Media Informatics
http://speechlab.tmi...
BME
Magyar tudósok körútja 2.
H-1117 Budapest
Hungary
[javascript protected email address]
Tel.: +36-1-463-3512
Fax: +36-1-463-3107
Usage
Foreseen Use
Nlp Applications
Use NLP Specific:
Knowledge Discovery, Linguistic Research, Speech Analysis, Speech Synthesis
Human Use
Actual Use - Nlp Applications
Use NLP Specific:
Speech Analysis, Speech Synthesis
Details:
This speech database contains 2000 sentences. Each speaker read this sentence set. This parallel speech database is used to train HMM based TTS and for unit selection TTS.
Derived Resources
First hungarian precisely labelled parallel speech database collection
Documentation
http://speechlab.tmi...
http://speechlab.tmi...
People who looked at this resource also viewed the following:
Hungarian Parliamentary Speech and Aligned Text Selection Database
Hungarian National Corpus
hunmorph
Italian_text_autocues_scripts_subtitles_RAI
Resources from the same creators