The Longitudinal Corpus of Finnish Spoken in Helsinki (2010s Text Corpus)
View resource name in all available languages
Helsingin puhekielen pitkittäiskorpus (2010-luvun tekstimuotoinen ainesto)
This corpus belongs to The Longitudinal Corpus of Finnish Spoken in Helsinki, which contains interviews with people of different ages born in Helsinki. The Longitudinal Corpus of Finnish Spoken in Helsinki (2010s text corpus) is the anonymized transcript of the 2010s interviews. The corpus can be used for educational and research purposes.
The 2010s material consists of about one hour long audio recordings of individual interviews. Most of the interview questions are about how the interviewees perceive Helsinki, living and traveling in the city, as well as the languages and language forms spoken in Helsinki. The interviews also touch upon such topics as school, work and hobbies related issues of the interviewees.
A part of the corpus has been transcribed and thematically coded. The Longitudinal Corpus of Finnish Spoken in Helsinki (2010s text corpus) contains also parts in which the audio material and the transcript are aligned.
Work on the transcription, alignment and thematic coding of the corpus is planned to continue in the future.
The corpus should be referred to in the following way:
The Longitudinal Corpus of Finnish Spoken in Helsinki (2010s text corpus), informant’s code (if applicable).
The informant’s code should be marked if concrete text examples of the corpus are given.
The corpus will be published in LAT (http://lat.csc.fi).
For detailed information on the license of the resource see https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaEngACANcReD.
View resource description in all available languages