Skolt Sami Corpus (2016)
View resource name in all available languages
Koltansaamen korpus (2016)
The corpus consists of Skolt Sami recordings and their annotations containing linguistic analysis. The annotations include a transcription and a translation into Finnish, and for a large part of the recordings also a translation into English, morphemic glossing and parts-of-speech analysis. The material has been produced in ELAN, where the soundtrack and analysis have been combined and time-synchronized. Estimated size of the corpus: 25,5 hours.
Some of the corpus’ recordings and of their transcriptions are available in the book Säämǩiõll, ä'rbbǩiõll edited by Eino Koponen, Jouni Moshnikoff and Satu Moshnikoff, see http://scripta.kotus.fi/www/verkkojulkaisut/julk14/. The transcriptions of the book are somewhat different from the transcriptions of this corpus.
The corpus will be published at http://lat.csc.fi and at https://korp.csc.fi/.
The purpose of the resource use must be outlined in a research plan.
Important: due to the nature of the material, the resource should be handled with care in order to respect the privacy of the personal data. If samples of the data are published, they must be anonymized according to best practices.
For detailed information on the license of the resource see https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaEngResPd.
View resource description in all available languages