PANACEA Spanish V-SUBCAT Gold Standard lexicon ENV domain
Spanish V-SUBCAT GS lexicon ENV domain
This is a domain-specific gold-standard for Spanish subcategorization frames, in the case, for environment (ENV) domain. This gold-standard was manually developed, choosing a set of 30 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were manually annotated.
The sentences were selected from crawled Web pages that were automatically detected to be in the Spanish language and were automatically classified as relevant to the ENV domain. Data collection took place in the summer of 2011.
This gold-standard was created in the context of PANACEA (http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064.