GENIA Event Corpus with meta-knowledge annotation
The corpus consists of 1000 MEDLINE abstracts. It is a subset of the original GENIA POS & term corpus, which was selected using the three MeSH terms human, blood cells and transcription factors. In each sentence, three types of information are annotated 1) biomedical terms are identified and assigned categories from the GENIA term ontology. 2) event structures are identified and assigned categories from the GENIA event ontology. 3) Thirdly, detailed information is annotated about how the event should be interpreted, according to its textual context. We call this information meta-knowledge.
People who looked at this resource also viewed the following: