PennBioIE | | The PennBioIE corpus (0.9) covers two domains of biomedical knowledge. One is the inhibition of the cytochrome P450 family of enzymes (CYP450 or CYP for short) , and the other domain is the molecular genetics of dance (oncology or onco for short). | 23.8 K | 2023-11-26 | Released | |
PIR-corpus1 | | The Protein Information Resource (PIR) is not biased towards any particular biomedical domain, and is expected to provide more diverse protein names in a given sample size.
Annotation category: protein, compound-protein, acronym. | 4.44 K | 2023-11-27 | Released | |
PIR-corpus2 | | The protein tag was used to tag proteins, or protein-associated or -related objects, such as domains, pathways, expression of gene.
Annotation guideline: http://pir.georgetown.edu/pirwww/about/doc/manietal.pdf | 5.52 K | 2023-11-29 | Released | |
SCAI-Test | | A small corpus for the evaluation of dictionaries containing chemical entities.
Publication: http://www.scai.fraunhofer.de/fileadmin/images/bio/data_mining/paper/kolarik2008.pdf
Original source: https://www.scai.fraunhofer.de/en/business-research-areas/bioinformatics/downloads/corpora-for-chemical-entity-recognition.html | 1.21 K | 2023-11-28 | Released | |
TEST0 | | | 3.37 M | 2023-11-24 | | |