DisGeNET5_variant_disease | | The file contains variant-disease associations obtained by text mining MEDLINE abstracts using the BeFree system, including the variant and disease off sets. | 144 K | 2023-11-24 | Released | |
PIR-corpus1 | | The Protein Information Resource (PIR) is not biased towards any particular biomedical domain, and is expected to provide more diverse protein names in a given sample size.
Annotation category: protein, compound-protein, acronym. | 4.44 K | 2023-11-27 | Released | |
PennBioIE | | The PennBioIE corpus (0.9) covers two domains of biomedical knowledge. One is the inhibition of the cytochrome P450 family of enzymes (CYP450 or CYP for short) , and the other domain is the molecular genetics of dance (oncology or onco for short). | 23.8 K | 2023-11-26 | Released | |
DisGeNET5_gene_disease | | The file contains gene-disease associations obtained by text mining MEDLINE abstracts using the BeFree system including the gene and disease off sets. | 2.04 M | 2023-11-24 | Released | |
bionlp-st-cg-2013-training | | The training dataset from the cancer genetics task in the BioNLP Shared Task 2013.
Composed of anatomical and molecular entities. | 10.9 K | 2023-11-28 | Released | |