ENG_NER_NEL_Diana | | | 461 | | dpavot | 2023-11-29 | Uploading | |
PGR-NEG | | Identification of Negative Relations
| 23 | Diana Sousa | dpavot | 2023-11-28 | Developing | |
PT_NER_NEL_Diana | | | 318 | | dpavot | 2023-11-24 | Developing | |
test01 | | | 0 | | Erika Asamizu | 2015-09-11 | Testing | |
Erin_test | | @ Yonsei University | 0 | Erin | ErinHJ_Kim | 2023-11-29 | Testing | |
bionlp-st-2016-SeeDev-training | | Entities and event annotations from the training set of the BioNLP-ST 2016 SeeDev task.
SeeDev task focuses on seed storage and reserve accumulation on the model organism, Arabidopsis thaliana. The SeeDev task is based on the knowledge model Gene Regulation Network for Arabidopsis (GRNA) that meets the needs of text-mining (i.e. manual annotation of texts and automatic information extraction), experimental data indexing and retrieval and reuse in other plant systems. It is also expected to meet the requirements of the integration of the text knowledge with knowledge derived from experimental data in view of modeling in systems biology.
GRNA model defines 16 different types of entities, and 22 types of event (in five sets of event types) that may be combined in complex events.
For more information, please refer to the task website
All annotations :
Train set
Development set
Test set (without events)
| 35 | | EstelleChaix | 2023-11-28 | Released | |
bionlp-st-2016-SeeDev-dev | | Entities and event annotations from the development set of the BioNLP-ST 2016 SeeDev task.
SeeDev task focuses on seed storage and reserve accumulation on the model organism, Arabidopsis thaliana. The SeeDev task is based on the knowledge model Gene Regulation Network for Arabidopsis (GRNA) that meets the needs of text-mining (i.e. manual annotation of texts and automatic information extraction), experimental data indexing and retrieval and reuse in other plant systems. It is also expected to meet the requirements of the integration of the text knowledge with knowledge derived from experimental data in view of modeling in systems biology.
GRNA model defines 16 different types of entities, and 22 types of event (in five sets of event types) that may be combined in complex events.
For more information, please refer to the task website
All annotations :
Train set
Development set
Test set (without events)
| 61 | | EstelleChaix | 2023-11-29 | Released | |
bionlp-st-2016-SeeDev-test | | Entities annotations from the test set of the BioNLP-ST 2016 SeeDev task.
SeeDev task focuses on seed storage and reserve accumulation on the model organism, Arabidopsis thaliana. The SeeDev task is based on the knowledge model Gene Regulation Network for Arabidopsis (GRNA) that meets the needs of text-mining (i.e. manual annotation of texts and automatic information extraction), experimental data indexing and retrieval and reuse in other plant systems. It is also expected to meet the requirements of the integration of the text knowledge with knowledge derived from experimental data in view of modeling in systems biology.
GRNA model defines 16 different types of entities, and 22 types of event (in five sets of event types) that may be combined in complex events.
For more information, please refer to the task website
All annotations :
Train set
Development set
Test set (without events)
| 184 | | EstelleChaix | 2023-11-29 | Released | |
SPECIES800 | | SPECIES 800 (S800): an abstract-based manually annotated corpus. S800 comprises 800 PubMed abstracts in which organism mentions were identified and mapped to the corresponding NCBI Taxonomy identifiers.
Described in:
The SPECIES and ORGANISMS Resources for Fast and Accurate Identification of Taxonomic Names in Text.
Pafilis E, Frankild SP, Fanini L, Faulwetter S, Pavloudi C, et al. (2013). PLoS ONE, 2013, 8(6): e65390. doi:10.1371/journal.pone.0065390 | 3.71 K | Evangelos Pafilis, Sune P. Frankild, Lucia Fanini, Sarah Faulwetter, Christina Pavloudi, Aikaterini Vasileiadou, Christos Arvanitidis, Lars Juhl Jensen | evangelos | 2023-11-28 | Released | |
disease_ontology_term_microbe | | | 5 | | evangelos | 2023-11-29 | Developing | |
disease_gene_microbe_small | | Small version (48 abstract that mention both Crohns and S. aureus) for development purposes
Abbreviation: dgm Content: annotated abstracts on Crohn’s disease or on on Staphylococcus aureus (according to the jensenlab.org indexing resources) Entity types: (three for a start, organisms (NCBI Taxonomy taxa), disease (Disease Ontology terms), human genes (ENSEMBL proteins) Aim: Explore indirect associations of diseases to microbial species in this corpus via gene co-mentions | 536 | | evangelos | 2023-11-27 | Testing | |
testing | | testing | 0 | | ewha-bio | 2023-11-29 | Testing | |
Genomics_Informatics | | Genomics & Informatics (NLM title abbreviation: Genomics Inform) is the official journal of the Korea Genome Organization.
Text corpus for this journal annotated with various levels of linguistic information would be a valuable resource as the process of information extraction requires syntactic, semantic, and higher levels of natural language processing. In this study, we publish our new corpus called GNI Corpus version 1.0, extracted and annotated from full texts of Genomics & Informatics, with NLTK (Natural Language ToolKit)-based text mining script. The preliminary version of the corpus could be used as a training and testing set of a system that serves a variety of functions for future biomedical text mining. | 35.3 K | Hyun-Seok Park | ewha-bio | 2023-11-29 | Beta | |
EwhaLecture2020 | | testing | 0 | | | 2023-11-29 | Testing | |
Oryza-OGER | | | 462 K | | fabiorinaldi | 2023-11-29 | | |
2015-BEL-Sample | | An attempt to upload 295 BEL statements, i.e. the sample set used for the 2015 BioCreative challenge.
| 58 | Fabio Rinaldi | Fabio Rinaldi | 2023-11-29 | Testing | |
EDAN70 | | NLP tagging of articles concerning covid19. | 0 | | fettmedknaoz | 2023-11-29 | | |
test5 | | | 0 | | glennq | 2016-02-06 | | |
Staphylococcus | | | 7.46 K | haruo | haruo | 2023-11-29 | Testing | |
CoGe_Citation_Annotations | | Annotated PMC abstracts+full articles, that cite the "CoGe" papers (PMID: 18952863, 18269575).
Total Num Citations: 165
Total Num Unique Citations: 141
Total Num Abstracts: 165
Total Num Whole Articles: 165 | 0 | Heather Lent | hclent | 2023-11-29 | Uploading | |