> top > projects

Projects

NameTDescription# Ann.AuthorMaintainer Updated_atStatus

501-520 / 590 show all
EDAN70 NLP tagging of articles concerning covid19.0fettmedknaoz2023-11-29
2015-BEL-Sample An attempt to upload 295 BEL statements, i.e. the sample set used for the 2015 BioCreative challenge. 58Fabio RinaldiFabio Rinaldi2023-11-29Testing
Oryza-OGER 462 Kfabiorinaldi2023-11-29
testing testing0ewha-bio2023-11-29Testing
Genomics_Informatics Genomics & Informatics (NLM title abbreviation: Genomics Inform) is the official journal of the Korea Genome Organization. Text corpus for this journal annotated with various levels of linguistic information would be a valuable resource as the process of information extraction requires syntactic, semantic, and higher levels of natural language processing. In this study, we publish our new corpus called GNI Corpus version 1.0, extracted and annotated from full texts of Genomics & Informatics, with NLTK (Natural Language ToolKit)-based text mining script. The preliminary version of the corpus could be used as a training and testing set of a system that serves a variety of functions for future biomedical text mining.35.3 KHyun-Seok Parkewha-bio2023-11-29Beta
EwhaLecture2020 testing02023-11-29Testing
disease_gene_microbe_small Small version (48 abstract that mention both Crohns and S. aureus) for development purposes Abbreviation: dgm Content: annotated abstracts on Crohn’s disease or on on Staphylococcus aureus (according to the jensenlab.org indexing resources) Entity types: (three for a start, organisms (NCBI Taxonomy taxa), disease (Disease Ontology terms), human genes (ENSEMBL proteins) Aim: Explore indirect associations of diseases to microbial species in this corpus via gene co-mentions536evangelos2023-11-27Testing
SPECIES800 SPECIES 800 (S800): an abstract-based manually annotated corpus. S800 comprises 800 PubMed abstracts in which organism mentions were identified and mapped to the corresponding NCBI Taxonomy identifiers. Described in: The SPECIES and ORGANISMS Resources for Fast and Accurate Identification of Taxonomic Names in Text. Pafilis E, Frankild SP, Fanini L, Faulwetter S, Pavloudi C, et al. (2013). PLoS ONE, 2013, 8(6): e65390. doi:10.1371/journal.pone.00653903.71 KEvangelos Pafilis, Sune P. Frankild, Lucia Fanini, Sarah Faulwetter, Christina Pavloudi, Aikaterini Vasileiadou, Christos Arvanitidis, Lars Juhl Jensenevangelos2023-11-28Released
disease_ontology_term_microbe 5evangelos2023-11-29Developing
SPECIES800_autotagged This project comprises the SPECIES800 corpus documents automatically annotated by the Jensenlab tagger. Annotated entity types are: Genes/proteins from the mentioned organisms (and any human ones) PubChem Compound identifiers NCBI Taxonomy entries Gene Ontology cellular component terms BRENDA Tissue Ontology terms Disease Ontology terms Environment Ontology terms The SPECIES 800 (S800) comprises 800 PubMed abstracts. In its original form species mentions were manually identified and mapped to the corresponding NCBI Taxonomy identifiers. Described in: The SPECIES and ORGANISMS Resources for Fast and Accurate Identification of Taxonomic Names in Text. Pafilis E, Frankild SP, Fanini L, Faulwetter S, Pavloudi C, et al. (2013). PLoS ONE, 2013, 8(6): e65390. doi:10.1371/journal.pone.0065390. The manually annotated corpus is also available as a PubAnnotation project (see here). 0Evangelos Pafilis, Sampo Pyysalo, Lars Juhl Jensenevangelos2015-11-20Testing
bionlp-st-2016-SeeDev-training Entities and event annotations from the training set of the BioNLP-ST 2016 SeeDev task. SeeDev task focuses on seed storage and reserve accumulation on the model organism, Arabidopsis thaliana. The SeeDev task is based on the knowledge model Gene Regulation Network for Arabidopsis (GRNA) that meets the needs of text-mining (i.e. manual annotation of texts and automatic information extraction), experimental data indexing and retrieval and reuse in other plant systems. It is also expected to meet the requirements of the integration of the text knowledge with knowledge derived from experimental data in view of modeling in systems biology. GRNA model defines 16 different types of entities, and 22 types of event (in five sets of event types) that may be combined in complex events. For more information, please refer to the task website All annotations : Train set Development set Test set (without events) 35EstelleChaix2023-11-28Released
bionlp-st-2016-SeeDev-dev Entities and event annotations from the development set of the BioNLP-ST 2016 SeeDev task. SeeDev task focuses on seed storage and reserve accumulation on the model organism, Arabidopsis thaliana. The SeeDev task is based on the knowledge model Gene Regulation Network for Arabidopsis (GRNA) that meets the needs of text-mining (i.e. manual annotation of texts and automatic information extraction), experimental data indexing and retrieval and reuse in other plant systems. It is also expected to meet the requirements of the integration of the text knowledge with knowledge derived from experimental data in view of modeling in systems biology. GRNA model defines 16 different types of entities, and 22 types of event (in five sets of event types) that may be combined in complex events. For more information, please refer to the task website All annotations : Train set Development set Test set (without events) 61EstelleChaix2023-11-29Released
bionlp-st-2016-SeeDev-test Entities annotations from the test set of the BioNLP-ST 2016 SeeDev task. SeeDev task focuses on seed storage and reserve accumulation on the model organism, Arabidopsis thaliana. The SeeDev task is based on the knowledge model Gene Regulation Network for Arabidopsis (GRNA) that meets the needs of text-mining (i.e. manual annotation of texts and automatic information extraction), experimental data indexing and retrieval and reuse in other plant systems. It is also expected to meet the requirements of the integration of the text knowledge with knowledge derived from experimental data in view of modeling in systems biology. GRNA model defines 16 different types of entities, and 22 types of event (in five sets of event types) that may be combined in complex events. For more information, please refer to the task website All annotations : Train set Development set Test set (without events) 184EstelleChaix2023-11-29Released
Erin_test @ Yonsei University0ErinErinHJ_Kim2023-11-29Testing
test01 0Erika Asamizu2015-09-11Testing
TEST-DiseaseOrPhenotypicFeature Annotated by Mesh_All_FN795Eisuke Dohi2023-11-29Released
AxD_symptoms Symptoms of AxD from available case report and case series401Eisuke Dohi2023-11-29Developing
LitCoin-MONDO_bioort2019 DiseaeseOrPhenotypicFeature3.72 KEisuke Dohi2023-11-29Testing
LitEisuke 6.12 KEisuke Dohi2023-11-29Developing
testing_230112 2eatfish2023-11-29Testing
NameT# Ann.AuthorMaintainer Updated_atStatus

501-520 / 590 show all
EDAN70 0fettmedknaoz2023-11-29
2015-BEL-Sample 58Fabio RinaldiFabio Rinaldi2023-11-29Testing
Oryza-OGER 462 Kfabiorinaldi2023-11-29
testing 0ewha-bio2023-11-29Testing
Genomics_Informatics 35.3 KHyun-Seok Parkewha-bio2023-11-29Beta
EwhaLecture2020 02023-11-29Testing
disease_gene_microbe_small 536evangelos2023-11-27Testing
SPECIES800 3.71 KEvangelos Pafilis, Sune P. Frankild, Lucia Fanini, Sarah Faulwetter, Christina Pavloudi, Aikaterini Vasileiadou, Christos Arvanitidis, Lars Juhl Jensenevangelos2023-11-28Released
disease_ontology_term_microbe 5evangelos2023-11-29Developing
SPECIES800_autotagged 0Evangelos Pafilis, Sampo Pyysalo, Lars Juhl Jensenevangelos2015-11-20Testing
bionlp-st-2016-SeeDev-training 35EstelleChaix2023-11-28Released
bionlp-st-2016-SeeDev-dev 61EstelleChaix2023-11-29Released
bionlp-st-2016-SeeDev-test 184EstelleChaix2023-11-29Released
Erin_test 0ErinErinHJ_Kim2023-11-29Testing
test01 0Erika Asamizu2015-09-11Testing
TEST-DiseaseOrPhenotypicFeature 795Eisuke Dohi2023-11-29Released
AxD_symptoms 401Eisuke Dohi2023-11-29Developing
LitCoin-MONDO_bioort2019 3.72 KEisuke Dohi2023-11-29Testing
LitEisuke 6.12 KEisuke Dohi2023-11-29Developing
testing_230112 2eatfish2023-11-29Testing