> top > projects

Projects

NameTDescription # Ann.AuthorMaintainerUpdated_atStatus

481-500 / 590 show all
bionlp-st-bb3-2016-training Entity (bacteria, habitats and geographical places) annotation to the training dataset of the BioNLP-ST 2016 BB task. For more information, please refer to bionlp-st-bb3-2016-development and bionlp-st-bb3-2016-test. Bacteria Bacteria entities are annotated as contiguous spans of text that contains a full unambiguous prokaryote taxon name, the type label is Bacteria. The Bacteria type is a taxon, at any taxonomic level from phylum (Eubacteria) to strain. The category that the text entities have to be assigned to is the most specific and unique category of the NCBI taxonomy resource. In case a given strain, or a group of strains is not referenced by NCBI, it is assigned with the closest taxid in the taxonomy. Habitat Habitat entities are annotated as spans of text that contains a complete mention of a potential habitat for bacteria, the type label is Habitat. Habitat entities are assigned one or several concepts from the habitat subpart of the OntoBiotope ontology. The assigned concepts are as specific as possible. OntoBiotope defines most relevant microorganism habitats from all areas considered by microbial ecology (hosts, natural environment, anthropized environments, food, medical, etc.). Habitat entities are rarely referential entities, they are usually noun phrases including properties and modifiers. There are rare cases of habitats referred with adjectives or verbs. The spans are generally contiguous but some of them are discontinuous in order to cope with conjunctions. Geographical Geographical entities are geographical and organization places denoted by official names.1.28 KINRAYue Wang2023-11-29Released
bionlp-st-ge-2016-reference It is the benchmark reference data set of the BioNLP-ST 2016 GE task. It includes Genia-style event annotations to 20 full paper articles which are about NFκB proteins. The task is to develop an automatic annotation system which can produce annotation similar to the annotation in this data set as much as possible. For evaluation of the performance of a participating system, the system needs to produce annotations to the documents in the benchmark test data set (bionlp-st-ge-2016-test). GE 2016 benchmark data set is provided as multi-layer annotations which include: bionlp-st-ge-2016-reference: benchmark reference data set (this project) bionlp-st-ge-2016-test: benchmark test data set (annotations are blined) bionlp-st-ge-2016-test-proteins: protein annotation to the benchmark test data set Following is supporting resources: bionlp-st-ge-2016-coref: coreference annotation bionlp-st-ge-2016-uniprot: Protein annotation with UniProt IDs. pmc-enju-pas: dependency parsing result produced by Enju UBERON-AE: annotation for anatomical entities as defined in UBERON ICD10: annotation for disease names as defined in ICD10 GO-BP: annotation for biological process names as defined in GO GO-CC: annotation for cellular component names as defined in GO A SPARQL-driven search interface is provided at http://bionlp.dbcls.jp/sparql.14.4 KDBCLSJin-Dong Kim2023-11-29Released
bionlp-st-ge-2016-test It is the benchmark test data set of the BioNLP-ST 2016 GE task. It includes Genia-style event annotations to 14 full paper articles which are about NFκB proteins. For testing purpose, however, annotations are all blinded, which means users cannot see the annotations in this project. Instead, annotations in any other project can be compared to the hidden annotations in this project, then the annotations in the project will be automatically evaluated based on the comparison. A participant of GE task can get the evaluation of his/her result of automatic annotation, through following process: Create a new project. Import documents from the project, bionlp-st-2016-test-proteins to your project. Import annotations from the project, bionlp-st-2016-test-proteins to your project. At this point, you may want to compare you project to this project, the benchmark data set. It will show that protein annotations in your project is 100% correct, but other annotations, e.g., events, are 0%. Produce event annotations, using your system, upon the protein annotations. Upload your event annotations to your project. Compare your project to this project, to get evaluation. GE 2016 benchmark data set is provided as multi-layer annotations which include: bionlp-st-ge-2016-reference: benchmark reference data set bionlp-st-ge-2016-test: benchmark test data set (this project) bionlp-st-ge-2016-test-proteins: protein annotation to the benchmark test data set Following is supporting resources: bionlp-st-ge-2016-coref: coreference annotation bionlp-st-ge-2016-uniprot: Protein annotation with UniProt IDs. pmc-enju-pas: dependency parsing result produced by Enju UBERON-AE: annotation for anatomical entities as defined in UBERON ICD10: annotation for disease names as defined in ICD10 GO-BP: annotation for biological process names as defined in GO GO-CC: annotation for cellular component names as defined in GO A SPARQL-driven search interface is provided at http://bionlp.dbcls.jp/sparql.7.99 KDBCLSJin-Dong Kim2023-11-29Released
PMA_MER PMAs annotated using MERpy.58.9 KStefano Rensitherightstef2023-11-29Developing
pmc-enju-pas Predicate-argument structure annotation produced by Enju. This data set is initially produced as a supporting resource for BioNLP-ST 2016 GE task. As so, it currently includes the 34 full paper articles that are in the benchmark data sets of GE 2016 task, reference data set (bionlp-st-ge-2016-reference) and test data set (bionlp-st-ge-2016-test), but will be extended to include more papers from the PubMed Central Open Access subset (PMCOA). 205 KDBCLSJin-Dong Kim2023-11-28Developing
bionlp-st-ge-2016-test-proteins Protein annotations to the benchmark test data set of the BioNLP-ST 2016 GE task. A participant of the GE task may import the documents and annotations of this project to his/her own project, to begin with producing event annotations. For more details, please refer to the benchmark test data set (bionlp-st-ge-2016-test). 4.34 KDBCLSJin-Dong Kim2023-11-27Released
LitCovid-PD-FMA-UBERON-v1 PubDictionaries annotation for anatomy terms - updated at 2020-04-20 Disease term annotation based on FMA and Uberon. Version 2020-04-20. The terms in FMA and Uberon are loaded in PubDictionaries (FMA and Uberon), with which the annotations in this project are produced. The parameter configuration used for this project is here for FMA and there for Uberon. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.4.3 KJin-Dong Kim2023-11-27Released
LitCovid-PD-MONDO-v1 PubDictionaries annotation for disease terms - updated at 2020-04-20 It is based on MONDO Version 2020-04-20. The terms in MONDO are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.13.4 KJin-Dong Kim2023-11-29Released
CORD-19-PD-HP PubDictionaries annotation for HP terms - updated at 2020-04-30 Disease term annotation based on HP. Version 2020-04-20. The terms in HP are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.1.15 MJin-Dong Kim2023-11-29Released
LitCovid-PD-HP-v1 PubDictionaries annotation for human phenotype terms - updated at 2020-04-20 Disease term annotation based on HP. Version 2020-04-20. The terms in HP are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.3.03 KJin-Dong Kim2023-11-29Released
CORD-19-PD-MONDO PubDictionaries annotation for MONDO terms - updated at 2020-04-30 It is disease term annotation based on MONDO. Version 2020-04-20. The terms in MONDO are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.6.32 MJin-Dong Kim2023-11-27Released
LitCovid-sample-PD-UBERON PubDictionaries annotation for UBERON terms - updated at 2020-04-30 It is annotation for anatomical entities based on Uberon. The terms in Uberon are uploaded in PubDictionaries (Uberon), with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved. 310Jin-Dong Kim2023-11-28Beta
CORD-19-PD-UBERON PubDictionaries annotation for UBERON terms - updated at 2020-04-30 It is disease term annotation based on Uberon. The terms in Uberon are uploaded in PubDictionaries (Uberon), with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.1.42 MJin-Dong Kim2023-11-24Released
LitCovid-PAS-Enju Predicate-argument structure annotation produced by the Enju parser.125 KJin-Dong Kim2023-11-28Beta
SMAFIRA_Methods Predictions for methods for the SMAFIRA project.0zebet2023-11-28Developing
PubMed_ArguminSci Predictions for PubMed automatically extracted with the ArguminSci tool (https://github.com/anlausch/ArguminSci).777 Kzebet2023-11-24Released
UseCases_ArguminSci_Discourse Predictions from ArguminSci(http://lelystad.informatik.uni-mannheim.de/) for the seven datasets and for discourse categories7.12 Kzebet2023-11-29Developing
UseCases_PubTatorCentral Predictions from PubTator Central (https://www.ncbi.nlm.nih.gov/research/pubtator/) for the seven datasets and for four entity types (disease,chemical,species,cellline)0zebet2023-11-29Developing
AlvisNLP-Test Project for testing AlviNLP PubAnnotation server during BLAH3.17Bibliome2023-11-29Testing
Trait curation Project for trait curation in PGDBj479Sachiko ShirasawaSachiko Shirasawa2023-11-24Testing
NameT# Ann.AuthorMaintainerUpdated_atStatus

481-500 / 590 show all
bionlp-st-bb3-2016-training 1.28 KINRAYue Wang2023-11-29Released
bionlp-st-ge-2016-reference 14.4 KDBCLSJin-Dong Kim2023-11-29Released
bionlp-st-ge-2016-test 7.99 KDBCLSJin-Dong Kim2023-11-29Released
PMA_MER 58.9 KStefano Rensitherightstef2023-11-29Developing
pmc-enju-pas 205 KDBCLSJin-Dong Kim2023-11-28Developing
bionlp-st-ge-2016-test-proteins 4.34 KDBCLSJin-Dong Kim2023-11-27Released
LitCovid-PD-FMA-UBERON-v1 4.3 KJin-Dong Kim2023-11-27Released
LitCovid-PD-MONDO-v1 13.4 KJin-Dong Kim2023-11-29Released
CORD-19-PD-HP 1.15 MJin-Dong Kim2023-11-29Released
LitCovid-PD-HP-v1 3.03 KJin-Dong Kim2023-11-29Released
CORD-19-PD-MONDO 6.32 MJin-Dong Kim2023-11-27Released
LitCovid-sample-PD-UBERON 310Jin-Dong Kim2023-11-28Beta
CORD-19-PD-UBERON 1.42 MJin-Dong Kim2023-11-24Released
LitCovid-PAS-Enju 125 KJin-Dong Kim2023-11-28Beta
SMAFIRA_Methods 0zebet2023-11-28Developing
PubMed_ArguminSci 777 Kzebet2023-11-24Released
UseCases_ArguminSci_Discourse 7.12 Kzebet2023-11-29Developing
UseCases_PubTatorCentral 0zebet2023-11-29Developing
AlvisNLP-Test 17Bibliome2023-11-29Testing
Trait curation 479Sachiko ShirasawaSachiko Shirasawa2023-11-24Testing