> top > projects

Projects

NameTDescription# Ann.AuthorMaintainerUpdated_at Status

321-340 / 590 show all
AnEM_full-texts 250 documents selected randomly from full-text papers Entity types: organism subdivision, anatomical system, organ, multi-tissue structure, tissue, cell, developing anatomical structure, cellular component, organism substance, immaterial anatomical entity and pathological formation Together with AnEM_abstracts, it is probably the largest manually annotated corpus on anatomical entities.687NaCTeMYue Wang2023-11-29Uploading
AxD_symptoms Symptoms of AxD from available case report and case series401Eisuke Dohi2023-11-29Developing
BLAH2015_Annotations_Adderall 0nestoralvaronestoralvaro2023-11-29Testing
BioLarkPubmedHPO 228 abstracts manually annotated with Human Phenotype Ontology (HPO) concepts and harmonized by three curators, which can be used as a reference standard for free text annotation of human phenotypes. For more info, please see Groza et al. "Automatic concept recognition using the human phenotype ontology reference and test suite corpora", 2015.7.16 KTudor Grozasimon2023-11-29Released
Briefings 0Sophie Nam2023-11-29
CORD-19_All_docs All the documents in the whole CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2023-11-29Released
CORD-19_Commercial_use_subset The Commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2023-11-29Released
CORD-19_HIRAKI HIRAKI Annotation for CORD-192.98 KAikoHIRAKI2023-11-29Testing
CORD-19_Non-commercial_use_subset The Non commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2023-11-29Released
CORD-19_bioRxiv_medRxiv_subset The bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT. 0Jin-Dong Kim2023-11-29Released
Test-GeneOrGeneProduct 1.17 KJin-Dong Kim2023-11-29
GlycoBiology-FMA FMA ontology-based annotation to GlycoBiology abstracts96.3 KJin-Dong Kim2023-11-29Testing
bionlp-st-ge-2016-reference-tees NER and event extraction produced by TEES (with the default GE11 model) for the 20 full papers used in the BioNLP 2016 GE task reference corpus.14.6 KNico Colic Nico Colic2023-11-29Released
semrep-sample Sample annotation of SemRep, produced by Rindflesch, et al. Rindflesch, T.C. and Fiszman, M. (2003). The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. Journal of Biomedical Informatics, 36(6):462-477.11.1 KRindflesch et al.Jin-Dong Kim2023-11-29Testing
CORD-19-sample-CHEBI 16Jin-Dong Kim2023-11-29Developing
bionlp-st-ge-2016-test It is the benchmark test data set of the BioNLP-ST 2016 GE task. It includes Genia-style event annotations to 14 full paper articles which are about NFκB proteins. For testing purpose, however, annotations are all blinded, which means users cannot see the annotations in this project. Instead, annotations in any other project can be compared to the hidden annotations in this project, then the annotations in the project will be automatically evaluated based on the comparison. A participant of GE task can get the evaluation of his/her result of automatic annotation, through following process: Create a new project. Import documents from the project, bionlp-st-2016-test-proteins to your project. Import annotations from the project, bionlp-st-2016-test-proteins to your project. At this point, you may want to compare you project to this project, the benchmark data set. It will show that protein annotations in your project is 100% correct, but other annotations, e.g., events, are 0%. Produce event annotations, using your system, upon the protein annotations. Upload your event annotations to your project. Compare your project to this project, to get evaluation. GE 2016 benchmark data set is provided as multi-layer annotations which include: bionlp-st-ge-2016-reference: benchmark reference data set bionlp-st-ge-2016-test: benchmark test data set (this project) bionlp-st-ge-2016-test-proteins: protein annotation to the benchmark test data set Following is supporting resources: bionlp-st-ge-2016-coref: coreference annotation bionlp-st-ge-2016-uniprot: Protein annotation with UniProt IDs. pmc-enju-pas: dependency parsing result produced by Enju UBERON-AE: annotation for anatomical entities as defined in UBERON ICD10: annotation for disease names as defined in ICD10 GO-BP: annotation for biological process names as defined in GO GO-CC: annotation for cellular component names as defined in GO A SPARQL-driven search interface is provided at http://bionlp.dbcls.jp/sparql.7.99 KDBCLSJin-Dong Kim2023-11-29Released
bionlp-st-ge-2016-test-tees NER and event extraction produced by TEES (with the default GE11 model) for the 14 full papers used in the BioNLP 2016 GE task test corpus.9.17 KNico ColicNico Colic2023-11-29Released
craft-ca-core-dev Development data for CRAFT CA shared task, core concepts only. This project contains the development (training) annotations for the Concept Annotation task of the CRAFT Shared Task 2019. This particular set of concept annotations is the "core" set. See the task description for details, but this set contains only annotations to concepts that appear in the original 10 Open Biomedical Ontologies used for annotation. (That is to say, it does not contain any annotations to extension classes).59.8 KUniversity of Colorado Anschutz Medical Campuscraft-st2023-11-29Released
BioNLP16_DUT 6.5 KMessiy2023-11-29Testing
HZAU_wangshuguang_Just-for-fun 4wangshuguangwangshuguang2023-11-29Testing
NameT# Ann.AuthorMaintainerUpdated_at Status

321-340 / 590 show all
AnEM_full-texts 687NaCTeMYue Wang2023-11-29Uploading
AxD_symptoms 401Eisuke Dohi2023-11-29Developing
BLAH2015_Annotations_Adderall 0nestoralvaronestoralvaro2023-11-29Testing
BioLarkPubmedHPO 7.16 KTudor Grozasimon2023-11-29Released
Briefings 0Sophie Nam2023-11-29
CORD-19_All_docs 0Jin-Dong Kim2023-11-29Released
CORD-19_Commercial_use_subset 0Jin-Dong Kim2023-11-29Released
CORD-19_HIRAKI 2.98 KAikoHIRAKI2023-11-29Testing
CORD-19_Non-commercial_use_subset 0Jin-Dong Kim2023-11-29Released
CORD-19_bioRxiv_medRxiv_subset 0Jin-Dong Kim2023-11-29Released
Test-GeneOrGeneProduct 1.17 KJin-Dong Kim2023-11-29
GlycoBiology-FMA 96.3 KJin-Dong Kim2023-11-29Testing
bionlp-st-ge-2016-reference-tees 14.6 KNico Colic Nico Colic2023-11-29Released
semrep-sample 11.1 KRindflesch et al.Jin-Dong Kim2023-11-29Testing
CORD-19-sample-CHEBI 16Jin-Dong Kim2023-11-29Developing
bionlp-st-ge-2016-test 7.99 KDBCLSJin-Dong Kim2023-11-29Released
bionlp-st-ge-2016-test-tees 9.17 KNico ColicNico Colic2023-11-29Released
craft-ca-core-dev 59.8 KUniversity of Colorado Anschutz Medical Campuscraft-st2023-11-29Released
BioNLP16_DUT 6.5 KMessiy2023-11-29Testing
HZAU_wangshuguang_Just-for-fun 4wangshuguangwangshuguang2023-11-29Testing