> top > projects

Projects

NameTDescription# Ann.AuthorMaintainerUpdated_atStatus

1-20 / 114 show all
craft-sa-dev Development data for CRAFT SA shared task. This project contains the development (training) annotations for the Structural Annotation task of the CRAFT Shared Task 2019. This particular set contains token and sentence annotations with tokens linked via dependency relations. These dependency relations were automatically generated using the manually curated CRAFT constituency treebank files as input.490 KUniversity of Colorado Anschutz Medical Campuscraft-st2020-09-22Released
bionlp-st-ge-2016-test-tees NER and event extraction produced by TEES (with the default GE11 model) for the 14 full papers used in the BioNLP 2016 GE task test corpus.9.17 KNico ColicNico Colic2020-09-18Released
bionlp-st-ge-2016-spacy-parsed Dependency parses produced by spaCy parser, and part-of-speech tags produced by Stanford tagger (with the wsj-0-18-left3words-nodistsim model). The exact procedure is described here. Data set contains the 34 full paper articles used in the BioNLP 2016 GE task. 225 KNico ColicNico Colic2020-09-18Released
bionlp-st-ge-2016-reference-tees NER and event extraction produced by TEES (with the default GE11 model) for the 20 full papers used in the BioNLP 2016 GE task reference corpus.14.6 KNico Colic Nico Colic2020-09-13Released
LitCovid-OGER-BB Using OGER (www.ontogene.com) and Biobert to obtain annotations for 10 different vocabularies.308 KFabio RinaldiNico Colic2020-06-04Released
LitCovid-PD-HP PubDictionaries annotation for human phenotype terms - updated at 2020-04-20 Disease term annotation based on HP. Version 2020-04-20. The terms in HP are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.3.03 KJin-Dong Kim2020-05-25Released
CORD-19-PD-HP PubDictionaries annotation for HP terms - updated at 2020-04-30 Disease term annotation based on HP. Version 2020-04-20. The terms in HP are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.1.15 MJin-Dong Kim2020-05-12Released
LitCovid-PD-MONDO PubDictionaries annotation for disease terms - updated at 2020-04-20 It is based on MONDO Version 2020-04-20. The terms in MONDO are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.13.4 KJin-Dong Kim2020-05-10Released
LitCovid-PD-FMA-UBERON PubDictionaries annotation for anatomy terms - updated at 2020-04-20 Disease term annotation based on FMA and Uberon. Version 2020-04-20. The terms in FMA and Uberon are loaded in PubDictionaries (FMA and Uberon), with which the annotations in this project are produced. The parameter configuration used for this project is here for FMA and there for Uberon. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.4.3 KJin-Dong Kim2020-05-10Released
CORD-19-PD-UBERON PubDictionaries annotation for UBERON terms - updated at 2020-04-30 It is disease term annotation based on Uberon. The terms in Uberon are uploaded in PubDictionaries (Uberon), with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.1.42 MJin-Dong Kim2020-04-30Released
CORD-19-PD-MONDO PubDictionaries annotation for MONDO terms - updated at 2020-04-30 It is disease term annotation based on MONDO. Version 2020-04-20. The terms in MONDO are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here. Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved.6.32 MJin-Dong Kim2020-04-30Released
LitCovid-sentences Sentence segmentation of all the texts in the LitCovid literature. The segmentation is automatically obtained using the TextSentencer annotation service developed and maintained by DBCLS.16.5 KJin-Dong Kim2020-04-14Released
CORD-19_Custom_license_subset The Custom license subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.5.08 MJin-Dong Kim2020-04-10Released
LitCovid-OGER Using OGER (http://www.ontogene.org/resources/oger) to detect entities from 10 different vocabularies9.31 KFabio RinaldiNico Colic2020-04-02Released
LitCovid-PubTatorCentral Named-entities for the documents in the LitCovid dataset. Annotations were automatically predicted by the PubTatorCentral tool (https://www.ncbi.nlm.nih.gov/research/pubtator/)4.64 Kzebet2020-04-01Released
PubMed_ArguminSci Predictions for PubMed automatically extracted with the ArguminSci tool (https://github.com/anlausch/ArguminSci).777 Kzebet2020-03-31Released
CORD-19_Non-commercial_use_subset The Non commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23Released
CORD-19_Commercial_use_subset The Commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23Released
CORD-19_bioRxiv_medRxiv_subset The bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT. 0Jin-Dong Kim2020-03-23Released
CORD-19_All_docs All the documents in the whole CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23Released
NameT# Ann.AuthorMaintainerUpdated_atStatus

1-20 / 114 show all
craft-sa-dev 490 KUniversity of Colorado Anschutz Medical Campuscraft-st2020-09-22Released
bionlp-st-ge-2016-test-tees 9.17 KNico ColicNico Colic2020-09-18Released
bionlp-st-ge-2016-spacy-parsed 225 KNico ColicNico Colic2020-09-18Released
bionlp-st-ge-2016-reference-tees 14.6 KNico Colic Nico Colic2020-09-13Released
LitCovid-OGER-BB 308 KFabio RinaldiNico Colic2020-06-04Released
LitCovid-PD-HP 3.03 KJin-Dong Kim2020-05-25Released
CORD-19-PD-HP 1.15 MJin-Dong Kim2020-05-12Released
LitCovid-PD-MONDO 13.4 KJin-Dong Kim2020-05-10Released
LitCovid-PD-FMA-UBERON 4.3 KJin-Dong Kim2020-05-10Released
CORD-19-PD-UBERON 1.42 MJin-Dong Kim2020-04-30Released
CORD-19-PD-MONDO 6.32 MJin-Dong Kim2020-04-30Released
LitCovid-sentences 16.5 KJin-Dong Kim2020-04-14Released
CORD-19_Custom_license_subset 5.08 MJin-Dong Kim2020-04-10Released
LitCovid-OGER 9.31 KFabio RinaldiNico Colic2020-04-02Released
LitCovid-PubTatorCentral 4.64 Kzebet2020-04-01Released
PubMed_ArguminSci 777 Kzebet2020-03-31Released
CORD-19_Non-commercial_use_subset 0Jin-Dong Kim2020-03-23Released
CORD-19_Commercial_use_subset 0Jin-Dong Kim2020-03-23Released
CORD-19_bioRxiv_medRxiv_subset 0Jin-Dong Kim2020-03-23Released
CORD-19_All_docs 0Jin-Dong Kim2020-03-23Released