CORD-19_Custom_license_subset | | The Custom license subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 5.08 M | | Jin-Dong Kim | 2023-11-24 | Released | |
CORD-19_HIRAKI | | HIRAKI Annotation for CORD-19 | 2.98 K | | AikoHIRAKI | 2023-11-29 | Testing | |
CORD-19_Non-commercial_use_subset | | The Non commercial use subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
CORD-19-PD-HP | | PubDictionaries annotation for HP terms - updated at 2020-04-30
Disease term annotation based on HP.
Version 2020-04-20.
The terms in HP are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here.
Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved. | 1.15 M | | Jin-Dong Kim | 2023-11-29 | Released | |
CORD-19-PD-MONDO | | PubDictionaries annotation for MONDO terms - updated at 2020-04-30
It is disease term annotation based on MONDO.
Version 2020-04-20.
The terms in MONDO are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here.
Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved. | 6.32 M | | Jin-Dong Kim | 2023-11-27 | Released | |
CORD-19-PD-UBERON | | PubDictionaries annotation for UBERON terms - updated at 2020-04-30
It is disease term annotation based on Uberon.
The terms in Uberon are uploaded in PubDictionaries
(Uberon), with which the annotations in this project are produced.
The parameter configuration used for this project is
here.
Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved. | 1.42 M | | Jin-Dong Kim | 2023-11-24 | Released | |
CORD-19-sample-CHEBI | | | 16 | | Jin-Dong Kim | 2023-11-29 | Developing | |
CORD-19-sample-FMA-UBERON | | | 61 | | Jin-Dong Kim | 2023-11-29 | Developing | |
CORD-19-sample-HP | | | 39 | | Jin-Dong Kim | 2023-11-27 | Developing | |
CORD-19-sample-IDO | | | 76 | | Jin-Dong Kim | 2023-11-29 | Developing | |
CORD-19-sample-MONDO | | | 113 | | Jin-Dong Kim | 2023-11-29 | Developing | |
CORD-19-sample-paragraphs | | | 28 | | Jin-Dong Kim | 2023-11-29 | Developing | |
CORD-19-sample-sentences | | | 161 | | Jin-Dong Kim | 2023-11-27 | Developing | |
CORD-19-sample-UBERON | | | 54 | | Jin-Dong Kim | 2023-11-26 | Developing | |
CORD-19-SciBite-sentences | | | 11.2 K | | Jin-Dong Kim | 2023-11-26 | Testing | |
CORD-19-Sentences | | | 13.4 M | | Jin-Dong Kim | 2023-11-24 | Testing | |
CORD-PICO | | Automatic annotation of the CORD-19 dataset with PICO categories. The corpus was automatically labeled with an LSTM-CRF model trained on human-annotated PubMed abstracts from https://github.com/bepnye/EBM-NLP. Currently, titles and abstracts only are annotated using Population, Intervention and Outcome labels, as well as more fine-grained labels such as Age, Drug, Mortality and others. | 69.6 K | Simon Suster | ssuster | 2023-11-27 | Developing | |
Covid19_manual_annotation | | | 5.1 K | | AikoHIRAKI | 2023-11-29 | Developing | |
Covid19_manual_annotation_v2 | | | 4.58 K | | AikoHIRAKI | 2023-11-24 | Developing | |
craft-ca-core-dev | | Development data for CRAFT CA shared task, core concepts only. This project contains the development (training) annotations for the Concept Annotation task of the CRAFT Shared Task 2019. This particular set of concept annotations is the "core" set. See the task description for details, but this set contains only annotations to concepts that appear in the original 10 Open Biomedical Ontologies used for annotation. (That is to say, it does not contain any annotations to extension classes). | 59.8 K | University of Colorado Anschutz Medical Campus | craft-st | 2023-11-29 | Released | |