> top > collections > CORD-19
CORD-19
Description

CORD-19 (COVID-19 Open Research Dataset) is a free, open resource for the global research community provided by the Allen Institute for AI: https://pages.semanticscholar.org/coronavirus-research.

As of 2020-03-20, it contains over 29,000 full text articles. This CORD-19 collection at PubAnnotation is prepared for the purpose of collecting annotations to the texts, so that they can be easily accessed and utilized.

If you want to contribute with your annotation,

  1. take the documents in the CORD-19_All_docs project,
  2. produce your annotation to the texts using your annotation system, and
  3. contribute the annotation back to PubAnnotation (HowTo).

All the contributed annotations will become publicly available. Please note that, during uploading your annotation data, you do not need to be worried about slight changes in the text: PubAnnotation will automatically catch them and adjust the positions appropriately.

Once you have uploaded your annotation, please notify it to admin@pubannotation.org admin@pubannotation.org, so that it can be included in this collection, which will make your annotation much easily findable.

Note that as the CORD-19 dataset grows, the documents in this collection also will be updated.

IMPORTANT: CORD-19 License agreement requires that the dataset must be used for text and data mining only.


Maintainer Jin-Dong Kim
Projects
Name TDescription# Ann.MaintainerUpdated_atRDFized_atStatus

1-8 / 8
CORD-19_All_docsAll the documents in the whole CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23-Released
CORD-19_bioRxiv_medRxiv_subsetThe bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT. 0Jin-Dong Kim2020-03-23-Released
CORD-19_Commercial_use_subsetThe Commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23-Released
CORD-19_Custom_license_subsetThe Custom license subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23-Released
CORD-19-Diseases-PubDictionariesAnnotation for disease names based on the MONDO dictionary at PubDictionaries. Now it is at a test phase with 50 randomly sampled articles.2.06 MJin-Dong Kim2020-04-052020-04-06Testing
CORD-19_Non-commercial_use_subsetThe Non commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23-Released
CORD-19-SciBite-sentences11.2 KJin-Dong Kim2020-03-27-Testing
CORD-19-Sentences5.89 MJin-Dong Kim2020-03-272020-03-27Testing