CORD-19_Custom_license_subset | | The Custom license subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 5.08 M | 2023-11-24 | Released | |
CORD-19_Non-commercial_use_subset | | The Non commercial use subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | 2023-11-29 | Released | |
bionlp-st-ge-2016-uniprot | | UniProt protein annotation to the benchmark data set of BioNLP-ST 2016 GE task: reference data set (bionlp-st-ge-2016-reference) and test data set (bionlp-st-ge-2016-test).
The annotations are produced based on a dictionary which is semi-automatically compiled for the 34 full paper articles included in the benchmark data set (20 in the reference data set + 14 in the test data set).
For detailed information about BioNLP-ST GE 2016 task data sets, please refer to the benchmark reference data set (bionlp-st-ge-2016-reference) and benchmark test data set (bionlp-st-ge-2016-test).
| 16.2 K | 2023-11-29 | Beta | |
metamap-sample | | Sample annotation of MetaMep, produced by Aronson, et al.
An overview of MetaMap: historical perspective and recent advances, JAMIA 2010 | 10.9 K | 2023-11-27 | Testing | |
pubtator-sample | | Sample annotation of PubTator produced by Zhiyong Lu et al. | 28 | 2023-11-27 | Testing | |
semrep-sample | | Sample annotation of SemRep, produced by Rindflesch, et al.
Rindflesch, T.C. and Fiszman, M. (2003). The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. Journal of Biomedical Informatics, 36(6):462-477. | 11.1 K | 2023-11-29 | Testing | |
sentences | | Sentence segmentation annotation.
Automatic annotation by TextSentencer. | 6.96 M | 2023-11-24 | Developing | |
LitCovid-sentences-v1 | | Sentence segmentation of all the texts in the LitCovid literature. The segmentation is automatically obtained using the TextSentencer annotation service developed and maintained by DBCLS. | 16.5 K | 2023-11-27 | Released | |
LitCovid-PD-GO-BP | | Terms for biological prosesses, as defined in GO | 374 K | 2023-11-29 | Developing | |
GlycoConjugate-collection | | The PubMed entries (titles and abstracts) from the journal of GlycoConjugate | 0 | 2023-11-28 | Developing | |