LitCoin-PubTator-for-Tuning | | A set of randomly selected PubMed articles with PubTator annotation.
The labels of PubTator annotations are converted to corresponding labels for LitCoin as follows:
'Gene' -> 'GeneOrGeneProduct',
'Disease' -> 'DiseaseOrPhenotypicFeature',
'Chemical' -> 'ChemicalEntity'
'Species' -> 'OrganismTaxon'
'Mutation' -> 'SequenceVariant'
'CellLine' -> 'CellLine' | 14.2 K | 2023-11-29 | | |
GlyCosmos600-GlycanStructure | | | 97 | 2023-11-29 | Testing | |
CORD-19_All_docs | | All the documents in the whole CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | 2023-11-29 | Released | |
CORD-19_Commercial_use_subset | | The Commercial use subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | 2023-11-29 | Released | |
CORD-19_Non-commercial_use_subset | | The Non commercial use subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | 2023-11-29 | Released | |
CORD-19_bioRxiv_medRxiv_subset | | The bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT.
| 0 | 2023-11-29 | Released | |
Test-GeneOrGeneProduct | | | 1.17 K | 2023-11-29 | | |
GlycoBiology-FMA | | FMA ontology-based annotation to GlycoBiology abstracts | 96.3 K | 2023-11-29 | Testing | |
semrep-sample | | Sample annotation of SemRep, produced by Rindflesch, et al.
Rindflesch, T.C. and Fiszman, M. (2003). The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. Journal of Biomedical Informatics, 36(6):462-477. | 11.1 K | 2023-11-29 | Testing | |
CORD-19-sample-CHEBI | | | 16 | 2023-11-29 | Developing | |