LitCovid-PAS-Enju | | Predicate-argument structure annotation produced by the Enju parser. | 125 K | | Jin-Dong Kim | 2023-11-28 | Beta | |
GlyCosmos600-MAT | | | 863 | | Jin-Dong Kim | 2023-11-29 | Testing | |
DisGeNet-2017-sample | | | 2.93 K | | Jin-Dong Kim | 2023-11-29 | Testing | |
CORD-19-sample-HP | | | 39 | | Jin-Dong Kim | 2023-11-27 | Developing | |
LitCovid-sample-sentences | | | 2.3 K | | Jin-Dong Kim | 2023-11-29 | Beta | |
LitCovid-sample-MedDRA | | | 185 | | Jin-Dong Kim | 2023-11-27 | Testing | |
GlycoBiology-FMA | | FMA ontology-based annotation to GlycoBiology abstracts | 96.3 K | | Jin-Dong Kim | 2023-11-29 | Testing | |
LitCovid-sample-PD-MONDO | | | 1.21 K | | Jin-Dong Kim | 2023-11-27 | Developing | |
CORD-19_All_docs | | All the documents in the whole CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
CORD-19-sample-CHEBI | | | 16 | | Jin-Dong Kim | 2023-11-29 | Developing | |
hydroxychloroquine | | | 2.59 K | | Jin-Dong Kim | 2023-11-29 | Developing | |
LitCovid-sample-CHEBI | | | 1.44 K | | Jin-Dong Kim | 2023-11-29 | Testing | |
PubMed-2000 | | abstracts published in 2000. | 0 | | Jin-Dong Kim | 2023-11-29 | Developing | |
LitCoin-PubTator-for-Tuning | | A set of randomly selected PubMed articles with PubTator annotation.
The labels of PubTator annotations are converted to corresponding labels for LitCoin as follows:
'Gene' -> 'GeneOrGeneProduct',
'Disease' -> 'DiseaseOrPhenotypicFeature',
'Chemical' -> 'ChemicalEntity'
'Species' -> 'OrganismTaxon'
'Mutation' -> 'SequenceVariant'
'CellLine' -> 'CellLine' | 14.2 K | | Jin-Dong Kim | 2023-11-29 | | |
LitCovid-PD-MONDO-v1 | | PubDictionaries annotation for disease terms - updated at 2020-04-20
It is based on MONDO Version 2020-04-20.
The terms in MONDO are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here.
Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved. | 13.4 K | | Jin-Dong Kim | 2023-11-29 | Released | |
CORD-19_bioRxiv_medRxiv_subset | | The bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT.
| 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
GlycoGenes | | annotation for glyco-genes based on GGDB | 1.01 K | | Jin-Dong Kim | 2023-11-29 | Developing | |
GlyCosmos6-docs | | | 0 | | Jin-Dong Kim | 2023-11-29 | Developing | |
BioASQ-sample | | collection of PubMed articles which appear in the BioASQ sample data set. | 0 | BioASQ | Jin-Dong Kim | 2023-11-28 | Testing | |
CORD-19_Commercial_use_subset | | The Commercial use subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | | Jin-Dong Kim | 2023-11-29 | Released | |