LitCovid-PD-MONDO-v1 | | PubDictionaries annotation for disease terms - updated at 2020-04-20
It is based on MONDO Version 2020-04-20.
The terms in MONDO are loaded in PubDictionaries, with which the annotations in this project are produced. The parameter configuration used for this project is here.
Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved. | 13.4 K | | Jin-Dong Kim | 2023-11-29 | Released | |
LitCovid-docs-s | | | 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
GlyCosmos600-docs | | A random collection of 600 PubMed abstracts from 6 glycobiology-related journals: Glycobiology, Glycoconjugate journal, The Journal of biological chemistry, Journal of proteome research, Journal of proteomics, and Carbohydrate research. The whole PMIDs were collected on June 11, 2019. From each journal, 100 PMIDs were randomly sampled. | 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
CORD-19_All_docs | | All the documents in the whole CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
CORD-19_Commercial_use_subset | | The Commercial use subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
CORD-19_Non-commercial_use_subset | | The Non commercial use subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
CORD-19_bioRxiv_medRxiv_subset | | The bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT.
| 0 | | Jin-Dong Kim | 2023-11-29 | Released | |
bionlp-st-ge-2016-reference-tees | | NER and event extraction produced by TEES (with the default GE11 model) for the 20 full papers used in the BioNLP 2016 GE task reference corpus. | 14.6 K | Nico Colic | Nico Colic | 2023-11-29 | Released | |
bionlp-st-ge-2016-test-tees | | NER and event extraction produced by TEES (with the default GE11 model) for the 14 full papers used in the BioNLP 2016 GE task test corpus. | 9.17 K | Nico Colic | Nico Colic | 2023-11-29 | Released | |
LitCovid-OGER | | Using OGER (http://www.ontogene.org/resources/oger) to detect entities from 10 different vocabularies | 9.31 K | Fabio Rinaldi | Nico Colic | 2023-11-29 | Released | |
PubmedHPO | | Human phenotype annotation to PubMed abstracts, based on the HPO ontology | 12.4 M | Tudor Groza | tudor | 2023-11-24 | Beta | |
LitCovid-PubTator | | | 5.88 M | | Jin-Dong Kim | 2023-11-24 | Beta | |
PubCasesHPO | | HPO annotation in PubCases | 3.18 M | | Toyofumi Fujiwara | 2023-11-24 | Beta | |
DisGeNET | | Disease-Gene association annotation. | 3.12 M | Nuria Queralt | Jin-Dong Kim | 2023-11-24 | Beta | |
NEUROSES | | This corpus is composed of PubMed articles containing cognitive enhancers and anti-depressants drug mentions. The selected sentences are automatically annotated using the NCBO Annotator with the Chemical Entities of Biological Interest (CHEBI) and Phenotypic Quality Ontology (PATO) ontologies, we also produced annotations using PhenoMiner ontology via a dictionary-based tagger. | 2.14 M | | nestoralvaro | 2023-11-24 | Beta | |
PubCasesORDO | | ORDO annotation in PubCases | 865 K | | Toyofumi Fujiwara | 2023-11-24 | Beta | |
LitCovid-PD-HP | | | 922 K | | Jin-Dong Kim | 2023-11-28 | Beta | |
LitCovid-sample-PD-IDO | | | 1.27 K | | Jin-Dong Kim | 2023-11-28 | Beta | |
LitCovid-PAS-Enju | | Predicate-argument structure annotation produced by the Enju parser. | 125 K | | Jin-Dong Kim | 2023-11-28 | Beta | |
LitCovid-sample-PD-FMA | | | 1.93 K | | Jin-Dong Kim | 2023-11-28 | Beta | |