> top > projects

Projects

NameTDescription# Ann. AuthorMaintainerUpdated_atStatus

1-20 / 115 show all
CORD-19_All_docs All the documents in the whole CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23Released
CORD-19_Commercial_use_subset The Commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23Released
CORD-19_Non-commercial_use_subset The Non commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2020-03-23Released
GlyCosmos600-docs A random collection of 600 PubMed abstracts from 6 glycobiology-related journals: Glycobiology, Glycoconjugate journal, The Journal of biological chemistry, Journal of proteome research, Journal of proteomics, and Carbohydrate research. The whole PMIDs were collected on June 11, 2019. From each journal, 100 PMIDs were randomly sampled.0Jin-Dong Kim2019-06-11Released
UseCases_PubTatorCentral Predictions from PubTator Central (https://www.ncbi.nlm.nih.gov/research/pubtator/) for the seven datasets and for four entity types (disease,chemical,species,cellline)0zebet2019-11-01Developing
PubCasesCollection abstracts in PubCases0Jin-Dong Kim2017-09-08
CORD-19_bioRxiv_medRxiv_subset The bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT. 0Jin-Dong Kim2020-03-23Released
Grays_part1_test 0Jin-Dong Kim2019-09-03Testing
GlycoConjugate-collection The PubMed entries (titles and abstracts) from the journal of GlycoConjugate0Jin-Dong Kim2018-02-09Developing
pubmed-2016 abstracts published in 20160Jin-Dong Kim2017-09-03
SPECIES800_autotagged This project comprises the SPECIES800 corpus documents automatically annotated by the Jensenlab tagger. Annotated entity types are: Genes/proteins from the mentioned organisms (and any human ones) PubChem Compound identifiers NCBI Taxonomy entries Gene Ontology cellular component terms BRENDA Tissue Ontology terms Disease Ontology terms Environment Ontology terms The SPECIES 800 (S800) comprises 800 PubMed abstracts. In its original form species mentions were manually identified and mapped to the corresponding NCBI Taxonomy identifiers. Described in: The SPECIES and ORGANISMS Resources for Fast and Accurate Identification of Taxonomic Names in Text. Pafilis E, Frankild SP, Fanini L, Faulwetter S, Pavloudi C, et al. (2013). PLoS ONE, 2013, 8(6): e65390. doi:10.1371/journal.pone.0065390. The manually annotated corpus is also available as a PubAnnotation project (see here). 0Evangelos Pafilis, Sampo Pyysalo, Lars Juhl Jensenevangelos2015-11-20Testing
IMDB-NLP Annotations for chunking and semantic role labeling based on in-memory databases.02016-05-06Uploading
BLAH2015_Annotations_Adderall 0nestoralvaronestoralvaro2015-03-15Testing
GlycoBiology-GO GO-based annotation to GlycoBiology abstracts0Jin-Dong Kim2016-06-11Testing
BioASQ-sample collection of PubMed articles which appear in the BioASQ sample data set.0BioASQJin-Dong Kim2015-10-13Testing
PubMed-2000 abstracts published in 2000.0Jin-Dong Kim2017-09-01Developing
AlvisNLP-Async-Test Test for the asynchronous AlvisNLP/ML annotator family.0Robert Bossyrbossy2017-04-07Testing
PubMed-2017 abstracts published in 2017.0Jin-Dong Kim2017-09-01Developing
AGCA_Sue Active Gene Annotation Corpus for the Application in Drug Repurposing Discovery0Jingbo Xia, Xuan Qin, Kaiyin Zhou2017-11-13Developing
Test_economics test1YongHwanKimkimyonghwan2017-07-13Testing
NameT# Ann. AuthorMaintainerUpdated_atStatus

1-20 / 115 show all
CORD-19_All_docs 0Jin-Dong Kim2020-03-23Released
CORD-19_Commercial_use_subset 0Jin-Dong Kim2020-03-23Released
CORD-19_Non-commercial_use_subset 0Jin-Dong Kim2020-03-23Released
GlyCosmos600-docs 0Jin-Dong Kim2019-06-11Released
UseCases_PubTatorCentral 0zebet2019-11-01Developing
PubCasesCollection 0Jin-Dong Kim2017-09-08
CORD-19_bioRxiv_medRxiv_subset 0Jin-Dong Kim2020-03-23Released
Grays_part1_test 0Jin-Dong Kim2019-09-03Testing
GlycoConjugate-collection 0Jin-Dong Kim2018-02-09Developing
pubmed-2016 0Jin-Dong Kim2017-09-03
SPECIES800_autotagged 0Evangelos Pafilis, Sampo Pyysalo, Lars Juhl Jensenevangelos2015-11-20Testing
IMDB-NLP 02016-05-06Uploading
BLAH2015_Annotations_Adderall 0nestoralvaronestoralvaro2015-03-15Testing
GlycoBiology-GO 0Jin-Dong Kim2016-06-11Testing
BioASQ-sample 0BioASQJin-Dong Kim2015-10-13Testing
PubMed-2000 0Jin-Dong Kim2017-09-01Developing
AlvisNLP-Async-Test 0Robert Bossyrbossy2017-04-07Testing
PubMed-2017 0Jin-Dong Kim2017-09-01Developing
AGCA_Sue 0Jingbo Xia, Xuan Qin, Kaiyin Zhou2017-11-13Developing
Test_economics 1YongHwanKimkimyonghwan2017-07-13Testing