PubAnnotation

> top > users > Jin-Dong Kim

Jin-Dong Kim

User info

Collections

Name		Description	Updated at

« 1 2 11-12 / 12 show all
CORD-19		CORD-19 (COVID-19 Open Research Dataset) is a free, open resource for the global research community provided by the Allen Institute for AI: https://pages.semanticscholar.org/coronavirus-research. As of 2020-03-20, it contains over 29,000 full text articles. This CORD-19 collection at PubAnnotation is prepared for the purpose of collecting annotations to the texts, so that they can be easily accessed and utilized. If you want to contribute with your annotation, take the documents in the CORD-19_All_docs project, produce your annotation to the texts using your annotation system, and contribute the annotation back to PubAnnotation (HowTo). All the contributed annotations will become publicly available. Please note that, during uploading your annotation data, you do not need to be worried about slight changes in the text: PubAnnotation will automatically catch them and adjust the positions appropriately. Once you have uploaded your annotation, please notify it to admin@pubannotation.org admin@pubannotation.org, so that it can be included in this collection, which will make your annotation much easily findable. Note that as the CORD-19 dataset grows, the documents in this collection also will be updated. IMPORTANT: CORD-19 License agreement requires that the dataset must be used for text and data mining only.	2020-04-14
bionlp-st-ge-2016		The 2016 edition of the Genia event extraction (GE) task organized within BioNLP-ST 2016	2019-03-11

Projects

Name	T	Description	# Ann.	Updated at	Status

« 1 2 ... 6 7 8 9 10 11 12 13 14 15 16 » 91-100 / 158 show all
CORD-19_All_docs		All the documents in the whole CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.	0	2023-11-29	Released
CORD-19_Commercial_use_subset		The Commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.	0	2023-11-29	Released
CORD-19_Non-commercial_use_subset		The Non commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.	0	2023-11-29	Released
CORD-19_bioRxiv_medRxiv_subset		The bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.	0	2023-11-29	Released
Test-GeneOrGeneProduct			1.17 K	2023-11-29
GlycoBiology-FMA		FMA ontology-based annotation to GlycoBiology abstracts	96.3 K	2023-11-29	Testing
semrep-sample		Sample annotation of SemRep, produced by Rindflesch, et al. Rindflesch, T.C. and Fiszman, M. (2003). The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. Journal of Biomedical Informatics, 36(6):462-477.	11.1 K	2023-11-29	Testing
CORD-19-sample-CHEBI			16	2023-11-29	Developing
bionlp-st-ge-2016-test		It is the benchmark test data set of the BioNLP-ST 2016 GE task. It includes Genia-style event annotations to 14 full paper articles which are about NFκB proteins. For testing purpose, however, annotations are all blinded, which means users cannot see the annotations in this project. Instead, annotations in any other project can be compared to the hidden annotations in this project, then the annotations in the project will be automatically evaluated based on the comparison. A participant of GE task can get the evaluation of his/her result of automatic annotation, through following process: Create a new project. Import documents from the project, bionlp-st-2016-test-proteins to your project. Import annotations from the project, bionlp-st-2016-test-proteins to your project. At this point, you may want to compare you project to this project, the benchmark data set. It will show that protein annotations in your project is 100% correct, but other annotations, e.g., events, are 0%. Produce event annotations, using your system, upon the protein annotations. Upload your event annotations to your project. Compare your project to this project, to get evaluation. GE 2016 benchmark data set is provided as multi-layer annotations which include: bionlp-st-ge-2016-reference: benchmark reference data set bionlp-st-ge-2016-test: benchmark test data set (this project) bionlp-st-ge-2016-test-proteins: protein annotation to the benchmark test data set Following is supporting resources: bionlp-st-ge-2016-coref: coreference annotation bionlp-st-ge-2016-uniprot: Protein annotation with UniProt IDs. pmc-enju-pas: dependency parsing result produced by Enju UBERON-AE: annotation for anatomical entities as defined in UBERON ICD10: annotation for disease names as defined in ICD10 GO-BP: annotation for biological process names as defined in GO GO-CC: annotation for cellular component names as defined in GO A SPARQL-driven search interface is provided at http://bionlp.dbcls.jp/sparql.	7.99 K	2023-11-29	Released
LitCovid-PD-GlycoEpitope			999	2023-11-29	Developing

Automatic annotators

Name	Description

1 2 3 4 » 1-10 / 39 show all
PD-HP
PD-HP-B
PD-GlycoGenes20190927-B
PD-GlycoProteins-B
PD-Preeclampsia-B
PD-NCBITaxon-B
PD-CHEBI-B
PD-GlycoGenes-B
PD-GlycanStructures-B
PD-CLO-B

Editors

Name	Description

1-1 / 1
TextAE	The official stable version of TextAE.