> top > projects

Projects

NameTDescription # Ann.AuthorMaintainerUpdated_atStatus

181-200 / 316 show all
BioLarkPubmedHPO 228 abstracts manually annotated with Human Phenotype Ontology (HPO) concepts and harmonized by three curators, which can be used as a reference standard for free text annotation of human phenotypes. For more info, please see Groza et al. "Automatic concept recognition using the human phenotype ontology reference and test suite corpora", 2015.7.16 KTudor Grozasimon2023-11-29Released
FA_Top100Plus-Disease 2/2 FirstAuthor Top100+7 for diseases MONDO & HPO246AikoHIRAKI2023-11-29Testing
AnEM_abstracts 250 documents selected randomly from citation abstracts Entity types: organism subdivision, anatomical system, organ, multi-tissue structure, tissue, cell, developing anatomical structure, cellular component, organism substance, immaterial anatomical entity and pathological formation Together with AnEM_full-texts, it is probably the largest manually annotated corpus on anatomical entities.1.91 KNaCTeMYue Wang2023-11-29Released
AnEM_full-texts 250 documents selected randomly from full-text papers Entity types: organism subdivision, anatomical system, organ, multi-tissue structure, tissue, cell, developing anatomical structure, cellular component, organism substance, immaterial anatomical entity and pathological formation Together with AnEM_abstracts, it is probably the largest manually annotated corpus on anatomical entities.687NaCTeMYue Wang2023-11-29Uploading
Virus300 300 abstracts from virology journals annotated with viral proteins and species0http://aclweb.org/anthology/W/W17/W17-2311.pdfhelencook2017-08-07Released
guideline annotations 5 guideline annotations with custom vocab0Tiffany Leung2015-11-07Developing
pubmed-sentences-benchmark A benchmark data for text segmentation into sentences. The source of annotation is the GENIA treebank v1.0. Following is the process taken. began with the GENIA treebank v1.0. sentence annotations were extracted and converted to PubAnnotation JSON. uploaded. 12 abstracts met alignment failure. among the 12 failure cases, 4 had a dot('.') character where there should be colon (':'). They were manually fixed then successfully uploaded: 7903907, 8053950, 8508358, 9415639. among the 12 failed abstracts, 8 were "250 word truncation" cases. They were manually fixed and successfully uploaded. During the fixing, manual annotations were added for the missing pieces of text. 30 abstracts had extra text in the end, indicating copyright statement, e.g., "Copyright 1998 Academic Press." They were annotated as a sentence in GTB. However, the text did not exist anymore in PubMed. Therefore, the extra texts were removed, together with the sentence annotation to them. 18.4 KGENIA projectJin-Dong Kim2023-11-28Released
RELISH-DB Abstracts contained in the data of the RELISH-DB (https://relishdb.ict.griffith.edu.au) made available for download here. Data was downloaded from here: https://figshare.com/projects/RELISH-DB/60095 Related publication: https://academic.oup.com/database/article/doi/10.1093/database/baz085/5608006#20072202302023-11-29Released
MeasurableQuantitativeAnnotation A collection and annotation the measurable quantity information from 3202 pubmed article, which can be used for the task of extracting measurable quantity information. Annotation category: entity, num, unit.2.84 KWenjieNie2023-11-29Testing
OryzaGP1 A dataset for Named Entity Recognition for rice gene0Huy Do. Pierre Larmande2019-01-31Uploading
EC-Neurodegenerative Alternative methods to animal experiments for neurodegenerative diseases.0zebet2023-11-30Developing
Minna_de_Honkoku An annotation project for Minna de Honkoku, a crowdsourced transcription project for historical Japanese documents..204Yuta Hashimotoyhashimoto2023-11-28Developing
2015-BEL-Sample An attempt to upload 295 BEL statements, i.e. the sample set used for the 2015 BioCreative challenge. 58Fabio RinaldiFabio Rinaldi2023-11-29Testing
CoGe_Citation_Annotations Annotated PMC abstracts+full articles, that cite the "CoGe" papers (PMID: 18952863, 18269575). Total Num Citations: 165 Total Num Unique Citations: 141 Total Num Abstracts: 165 Total Num Whole Articles: 165 0Heather Lenthclent2023-11-29Uploading
GO-BP Annotation for biological processes as defined in the "Biological Process" subset of Gene Ontology35.4 KDBCLSJin-Dong Kim2023-11-29Developing
GO-CC Annotation for cellular components as defined in the "Cellular Component" subtree of Gene Ontology17.6 KDBCLSJin-Dong Kim2023-11-30Developing
GO-MF Annotation for molecular functions as defined in the "Molecular Function" subtree of Gene Ontology19.7 KDBCLSJin-Dong Kim2023-12-04Testing
BioMedLAT Annotation of 643 questions from BioASQ with the Lexical Answer Type (LAT) and headword.02016-09-23Developing
dailymed_spl Annotation of indications from DailyMed structured product labels0micheldumontier2023-11-29Developing
DocumentLevelAnnotationSample A sample project for document level annotation47Jin-Dong Kim2023-11-29Testing
NameT# Ann.AuthorMaintainerUpdated_atStatus

181-200 / 316 show all
BioLarkPubmedHPO 7.16 KTudor Grozasimon2023-11-29Released
FA_Top100Plus-Disease 246AikoHIRAKI2023-11-29Testing
AnEM_abstracts 1.91 KNaCTeMYue Wang2023-11-29Released
AnEM_full-texts 687NaCTeMYue Wang2023-11-29Uploading
Virus300 0http://aclweb.org/anthology/W/W17/W17-2311.pdfhelencook2017-08-07Released
guideline annotations 0Tiffany Leung2015-11-07Developing
pubmed-sentences-benchmark 18.4 KGENIA projectJin-Dong Kim2023-11-28Released
RELISH-DB 02023-11-29Released
MeasurableQuantitativeAnnotation 2.84 KWenjieNie2023-11-29Testing
OryzaGP1 0Huy Do. Pierre Larmande2019-01-31Uploading
EC-Neurodegenerative 0zebet2023-11-30Developing
Minna_de_Honkoku 204Yuta Hashimotoyhashimoto2023-11-28Developing
2015-BEL-Sample 58Fabio RinaldiFabio Rinaldi2023-11-29Testing
CoGe_Citation_Annotations 0Heather Lenthclent2023-11-29Uploading
GO-BP 35.4 KDBCLSJin-Dong Kim2023-11-29Developing
GO-CC 17.6 KDBCLSJin-Dong Kim2023-11-30Developing
GO-MF 19.7 KDBCLSJin-Dong Kim2023-12-04Testing
BioMedLAT 02016-09-23Developing
dailymed_spl 0micheldumontier2023-11-29Developing
DocumentLevelAnnotationSample 47Jin-Dong Kim2023-11-29Testing