> top > projects

Projects

NameTDescription# Ann.Author MaintainerUpdated_atStatus

81-100 / 556 show all
AGCA_Sue Active Gene Annotation Corpus for the Application in Drug Repurposing Discovery0Jingbo Xia, Xuan Qin, Kaiyin Zhou2023-11-29Developing
uniprot-human Uniprot proteins for human21.8 KJin-Dong KimJin-Dong Kim2023-11-29Testing
glycoprotein glycoprotein annotation54issaku yamadaISSAKU YAMADA2023-11-29Testing
bionlp-st-bb3-2016-training Entity (bacteria, habitats and geographical places) annotation to the training dataset of the BioNLP-ST 2016 BB task. For more information, please refer to bionlp-st-bb3-2016-development and bionlp-st-bb3-2016-test. Bacteria Bacteria entities are annotated as contiguous spans of text that contains a full unambiguous prokaryote taxon name, the type label is Bacteria. The Bacteria type is a taxon, at any taxonomic level from phylum (Eubacteria) to strain. The category that the text entities have to be assigned to is the most specific and unique category of the NCBI taxonomy resource. In case a given strain, or a group of strains is not referenced by NCBI, it is assigned with the closest taxid in the taxonomy. Habitat Habitat entities are annotated as spans of text that contains a complete mention of a potential habitat for bacteria, the type label is Habitat. Habitat entities are assigned one or several concepts from the habitat subpart of the OntoBiotope ontology. The assigned concepts are as specific as possible. OntoBiotope defines most relevant microorganism habitats from all areas considered by microbial ecology (hosts, natural environment, anthropized environments, food, medical, etc.). Habitat entities are rarely referential entities, they are usually noun phrases including properties and modifiers. There are rare cases of habitats referred with adjectives or verbs. The spans are generally contiguous but some of them are discontinuous in order to cope with conjunctions. Geographical Geographical entities are geographical and organization places denoted by official names.1.28 KINRAYue Wang2023-11-29Released
Test_Project 0Ingenerfingenerf2023-11-29Testing
ichiharatest_150825 test0ichihara_hisakoHisako Ichihara2023-11-29Testing
traitCurationTest_ichihara testProject1508064ichihara_hisakoHisako Ichihara2023-11-29Testing
PGDBj_disease_curation1 disease curation test348ichihara_hisakoichihara_hisako2023-12-03Testing
ichiharatest_150825_3 test0ichihara_hisakoHisako Ichihara2023-11-26Testing
ichiharatest_150825_2 test0ichihara_hisakoHisako Ichihara2015-09-11Testing
falsetest_150825 test0ichihara_hisakoHisako Ichihara2015-09-11Testing
DisGeNET5_gene_disease The file contains gene-disease associations obtained by text mining MEDLINE abstracts using the BeFree system including the gene and disease off sets.2.04 MIBI GroupYue Wang2023-11-24Released
DisGeNET5_variant_disease The file contains variant-disease associations obtained by text mining MEDLINE abstracts using the BeFree system, including the variant and disease off sets. 144 KIBI GroupYue Wang2023-11-24Released
Genomics_Informatics Genomics & Informatics (NLM title abbreviation: Genomics Inform) is the official journal of the Korea Genome Organization. Text corpus for this journal annotated with various levels of linguistic information would be a valuable resource as the process of information extraction requires syntactic, semantic, and higher levels of natural language processing. In this study, we publish our new corpus called GNI Corpus version 1.0, extracted and annotated from full texts of Genomics & Informatics, with NLTK (Natural Language ToolKit)-based text mining script. The preliminary version of the corpus could be used as a training and testing set of a system that serves a variety of functions for future biomedical text mining.35.3 KHyun-Seok Parkewha-bio2023-11-29Beta
Testing Testing241Hyun-Seok Parkhsp202023-11-28Testing
OryzaGP1 A dataset for Named Entity Recognition for rice gene0Huy Do. Pierre Larmande2019-01-31Uploading
OryzaGP A dataset for Named Entity Recognition for rice gene29.1 KHuy Do and Pierre LarmandeYue Wang2023-11-24Uploading
Virus300 300 abstracts from virology journals annotated with viral proteins and species0http://aclweb.org/anthology/W/W17/W17-2311.pdfhelencook2017-08-07Released
test1 test121H. S. ParkSophie Nam2023-11-26Testing
CoGe_Citation_Annotations Annotated PMC abstracts+full articles, that cite the "CoGe" papers (PMID: 18952863, 18269575). Total Num Citations: 165 Total Num Unique Citations: 141 Total Num Abstracts: 165 Total Num Whole Articles: 165 0Heather Lenthclent2023-11-29Uploading
NameT# Ann.Author MaintainerUpdated_atStatus

81-100 / 556 show all
AGCA_Sue 0Jingbo Xia, Xuan Qin, Kaiyin Zhou2023-11-29Developing
uniprot-human 21.8 KJin-Dong KimJin-Dong Kim2023-11-29Testing
glycoprotein 54issaku yamadaISSAKU YAMADA2023-11-29Testing
bionlp-st-bb3-2016-training 1.28 KINRAYue Wang2023-11-29Released
Test_Project 0Ingenerfingenerf2023-11-29Testing
ichiharatest_150825 0ichihara_hisakoHisako Ichihara2023-11-29Testing
traitCurationTest_ichihara 4ichihara_hisakoHisako Ichihara2023-11-29Testing
PGDBj_disease_curation1 348ichihara_hisakoichihara_hisako2023-12-03Testing
ichiharatest_150825_3 0ichihara_hisakoHisako Ichihara2023-11-26Testing
ichiharatest_150825_2 0ichihara_hisakoHisako Ichihara2015-09-11Testing
falsetest_150825 0ichihara_hisakoHisako Ichihara2015-09-11Testing
DisGeNET5_gene_disease 2.04 MIBI GroupYue Wang2023-11-24Released
DisGeNET5_variant_disease 144 KIBI GroupYue Wang2023-11-24Released
Genomics_Informatics 35.3 KHyun-Seok Parkewha-bio2023-11-29Beta
Testing 241Hyun-Seok Parkhsp202023-11-28Testing
OryzaGP1 0Huy Do. Pierre Larmande2019-01-31Uploading
OryzaGP 29.1 KHuy Do and Pierre LarmandeYue Wang2023-11-24Uploading
Virus300 0http://aclweb.org/anthology/W/W17/W17-2311.pdfhelencook2017-08-07Released
test1 21H. S. ParkSophie Nam2023-11-26Testing
CoGe_Citation_Annotations 0Heather Lenthclent2023-11-29Uploading