> top > projects

Projects

NameTDescription# Ann.Author MaintainerUpdated_atStatus

461-480 / 556 show all
OryzaGP1 A dataset for Named Entity Recognition for rice gene0Huy Do. Pierre Larmande2019-01-31Uploading
Genomics_Informatics Genomics & Informatics (NLM title abbreviation: Genomics Inform) is the official journal of the Korea Genome Organization. Text corpus for this journal annotated with various levels of linguistic information would be a valuable resource as the process of information extraction requires syntactic, semantic, and higher levels of natural language processing. In this study, we publish our new corpus called GNI Corpus version 1.0, extracted and annotated from full texts of Genomics & Informatics, with NLTK (Natural Language ToolKit)-based text mining script. The preliminary version of the corpus could be used as a training and testing set of a system that serves a variety of functions for future biomedical text mining.35.3 KHyun-Seok Parkewha-bio2023-11-29Beta
Testing Testing241Hyun-Seok Parkhsp202023-11-28Testing
DisGeNET5_variant_disease The file contains variant-disease associations obtained by text mining MEDLINE abstracts using the BeFree system, including the variant and disease off sets. 144 KIBI GroupYue Wang2023-11-24Released
DisGeNET5_gene_disease The file contains gene-disease associations obtained by text mining MEDLINE abstracts using the BeFree system including the gene and disease off sets.2.04 MIBI GroupYue Wang2023-11-24Released
traitCurationTest_ichihara testProject1508064ichihara_hisakoHisako Ichihara2023-11-29Testing
falsetest_150825 test0ichihara_hisakoHisako Ichihara2015-09-11Testing
ichiharatest_150825_2 test0ichihara_hisakoHisako Ichihara2015-09-11Testing
PGDBj_disease_curation1 disease curation test348ichihara_hisakoichihara_hisako2023-12-03Testing
ichiharatest_150825 test0ichihara_hisakoHisako Ichihara2023-11-29Testing
ichiharatest_150825_3 test0ichihara_hisakoHisako Ichihara2023-11-26Testing
Test_Project 0Ingenerfingenerf2023-11-29Testing
bionlp-st-bb3-2016-training Entity (bacteria, habitats and geographical places) annotation to the training dataset of the BioNLP-ST 2016 BB task. For more information, please refer to bionlp-st-bb3-2016-development and bionlp-st-bb3-2016-test. Bacteria Bacteria entities are annotated as contiguous spans of text that contains a full unambiguous prokaryote taxon name, the type label is Bacteria. The Bacteria type is a taxon, at any taxonomic level from phylum (Eubacteria) to strain. The category that the text entities have to be assigned to is the most specific and unique category of the NCBI taxonomy resource. In case a given strain, or a group of strains is not referenced by NCBI, it is assigned with the closest taxid in the taxonomy. Habitat Habitat entities are annotated as spans of text that contains a complete mention of a potential habitat for bacteria, the type label is Habitat. Habitat entities are assigned one or several concepts from the habitat subpart of the OntoBiotope ontology. The assigned concepts are as specific as possible. OntoBiotope defines most relevant microorganism habitats from all areas considered by microbial ecology (hosts, natural environment, anthropized environments, food, medical, etc.). Habitat entities are rarely referential entities, they are usually noun phrases including properties and modifiers. There are rare cases of habitats referred with adjectives or verbs. The spans are generally contiguous but some of them are discontinuous in order to cope with conjunctions. Geographical Geographical entities are geographical and organization places denoted by official names.1.28 KINRAYue Wang2023-11-29Released
glycoprotein glycoprotein annotation54issaku yamadaISSAKU YAMADA2023-11-29Testing
uniprot-human Uniprot proteins for human21.8 KJin-Dong KimJin-Dong Kim2023-11-29Testing
AGCA_Sue Active Gene Annotation Corpus for the Application in Drug Repurposing Discovery0Jingbo Xia, Xuan Qin, Kaiyin Zhou2023-11-29Developing
JF-test A test corpus for exploring this service9Johan Fridjohanf2023-12-03Testing
causal0001 test0ju-hyuck hanJu-Hyuck Han2024-01-17
bionlp-st-gro-2013-training The training data set of the BioNLP-ST 2013 GRO task, including 150 MEDLINE abstracts that are annotated with concepts and relations of the Gene Regulation Ontology (GRO; http://www.ebi.ac.uk/Rebholz-srv/GRO/GRO.html)8.02 KJung-jae KimJung-jae Kim2023-11-29Testing
bionlp-st-gro-2013-development The development data set of the BioNLP-ST 2013 GRO task, including 50 MEDLINE abstracts that are annotated with concepts and relations of the Gene Regulation Ontology (GRO; http://www.ebi.ac.uk/Rebholz-srv/GRO/GRO.html)2.66 KJung-jae KimJung-jae Kim2023-11-29Testing
NameT# Ann.Author MaintainerUpdated_atStatus

461-480 / 556 show all
OryzaGP1 0Huy Do. Pierre Larmande2019-01-31Uploading
Genomics_Informatics 35.3 KHyun-Seok Parkewha-bio2023-11-29Beta
Testing 241Hyun-Seok Parkhsp202023-11-28Testing
DisGeNET5_variant_disease 144 KIBI GroupYue Wang2023-11-24Released
DisGeNET5_gene_disease 2.04 MIBI GroupYue Wang2023-11-24Released
traitCurationTest_ichihara 4ichihara_hisakoHisako Ichihara2023-11-29Testing
falsetest_150825 0ichihara_hisakoHisako Ichihara2015-09-11Testing
ichiharatest_150825_2 0ichihara_hisakoHisako Ichihara2015-09-11Testing
PGDBj_disease_curation1 348ichihara_hisakoichihara_hisako2023-12-03Testing
ichiharatest_150825 0ichihara_hisakoHisako Ichihara2023-11-29Testing
ichiharatest_150825_3 0ichihara_hisakoHisako Ichihara2023-11-26Testing
Test_Project 0Ingenerfingenerf2023-11-29Testing
bionlp-st-bb3-2016-training 1.28 KINRAYue Wang2023-11-29Released
glycoprotein 54issaku yamadaISSAKU YAMADA2023-11-29Testing
uniprot-human 21.8 KJin-Dong KimJin-Dong Kim2023-11-29Testing
AGCA_Sue 0Jingbo Xia, Xuan Qin, Kaiyin Zhou2023-11-29Developing
JF-test 9Johan Fridjohanf2023-12-03Testing
causal0001 0ju-hyuck hanJu-Hyuck Han2024-01-17
bionlp-st-gro-2013-training 8.02 KJung-jae KimJung-jae Kim2023-11-29Testing
bionlp-st-gro-2013-development 2.66 KJung-jae KimJung-jae Kim2023-11-29Testing