> top > projects

Projects

NameT Description# Ann.AuthorMaintainerUpdated_atStatus

381-400 / 556 show all
korean_corpus_dep 0donghwan kim2019-04-23
pubmed-sentences-benchmark A benchmark data for text segmentation into sentences. The source of annotation is the GENIA treebank v1.0. Following is the process taken. began with the GENIA treebank v1.0. sentence annotations were extracted and converted to PubAnnotation JSON. uploaded. 12 abstracts met alignment failure. among the 12 failure cases, 4 had a dot('.') character where there should be colon (':'). They were manually fixed then successfully uploaded: 7903907, 8053950, 8508358, 9415639. among the 12 failed abstracts, 8 were "250 word truncation" cases. They were manually fixed and successfully uploaded. During the fixing, manual annotations were added for the missing pieces of text. 30 abstracts had extra text in the end, indicating copyright statement, e.g., "Copyright 1998 Academic Press." They were annotated as a sentence in GTB. However, the text did not exist anymore in PubMed. Therefore, the extra texts were removed, together with the sentence annotation to them. 18.4 KGENIA projectJin-Dong Kim2023-11-28Released
SMAFIRA_Feedback_Labels 0zebet2021-01-21Developing
Glycobiology-GlycanName 946Toshihide Shikanaishikanai2023-11-27Testing
Training_Data_Japanese_ja_en 0wmtbio2023-11-27Developing
korean_corpus_pos 0donghwan kim2023-11-27
proj_h_1 6.7 K2023-11-24
bionlp-ost-19-BB-kb-ner-test 125ldeleger2023-11-28Developing
bionlp-ost-19-SeeDev-bin-dev 2.58 Kldeleger2023-11-28Developing
NER-microbes 10Shuichi Kawashima2023-11-29Developing
Test_PubTator 62Chih-Hsuan Wei2023-11-29Testing
bionlp-st-id-2011-training The training dataset from the infectious diseases (ID) task in the BioNLP Shared Task 2011. Entity types: - Genes and gene products: gene, RNA, and protein name mentions. - Two-component systems: mentions of the names of two-component regulatory systems, frequently embedding the names of the two Proteins forming the system.- Chemicals: mentions of chemical compounds such as "NaCL".- Organisms: mentions of organism names or organism specification through specific properties (e.g. "graRS mutant").- Regulons/Operons: mentions of names of specific regulons and operons.5.61 KUniversity of Tokyo Tsujii Laboratory, NaCTeM and Biocomplexity Institute of Virginia TechYue Wang2023-11-28Released
LitCovid-PD-UBERON 540 KJin-Dong Kim2023-11-29
acggdb_ggdb 0nfujita2023-11-29
AnEM_full-texts 250 documents selected randomly from full-text papers Entity types: organism subdivision, anatomical system, organ, multi-tissue structure, tissue, cell, developing anatomical structure, cellular component, organism substance, immaterial anatomical entity and pathological formation Together with AnEM_abstracts, it is probably the largest manually annotated corpus on anatomical entities.687NaCTeMYue Wang2023-11-29Uploading
FirstAuthor_s_Plants For only Plants4.3 KAikoHIRAKI2023-11-29Testing
bionlp-ost-19-BB-rel-dev 1.97 Kldeleger2023-11-29Developing
Training_Data_English_pt_en 0wmtbio2023-11-29Developing
SNPPhenoExt 3behrouz bokharaeianbokharaeian2023-11-29Developing
korean_corpus validation korean90donghwan kim2023-11-29
NameT # Ann.AuthorMaintainerUpdated_atStatus

381-400 / 556 show all
korean_corpus_dep 0donghwan kim2019-04-23
pubmed-sentences-benchmark 18.4 KGENIA projectJin-Dong Kim2023-11-28Released
SMAFIRA_Feedback_Labels 0zebet2021-01-21Developing
Glycobiology-GlycanName 946Toshihide Shikanaishikanai2023-11-27Testing
Training_Data_Japanese_ja_en 0wmtbio2023-11-27Developing
korean_corpus_pos 0donghwan kim2023-11-27
proj_h_1 6.7 K2023-11-24
bionlp-ost-19-BB-kb-ner-test 125ldeleger2023-11-28Developing
bionlp-ost-19-SeeDev-bin-dev 2.58 Kldeleger2023-11-28Developing
NER-microbes 10Shuichi Kawashima2023-11-29Developing
Test_PubTator 62Chih-Hsuan Wei2023-11-29Testing
bionlp-st-id-2011-training 5.61 KUniversity of Tokyo Tsujii Laboratory, NaCTeM and Biocomplexity Institute of Virginia TechYue Wang2023-11-28Released
LitCovid-PD-UBERON 540 KJin-Dong Kim2023-11-29
acggdb_ggdb 0nfujita2023-11-29
AnEM_full-texts 687NaCTeMYue Wang2023-11-29Uploading
FirstAuthor_s_Plants 4.3 KAikoHIRAKI2023-11-29Testing
bionlp-ost-19-BB-rel-dev 1.97 Kldeleger2023-11-29Developing
Training_Data_English_pt_en 0wmtbio2023-11-29Developing
SNPPhenoExt 3behrouz bokharaeianbokharaeian2023-11-29Developing
korean_corpus 90donghwan kim2023-11-29