Genomics_Informatics | | Genomics & Informatics (NLM title abbreviation: Genomics Inform) is the official journal of the Korea Genome Organization.
Text corpus for this journal annotated with various levels of linguistic information would be a valuable resource as the process of information extraction requires syntactic, semantic, and higher levels of natural language processing. In this study, we publish our new corpus called GNI Corpus version 1.0, extracted and annotated from full texts of Genomics & Informatics, with NLTK (Natural Language ToolKit)-based text mining script. The preliminary version of the corpus could be used as a training and testing set of a system that serves a variety of functions for future biomedical text mining. | 35.3 K | Hyun-Seok Park | ewha-bio | 2023-11-29 | Beta | |
bionlp-ost-19-BB-rel-ner-dev | | | 1.98 K | | ldeleger | 2023-11-29 | Developing | |
Training_Data_English_pt_en | | | 0 | | wmtbio | 2023-11-29 | Developing | |
nakashima | | | 0 | | nakashima | 2023-11-29 | Testing | |
ngly1-sample10 | | | 4 | | Nuria | 2023-11-29 | | |
ngly1-sample3 | | | 10 | | Nuria | 2023-11-29 | | |
korean_corpus | | validation korean | 90 | | donghwan kim | 2023-11-29 | | |
Tester | | Test for cancer | 93 | Han | | 2023-11-30 | Testing | |
youworks-test | | this is test annotation. | 0 | | Hisato Terada | 2023-11-29 | Testing | |
GlyCosmos-GlycanStructure-c | | | 0 | | Jin-Dong Kim | 2023-11-29 | Testing | |
Grays_part1 | | Embryology | 1.44 K | | okubo | 2023-11-30 | Testing | |
新着論文レビュー | | 新着論文レビューに関するアノテーション。 | 0 | Database Center for Life Science | Yasunori Yamamoto | 2019-02-04 | Developing | |
namedentityrecognition | | | 0 | | white | 2016-05-13 | Testing | |
zhou_test | | | 0 | | | 2019-07-13 | Testing | |
Annotation-Euglena-Enzymes | | | 0 | | Shuichi Kawashima | 2016-06-13 | Developing | |
ichiharatest_150825_2 | | test | 0 | ichihara_hisako | Hisako Ichihara | 2015-09-11 | Testing | |
bionlp-ost-19-BB-norm-ner-dev | | | 1.33 K | | ldeleger | 2023-11-27 | Developing | |
NAKLEE | |
| 0 | | Nakyolee | 2017-07-13 | | |
bionlp-st-pc-2013-training | | The training dataset from the pathway curation (PC) task in the BioNLP Shared Task 2013.
The entity types defined in the PC task are simple chemical, gene or gene product, complex and cellular component. | 7.86 K | NaCTeM and KISTI | Yue Wang | 2023-11-27 | Released | |
ENG_NER_NEL_pruas | | | 582 | Pedro Ruas | pruas_18 | 2023-11-30 | Developing | |