English
日本語
signup
login
Repository
Search
Annotators
Editors
Evaluators
NEWS
Documentation
>
top
>
projects
> Genomics_Informatics
Genomics_Informatics
Project info
Genomics & Informatics (NLM title abbreviation: Genomics Inform) is the official journal of the Korea Genome Organization. Text corpus for this journal annotated with various levels of linguistic information would be a valuable resource as the process of information extraction requires syntactic, semantic, and higher levels of natural language processing. In this study, we publish our new corpus called GNI Corpus version 1.0, extracted and annotated from full texts of Genomics & Informatics, with NLTK (Natural Language ToolKit)-based text mining script. The preliminary version of the corpus could be used as a training and testing set of a system that serves a variety of functions for future biomedical text mining.
Updated at
2023-11-29 06:32:05 UTC
Status
Beta
Maintainer
ewha-bio
Author
Hyun-Seok Park
License
This work is licensed under a
Creative Commons Attribution 4.0 International License
.
Documents
(138)
@ewha-bio
138
Annotations
(35,306)
Download
Not prepared yet for this project.
Contact the manager of this project.
Evaluations
Evaluations
(0)