PubmedHPO | | Human phenotype annotation to PubMed abstracts, based on the HPO ontology | 12.4 M | Tudor Groza | tudor | 2023-11-24 | Beta | |
relna | | https://www.tagtog.net/jmcejuela/relna | 2.77 K | Ashish Baghudana and Juan Miguel Cejuela | tagtog | 2023-11-29 | Developing | |
LitCoin-GeneOrGeneProduct-v0 | | https://pubdictionaries.org/text_annotation.json?dictionary=NCBIGene-NER&threshold=0.85&abbreviation=true | 15.8 K | | Jin-Dong Kim | 2023-11-29 | | |
PubCasesHPO | | HPO annotation in PubCases | 3.18 M | | Toyofumi Fujiwara | 2023-11-24 | Beta | |
CORD-19_HIRAKI | | HIRAKI Annotation for CORD-19 | 2.98 K | | AikoHIRAKI | 2023-11-29 | Testing | |
hahm_test | | hahm_test | 0 | hahm | kaist_nlp | 2019-04-02 | Testing | |
PathNERGold | | Gold standard used for training PathNER | 0 | Chengkin Wu | nikolamilosevic | 2023-11-29 | Developing | |
GlycoBiology-GO | | GO-based annotation to GlycoBiology abstracts | 0 | | Jin-Dong Kim | 2023-11-29 | Testing | |
GlyCosmos600-GlycoProteins | | GlycoProtein annotations were made using the glycoprotein-name dictionary on PubDictionaries:
http://pubannotation.org/projects/GlyCosmos600-docs
The documents were imported from the GlyCosmos600-docs project:
http://pubannotation.org/projects/GlyCosmos600-docs | 3.68 K | | Jin-Dong Kim | 2023-11-27 | Testing | |
glycoprotein | | glycoprotein annotation | 54 | issaku yamada | ISSAKU YAMADA | 2023-11-29 | Testing | |
GlycoBiology-Epitope | | GlycoEpitope-based annotation to GlycoBiology abstracts | 3.18 K | Shujiro Okuda | shuo50 | 2023-11-29 | Testing | |
Glycan-Abbreviation | | Glycan-Abbreviation - GlycoNAVI Project | 0 | | ISSAKU YAMADA | 2023-11-29 | Testing | |
PMID_GLOBAL | | Global sentencer tagging of public PMID abstracts.
Open and publicly available to the global community. | 2.24 M | | alo33 | 2023-11-24 | Developing | |
Genomics_Informatics | | Genomics & Informatics (NLM title abbreviation: Genomics Inform) is the official journal of the Korea Genome Organization.
Text corpus for this journal annotated with various levels of linguistic information would be a valuable resource as the process of information extraction requires syntactic, semantic, and higher levels of natural language processing. In this study, we publish our new corpus called GNI Corpus version 1.0, extracted and annotated from full texts of Genomics & Informatics, with NLTK (Natural Language ToolKit)-based text mining script. The preliminary version of the corpus could be used as a training and testing set of a system that serves a variety of functions for future biomedical text mining. | 35.3 K | Hyun-Seok Park | ewha-bio | 2023-11-29 | Beta | |
LitCoin-GeneOrGeneProduct-v3 | | GeneOrGeneProduct
after false positive control | 4.67 K | | Jin-Dong Kim | 2023-11-24 | | |
GlycoBiology-GDGDB | | GDGDB-based annotation to GlycoBiology abstracts | 2.46 K | Toshihide Shikanai | shikanai | 2023-11-29 | Testing | |
SMAFIRA-Case-Studies-19735549 | | From: https://github.com/SMAFIRA/c_corpus/blob/master/SMAFIRAc_0.4_Annotations.csv | 0 | | zebet | 2023-11-30 | Developing | |
SMAFIRA-Case-Studies-16850029 | | From: https://github.com/SMAFIRA/c_corpus/blob/master/SMAFIRAc_0.4_Annotations.csv | 0 | | zebet | 2023-11-30 | Developing | |
SMAFIRA-Case-Studies-24204323 | | From: https://github.com/SMAFIRA/c_corpus/blob/master/SMAFIRAc_0.4_Annotations.csv | 0 | | zebet | 2023-11-29 | Developing | |
SMAFIRA-Case-Studies-21494637 | | From: https://github.com/SMAFIRA/c_corpus/blob/master/SMAFIRAc_0.4_Annotations.csv | 0 | | zebet | 2023-11-26 | Developing | |