LitCoin-entities-OrganismTaxon-PD | | | 949 | | Jin-Dong Kim | 2023-11-27 | Testing | |
bionlp-st-ge-2016-test-proteins | | Protein annotations to the benchmark test data set of the BioNLP-ST 2016 GE task.
A participant of the GE task may import the documents and annotations of this project to his/her own project, to begin with producing event annotations.
For more details, please refer to the benchmark test data set (bionlp-st-ge-2016-test).
| 4.34 K | DBCLS | Jin-Dong Kim | 2023-11-27 | Released | |
Zoonoses_partialAnnotation | | This is a part of Zoonoses project used by PanZoora. But Zoonoses project provides whole manual annotated data but this is partial ones. | 266 | | AikoHIRAKI | 2023-11-27 | Released | |
metamap-sample | | Sample annotation of MetaMep, produced by Aronson, et al.
An overview of MetaMap: historical perspective and recent advances, JAMIA 2010 | 10.9 K | Alan R Aronson | Jin-Dong Kim | 2023-11-27 | Testing | |
LappsTest | | Project to test posting annotations directly from the Language Applications Grid | 2.67 K | Keith Suderman | ksuderman | 2023-11-27 | Developing | |
Training_Data_English_de_en | | | 0 | | wmtbio | 2023-11-27 | Developing | |
LitCovid-sentences-v1 | | Sentence segmentation of all the texts in the LitCovid literature. The segmentation is automatically obtained using the TextSentencer annotation service developed and maintained by DBCLS. | 16.5 K | | Jin-Dong Kim | 2023-11-27 | Released | |
LitCovid-sample-PD-GlycoEpitope | | | 1 | | Jin-Dong Kim | 2023-11-27 | Developing | |
Training_Data_Japanese_ja_en | | | 0 | | wmtbio | 2023-11-27 | Developing | |
GlyCosmos600-GlycoProteins | | GlycoProtein annotations were made using the glycoprotein-name dictionary on PubDictionaries:
http://pubannotation.org/projects/GlyCosmos600-docs
The documents were imported from the GlyCosmos600-docs project:
http://pubannotation.org/projects/GlyCosmos600-docs | 3.68 K | | Jin-Dong Kim | 2023-11-27 | Testing | |
AIMed | | The AIMed corpus is one of the most widely used corpora for protein-protein interaction extraction. The protein annotations are either parts of the protein interaction annotations, or are uninvolved in any protein interaction annotation.
Publication: http://www.cs.utexas.edu/~ml/papers/bionlp-aimed-04.pdf | 4.04 K | The University of Texas at Austin | Yue Wang | 2023-11-27 | Testing | |
PIR-corpus1 | | The Protein Information Resource (PIR) is not biased towards any particular biomedical domain, and is expected to provide more diverse protein names in a given sample size.
Annotation category: protein, compound-protein, acronym. | 4.44 K | University of Delaware and Georgetown University Medical Center | Yue Wang | 2023-11-27 | Released | |
Training_Data_French_fr_en | | | 0 | | wmtbio | 2023-11-27 | Developing | |
GlyCosmos600-GlycoEpitope | | | 277 | | Jin-Dong Kim | 2023-11-27 | Testing | |
korean_corpus_pos | | | 0 | | donghwan kim | 2023-11-27 | | |
example-dialog | | | 0 | | Jin-Dong Kim | 2023-11-27 | Testing | |
Glycobiology-GlycanName | | | 946 | Toshihide Shikanai | shikanai | 2023-11-27 | Testing | |
disease_gene_microbe_small | | Small version (48 abstract that mention both Crohns and S. aureus) for development purposes
Abbreviation: dgm Content: annotated abstracts on Crohn’s disease or on on Staphylococcus aureus (according to the jensenlab.org indexing resources) Entity types: (three for a start, organisms (NCBI Taxonomy taxa), disease (Disease Ontology terms), human genes (ENSEMBL proteins) Aim: Explore indirect associations of diseases to microbial species in this corpus via gene co-mentions | 536 | | evangelos | 2023-11-27 | Testing | |
performance-test | | a project for performance test | 480 K | | Jin-Dong Kim | 2023-11-27 | Testing | |
123123123 | | 123123123 | 150 | | yaoxinzhi | 2023-11-27 | Released | |