| CORD-PICO | | Automatic annotation of the CORD-19 dataset with PICO categories. The corpus was automatically labeled with an LSTM-CRF model trained on human-annotated PubMed abstracts from https://github.com/bepnye/EBM-NLP. Currently, titles and abstracts only are annotated using Population, Intervention and Outcome labels, as well as more fine-grained labels such as Age, Drug, Mortality and others. | 69.6 K | Simon Suster | ssuster | 2023-11-27 | Developing | |
| CORD-19-sample-HP | | | 39 | | Jin-Dong Kim | 2023-11-27 | Developing | |
| CORD-19-sample-sentences | | | 161 | | Jin-Dong Kim | 2023-11-27 | Developing | |
| LitCovid-sample-PD-MONDO | | | 1.21 K | | Jin-Dong Kim | 2023-11-27 | Developing | |
| bionlp-ost-19-BB-norm-ner-dev | | | 1.33 K | | ldeleger | 2023-11-27 | Developing | |
| PMC-KEGG | | Documents from PMC including the word KEGG, with names of software tools and databases marked. | 27 | | yucca | 2023-11-28 | Developing | |
| GlycoConjugate-collection | | The PubMed entries (titles and abstracts) from the journal of GlycoConjugate | 0 | | Jin-Dong Kim | 2023-11-28 | Developing | |
| KAIST_NLP_Annotation9 | | | 6.32 K | | kaist_nlp | 2023-11-28 | Developing | |
| bionlp-ost-19-BB-kb-ner-test | | | 125 | | ldeleger | 2023-11-28 | Developing | |
| OryzaGP_2021 | | Updating OryzaGP | 1.08 M | Pierre Larmande | larmande | 2023-11-28 | Developing | |
| bionlp-ost-19-BB-kb-ner-train | | | 3.56 K | | ldeleger | 2023-11-28 | Developing | |
| ENG_NER_NEL_CONSENSUS | | | 607 | | dpavot | 2023-11-28 | Developing | |
| bionlp-ost-19-BB-rel-train | | | 3.52 K | | ldeleger | 2023-11-28 | Developing | |
| Minna_de_Honkoku | | An annotation project for Minna de Honkoku, a crowdsourced transcription project for historical Japanese documents.. | 204 | Yuta Hashimoto | yhashimoto | 2023-11-28 | Developing | |
| PGR-NEG | | Identification of Negative Relations
| 23 | Diana Sousa | dpavot | 2023-11-28 | Developing | |
| pmc-enju-pas | | Predicate-argument structure annotation produced by Enju.
This data set is initially produced as a supporting resource for BioNLP-ST 2016 GE task.
As so, it currently includes the 34 full paper articles that are in the benchmark data sets of GE 2016 task, reference data set (bionlp-st-ge-2016-reference) and test data set (bionlp-st-ge-2016-test), but will be extended to include more papers from the PubMed Central Open Access subset (PMCOA).
| 205 K | DBCLS | Jin-Dong Kim | 2023-11-28 | Developing | |
| bionlp-ost-19-BB-rel-ner-train | | | 3.62 K | | ldeleger | 2023-11-28 | Developing | |
| SMAFIRA_OGER_TEXT | | | 116 | | zebet | 2023-11-28 | Developing | |
| SMAFIRA_Methods | | Predictions for methods for the SMAFIRA project. | 0 | | zebet | 2023-11-28 | Developing | |
| Glycosmos6-GlycoEpitope | | Automatic annotation by PD-GlycoEpitope. | 19.9 K | | Jin-Dong Kim | 2023-11-28 | Developing | |