| PMID_GLOBAL | | Global sentencer tagging of public PMID abstracts.
Open and publicly available to the global community. | 2.24 M | | alo33 | 2023-11-24 | Developing | |
| LitCovid-PD-CHEBI | | | 1.43 M | | Jin-Dong Kim | 2023-11-24 | Developing | |
| Epistemic_Statements | | The goal of this work is to identify epistemic statements in the scientific literature. An epistemic statement is a statement of unknowns, hypotheses, speculations, uncertainties, including statements of claims, hypotheses, questions, explanations, future opportunities, surprises, issues, or concerns within a sentence. The unit of an epistemic statement is a sentence automatically parsed. The classification is binary - epistemic statement or not. We will label epistemic statements only and one can assume that if a statement is not labeled, then it is not an epistemic statement.
The classifier is a CRF, trained on gold standard annotations of epistemic statements that are currently ongoing. We report an F-measure of 0.91 after 5-fold cross validation on a test set with 914 statements and an F-measure of 0.9 on a held out document with 130 statements. This project is still under development and is submitted to be used for the CovidLit project and associated Hackathon.
Please contact Mayla if you have any questions. | 1.42 M | | mboguslav | 2023-11-24 | Developing | |
| Biotea | | NCBO annotation on full text for PMC articles. Currently including only a small set of 2811 articles corresponding to those supporting curated diesease-protein annotation from UniProt and with machine-processable full text. | 894 K | L. Garcia | | 2023-11-24 | Developing | |
| GlyCosmosP-Glycan-Motif | | | 8 | | Jin-Dong Kim | 2023-11-24 | Developing | |
| tees-test | | Random PMC document used for testing during the development of a RESTful TEES parsing web service. | 3.39 K | Nico Colic | Nico Colic | 2023-11-24 | Developing | |
| Covid19_manual_annotation_v2 | | | 4.58 K | | AikoHIRAKI | 2023-11-24 | Developing | |
| PubMed-German-test | | A collection of PubMed abstracts which are written in German | 0 | | Jin-Dong Kim | 2023-11-24 | Developing | |
| PubMed-2017 | | abstracts published in 2017. | 0 | | Jin-Dong Kim | 2023-11-24 | Developing | |
| ENG_NER_NEL | | Annotations in COVID-19 related PubMed abstracts from the following ontologies: Disease Ontology ("do"), Gene Ontology ("go"), Human Phenotype Ontology ("hpo"), ChEBI ontology ("chebi"), MeSH
| 493 | LASIGE-DeST | pruas_18 | 2023-11-26 | Developing | |
| Zoonoses | | This is a main data sets of Zoonoses project used by PanZoora. | 10.3 K | | AikoHIRAKI | 2023-11-26 | Developing | |
| CORD-19-sample-UBERON | | | 54 | | Jin-Dong Kim | 2023-11-26 | Developing | |
| demo4TogoSite | | | 323 | | AikoHIRAKI | 2023-11-26 | Developing | |
| LappsTest | | Project to test posting annotations directly from the Language Applications Grid | 2.67 K | Keith Suderman | ksuderman | 2023-11-27 | Developing | |
| LitCovid-sample-PD-GlycoEpitope | | | 1 | | Jin-Dong Kim | 2023-11-27 | Developing | |
| CORD-PICO | | Automatic annotation of the CORD-19 dataset with PICO categories. The corpus was automatically labeled with an LSTM-CRF model trained on human-annotated PubMed abstracts from https://github.com/bepnye/EBM-NLP. Currently, titles and abstracts only are annotated using Population, Intervention and Outcome labels, as well as more fine-grained labels such as Age, Drug, Mortality and others. | 69.6 K | Simon Suster | ssuster | 2023-11-27 | Developing | |
| CORD-19-sample-HP | | | 39 | | Jin-Dong Kim | 2023-11-27 | Developing | |
| CORD-19-sample-sentences | | | 161 | | Jin-Dong Kim | 2023-11-27 | Developing | |
| LitCovid-sample-PD-MONDO | | | 1.21 K | | Jin-Dong Kim | 2023-11-27 | Developing | |
| GlycoConjugate-collection | | The PubMed entries (titles and abstracts) from the journal of GlycoConjugate | 0 | | Jin-Dong Kim | 2023-11-28 | Developing | |