AGAC-COVID-19 | | | 14 | | xiajingbo | 2023-11-29 | | |
SMAFIRA_Feedback_Research_Goal | | | 15 | | zebet | 2023-11-28 | Released | |
ngly1-sample2 | | | 15 | | Nuria | 2023-11-29 | | |
CORD-19-sample-CHEBI | | | 16 | | Jin-Dong Kim | 2023-11-29 | Developing | |
AlvisNLP-Test | | Project for testing AlviNLP PubAnnotation server during BLAH3. | 17 | | Bibliome | 2023-11-29 | Testing | |
ngly1-sample4 | | | 18 | | Nuria | 2023-11-29 | | |
LitCovid-docs | | Updated at 2021-01-12
A comprehensive literature resource on the subject of Covid-19 is collected by NCBI:
https://www.ncbi.nlm.nih.gov/research/coronavirus/
The LitCovid project@PubAnnotation is a collection of the titles and abstracts of the LitCovid dataset, for the people who want to perform text mining analysis. Please note that if you produce some annotation to the documents in this project, and contribute the annotation back to PubAnnotation, it will become publicly available together with contribution from other people.
If you want to contribute your annotation to PubAnnotation, please refer to the documentation page:
http://www.pubannotation.org/docs/submit-annotation/
The list of the PMID is sourced from here
The 6 entries of the following PMIDs could not be included because they were not available from PubMed:32161394,
32104909,
32090470,
32076224,
32161394
32188956,
32238946.
Below is a notice from the original LitCovid dataset:
PUBLIC DOMAIN NOTICE
National Center for Biotechnology Information
This software/database is a "United States Government Work" under the
terms of the United States Copyright Act. It was written as part of
the author's official duties as a United States Government employee and
thus cannot be copyrighted. This software/database is freely available
to the public for use. The National Library of Medicine and the U.S.
Government have not placed any restriction on its use or reproduction.
Although all reasonable efforts have been taken to ensure the accuracy
and reliability of the software and data, the NLM and the U.S.
Government do not and cannot warrant the performance or results that
may be obtained by using this software or data. The NLM and the U.S.
Government disclaim all warranties, express or implied, including
warranties of performance, merchantability or fitness for any particular
purpose.
Please cite the authors in any work or product based on this material :
Chen Q, Allot A, & Lu Z. (2020) Keep up with the latest coronavirus research, Nature 579:193
| 18 | | Jin-Dong Kim | 2023-11-28 | Testing | |
RDoCTask1SampleData | | Each annotation file contains an annotated abstract with an RDoC category. Each title span in these sample data is annotated with the corresponding related RDoC construct, although the RDoC category would apply for the entire abstract. The annotation data are formatted as json files. Please refer to the following page for a more detailed description of the json format http://www.pubannotation.org/docs/annotation-format/. | 20 | | mmanani1s | 2023-11-29 | Released | |
test1 | | test1 | 21 | H. S. Park | Sophie Nam | 2023-11-26 | Testing | |
PGR-NEG | | Identification of Negative Relations
| 23 | Diana Sousa | dpavot | 2023-11-28 | Developing | |
GlycosmosP-GlycoEpitope | | | 24 | | Jin-Dong Kim | 2023-11-29 | Testing | |
excludesZoonoses | | | 25 | | AikoHIRAKI | 2023-11-29 | Developing | |
ngly1-sample1 | | | 25 | | Nuria | 2023-11-27 | | |
glycobiology-test | | | 27 | | Jin-Dong Kim | 2023-11-29 | Developing | |
PMC-KEGG | | Documents from PMC including the word KEGG, with names of software tools and databases marked. | 27 | | yucca | 2023-11-28 | Developing | |
pubtator-sample | | Sample annotation of PubTator produced by Zhiyong Lu et al. | 28 | Zhiyong Lu | Jin-Dong Kim | 2023-11-27 | Testing | |
CORD-19-sample-paragraphs | | | 28 | | Jin-Dong Kim | 2023-11-29 | Developing | |
ngly1-sample5 | | | 29 | | Nuria | 2023-11-29 | | |
bionlp-st-2016-SeeDev-training | | Entities and event annotations from the training set of the BioNLP-ST 2016 SeeDev task.
SeeDev task focuses on seed storage and reserve accumulation on the model organism, Arabidopsis thaliana. The SeeDev task is based on the knowledge model Gene Regulation Network for Arabidopsis (GRNA) that meets the needs of text-mining (i.e. manual annotation of texts and automatic information extraction), experimental data indexing and retrieval and reuse in other plant systems. It is also expected to meet the requirements of the integration of the text knowledge with knowledge derived from experimental data in view of modeling in systems biology.
GRNA model defines 16 different types of entities, and 22 types of event (in five sets of event types) that may be combined in complex events.
For more information, please refer to the task website
All annotations :
Train set
Development set
Test set (without events)
| 35 | | EstelleChaix | 2023-11-28 | Released | |
EBM_test | | | 35 | | Suexuan | 2024-08-23 | | |