> top > projects

Projects

NameTDescription # Ann.AuthorMaintainerUpdated_atStatus

501-520 / 590 show all
LappsTest Project to test posting annotations directly from the Language Applications Grid2.67 KKeith Sudermanksuderman2023-11-27Developing
uniprot-mouse Protein annotation based on UniProt11.5 KJin-Dong Kim2023-11-28Developing
LitCovid-docs Updated at 2021-01-12 A comprehensive literature resource on the subject of Covid-19 is collected by NCBI: https://www.ncbi.nlm.nih.gov/research/coronavirus/ The LitCovid project@PubAnnotation is a collection of the titles and abstracts of the LitCovid dataset, for the people who want to perform text mining analysis. Please note that if you produce some annotation to the documents in this project, and contribute the annotation back to PubAnnotation, it will become publicly available together with contribution from other people. If you want to contribute your annotation to PubAnnotation, please refer to the documentation page: http://www.pubannotation.org/docs/submit-annotation/ The list of the PMID is sourced from here The 6 entries of the following PMIDs could not be included because they were not available from PubMed:32161394, 32104909, 32090470, 32076224, 32161394 32188956, 32238946. Below is a notice from the original LitCovid dataset: PUBLIC DOMAIN NOTICE National Center for Biotechnology Information This software/database is a "United States Government Work" under the terms of the United States Copyright Act. It was written as part of the author's official duties as a United States Government employee and thus cannot be copyrighted. This software/database is freely available to the public for use. The National Library of Medicine and the U.S. Government have not placed any restriction on its use or reproduction. Although all reasonable efforts have been taken to ensure the accuracy and reliability of the software and data, the NLM and the U.S. Government do not and cannot warrant the performance or results that may be obtained by using this software or data. The NLM and the U.S. Government disclaim all warranties, express or implied, including warranties of performance, merchantability or fitness for any particular purpose. Please cite the authors in any work or product based on this material : Chen Q, Allot A, & Lu Z. (2020) Keep up with the latest coronavirus research, Nature 579:193 18Jin-Dong Kim2023-11-28Testing
CORD-19_bioRxiv_medRxiv_subset The bioRxiv/medRxiv subset of the CORD-19 dataset: pre-prints that are not peer reviewed. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT. 0Jin-Dong Kim2023-11-29Released
CORD-19_Commercial_use_subset The Commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2023-11-29Released
CORD-19_Custom_license_subset The Custom license subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.5.08 MJin-Dong Kim2023-11-24Released
CORD-19_Non-commercial_use_subset The Non commercial use subset of the CORD-19 dataset. The documents in this project will be updated as the CORD-19 dataset grows. See the COVID DATASET LICENSE AGREEMENT.0Jin-Dong Kim2023-11-29Released
ykjeong_test pub_annotation_test2762023-11-28Testing
LitCovid_Glycan-Motif-Structure PubDictionaries annotation for glycan-Motif terms.6.51 KISSAKU YAMADA2023-11-29Beta
bionlp-st-ge-2016-uniprot UniProt protein annotation to the benchmark data set of BioNLP-ST 2016 GE task: reference data set (bionlp-st-ge-2016-reference) and test data set (bionlp-st-ge-2016-test). The annotations are produced based on a dictionary which is semi-automatically compiled for the 34 full paper articles included in the benchmark data set (20 in the reference data set + 14 in the test data set). For detailed information about BioNLP-ST GE 2016 task data sets, please refer to the benchmark reference data set (bionlp-st-ge-2016-reference) and benchmark test data set (bionlp-st-ge-2016-test). 16.2 KDBCLSJin-Dong Kim2023-11-29Beta
QFMC_MEDLINE Quaero French Medical Corpus: Annotation of MEDLINE titles5.9 KAurélie NévéolPierre Zweigenbaum2023-11-29Beta
tees-test Random PMC document used for testing during the development of a RESTful TEES parsing web service.3.39 KNico ColicNico Colic2023-11-24Developing
spacy-test Random set of articles used for testing in the development of the RESTful spaCy parsing web service. Since development is now finished, they are released for the community to use.131 KNico ColicNico Colic2023-11-29Released
glytoucan-iupac retrying glytoucan-iupac annotation as of march 9, 20180kiyoko2023-11-29Testing
metamap-sample Sample annotation of MetaMep, produced by Aronson, et al. An overview of MetaMap: historical perspective and recent advances, JAMIA 201010.9 KAlan R AronsonJin-Dong Kim2023-11-27Testing
pubtator-sample Sample annotation of PubTator produced by Zhiyong Lu et al.28Zhiyong LuJin-Dong Kim2023-11-27Testing
semrep-sample Sample annotation of SemRep, produced by Rindflesch, et al. Rindflesch, T.C. and Fiszman, M. (2003). The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. Journal of Biomedical Informatics, 36(6):462-477.11.1 KRindflesch et al.Jin-Dong Kim2023-11-29Testing
PubMed_Structured_Abstracts Sections (zones) as retrieved from PubMed.131 Kzebet2023-11-28Released
sentences Sentence segmentation annotation. Automatic annotation by TextSentencer.6.96 MDBCLSJin-Dong Kim2023-11-24Developing
LitCovid-sentences-v1 Sentence segmentation of all the texts in the LitCovid literature. The segmentation is automatically obtained using the TextSentencer annotation service developed and maintained by DBCLS.16.5 KJin-Dong Kim2023-11-27Released
NameT# Ann.AuthorMaintainerUpdated_atStatus

501-520 / 590 show all
LappsTest 2.67 KKeith Sudermanksuderman2023-11-27Developing
uniprot-mouse 11.5 KJin-Dong Kim2023-11-28Developing
LitCovid-docs 18Jin-Dong Kim2023-11-28Testing
CORD-19_bioRxiv_medRxiv_subset 0Jin-Dong Kim2023-11-29Released
CORD-19_Commercial_use_subset 0Jin-Dong Kim2023-11-29Released
CORD-19_Custom_license_subset 5.08 MJin-Dong Kim2023-11-24Released
CORD-19_Non-commercial_use_subset 0Jin-Dong Kim2023-11-29Released
ykjeong_test 2762023-11-28Testing
LitCovid_Glycan-Motif-Structure 6.51 KISSAKU YAMADA2023-11-29Beta
bionlp-st-ge-2016-uniprot 16.2 KDBCLSJin-Dong Kim2023-11-29Beta
QFMC_MEDLINE 5.9 KAurélie NévéolPierre Zweigenbaum2023-11-29Beta
tees-test 3.39 KNico ColicNico Colic2023-11-24Developing
spacy-test 131 KNico ColicNico Colic2023-11-29Released
glytoucan-iupac 0kiyoko2023-11-29Testing
metamap-sample 10.9 KAlan R AronsonJin-Dong Kim2023-11-27Testing
pubtator-sample 28Zhiyong LuJin-Dong Kim2023-11-27Testing
semrep-sample 11.1 KRindflesch et al.Jin-Dong Kim2023-11-29Testing
PubMed_Structured_Abstracts 131 Kzebet2023-11-28Released
sentences 6.96 MDBCLSJin-Dong Kim2023-11-24Developing
LitCovid-sentences-v1 16.5 KJin-Dong Kim2023-11-27Released