LitCovid-sentences | | | 5.63 M | 2023-11-24 | Developing | |
CORD-19_Custom_license_subset | | The Custom license subset of the CORD-19 dataset.
The documents in this project will be updated as the CORD-19 dataset grows.
See the COVID DATASET LICENSE AGREEMENT. | 5.08 M | 2023-11-24 | Released | |
CORD-19-PD-UBERON | | PubDictionaries annotation for UBERON terms - updated at 2020-04-30
It is disease term annotation based on Uberon.
The terms in Uberon are uploaded in PubDictionaries
(Uberon), with which the annotations in this project are produced.
The parameter configuration used for this project is
here.
Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved. | 1.42 M | 2023-11-24 | Released | |
PubMed-German-test | | A collection of PubMed abstracts which are written in German | 0 | 2023-11-24 | Developing | |
PubMed-2017 | | abstracts published in 2017. | 0 | 2023-11-24 | Developing | |
speech-test | | | 6 | 2023-11-26 | Testing | |
CORD-19-SciBite-sentences | | | 11.2 K | 2023-11-26 | Testing | |
LitCovid-PD-FMA-UBERON-v1 | | PubDictionaries annotation for anatomy terms - updated at 2020-04-20
Disease term annotation based on FMA and Uberon. Version 2020-04-20.
The terms in FMA and Uberon are loaded in PubDictionaries
(FMA and
Uberon), with which the annotations in this project are produced.
The parameter configuration used for this project is
here for FMA and
there for Uberon.
Note that it is an automatically generated dictionary-based annotation. It will be updated periodically, as the documents are increased, and the dictionary is improved. | 4.3 K | 2023-11-27 | Released | |
GlyCosmos600-GlycoProteins | | GlycoProtein annotations were made using the glycoprotein-name dictionary on PubDictionaries:
http://pubannotation.org/projects/GlyCosmos600-docs
The documents were imported from the GlyCosmos600-docs project:
http://pubannotation.org/projects/GlyCosmos600-docs | 3.68 K | 2023-11-27 | Testing | |
bionlp-st-ge-2016-uniprot | | UniProt protein annotation to the benchmark data set of BioNLP-ST 2016 GE task: reference data set (bionlp-st-ge-2016-reference) and test data set (bionlp-st-ge-2016-test).
The annotations are produced based on a dictionary which is semi-automatically compiled for the 34 full paper articles included in the benchmark data set (20 in the reference data set + 14 in the test data set).
For detailed information about BioNLP-ST GE 2016 task data sets, please refer to the benchmark reference data set (bionlp-st-ge-2016-reference) and benchmark test data set (bionlp-st-ge-2016-test).
| 16.2 K | 2023-11-29 | Beta | |