PA-LLM-test | | | 0 | 2024-01-19 | Testing | |
LitCovid-PMC-OGER-BB | | Annotating PMC articles with OGER and BioBert, according to an hand-crafted Covid-specific dictionary and the 10 different CRAFT ontologies (http://bionlp-corpora.sourceforge.net/CRAFT/):
Chemical Entities of Biological Interest (CHEBI),
Cell Ontology (CL),
Entrez Gene (UBERON),
Gene Ontology (biological process (GO-BP), cellular component (GO-CC), and molecular function (GO-MF),
NCBI Taxonomy (NCBITaxon),
Protein Ontology (PR),
Sequence Ontology (SO) | 3.14 M | 2023-11-24 | Developing | |
bionlp-st-ge-2016-spacy-parsed | | Dependency parses produced by spaCy parser, and part-of-speech tags produced by Stanford tagger (with the wsj-0-18-left3words-nodistsim model). The exact procedure is described here. Data set contains the 34 full paper articles used in the BioNLP 2016 GE task.
| 225 K | 2023-11-29 | Released | |
PA-LLM | | 🖐️ LLMs for biomedical text summarisation | 0 | 2024-01-19 | Developing | |
bionlp-st-ge-2016-test-tees | | NER and event extraction produced by TEES (with the default GE11 model) for the 14 full papers used in the BioNLP 2016 GE task test corpus. | 9.17 K | 2023-11-29 | Released | |
bionlp-st-ge-2016-reference-tees | | NER and event extraction produced by TEES (with the default GE11 model) for the 20 full papers used in the BioNLP 2016 GE task reference corpus. | 14.6 K | 2023-11-29 | Released | |
tees-test | | Random PMC document used for testing during the development of a RESTful TEES parsing web service. | 3.39 K | 2023-11-24 | Developing | |
spacy-test | | Random set of articles used for testing in the development of the RESTful spaCy parsing web service. Since development is now finished, they are released for the community to use. | 131 K | 2023-11-29 | Released | |
oger-json-test | | Test corpus for testing OGER web service | 97.6 K | 2023-11-29 | Testing | |
2015-BEL-Sample-2 | | The 295 BEL statements for sample set used for the 2015 BioCreative challenge. | 11.4 K | 2023-11-28 | Released | |