| biored-valid-gpt-r-ng | | BioRed: 100 documents from Validation set
Model: GPT-5
Reasoning: high
Guidelines: x | 3.47 K | 2026-01-01 | | |
| biored-valid-gpt-r-g | | BioRed: 100 documents from Validation set
Model: GPT-5
Reasoning: high
Guidelines: o | 3.15 K | 2026-01-01 | | |
| ncbi-valid | | NCBI Disease: 100 documents from Validation Set | 791 | 2026-01-01 | | |
| ncbi-valid-gpt-nr-ng | | NCBI Disease: 100 documents from Validation Set
Model: GPT-5
Reasoning: low
Guidelines: x | 534 | 2026-01-01 | | |
| bc5cdr-valid-gpt-r-ng | | BC5CDR: 100 randomly selected documents from Validation set
Model: GPT-5
Reasoning: high
Guidelines: x | 1.99 K | 2025-12-31 | | |
| ncbi-valid-gpt-nr-g | | NCBI Disease: 100 documents from Validation Set
Model: GPT-5
Reasoning: low
Guidelines: o | 625 | 2026-01-01 | | |
| bc5cdr-valid-deepseek-nr-ng | | BC5CDR: 100 randomly selected documents from Validation set
Model: deepseek-chat (deepseek-v3.2)
Reasoning: low
Guidelines: x | 1.49 K | 2025-12-31 | | |
| bc5cdr-valid-gpt-nr-g | | BC5CDR: 100 randomly selected documents from Validation set
Model: GPT-5
Reasoning: low
Guidelines: o | 1.74 K | 2025-12-31 | | |
| bc5cdr-valid-deepseek-nr-g | | BC5CDR: 100 randomly selected documents from Validation set Model: deepseek-chat (deepseek-v3.2) Reasoning: low Guidelines: o | 1.84 K | 2026-01-01 | | |
| bc5cdr-valid-gemini-r-g | | BC5CDR: 100 randomly selected documents from Validation set
Model: gemini-2.5-pro
Reasoning: high
Guidelines: o | 1.75 K | 2025-12-31 | | |