> top > docs > PMC:7116472 > annotations

PMC:7116472 JSONTXT

Annnotations TAB JSON ListView MergeView

LitCovid-PubTator

Id Subject Object Predicate Lexical cue tao:has_database_id
3 23-31 Species denotes patients Tax:9606
4 58-66 Disease denotes covid-19 MESH:C000657245
5 161-170 Disease denotes Mortality MESH:D003643
10 268-276 Species denotes patients Tax:9606
11 255-264 Disease denotes mortality MESH:D003643
12 303-327 Disease denotes coronavirus disease 2019 MESH:C000657245
13 329-337 Disease denotes covid-19 MESH:C000657245
18 590-601 Species denotes Coronavirus Tax:11118
19 748-756 Species denotes patients Tax:9606
20 851-859 Species denotes patients Tax:9606
21 449-459 Disease denotes Infections MESH:D007239
23 990-998 Disease denotes covid-19 MESH:C000657245
25 1083-1092 Disease denotes mortality MESH:D003643
38 1470-1488 Gene denotes C reactive protein Gene:1401
39 1110-1118 Species denotes patients Tax:9606
40 1808-1816 Species denotes Patients Tax:9606
41 1411-1417 Chemical denotes oxygen MESH:D010100
42 1454-1458 Chemical denotes urea MESH:D014508
43 1160-1169 Disease denotes mortality MESH:D003643
44 1220-1229 Disease denotes mortality MESH:D003643
45 1256-1265 Disease denotes Mortality MESH:D003643
46 1560-1569 Disease denotes mortality MESH:D003643
47 1869-1878 Disease denotes mortality MESH:D003643
48 1928-1937 Disease denotes mortality MESH:D003643
49 2210-2218 Disease denotes covid-19 MESH:C000657245
53 2563-2571 Species denotes patients Tax:9606
54 2429-2438 Disease denotes Mortality MESH:D003643
55 2598-2606 Disease denotes covid-19 MESH:C000657245
73 2819-2866 Species denotes severe acute respiratory syndrome coronavirus 2 Tax:2697049
74 2868-2878 Species denotes SARS-CoV-2 Tax:2697049
75 3004-3010 Species denotes people Tax:9606
76 3092-3098 Species denotes people Tax:9606
77 3186-3194 Species denotes patients Tax:9606
78 3319-3327 Species denotes patients Tax:9606
79 3342-3352 Species denotes SARS-CoV-2 Tax:2697049
80 2804-2813 Disease denotes infection MESH:D007239
81 2891-2900 Disease denotes mortality MESH:D003643
82 2911-2917 Disease denotes deaths MESH:D003643
83 2942-2961 Disease denotes respiratory failure MESH:D012131
84 3025-3049 Disease denotes coronavirus disease 2019 MESH:C000657245
85 3051-3059 Disease denotes covid-19 MESH:C000657245
86 3103-3107 Disease denotes died MESH:D003643
87 3200-3208 Disease denotes covid-19 MESH:C000657245
88 3328-3336 Disease denotes infected MESH:D007239
89 3384-3389 Disease denotes death MESH:D003643
101 3804-3811 Species denotes patient Tax:9606
102 3900-3908 Species denotes patients Tax:9606
103 3627-3635 Disease denotes covid-19 MESH:C000657245
104 3725-3730 Disease denotes death MESH:D003643
105 3817-3825 Disease denotes covid-19 MESH:C000657245
106 3852-3861 Disease denotes pneumonia MESH:D011014
107 3886-3892 Disease denotes sepsis MESH:D018805
108 3921-3929 Disease denotes covid-19 MESH:C000657245
109 3981-3992 Disease denotes pneumonitis MESH:D011014
110 4003-4010 Disease denotes hypoxia MESH:D000860
111 4025-4037 Disease denotes inflammation MESH:D007249
116 4712-4720 Species denotes patients Tax:9606
117 4125-4133 Disease denotes covid-19 MESH:C000657245
118 4433-4441 Disease denotes covid-19 MESH:C000657245
119 4747-4755 Disease denotes covid-19 MESH:C000657245
123 5020-5028 Species denotes patients Tax:9606
124 5007-5016 Disease denotes mortality MESH:D003643
125 5055-5063 Disease denotes covid-19 MESH:C000657245
128 5424-5435 Species denotes Coronavirus Tax:11118
129 5225-5235 Disease denotes Infections MESH:D007239
133 6001-6009 Species denotes patients Tax:9606
134 6313-6320 Species denotes patient Tax:9606
135 6395-6404 Disease denotes infection MESH:D007239
149 7113-7120 Gene denotes insulin Gene:3630
150 7282-7290 Species denotes patients Tax:9606
151 6846-6861 Disease denotes cardiac disease MESH:D006331
152 6863-6890 Disease denotes chronic respiratory disease MESH:D012140
153 6902-6908 Disease denotes asthma MESH:D001249
154 6911-6932 Disease denotes chronic renal disease MESH:D051436
155 6992-7005 Disease denotes liver disease MESH:D008107
156 7007-7015 Disease denotes dementia MESH:D003704
157 7077-7094 Disease denotes diabetes mellitus MESH:D003920
158 7134-7137 Disease denotes HIV MESH:D015658
159 7141-7145 Disease denotes AIDS MESH:D000163
160 7151-7161 Disease denotes malignancy MESH:D009369
161 7346-7355 Disease denotes infection MESH:D007239
165 7524-7532 Species denotes patients Tax:9606
166 7422-7429 Disease denotes obesity MESH:D009765
167 7538-7546 Disease denotes covid-19 MESH:C000657245
173 8098-8106 Species denotes patients Tax:9606
174 8229-8237 Species denotes patients Tax:9606
175 8336-8344 Species denotes patients Tax:9606
176 8004-8013 Disease denotes mortality MESH:D003643
177 8145-8165 Disease denotes SARS-CoV-2 infection MESH:C000657245
186 8496-8503 Species denotes patient Tax:9606
187 8795-8802 Species denotes patient Tax:9606
188 8978-8986 Species denotes patients Tax:9606
189 9046-9054 Species denotes patients Tax:9606
190 8627-8635 Disease denotes covid-19 MESH:C000657245
191 8856-8865 Disease denotes pneumonia MESH:D011014
192 8870-8885 Disease denotes flulike illness MESH:D002908
193 8992-9000 Disease denotes covid-19 MESH:C000657245
197 9236-9243 Species denotes patient Tax:9606
198 9407-9415 Species denotes patients Tax:9606
199 9421-9429 Disease denotes covid-19 MESH:C000657245
202 11415-11426 Species denotes Coronavirus Tax:11118
203 11378-11387 Disease denotes Mortality MESH:D003643
205 12221-12230 Disease denotes mortality MESH:D003643
209 13506-13515 Disease denotes mortality MESH:D003643
210 13680-13689 Disease denotes mortality MESH:D003643
211 13774-13783 Disease denotes pneumonia MESH:D011014
215 13987-13995 Species denotes Patients Tax:9606
216 14261-14269 Species denotes patients Tax:9606
217 14368-14376 Species denotes patients Tax:9606
220 14795-14803 Disease denotes covid-19 MESH:C000657245
221 15206-15214 Disease denotes covid-19 MESH:C000657245
223 15369-15373 Species denotes mice Tax:10090
229 16041-16051 Species denotes SARS-CoV-2 Tax:2697049
230 16055-16066 Species denotes coronavirus Tax:11118
231 15993-16002 Disease denotes pneumonia MESH:D011014
232 16006-16012 Disease denotes sepsis MESH:D018805
233 16029-16034 Disease denotes COVID MESH:C000657245
236 16590-16597 Species denotes patient Tax:9606
237 16686-16694 Species denotes Patients Tax:9606
240 17243-17250 Species denotes patient Tax:9606
241 17345-17353 Species denotes patients Tax:9606
243 17381-17388 Species denotes Patient Tax:9606
245 17527-17535 Species denotes Patients Tax:9606
251 17677-17685 Species denotes patients Tax:9606
252 17764-17772 Species denotes patients Tax:9606
253 17869-17877 Species denotes patients Tax:9606
254 17898-17906 Species denotes patients Tax:9606
255 17836-17845 Disease denotes mortality MESH:D003643
261 18706-18724 Gene denotes C reactive protein Gene:1401
262 18651-18657 Chemical denotes oxygen MESH:D010100
263 18690-18694 Chemical denotes urea MESH:D014508
264 18505-18514 Disease denotes mortality MESH:D003643
265 18678-18682 Disease denotes coma MESH:D003128
267 19311-19320 Disease denotes Mortality MESH:D003643
271 19363-19372 Disease denotes Mortality MESH:D003643
272 19410-19415 Disease denotes death MESH:D003643
273 19531-19540 Disease denotes Mortality MESH:D003643
279 19775-19783 Species denotes patients Tax:9606
280 19884-19892 Species denotes patients Tax:9606
281 19987-19995 Species denotes patients Tax:9606
282 20016-20024 Species denotes patients Tax:9606
283 19956-19965 Disease denotes mortality MESH:D003643
287 20190-20199 Disease denotes Mortality MESH:D003643
288 20402-20411 Disease denotes mortality MESH:D003643
289 20548-20557 Disease denotes Mortality MESH:D003643
293 20712-20721 Disease denotes mortality MESH:D003643
294 20771-20780 Disease denotes mortality MESH:D003643
295 21064-21073 Disease denotes mortality MESH:D003643
302 21088-21096 Species denotes Patients Tax:9606
303 21219-21227 Species denotes Patients Tax:9606
304 21352-21360 Species denotes patients Tax:9606
305 21161-21170 Disease denotes mortality MESH:D003643
306 21287-21296 Disease denotes mortality MESH:D003643
307 21404-21413 Disease denotes mortality MESH:D003643
316 21902-21910 Species denotes patients Tax:9606
317 21704-21713 Disease denotes Mortality MESH:D003643
318 21810-21819 Disease denotes mortality MESH:D003643
319 21935-21944 Disease denotes pneumonia MESH:D011014
320 22002-22011 Disease denotes mortality MESH:D003643
321 22212-22220 Disease denotes covid-19 MESH:C000657245
322 22300-22310 Disease denotes COVID-GRAM MESH:C000657245
323 22401-22410 Disease denotes Mortality MESH:D003643
327 22532-22540 Species denotes patients Tax:9606
328 22799-22807 Species denotes patients Tax:9606
329 23013-23022 Disease denotes Mortality MESH:D003643
331 23605-23614 Disease denotes Mortality MESH:D003643
335 23923-23926 Species denotes men Tax:9606
336 24031-24036 Species denotes women Tax:9606
337 23837-23846 Disease denotes Mortality MESH:D003643
345 24434-24442 Species denotes patients Tax:9606
346 24507-24514 Species denotes patient Tax:9606
347 24685-24693 Species denotes patients Tax:9606
348 24375-24384 Disease denotes Mortality MESH:D003643
349 24469-24477 Disease denotes covid-19 MESH:C000657245
350 24486-24495 Disease denotes Mortality MESH:D003643
351 24710-24715 Disease denotes death MESH:D003643
353 25388-25397 Disease denotes Mortality MESH:D003643
367 26330-26348 Gene denotes C reactive protein Gene:1401
368 25593-25600 Species denotes patient Tax:9606
369 26189-26197 Species denotes patients Tax:9606
370 26071-26075 Chemical denotes urea MESH:D014508
371 25546-25555 Disease denotes Mortality MESH:D003643
372 25644-25656 Disease denotes inflammation MESH:D007249
373 25742-25748 Disease denotes sepsis MESH:D018805
374 25772-25781 Disease denotes pneumonia MESH:D011014
375 26003-26009 Disease denotes sepsis MESH:D018805
376 26033-26042 Disease denotes pneumonia MESH:D011014
377 26162-26171 Disease denotes mortality MESH:D003643
378 26212-26220 Disease denotes covid-19 MESH:C000657245
379 26264-26272 Disease denotes covid-19 MESH:C000657245
383 26834-26840 Chemical denotes oxygen MESH:D010100
384 26393-26401 Disease denotes covid-19 MESH:C000657245
385 26493-26503 Disease denotes COVID-GRAM MESH:C000657245
387 27414-27422 Disease denotes covid-19 MESH:C000657245
391 28029-28037 Species denotes patients Tax:9606
392 27826-27834 Disease denotes covid-19 MESH:C000657245
393 28043-28051 Disease denotes covid-19 MESH:C000657245
396 28721-28728 Species denotes patient Tax:9606
397 28479-28487 Disease denotes covid-19 MESH:C000657245
401 29293-29300 Species denotes patient Tax:9606
402 28909-28918 Disease denotes Mortality MESH:D003643
403 28976-28984 Disease denotes covid-19 MESH:C000657245
406 29661-29668 Species denotes patient Tax:9606
407 29641-29650 Disease denotes infection MESH:D007239
413 30144-30152 Disease denotes covid-19 MESH:C000657245
414 30311-30323 Disease denotes hypertension MESH:D006973
415 30334-30355 Disease denotes myocardial infarction MESH:D009203
416 30361-30367 Disease denotes stroke MESH:D020521
417 30530-30539 Disease denotes Mortality MESH:D003643
429 30583-30591 Species denotes patients Tax:9606
430 30654-30662 Species denotes patients Tax:9606
431 30836-30843 Species denotes patient Tax:9606
432 30913-30920 Species denotes patient Tax:9606
433 30982-30990 Species denotes patients Tax:9606
434 31336-31344 Species denotes patients Tax:9606
435 30757-30766 Disease denotes mortality MESH:D003643
436 30944-30953 Disease denotes Mortality MESH:D003643
437 31036-31045 Disease denotes mortality MESH:D003643
438 31214-31219 Disease denotes death MESH:D003643
439 31289-31298 Disease denotes Mortality MESH:D003643
443 31520-31528 Species denotes patients Tax:9606
444 31534-31542 Disease denotes covid-19 MESH:C000657245
445 31567-31576 Disease denotes mortality MESH:D003643
456 31915-31923 Species denotes Patients Tax:9606
457 32374-32382 Species denotes patients Tax:9606
458 32521-32529 Chemical denotes steroids MESH:D013256
459 31934-31943 Disease denotes Mortality MESH:D003643
460 31986-31995 Disease denotes mortality MESH:D003643
461 32125-32134 Disease denotes mortality MESH:D003643
462 32136-32145 Disease denotes mortality MESH:D003643
463 32232-32241 Disease denotes mortality MESH:D003643
464 32304-32313 Disease denotes pneumonia MESH:D011014
465 32432-32437 Disease denotes death MESH:D003643
504 34923-34926 Gene denotes SRK Gene:6199
505 34962-34964 Gene denotes CS Gene:1431
506 35047-35050 Gene denotes CAS Gene:9564
507 35084-35087 Gene denotes SRK Gene:6199
508 35112-35115 Gene denotes MGP Gene:4256
509 35203-35206 Gene denotes SRK Gene:6199
510 35267-35270 Gene denotes SRK Gene:6199
511 35289-35292 Gene denotes CAS Gene:9564
512 35298-35300 Gene denotes CS Gene:1431
513 35362-35365 Gene denotes CAS Gene:9564
514 35420-35423 Gene denotes SRK Gene:6199
515 35452-35455 Gene denotes SRK Gene:6199
516 35516-35519 Gene denotes SRK Gene:6199
517 35525-35528 Gene denotes MGP Gene:4256
518 35559-35562 Gene denotes SRK Gene:6199
519 35597-35600 Gene denotes SRK Gene:6199
520 35663-35666 Gene denotes MGP Gene:4256
521 35677-35679 Gene denotes CS Gene:1431
522 35718-35721 Gene denotes SRK Gene:6199
523 34928-34930 Disease denotes AH MESH:D007039
524 34977-34980 Disease denotes MGS MESH:C548078
525 35052-35055 Disease denotes MGS MESH:C548078
526 35089-35091 Disease denotes AH MESH:D007039
527 35097-35100 Disease denotes TMD MESH:D049310
528 35178-35181 Disease denotes MGS MESH:C548078
529 35208-35210 Disease denotes AH MESH:D007039
530 35239-35242 Disease denotes MGS MESH:C548078
531 35272-35274 Disease denotes AH MESH:D007039
532 35367-35370 Disease denotes MGS MESH:C548078
533 35400-35403 Disease denotes MGS MESH:C548078
534 35491-35494 Disease denotes MGS MESH:C548078
535 35606-35608 Disease denotes AH MESH:D007039
536 35622-35625 Disease denotes TMD MESH:D049310
537 35703-35706 Disease denotes MGS MESH:C548078
538 35726-35728 Disease denotes AH MESH:D007039
539 35754-35757 Disease denotes MGS MESH:C548078
540 35798-35801 Disease denotes MGS MESH:C548078
541 35645-35647 CellLine denotes MN CVCL:U508
556 36269-36288 Disease denotes Zoonotic Infections MESH:D015047
557 36420-36442 Disease denotes Respiratory Infections MESH:D012141
558 36591-36597 Disease denotes Cancer MESH:D009369
559 37713-37734 Disease denotes Respiratory Infection MESH:D012141
560 37975-37978 Disease denotes MGS MESH:C548078
561 38066-38085 Disease denotes Zoonotic Infections MESH:D015047
562 38276-38295 Disease denotes Zoonotic Infections MESH:D015047
563 36199-36202 CellLine denotes MRC CVCL:0440
564 37350-37352 CellLine denotes MN CVCL:U508
565 37585-37588 CellLine denotes MRC CVCL:0440
566 37590-37593 CellLine denotes MRC CVCL:0440
567 37640-37643 CellLine denotes MRC CVCL:0440
568 37967-37970 CellLine denotes MRC CVCL:0440
569 38025-38028 CellLine denotes MRC CVCL:0440
572 39516-39528 Species denotes participants Tax:9606
573 39541-39548 Species denotes patient Tax:9606
575 39929-39935 Species denotes People Tax:9606
700 40143-40151 Species denotes patients Tax:9606
701 40278-40286 Species denotes patients Tax:9606
702 40157-40181 Disease denotes coronavirus disease 2019 MESH:C000657245
703 40183-40191 Disease denotes covid-19 MESH:C000657245
704 40224-40233 Disease denotes mortality MESH:D003643
705 40292-40300 Disease denotes covid-19 MESH:C000657245
707 40573-40577 Disease denotes ADDS
711 40709-40716 Species denotes patient Tax:9606
712 40592-40600 Disease denotes covid-19 MESH:C000657245
713 40769-40777 Disease denotes covid-19 MESH:C000657245
719 40786-40797 Species denotes Coronavirus Tax:11118
720 40947-40955 Species denotes patients Tax:9606
721 40836-40845 Disease denotes Mortality MESH:D003643
722 40912-40921 Disease denotes mortality MESH:D003643
723 41014-41019 Disease denotes death MESH:D003643
726 41325-41349 Disease denotes coronavirus disease 2019 MESH:C000657245
727 41417-41427 Disease denotes Infections MESH:D007239
734 41575-41583 Species denotes patients Tax:9606
735 41603-41612 Disease denotes Mortality MESH:D003643
736 41676-41685 Disease denotes mortality MESH:D003643
737 41705-41714 Disease denotes Mortality MESH:D003643
738 41809-41818 Disease denotes mortality MESH:D003643
739 41850-41859 Disease denotes Mortality MESH:D003643
742 42258-42266 Species denotes patients Tax:9606
743 42297-42305 Species denotes patients Tax:9606
746 42415-42423 Species denotes patients Tax:9606
747 42450-42458 Disease denotes covid-19 MESH:C000657245
762 45313-45331 Gene denotes C reactive protein Gene:1401
763 42519-42527 Species denotes patients Tax:9606
764 42569-42577 Species denotes patients Tax:9606
765 45171-45175 Chemical denotes Urea MESH:D014508
766 45237-45247 Chemical denotes Creatinine MESH:D003404
767 42612-42621 Disease denotes Mortality MESH:D003643
768 43254-43269 Disease denotes cardiac disease MESH:D006331
769 43328-43350 Disease denotes Chronic kidney disease MESH:D051436
770 43407-43425 Disease denotes Malignant neoplasm MESH:D009369
771 43501-43514 Disease denotes liver disease MESH:D008107
772 43651-43676 Disease denotes Chronic pulmonary disease MESH:D008171
773 43682-43688 Disease denotes asthma MESH:D001249
774 43746-43754 Disease denotes Diabetes MESH:D003920
775 44483-44487 Disease denotes coma MESH:D003128
777 45406-45430 Disease denotes coronavirus disease 2019 MESH:C000657245
779 45561-45568 Disease denotes obesity MESH:D009765
784 45633-45641 Species denotes patients Tax:9606
785 45588-45597 Disease denotes Mortality MESH:D003643
786 45620-45629 Disease denotes mortality MESH:D003643
787 45647-45655 Disease denotes covid-19 MESH:C000657245
792 46170-46188 Gene denotes C reactive protein Gene:1401
793 46119-46123 Chemical denotes Urea MESH:D014508
794 45744-45753 Disease denotes Mortality MESH:D003643
795 46079-46083 Disease denotes coma MESH:D003128
797 46245-46269 Disease denotes coronavirus disease 2019 MESH:C000657245
799 46375-46382 Disease denotes obesity MESH:D009765
801 46562-46571 Disease denotes Mortality MESH:D003643
804 46876-46885 Disease denotes Mortality MESH:D003643
805 46916-46925 Disease denotes mortality MESH:D003643
810 46996-47004 Species denotes patients Tax:9606
811 47078-47087 Disease denotes Mortality MESH:D003643
812 47101-47110 Disease denotes mortality MESH:D003643
813 47526-47535 Disease denotes mortality MESH:D003643
816 48121-48130 Disease denotes mortality MESH:D003643
817 48144-48153 Disease denotes Mortality MESH:D003643
822 48268-48276 Species denotes patients Tax:9606
823 48306-48314 Species denotes patients Tax:9606
824 48288-48294 Disease denotes deaths MESH:D003643
825 48326-48332 Disease denotes deaths MESH:D003643
829 48775-48783 Species denotes patients Tax:9606
830 48762-48771 Disease denotes mortality MESH:D003643
831 48789-48797 Disease denotes covid-19 MESH:C000657245
836 49114-49118 Gene denotes SCAP Gene:22937
837 48830-48838 Species denotes patients Tax:9606
838 49186-49191 Disease denotes COVID MESH:C000657245
839 49463-49472 Disease denotes Mortality MESH:D003643
841 49582-49606 Disease denotes coronavirus disease 2019 MESH:C000657245
843 49669-49677 Disease denotes covid-19 MESH:C000657245

LitCovid-sentences

Id Subject Object Predicate Lexical cue
T1 0-176 Sentence denotes Risk stratification of patients admitted to hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: development and validation of the 4C Mortality Score
T2 178-186 Sentence denotes Abstract
T3 187-196 Sentence denotes Objective
T4 197-339 Sentence denotes To develop and validate a pragmatic risk score to predict mortality in patients admitted to hospital with coronavirus disease 2019 (covid-19).
T5 341-347 Sentence denotes Design
T6 348-387 Sentence denotes Prospective observational cohort study.
T7 389-396 Sentence denotes Setting
T8 397-703 Sentence denotes International Severe Acute Respiratory and emerging Infections Consortium (ISARIC) World Health Organization (WHO) Clinical Characterisation Protocol UK (CCP-UK) study (performed by the ISARIC Coronavirus Clinical Characterisation Consortium—ISARIC-4C) in 260 hospitals across England, Scotland, and Wales.
T9 704-926 Sentence denotes Model training was performed on a cohort of patients recruited between 6 February and 20 May 2020, with validation conducted on a second cohort of patients recruited after model development between 21 May and 29 June 2020.
T10 928-940 Sentence denotes Participants
T11 941-1048 Sentence denotes Adults (age ≥18 years) admitted to hospital with covid-19 at least four weeks before final data extraction.
T12 1050-1070 Sentence denotes Main Outcome Measure
T13 1071-1093 Sentence denotes In-hospital mortality.
T14 1095-1102 Sentence denotes Results
T15 1103-1242 Sentence denotes 35 463 patients were included in the derivation dataset (mortality rate 32.2%) and 22 361 in the validation dataset (mortality rate 30.1%).
T16 1243-1515 Sentence denotes The final 4C Mortality Score included eight variables readily available at initial hospital assessment: age, sex, number of comorbidities, respiratory rate, peripheral oxygen saturation, level of consciousness, urea level, and C reactive protein (score range 0-21 points).
T17 1516-1707 Sentence denotes The 4C Score showed high discrimination for mortality (derivation cohort: area under the receiver operating characteristic curve 0.79, 95% confidence interval 0.78 to 0.79; validation cohort:
T18 1708-1807 Sentence denotes 0.77, 0.76 to 0.77) with excellent calibration (validation: calibration-in-the-large=0, slope=1.0).
T19 1808-2018 Sentence denotes Patients with a score of at least 15 (n=4158, 19%) had a 62% mortality (positive predictive value 62%) compared with 1% mortality for those with a score of 3 or less (n=1650, 7%; negative predictive value 99%).
T20 2019-2269 Sentence denotes Discriminatory performance was higher than 15 pre-existing risk stratification scores (area under the receiver operating characteristic curve range 0.61-0.76), with scores developed in other covid-19 cohorts often performing poorly (range 0.63-0.73).
T21 2271-2282 Sentence denotes Conclusions
T22 2283-2421 Sentence denotes An easy-to-use risk stratification score has been developed and validated based on commonly available parameters at hospital presentation.
T23 2422-2640 Sentence denotes The 4C Mortality Score outperformed existing scores, showed utility to directly inform clinical decision making, and can be used to stratify patients admitted to hospital with covid-19 into different management groups.
T24 2641-2731 Sentence denotes The score should be further validated to determine its applicability in other populations.
T25 2733-2751 Sentence denotes Study Registration
T26 2752-2766 Sentence denotes ISRCTN66726260
T27 2768-2780 Sentence denotes Introduction
T28 2781-3443 Sentence denotes Disease resulting from infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has a high mortality rate with deaths predominantly caused by respiratory failure.1 As of 1 September 2020, over 25 million people had confirmed coronavirus disease 2019 (covid-19) worldwide and at least 850 000 people had died from the disease.23 As hospitals around the world are faced with an influx of patients with covid-19, there is an urgent need for a pragmatic risk stratification tool that will allow the early identification of patients infected with SARS-CoV-2 who are at the highest risk of death to guide management and optimise resource allocation.
T29 3444-3540 Sentence denotes Prognostic scores attempt to transform complex clinical pictures into tangible numerical values.
T30 3541-3741 Sentence denotes Prognostication is more difficult when dealing with a severe pandemic illness such as covid-19 because strain on healthcare resources and rapidly evolving treatments alter the risk of death over time.
T31 3742-4065 Sentence denotes Early information has suggested that the clinical course of a patient with covid-19 is different from that of pneumonia, seasonal influenza, or sepsis.4 Most patients with severe covid-19 have developed a clinical picture characterised by pneumonitis, profound hypoxia, and systemic inflammation affecting multiple organs.1
T32 4066-4231 Sentence denotes A recent review identified many prognostic scores used for covid-19,5 which varied in their setting, predicted outcome measure, and the clinical parameters included.
T33 4232-4812 Sentence denotes The large number of risk stratification tools reflects difficulties in their application, with most scores showing moderate performance at best and no benefit to clinical decision making.67 Many novel covid-19 prognostic scores have been found to have a high risk of bias, which could reflect development in small cohorts, and many have been published without clear details of model derivation and testing.5 Therefore, a risk stratification tool within a large national cohort of patients admitted to hospital with covid-19 is needed with clear development and validation details.
T34 4813-5064 Sentence denotes Our aim was to develop and validate a pragmatic, clinically relevant risk stratification score that uses routinely available clinical information at hospital presentation to predict in-hospital mortality in patients admitted to hospital with covid-19.
T35 5065-5133 Sentence denotes We then aimed to compare this score with existing prognostic models.
T36 5135-5142 Sentence denotes Methods
T37 5144-5168 Sentence denotes Study design and setting
T38 5169-5380 Sentence denotes The International Severe Acute Respiratory and emerging Infections Consortium (ISARIC) World Health Organization (WHO) Clinical Characterisation Protocol UK (CCP-UK) study is an ongoing prospective cohort study.
T39 5381-5650 Sentence denotes The study is being performed by the ISARIC Coronavirus Clinical Characterisation Consortium (ISARIC-4C) in 260 hospitals across England, Scotland, and Wales (National Institute for Health Research Clinical Research Network Central Portfolio Management System ID 14152).
T40 5651-5954 Sentence denotes The protocol and further study details are available online.8 Model development and reporting followed the TRIPOD (transparent reporting of a multivariable prediction model for individual prediction or diagnosis) guidelines.9 The study is being conducted according to a predefined protocol (appendix 1).
T41 5956-5968 Sentence denotes Participants
T42 5969-6238 Sentence denotes The study recruited consecutive patients aged 18 years and older with a completed index admission to one of 260 hospitals in England, Scotland, or Wales.8 Reverse transcriptase polymerase chain reaction was the only mode of testing available during the period of study.
T43 6239-6350 Sentence denotes The decision to test was at the discretion of the clinician attending the patient, and not defined by protocol.
T44 6351-6520 Sentence denotes The enrolment criterion “high likelihood of infection” reflected that a preparedness protocol cannot assume a diagnostic test will be available for an emergent pathogen.
T45 6521-6613 Sentence denotes In this activation, site training emphasised the importance of only recruiting proven cases.
T46 6615-6630 Sentence denotes Data collection
T47 6631-6727 Sentence denotes Demographic, clinical, and outcome data were collected by using a prespecified case report form.
T48 6728-7162 Sentence denotes Comorbidities were defined according to a modified Charlson comorbidity index.10 Comorbidities collected were chronic cardiac disease, chronic respiratory disease (excluding asthma), chronic renal disease (estimated glomerular filtration rate ≤30), mild to severe liver disease, dementia, chronic neurological conditions, connective tissue disease, diabetes mellitus (diet, tablet, or insulin controlled), HIV or AIDS, and malignancy.
T49 7163-7403 Sentence denotes These conditions were selected a priori by a global consortium to provide rapid, coordinated clinical investigation of patients presenting with any severe or potentially severe acute infection of public interest and enabled standardisation.
T50 7404-7811 Sentence denotes Clinician defined obesity was also included as a comorbidity owing to its probable association with adverse outcomes in patients with covid-19.1112 The clinical information used to calculate prognostic scores was taken from the day of admission to hospital.13 A practical approach was taken to sample size requirements.14 We used all available data to maximise the power and generalisability of our results.
T51 7812-7957 Sentence denotes Model reliability was assessed by using a temporally distinct validation cohort with geographical subsetting, together with sensitivity analyses.
T52 7959-7967 Sentence denotes Outcomes
T53 7968-8014 Sentence denotes The primary outcome was in-hospital mortality.
T54 8015-8183 Sentence denotes This outcome was selected because of the importance of the early identification of patients likely to develop severe illness from SARS-CoV-2 infection (a rule in test).
T55 8184-8382 Sentence denotes We chose to restrict analysis of outcomes to patients who were admitted more than four weeks before final data extraction (29 June 2020) to enable most patients to complete their hospital admission.
T56 8384-8415 Sentence denotes Independent predictor variables
T57 8416-9084 Sentence denotes A reduced set of potential predictor variables was selected a priori, including patient demographic information, common clinical investigations, and parameters consistently identified as clinically important in covid-19 cohorts following the methods described by Wynants and colleagues (appendix 2).5 Candidate predictor variables were selected based on three common criteria15: patient and clinical variables known to influence outcome in pneumonia and flulike illness; clinical biomarkers previously identified within the literature as potential predictors in patients with covid-19; values available for at least two thirds of patients within the derivation cohort.
T58 9085-9344 Sentence denotes Because our overall aim was to develop an easy-to-use risk stratification score, we made the decision to include an overall comorbidity count for each patient within model development giving each comorbidity equal weight, rather than individual comorbidities.
T59 9345-9505 Sentence denotes Recent evidence suggests an additive effect of comorbidity in patients with covid-19, with increasing number of comorbidities associated with poorer outcomes.16
T60 9507-9524 Sentence denotes Model development
T61 9525-9692 Sentence denotes Missing values for potential candidate variables were handled by using multiple imputation with chained equations, under the missing at random assumption (appendix 6).
T62 9693-9824 Sentence denotes Ten sets, each with 10 iterations, were imputed using available explanatory variables for both cohorts (derivation and validation).
T63 9825-9931 Sentence denotes The outcome variable was included as a predictor in the derivation dataset but not the validation dataset.
T64 9932-10048 Sentence denotes All model derivation and validation was performed in imputed datasets, with Rubin’s rules17 used to combine results.
T65 10049-10115 Sentence denotes Models were trained by using all available data up to 20 May 2020.
T66 10116-10262 Sentence denotes The primary intention was to create a pragmatic model for bedside use not requiring complex equations, online calculators, or mobile applications.
T67 10263-10368 Sentence denotes An a priori decision was therefore made to categorise continuous variables in the final prognostic score.
T68 10369-10422 Sentence denotes We used a three stage model building process (fig 1).
T69 10423-10611 Sentence denotes Firstly, generalised additive models were built incorporating continuous smoothed predictors (penalised thin plate splines) in combination with categorical predictors as linear components.
T70 10612-10796 Sentence denotes A criterion based approach to variable selection was taken based on the deviance explained, the unbiased risk estimator, and the area under the receiver operating characteristic curve.
T71 10797-10978 Sentence denotes Secondly, we visually inspected plots of component smoothed continuous predictors for linearity, and selected optimal cut-off values by using the methods of Barrio and colleagues.18
T72 10979-11116 Sentence denotes Lastly, final models using categorised variables were specified with least absolute shrinkage and selection operator logistic regression.
T73 11117-11270 Sentence denotes L1 penalised coefficients were derived using 10-fold cross validation to select the value of lambda (minimised cross validated sum of squared residuals).
T74 11271-11465 Sentence denotes We converted shrunk coefficients to a prognostic index with appropriate scaling to create the pragmatic 4C Mortality Score (where 4C stands for Coronavirus Clinical Characterisation Consortium).
T75 11466-11555 Sentence denotes We used machine learning approaches in parallel for comparison of predictive performance.
T76 11556-11730 Sentence denotes Given issues with interpretability, this was intended to provide a best-in-class comparison of predictive performance when accounting for any complex underlying interactions.
T77 11731-11784 Sentence denotes Gradient boosting decision trees were used (XGBoost).
T78 11785-11911 Sentence denotes All candidate predictor variables identified were included within the model, except for those with high missing values (>33%).
T79 11912-12043 Sentence denotes We retained individual major comorbidity variables within the model to determine whether inclusion improved predictive performance.
T80 12044-12133 Sentence denotes An 80%/20% random split of the derivation dataset was used to define train and test sets.
T81 12134-12210 Sentence denotes The validation datasets were held back and not used in the training process.
T82 12211-12499 Sentence denotes We used a mortality label and design matrix of centred or standardised continuous and categorical variables including all candidate variables to train gradient boosted trees minimising the binary classification error rate (defined as number of wrong cases divided by number of all cases).
T83 12500-12667 Sentence denotes Hyperparameters were tuned, including the learning rate and maximum tree depth, to maximise the area under the receiver operating characteristic curve in the test set.
T84 12668-12866 Sentence denotes This approach affords flexibility in the handling of missing data; therefore, two models were trained and optimised, one using imputed data and the other modelling missingness in complete case data.
T85 12867-13086 Sentence denotes We assessed discrimination for all models by using the area under the receiver operating characteristic curve in the derivation cohort, with 95% confidence intervals calculated by bootstrapped resampling (2000 samples).
T86 13087-13302 Sentence denotes A value of 0.5 indicates no predictive ability, 0.8 is considered good, and 1.0 is perfect.19 We assessed overall goodness of fit with the Brier score,20 a measure to quantify how close predictions are to the truth.
T87 13303-13394 Sentence denotes The score ranges between 0 and 1, where smaller values indicate superior model performance.
T88 13395-13575 Sentence denotes We plotted model calibration curves to examine agreement between predicted and observed risk across deciles of mortality risk to determine the presence of over or under prediction.
T89 13576-13815 Sentence denotes Risk cut-off values were defined by the total point score for an individual, which represented low (<2% mortality rate), intermediate (2-14.9%), or high risk (≥15%) groups, similar to commonly used pneumonia risk stratification scores.2122
T90 13816-13878 Sentence denotes We performed sensitivity analyses by using complete case data.
T91 13879-13968 Sentence denotes Model discrimination was also checked in ethnic groups and by sex using imputed datasets.
T92 13970-13986 Sentence denotes Model validation
T93 13987-14109 Sentence denotes Patients entered into the ISARIC WHO CCP-UK study after 20 May 2020 were included in a separate validation cohort (fig 1).
T94 14110-14215 Sentence denotes We determined discrimination, calibration, and performance across a range of clinically relevant metrics.
T95 14216-14355 Sentence denotes To avoid bias in the assessment of outcomes, patients who were admitted within four weeks of data extraction on 29 June 2020 were excluded.
T96 14356-14449 Sentence denotes We included patients without an outcome after four weeks and considered to have had no event.
T97 14450-14563 Sentence denotes A sensitivity analysis was also performed, with stratification of the validation cohort by geographical location.
T98 14564-14933 Sentence denotes We selected this geographical categorisation based on well described economic and health inequalities between the north and south of the United Kingdom.2324 Recent analysis has shown the impact of deprivation on risk of dying with covid-19.25 As a result, population differences between regions could change the discriminatory performance of risk stratification scores.
T99 14934-15226 Sentence denotes Two geographical cohorts were created, based on north-south geographical locations across the UK as defined by Hacking and colleagues.23 We performed a further sensitivity analysis to determine model performance in ethnic minority groups given the reported differences in covid-19 outcomes.26
T100 15227-15323 Sentence denotes All tests were two tailed and P values less than 0.05 were considered statistically significant.
T101 15324-15465 Sentence denotes We used R (version 3.6.3) with the finalfit, mice, glmnet, pROC, recipes, xgboost, rmda, and tidyverse packages for all statistical analysis.
T102 15467-15518 Sentence denotes Comparison with existing risk stratification scores
T103 15519-15628 Sentence denotes All derived models in the derivation dataset were compared within the validation cohort with existing scores.
T104 15629-15821 Sentence denotes We assessed model performance by using the area under the receiver operating characteristic curve statistic, sensitivity, specificity, positive predictive value, and negative predictive value.
T105 15822-15966 Sentence denotes Existing risk stratification scores were identified through a systematic literature search of Embase, WHO Medicus, and Google Scholar databases.
T106 15967-16151 Sentence denotes We used the search terms “pneumonia,” “sepsis,” “influenza,” “COVID-19,” “SARS-CoV-2,” “coronavirus” combined with “score” and “prognosis.” We applied no language or date restrictions.
T107 16152-16197 Sentence denotes The last search was performed on 1 July 2020.
T108 16198-16332 Sentence denotes Risk stratification tools were included whose variables were available within the database and had accessible methods for calculation.
T109 16333-16539 Sentence denotes We calculated performance characteristics according to original publications, and selected score cutoff values for adverse outcomes based on the most commonly used criteria identified within the literature.
T110 16540-16685 Sentence denotes Cut-off values were the score value for which the patient was considered at low or high risk of adverse outcome, as defined by the study authors.
T111 16686-16775 Sentence denotes Patients with one or more missing input variables were omitted for that particular score.
T112 16776-16937 Sentence denotes We also performed a decision curve analysis.27 Briefly, assessment of the adequacy of clinical prediction models can be extended by determining clinical utility.
T113 16938-17149 Sentence denotes By using decision curve analysis, we can make a clinical judgment about the relative value of benefits (treating a true positive) and harms (treating a false positive) associated with a clinical prediction tool.
T114 17150-17379 Sentence denotes The standardised net benefit was plotted against the threshold probability for considering a patient high risk for age alone and for the best discriminating models applicable to more than 50% of patients in the validation cohort.
T115 17381-17411 Sentence denotes Patient and public involvement
T116 17412-17526 Sentence denotes This was an urgent public health research study in response to a Public Health Emergency of International Concern.
T117 17527-17637 Sentence denotes Patients or the public were not involved in the design, conduct, or reporting of this rapid response research.
T118 17639-17646 Sentence denotes Results
T119 17647-17823 Sentence denotes We collected data from 35 463 patients between 6 February 2020 and 20 May 2020 in the derivation cohort; 1275 (3.6%) patients had no outcome recorded and were considered alive.
T120 17824-17879 Sentence denotes The overall mortality rate was 32.2% (11 426 patients).
T121 17880-18038 Sentence denotes The median age of patients in the cohort was 73 years (interquartile range 59-83); 41.7% (14 741) were female and 76.0% (26 966) had at least one comorbidity.
T122 18039-18137 Sentence denotes Table 1 shows demographic and clinical characteristics for the derivation and validation datasets.
T123 18139-18156 Sentence denotes Model development
T124 18157-18274 Sentence denotes We identified 41 candidate predictor variables measured at hospital admission for model creation (fig 1, appendix 2).
T125 18275-18460 Sentence denotes After the creation of a composite variable containing all seven individual comorbidities and the exclusion of 13 variables owing to high levels of missing values, 21 variables remained.
T126 18461-18774 Sentence denotes We identified eight important predictors of mortality by using generalised additive modelling with multiply imputed datasets: age, sex, number of comorbidities, respiratory rate, peripheral oxygen saturation, Glasgow coma scale, urea level, and C reactive protein (for variable selection process, see appendix 3).
T127 18775-19025 Sentence denotes Given the need for a pragmatic score for use at the bedside, continuous variables were converted to factors with cut-off values chosen by using component smoothed functions (on linear predictor scale) from generalised additive modelling (appendix 4).
T128 19026-19206 Sentence denotes On entering variables into a penalised logistic regression model (least absolute shrinkage and selection operator), all variables were retained within the final model (appendix 5).
T129 19207-19355 Sentence denotes We converted penalised regression coefficients into a prognostic index by using appropriate scaling (4C Mortality Score range 0-21 points; table 2).
T130 19356-19523 Sentence denotes The 4C Mortality Score showed good discrimination for death in hospital within the derivation cohort (table 3), with performance approaching that of the XGBoost model.
T131 19524-19708 Sentence denotes The 4C Mortality Score showed good calibration (calibration intercept=0, slope=1, Brier score 0.170) across the range of risk and no adjustment to the model was required (appendix 11).
T132 19710-19726 Sentence denotes Model validation
T133 19727-19943 Sentence denotes The validation cohort included data from 22 361 patients collected between 21 May 2020 and 29 June 2020 who had at least four weeks of follow-up; 743 (3.3%) patients had no outcome recorded and were considered alive.
T134 19944-19997 Sentence denotes The overall mortality rate was 30.1% (6729 patients).
T135 19998-20164 Sentence denotes The median age of patients in the cohort was 76 (interquartile range 60-85) years; 10 178 (45.6%) were female and 17 263 (77%) had at least one comorbidity (table 1).
T136 20165-20282 Sentence denotes Discrimination of the 4C Mortality Score in the validation cohort was similar to that of the XGBoost model (table 3).
T137 20283-20540 Sentence denotes Calibration was also found to be excellent in the validation cohort: overall observed (30.1%) versus predicted (30.1%) mortality was equal (calibration-in-the-large=0) and calibration was excellent over the range of risk (slope=1, Brier score 0.171; fig 2).
T138 20541-20662 Sentence denotes The 4C Mortality Score showed good performance in clinically relevant metrics across a range of cut-off values (table 4).
T139 20663-20899 Sentence denotes Four risk groups were defined with corresponding mortality rates determined (table 5): low risk (0-3 score, mortality rate 1.2%), intermediate risk (4-8 score, 9.9%), high risk (9-14 score, 31.4%), and very high risk (≥15 score, 61.5%).
T140 20900-21087 Sentence denotes Performance metrics showed a high sensitivity (99.7%) and negative predictive value (98.8%) for the low risk group, covering 7.4% of the cohort and a corresponding mortality rate of 1.2%.
T141 21088-21218 Sentence denotes Patients in the intermediate risk group (score 4-8, n=4889, 21.9%) had a mortality rate of 9.9% (negative predictive value 90.1%).
T142 21219-21462 Sentence denotes Patients in the high risk group (score 9-14, n=11 664, 52.2%) had a mortality rate of 31.4% (negative predictive value 68.6%), while patients scoring 15 or higher (n=4158, 18.6%) had a mortality rate of 61.5% (positive predictive value 61.5%).
T143 21463-21531 Sentence denotes An interactive infographic is available at https://isaric4c.net/risk
T144 21533-21563 Sentence denotes Comparison with existing tools
T145 21564-21850 Sentence denotes We performed a systematic literature search and identified 15 risk stratification scores that could beapplied to these data.62228-40 The 4C Mortality Score compared well against these existing risk stratification scores in predicting in-hospital mortality (table 6, fig 3, upper panel).
T146 21851-22234 Sentence denotes Risk stratification scores originally validated in patients with community acquired pneumonia (n=9) generally had higher discrimination for inhospital mortality in the validation cohort (eg, A-DROP (area under the receiver operating characteristic curve 0.74, 95% confidence interval 0.73 to 0.74) and E-CURB65 (0.76, 0.74 to 0.79)) than those developed within covid-19 cohorts (n=4:
T147 22235-22369 Sentence denotes Surgisphere (0.63, 0.62 to 0.64), DL score (0.67, 0.66 to 0.68), COVID-GRAM (0.71, 0.68 to 0.74), and Xie score (0.73, 0.70 to 0.75)).
T148 22370-22517 Sentence denotes Performance metrics for the 4C Mortality Score compared well against existing risk stratification scores at specified cut-off values (appendix 13).
T149 22518-22749 Sentence denotes The number of patients in whom risk stratification scores could be applied differed owing to certain variables not being available, either because of missingness or because they were not tested for or recorded in clinical practice.
T150 22750-22969 Sentence denotes Seven scores could be applied to fewer than 2000 patients (<10%) in the validation cohort owing to the requirement for biomarkers or physiological parameters that were not routinely captured (eg, lactate dehydrogenase).
T151 22970-23239 Sentence denotes Decision curve analysis showed that the 4C Mortality Score had better clinical utility across a wide range of threshold risks compared with the best performing existing scores applicable to more than 50% of the validation cohort (A-DROP and CURB65; fig 3, lower panel).
T152 23241-23261 Sentence denotes Sensitivity analysis
T153 23262-23442 Sentence denotes Sensitivity analyses that used complete case data showed similar discrimination (appendix 7) and performance metrics (appendices 8 and 9) to analyses that used the imputed dataset.
T154 23443-23791 Sentence denotes After stratification of the validation cohort into two geographical cohorts (validation north and south; appendix 14), discrimination remained similar for the 4C Mortality Score in the north subset (area under the receiver operating characteristic curve 0.77, 95% confidence interval 0.76 to 0.78) and south subset (0.76, 0.75 to 0.77; appendix 6).
T155 23792-23891 Sentence denotes Finally, we checked discrimination of the 4C Mortality Score by sex and ethnic group (appendix 10).
T156 23892-24058 Sentence denotes Discrimination was the same in men (area under the receiver operating characteristic curve 0.77, 95% confidence interval 0.76 to 0.78) and women (0.76, 0.75 to 0.77).
T157 24059-24145 Sentence denotes Discrimination was better in all nonwhite ethnic groups compared with the white group:
T158 24146-24288 Sentence denotes South Asian (0.82, 0.80 to 0.85), East Asian (0.85, 0.79 to 0.91), Black (0.83, 0.80 to 0.86), and other ethnic minority (0.81, 0.79 to 0.84).
T159 24290-24300 Sentence denotes Discussion
T160 24302-24320 Sentence denotes Principal findings
T161 24321-24478 Sentence denotes We have developed and validated the eight variable 4C Mortality Score in a UK prospective cohort study of 57 824 patients admitted to hospital with covid-19.
T162 24479-24728 Sentence denotes The 4C Mortality Score uses patient demographics, clinical observations, and blood parameters that are commonly available at the time of hospital admission and can accurately characterise the population of patients at high risk of death in hospital.
T163 24729-24957 Sentence denotes The score compared favourably with other models, including best-in-class machine learning techniques, and showed consistent performance across the validation cohorts, including good clinical utility in a decision curve analysis.
T164 24958-25087 Sentence denotes Model performance compared well against other generated models, with minimal loss in discrimination despite its pragmatic nature.
T165 25088-25380 Sentence denotes A machine learning approach showed a marginal improvement in discrimination, but at the cost of interpretability, the requirement for many more input variables, and the need for an app or website calculator that might limit use at the bedside given personal protective equipment requirements.
T166 25381-25507 Sentence denotes The 4C Mortality Score showed good applicability within the validation cohort and consistency across all performance measures.
T167 25509-25538 Sentence denotes Comparison with other studies
T168 25539-25820 Sentence denotes The 4C Mortality Score contains parameters reflecting patient demographics, comorbidity, physiology, and inflammation at hospital admission; it shares characteristics with existing prognostic scores for sepsis and community acquired pneumonia but has important differences as well.
T169 25821-25902 Sentence denotes No preexisting score appears to use this combination of variables and weightings.
T170 25903-26353 Sentence denotes Altered consciousness and high respiratory rate are included in most risk stratification scores for sepsis and community acquired pneumonia,21222829323336 while raised urea is also a common component.212228 Increasing age is a strong predictor of in-hospital mortality in our cohort of patients admitted with covid-19 and is commonly included in other existing covid-19 scores,374142 together with comorbidity374142 and raised C reactive protein.4043
T171 26354-26853 Sentence denotes Discriminatory performance of existing covid-19 scores applied to our cohort was lower than reported in derivation cohorts (DL score 0.74, COVID-GRAM 0.88, Xie score 0.98).373840 The use of small inpatient cohorts from Wuhan, China for model development might have resulted in overfitting, limiting generalisability in other cohorts.3840 The Xie score demonstrated the highest discriminatory power (0.73), and included age, lymphocyte count, lactate dehydrogenase, and peripheral oxygen saturations.
T172 26854-27032 Sentence denotes However, we were only able to apply this score for less than 10% of the validation cohort because lactate dehydrogenase is not routinely measured on hospital admission in the UK.
T173 27033-27572 Sentence denotes Owing to challenges of clinical data collection during an epidemic, missing data are common, with choice of predictors influenced by data availability.40 Complete case analysis often leads to exclusion of a substantial proportion of the original sample, subsequently leading to a loss of precision and power.44 However, the assessment of missing data on model performance in novel covid-19 risk stratification scores has been limited37 or unexplored,3840 potentially introducing bias and further limiting generalisability to other cohorts.
T174 27573-27769 Sentence denotes We found discriminatory performance in both derivation and validation cohorts remained similar after the imputation of a wide range of variables,41 further supporting the validity of our findings.
T175 27770-28333 Sentence denotes The presence of comorbidities is handled differently in covid-19 prognostic scores; comorbidities might be included individually,4042 given equal weight,37 or found to have no predictive effect.38 Recent evidence suggests an additive effect of comorbidity in patients with covid-19, with increasing number of comorbidities associated with poorer outcomes.16 In our cohort, the inclusion of individual comorbidities within the machine learning model conferred minimal additional discriminatory performance, supporting the inclusion of an overall comorbidity count.
T176 28335-28374 Sentence denotes Strengths and limitations of this study
T177 28375-28580 Sentence denotes The ISARIC WHO CCP-UK study represents a large prospectively collected cohort admitted to hospital with covid-19 and reflects the clinical data available in most economically developed healthcare settings.
T178 28581-28757 Sentence denotes We derived a clinically applicable prediction score with clear methods and tested it against existing risk stratification scores in a large patient cohort admitted to hospital.
T179 28758-28901 Sentence denotes The score compared favourably with other prognostic tools, with good to excellent discrimination, calibration, and performance characteristics.
T180 28902-29003 Sentence denotes The 4C Mortality Score has several methodological advantages over current covid-19 prognostic scores.
T181 29004-29435 Sentence denotes The use of penalised regression methods and an event-to-variable ratio greater than 100 reduce the risk of overfitting.4546 The use of parameters commonly available at first assessment increases its clinical applicability, avoiding the requirement for markers often only available after a patient has been admitted to a critical care facility.447 Of course a model developed in a specific dataset should describe that dataset best.
T182 29436-29602 Sentence denotes However, by including comparisons with pre-existing models, reassurance is provided that equivalent performance cannot be delivered with a simple tool already in use.
T183 29603-29720 Sentence denotes Additionally, in a pandemic, baseline infection rates and patient characteristics might change by time and geography.
T184 29721-29826 Sentence denotes This motivated the temporal and geographical validation, which is crucial to the reporting of this study.
T185 29827-29931 Sentence denotes These sensitivity analyses showed that score performance continued to be robust over time and geography.
T186 29932-29958 Sentence denotes Our study has limitations.
T187 29959-30408 Sentence denotes Firstly, we were unable to evaluate the predictive performance of several existing scores that require a large number of parameters (for example, APACHE II48), as well as several other covid-19 prognostic scores that use computed tomography findings or uncommonly measured biomarkers.5 Additionally, several potentially relevant comorbidities, such as hypertension, previous myocardial infarction, and stroke,16 were not included in data collection.
T188 30409-30546 Sentence denotes The inclusion of these comorbidities might have impacted upon or improved the performance and generalisability of the 4C Mortality Score.
T189 30547-30623 Sentence denotes Secondly, a proportion of recruited patients (3.3%) had incomplete episodes.
T190 30624-30805 Sentence denotes Selection bias is possible if patients with incomplete episodes, such as those with prolonged hospital admission, had a differential mortality risk to those with completed episodes.
T191 30806-30908 Sentence denotes Nevertheless, the size of our patient cohort compares favourably to other datasets for model creation.
T192 30909-31108 Sentence denotes The patient cohort on which the 4C Mortality Score was derived comprised patients admitted to hospital who were seriously ill (mortality rate of 32.2%) and were of advanced age (median age 73 years).
T193 31109-31220 Sentence denotes This model is not for use in the community and could perform differently in populations at lower risk of death.
T194 31221-31376 Sentence denotes Further external validation is required to determine whether the 4C Mortality Score is generalisable among younger patients and in settings outside the UK.
T195 31378-31413 Sentence denotes Conclusions and policy implications
T196 31414-31607 Sentence denotes We have derived and validated an easy-to-use eight variable score that enables accurate stratification of patients with covid-19 admitted to hospital by mortality risk at hospital presentation.
T197 31608-31731 Sentence denotes Application within the validation cohorts showed this tool could guide clinician decisions, including treatment escalation.
T198 31732-31809 Sentence denotes A key aim of risk stratification is to support clinical management decisions.
T199 31810-31914 Sentence denotes Four risk classes were identified and showed similar adverse outcome rates across the validation cohort.
T200 31915-32223 Sentence denotes Patients with a 4C Mortality Score falling within the low risk groups (mortality rate 1%) might be suitable for management in the community, while those within the intermediate risk group were at lower risk of mortality (mortality rate 10%; 22% of the cohort) and might be suitable for ward level monitoring.
T201 32224-32585 Sentence denotes Similar mortality rates have been identified as an appropriate cut-off value in pneumonia risk stratification scores (CURB-65 and PSI).2122 Meanwhile patients with a score of 9 or higher were at high risk of death (around 40%), which could prompt aggressive treatment, including the initiation of steroids49 and early escalation to critical care if appropriate.
T202 32587-32609 Sentence denotes Supplementary Material
T203 32610-32632 Sentence denotes Supplementary material
T204 32634-32754 Sentence denotes The study protocol is available at https://isaric4c.net/protocols; study registry https://www.isrctn.com/ISRCTN66726260.
T205 32755-32871 Sentence denotes This work uses data provided by patients and collected by the NHS as part of their care and support #DataSavesLives.
T206 32872-33144 Sentence denotes We are grateful to the 2648 frontline NHS clinical and research staff and volunteer medical students who collected the data in challenging circumstances; and the generosity of the participants and their families for their individual contributions in these difficult times.
T207 33145-33460 Sentence denotes We also acknowledge the support of Jeremy J Farrar, Nahoko Shindo, Devika Dixit, Nipunie Rajapakse, Lyndsey Castle, Martha Buckley, Debbie Malden, Katherine Newell, Kwame O’Neill, Emmanuelle Denis, Claire Petersen, Scott Mullaney, Sue MacFarlane, Nicole Maziere, Julien Martinez, Oslem Dincarslan, and Annette Lake.
T208 33462-33469 Sentence denotes Funding
T209 33470-34476 Sentence denotes This work is supported by grants from: the National Institute for Health Research (NIHR; award CO-CIN-01), the Medical Research Council (MRC; grant MC_PC_19059), and by the NIHR Health Protection Research Unit (HPRU) in Emerging and Zoonotic Infections at University of Liverpool in partnership with Public Health England (PHE), in collaboration with Liverpool School of Tropical Medicine and the University of Oxford (award 200907), NIHR HPRU in Respiratory Infections at Imperial College London with PHE (award 200927), Wellcome Trust and Department for International Development (DID; 215091/Z/18/Z), the Bill and Melinda Gates Foundation (OPP1209135), Liverpool Experimental Cancer Medicine Centre (grant reference C18616/A25153), NIHR Biomedical Research Centre at Imperial College London (IS-BRC-1215-20013), EU Platform for European Preparedness Against (Re-)emerging Epidemics (PREPARE; FP7 project 602525), and NIHR Clinical Research Network for providing infrastructure support for this research.
T210 34477-34540 Sentence denotes PJMO is supported by a NIHR senior investigator award (201385).
T211 34541-34601 Sentence denotes LT is supported by the Wellcome Trust (award 205228/Z/16/Z).
T212 34602-34734 Sentence denotes MN is funded by a WT investigator award (207511/Z/17/Z) and the NIHR University College London Hospitals Biomedical Research Centre.
T213 34735-34890 Sentence denotes The views expressed are those of the authors and not necessarily those of the Department of Health and Social Care, DID, NIHR, MRC, Wellcome Trust, or PHE.
T214 34891-34903 Sentence denotes Contributors
T215 34904-35127 Sentence denotes Conceptualisation: SRK, AH, RP, GC, JD, PWH, LM, JSN-V-T, CS, PJMO, JKB, MGS, ABD, EMH; data curation: RP, SH, KAH, CJ, CAG, KAM, LM, LN, CDR, CAS, MGS, ABD, EMH; formal analysis: SRK, AH, RP, TMD, CJF, KAM, MGP, LCWT, EMH.
T216 35128-35187 Sentence denotes Funding acquisition: PWH, JSN-V-T, TS, PJMO, JKB, MGS, ABD.
T217 35188-35717 Sentence denotes Investigation: SRK, AH, CDR, OVS, LCWT, PJMO, JKB, MGS, ABD, EMH; methodology: SRK, AH, RP, IB, KAM, CAS, AS, CS, ABD, EMH; project administration: SH, HEH, CJ, LM, LN, CDR, CAS, MGS; resources: CG, PWH, LM, TS, MGS, EMH; software: SRK, RP, KAM, OVS; supervision: SRK, RP, CG, HEH, PWH, PLO, PJMO, JKB, MGS, EMH; visualisation: SRK, RP, MGP, EMH; writing-original draft: SRK, EMH; writing-review and editing: SRK, RG, AH, RP, IB, GC, TMD, CAG, JD, CJF, LM, MN, JSN-V-T, PLO, MGP, CDR, AS, CS, OVS, LCWT, PJMO, JKB, MGS, ABD, EMH.
T218 35718-35753 Sentence denotes SRK and AH are joint first authors.
T219 35754-35797 Sentence denotes MGS, ABD, and EMH are joint senior authors.
T220 35798-35974 Sentence denotes MGS is guarantor and corresponding author for this work, and attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.
T221 35975-35994 Sentence denotes Competing interests
T222 35995-38765 Sentence denotes All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: support from the National Institute for Health Research (NIHR), the Medical Research Council (MRC), the NIHR Health Protection Research Unit (HPRU) in Emerging and Zoonotic Infections at University of Liverpool, Public Health England (PHE), Liverpool School of Tropical Medicine, University of Oxford, NIHR HPRU in Respiratory Infections at Imperial College London, Wellcome Trust, Department for International Development, the Bill and Melinda Gates Foundation, Liverpool Experimental Cancer Medicine Centre, NIHR Biomedical Research Centre at Imperial College London, EU Platform for European Preparedness Against (Re-)emerging Epidemics (PREPARE), and NIHR Clinical Research Network for the submitted work; ABD reports grants from Department of Health and Social Care (DHSC), during the conduct of the study, grants from Wellcome Trust, outside the submitted work; CAG reports grants from DHSC National Institute of Health Research (NIHR) UK, during the conduct of the study; PWH reports grants from Wellcome Trust, Department for International Development, Bill and Melinda Gates Foundation, NIHR, during the conduct of the study; JSN-V-T reports grants from DHSC, England, during the conduct of the study, and is seconded to DHSC, England; MN is supported by a Wellcome Trust investigator award and the NIHR University College London Hospitals Biomedical Research Centre (BRC); PJMO reports personal fees from consultancies and from European Respiratory Society, grants from MRC, MRC Global Challenge Research Fund, EU, NIHR BRC, MRC/GSK, Wellcome Trust, NIHR (Health Protection Research Unit (HPRU) in Respiratory Infection), and is NIHR senior investigator outside the submitted work; his role as President of the British Society for Immunology was unpaid but travel and accommodation at some meetings was provided by the Society; JKB reports grants from MRC UK; MGS reports grants from DHSC NIHR UK, grants from MRC UK, grants from HPRU in Emerging and Zoonotic Infections, University of Liverpool, during the conduct of the study, other from Integrum Scientific LLC, Greensboro, NC, USA, outside the submitted work; LCWT reports grants from HPRU in Emerging and Zoonotic Infections, University of Liverpool, during the conduct of the study, and grants from Wellcome Trust outside the submitted work; EMH, HEH, JD, RG, RP, LN, KAH, GC, LM, SH, CJ, and CG, all declare: no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; and no other relationships or activities that could appear to have influenced the submitted work.
T223 38766-38782 Sentence denotes Ethical approval
T224 38783-38973 Sentence denotes Ethical approval was given by the South Central-Oxford C Research Ethics Committee in England (reference 13/ SC/0149), and by the Scotland A Research Ethics Committee (reference 20/SS/0028).
T225 38974-39040 Sentence denotes The study was registered at https://www.isrctn.com/ISRCTN66726260.
T226 39041-39053 Sentence denotes Data sharing
T227 39054-39185 Sentence denotes We welcome applications for data and material access via our Independent Data and Material Access Committee (https://isaric4c.net).
T228 39186-39498 Sentence denotes The lead author (the manuscript’s guarantor) affirms that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned (and, if relevant, registered) have been explained.
T229 39499-39571 Sentence denotes Dissemination to participants and related patient and public communities
T230 39572-39657 Sentence denotes ISARIC-4C has a public facing website isaric4c.net and twitter account (@CCPUKstudy).
T231 39658-39763 Sentence denotes We are engaging with print and internet press, television, radio, news, and documentary programme makers.
T232 39764-39954 Sentence denotes We will explore distribution of findings with The Asthma UK and British Lung Foundation Partnership and take advice from NIHR Involve and GenerationR Alliance Young People’s Advisory Groups.
T233 39955-39991 Sentence denotes Provenance and peer review statement
T234 39992-40035 Sentence denotes Not commissioned; externally peer reviewed.
T235 40037-40072 Sentence denotes WHAT IS ALREADY KNOWN ON THIS TOPIC
T236 40073-40406 Sentence denotes Robust, validated clinical prediction tools are lacking that identify patients with coronavirus disease 2019 (covid-19) who are at the highest risk of mortality Given the uncertainty about how to stratify patients with covid-19, considerable interest exists in risk stratification scores to support frontline clinical decision making
T237 40407-40555 Sentence denotes Available risk stratification tools have a high risk of bias, small sample size resulting in uncertainty, poor reporting, and lack formal validation
T238 40557-40577 Sentence denotes WHAT THIS STUDY ADDS
T239 40578-40777 Sentence denotes Most existing covid-19 risk stratification tools performed poorly in our cohort; caution is needed when novel tools based on small patient populations are applied to cohorts in hospital with covid-19
T240 40778-41019 Sentence denotes The 4C (Coronavirus Clinical Characterisation Consortium) Mortality Score is an easy-to-use and valid prediction tool for in-hospital mortality, accurately categorising patients as being at low, intermediate, high, or very high risk of death
T241 41020-41204 Sentence denotes This pragmatic and clinically applicable score outperformed other risk stratification tools, showed clinical decision making utility, and had similar performance to more complex models
T242 41205-41253 Sentence denotes Fig 1 Model derivation and validation workflow.
T243 41254-41538 Sentence denotes AuROc=area under the receiver operating characteristic curve; covid-19=coronavirus disease 2019; ISARIC CCP-UK=lnternational severe acute respiratory and emerging Infections consortium clinical characterisation Protocol uK; NPv=negative predictive value; PPv=positive predictive value
T244 41539-41890 Sentence denotes Fig 2 Upper panel: distribution of patients across range of 4C Mortality Score in validation cohort; middle panel: observed in-hospital mortality across range of 4C Mortality Score in validation cohort; lower panel: predicted versus observed probability of in-hospital mortality (calibration; red line) for 4C Mortality Score within validation cohort
T245 41891-42152 Sentence denotes Fig 3 Receiver operator characteristic curves (upper panel) and decision curve analysis (lower panel) for most discriminating three models applicable to more than 50% of validation population compared with age alone (restricted cubic spline; imputed datasets).
T246 42153-42323 Sentence denotes In lower panel, lines are shown for standardised net benefit at different risk thresholds of treating no patients (black line) and treating all patients (red dashed line)
T247 42324-42458 Sentence denotes Table 1 Demographic and clinical characteristics for derivation and validation cohorts of patients admitted to hospital with covid-19
T248 42459-42512 Sentence denotes characteristics Derivation cohort validation cohort
T249 42513-42611 Sentence denotes No of patients (%) or median (iQR) Total No (%) No of patients (%) or median (iQR) total No (%)
T250 42612-42693 Sentence denotes Mortality in hospital 11 426 (32.2) 35 463 (100.0) 6729 (30.1) 22 361 (100.0)
T251 42694-42705 Sentence denotes Age (years)
T252 42706-42770 Sentence denotes     <50 4876 (13.8) 35 277 (99.5) 2808 (12.6) 22 361 (100.0)
T253 42771-42814 Sentence denotes     50-69 10 183 (28.9) — 5762 (25.8) —
T254 42815-42856 Sentence denotes     70-79 8017 (22.7) — 4951 (22.1) —
T255 42857-42898 Sentence denotes     ≥80 12 201 (34.6) — 8840 (39.5) —
T256 42899-42911 Sentence denotes Sex at birth
T257 42912-42982 Sentence denotes     Female 14 741 (41.7) 35 356 (99.7) 10 178 (45.6) 22 319 (99.8)
T258 42983-42992 Sentence denotes Ethnicity
T259 42993-43062 Sentence denotes     White 26 300 (82.2) 31 987 (90.2) 16 831 (84.9) 19 818 (88.6)
T260 43063-43107 Sentence denotes     South Asian 1647 (5.1) — 811 (4.1) —
T261 43108-43150 Sentence denotes     East Asian 271 (0.8) — 140 (0.7) —
T262 43151-43189 Sentence denotes     Black 1256 (3.9) — 769 (3.9) —
T263 43190-43245 Sentence denotes     Other ethnic minority 2513 (7.9) — 1267 (6.4) —
T264 43246-43327 Sentence denotes Chronic cardiac disease 10 513 (31.8) 33 090 (93.3) 7019 (34.0) 20 616 (92.2)
T265 43328-43406 Sentence denotes Chronic kidney disease 5653 (17.2) 32 834 (92.6) 3769 (18.4) 20 444 (91.4)
T266 43407-43481 Sentence denotes Malignant neoplasm 3312 (10.2) 32 556 (91.8) 2187 (10.8) 20 297 (90.8)
T267 43482-43566 Sentence denotes Moderate or severe liver disease 604 (1.9) 32 538 (91.8) 434 (2.1) 20 218 (90.4)
T268 43567-43650 Sentence denotes Obesity (clinician defined) 3414 (11.4) 29 829 (84.1) 2234 (12.2) 18 304 (81.9)
T269 43651-43745 Sentence denotes Chronic pulmonary disease (not asthma) 5830 (17.7) 32 990 (93.0) 3737 (18.2) 20 502 (91.7)
T270 43746-43825 Sentence denotes Diabetes (type 1 and 2) 8487 (26.0) 32 622 (92.0) 4275 (21.9) 19 511 (87.3)
T271 43826-43845 Sentence denotes No of comorbidities
T272 43846-43909 Sentence denotes     0 8497 (24.0) 35 463 (100.0) 5098 (22.8) 22 361 (100.0)
T273 43910-43947 Sentence denotes     1 9941 (28.0) — 6114 (27.3) —
T274 43948-43990 Sentence denotes     ≥2 17 025 (48.0) — 11 149 (49.9) —
T275 43991-44075 Sentence denotes Respiratory rate (breaths/min) 22.0 (9.0) 33 330 (94.0) 20.0 (8.0) 20 970 (93.8)
T276 44076-44151 Sentence denotes Oxygen saturation (%) 94.0 (6.0) 33 696 (95.0) 94.0 (5.0) 21 197 (94.8)
T277 44152-44241 Sentence denotes Systolic blood pressure (mm Hg) 124.0 (33.0) 33 637 (94.9) 129.0 (33.0) 21 073 (94.2)
T278 44242-44330 Sentence denotes Diastolic blood pressure (mm Hg) 70.0 (19.0) 33 568 (94.7) 73.0 (20.0) 21 026 (94.0)
T279 44331-44401 Sentence denotes Temperature (°C) 37.3 (1.5) 33 467 (94.4) 37.1 (1.5) 21 139 (94.5)
T280 44402-44474 Sentence denotes Heart rate (bpm) 90.0 (27.0) 33 405 (94.2) 90.0 (28.0) 20 991 (93.9)
T281 44475-44553 Sentence denotes Glasgow coma scale score 15.0 (0.0) 30 819 (86.9) 15.0 (0.0) 20 015 (89.5)
T282 44554-44629 Sentence denotes Haemoglobin (g/L) 129.0 (30.0) 29 924 (84.4) 127.0 (31.0) 18 480 (82.6)
T283 44630-44712 Sentence denotes White blood cell count (109/L) 7.4 (5.1) 29 740 (83.9) 7.6 (5.3) 18 362 (82.1)
T284 44713-44789 Sentence denotes Neutrophil count (109/L) 5.6 (4.6) 29 594 (83.5) 5.8 (4.9) 18 354 (82.1)
T285 44790-44866 Sentence denotes Lymphocyte count (109/L) 0.9 (0.7) 29 553 (83.3) 0.9 (0.7) 18 348 (82.1)
T286 44867-44949 Sentence denotes Platelet count (109/L) 216.0 (120.0) 29 582 (83.4) 223.0 (126.0) 18 281 (81.8)
T287 44950-45021 Sentence denotes Sodium (mmol/L) 137.0 (6.0) 29 522 (83.2) 137.0 (6.0) 18 409 (82.3)
T288 45022-45092 Sentence denotes Potassium (mmol/L) 4.1 (0.8) 27 224 (76.8) 4.1 (0.8) 16 926 (75.7)
T289 45093-45170 Sentence denotes Total bilirubin (mg/dL) 10.0 (7.0) 24 446 (68.9) 10.0 (7.0) 15 404 (68.9)
T290 45171-45236 Sentence denotes Urea (mmol/L) 7.0 (6.3) 26 122 (73.7) 7.3 (6.8) 16 863 (75.4)
T291 45237-45312 Sentence denotes Creatinine (pmol/L) 86.0 (53.0) 29 439 (83.0) 86.0 (56.0) 18 225 (81.5)
T292 45313-45396 Sentence denotes C reactive protein (mg/L) 84.9 (122.0) 27 856 (78.5) 78.0 (120.0) 17 119 (76.6)
T293 45397-45456 Sentence denotes Covid-19=coronavirus disease 2019; IQR=interquartile range.
T294 45457-45569 Sentence denotes Comorbidities were defined using the Charlson comorbidity index, with the addition of clinician defined obesity.
T295 45570-45656 Sentence denotes Table 2 Final 4C Mortality Score for in-hospital mortality in patients with covid-19.
T296 45657-45730 Sentence denotes Prognostic index derived from penalised logistic regression (LASSO) model
T297 45731-45759 Sentence denotes Variable 4C Mortality Score
T298 45760-45771 Sentence denotes Age (years)
T299 45772-45782 Sentence denotes     <50 —
T300 45783-45797 Sentence denotes     50-59 + 2
T301 45798-45811 Sentence denotes     60-69 +4
T302 45812-45825 Sentence denotes     70-79 +6
T303 45826-45837 Sentence denotes     ≥80 +7
T304 45838-45850 Sentence denotes Sex at birth
T305 45851-45864 Sentence denotes     Female —
T306 45865-45878 Sentence denotes     Male + 1
T307 45879-45899 Sentence denotes No of comorbidities*
T308 45900-45908 Sentence denotes     0 —
T309 45909-45919 Sentence denotes     1 + 1
T310 45920-45931 Sentence denotes     ≥2 + 2
T311 45932-45962 Sentence denotes Respiratory rate (breaths/min)
T312 45963-45973 Sentence denotes     <20 —
T313 45974-45988 Sentence denotes     20-29 + 1
T314 45989-46001 Sentence denotes     ≥30 + 2
T315 46002-46046 Sentence denotes Peripheral oxygen saturation on room air (%)
T316 46047-46057 Sentence denotes     ≥92 —
T317 46058-46070 Sentence denotes     <92 + 2
T318 46071-46095 Sentence denotes Glasgow coma scale score
T319 46096-46105 Sentence denotes     15 —
T320 46106-46118 Sentence denotes     <15 + 2
T321 46119-46132 Sentence denotes Urea (mmol/L)
T322 46133-46142 Sentence denotes     <7 —
T323 46143-46156 Sentence denotes     7-14 + 1
T324 46157-46169 Sentence denotes     ≥14 + 3
T325 46170-46195 Sentence denotes C reactive protein (mg/L)
T326 46196-46206 Sentence denotes     <50 —
T327 46207-46221 Sentence denotes     50-99 + 1
T328 46222-46235 Sentence denotes     ≥100 + 2
T329 46236-46270 Sentence denotes Covid-19=coronavirus disease 2019.
T330 46271-46383 Sentence denotes *Comorbidities were defined by using Charlson comorbidity index, with the addition of clinician defined obesity.
T331 46384-46450 Sentence denotes Table 3 Model discrimination in derivation and validation cohorts
T332 46451-46494 Sentence denotes Model Derivation cohort Validation cohort
T333 46495-46558 Sentence denotes Model AUROc (95% CI) Brier score AUROc (95% CI) Brier score
T334 46559-46639 Sentence denotes 4C Mortality Score 0.786 (0.781 to 0.790) 0.170 0.767 (0.760 to 0.773) 0.171
T335 46640-46730 Sentence denotes Machine learning comparison* 0.796 (0.786 to 0.807) 0.191 0.779 (0.772 to 0.785) 0.197
T336 46731-46796 Sentence denotes AUROC=area under receiver operator curve; CI=confidence interval.
T337 46797-46840 Sentence denotes *Gradient boosting decision tree (XGBoost).
T338 46841-46974 Sentence denotes Table 4 Performance metrics of 4C Mortality Score to rule out and rule in mortality at different cut-off values in validation cohort
T339 46975-47091 Sentence denotes Cut-off value No of patients (%) TP TN FP FN Sensitivity (%) Specificity (%) PPV (%) NPV (%) Mortality (%)
T340 47092-47110 Sentence denotes Rule out mortality
T341 47111-47175 Sentence denotes ≤2 1001 (4.5) 6724 996 14 636 5 99.9 6.4 31.5 99.5 0.5
T342 47176-47243 Sentence denotes ≤3 1650 (7.4) 6709 1630 14 002 20 99.7 10.4 32.4 98.8 1.2
T343 47244-47312 Sentence denotes ≤4 2420 (10.8) 6672 2363 13 269 57 99.2 15.1 33.5 97.6 2.4
T344 47313-47382 Sentence denotes ≤6 4121 (18.4) 6542 3934 11 698 187 97.2 25.2 35.9 95.5 4.5
T345 47383-47450 Sentence denotes ≤8 6539 (29.2) 6223 6033 9599 506 92.5 38.6 39.3 92.3 7.7
T346 47451-47517 Sentence denotes ≤9 8167 (36.5) 5911 7349 8283 818 87.8 47 41.6 90.0 10.0
T347 47518-47535 Sentence denotes Rule in mortality
T348 47536-47606 Sentence denotes ≥9 15 822 (70.8) 6223 6033 9599 506 92.5 38.6 39.3 92.3 39.3
T349 47607-47679 Sentence denotes ≥11 12 325 (55.1) 5483 8790 6842 1246 81.5 56.2 44.5 87.6 44.5
T350 47680-47752 Sentence denotes ≥13 8069 (36.1) 4206 11 769 3863 2523 62.5 75.3 52.1 82.3 52.1
T351 47753-47823 Sentence denotes ≥15 4158 (18.6) 2557 14 031 1601 4172 38 89.8 61.5 77.1 61.5
T352 47824-47892 Sentence denotes ≥17 1579 (7.1) 1142 15 195 437 5587 17 97.2 72.3 73.1 72.3
T353 47893-47959 Sentence denotes ≥19 381 (1.7) 305 15 556 76 6424 4.5 99.5 80.1 70.8 80.1
T354 47960-48097 Sentence denotes FN=false negative; FP=false positive; NPV=i negative predictive value; PPV=positive predictive value; TN=true negative; TP=true positive.
T355 48098-48212 Sentence denotes Table 5 Comparison of mortality rates for 4C Mortality Score risk groups across derivation and validation cohorts
T356 48213-48261 Sentence denotes Risk group Derivation cohort Validation cohort
T357 48262-48336 Sentence denotes No of patients (%) No of deaths (%) No of patients (%) No of deaths (%)
T358 48337-48390 Sentence denotes Low (0-3) 2574 (7.3) 45 (1.7) 1650 (7.4) 20 (1.2)
T359 48391-48457 Sentence denotes Intermediate (4-8) 8277 (23.3) 751 (9.1) 4889 (21.9) 486 (9.9)
T360 48458-48525 Sentence denotes High (9-14) 18 091 (51.0) 6310 (34.9) 11 664 (52.2) 3666 (31.4)
T361 48526-48593 Sentence denotes Very high (≥15) 6521 (18.4) 4320 (66.2) 4158 (18.6) 2557 (61.5)
T362 48594-48631 Sentence denotes Overall 35 463 11 426 22 361 6729
T363 48632-48797 Sentence denotes Table 6 Discriminatory performance of risk stratification scores within validation cohort (complete case) to predict in-hospital mortality in patients with covid-19
T364 48798-48823 Sentence denotes Model Validation cohort*
T365 48824-48879 Sentence denotes No of patients with required parameters AUROC (95% CI)
T366 48880-48913 Sentence denotes SOFA 197 0.614 (0.530 to 0.698)
T367 48914-48951 Sentence denotes qSOFA 19 361 0.622 (0.615 to 0.630)
T368 48952-48996 Sentence denotes Surgispheret 18 986 0.630 (0.622 to 0.639)
T369 48997-49034 Sentence denotes SMARTCOP 486 0.645 (0.593 to 0.697)
T370 49035-49071 Sentence denotes NEWS 19 074 0.654 (0.645 to 0.662)
T371 49072-49113 Sentence denotes DL scoret 16 345 0.669 (0.660 to 0.678)
T372 49114-49147 Sentence denotes SCAP 370 0.675 (0.620 to 0.729)
T373 49148-49185 Sentence denotes CRB65 19 361 0.683 (0.676 to 0.691)
T374 49186-49227 Sentence denotes COVID-GRAMt 1239 0.706 (0.675 to 0.736)
T375 49228-49268 Sentence denotes DS-CRB65 18 718 0.718 (0.710 to 0.725)
T376 49269-49307 Sentence denotes CURB65 15 560 0.720 (0.713 to 0.728)
T377 49308-49348 Sentence denotes Xie scoret 1753 0.727 (0.701 to 0.753)
T378 49349-49387 Sentence denotes A-DROP 15 572 0.736 (0.728 to 0.744)
T379 49388-49420 Sentence denotes PSI 360 0.736 (0.683 to 0.790)
T380 49421-49459 Sentence denotes E-CURB65 1553 0.764 (0.740 to 0.788)
T381 49460-49510 Sentence denotes 4C Mortality Score 14 398 0.774 (0.767 to 0.782)
T382 49511-49607 Sentence denotes AUROC=area under the receiver operating characteristic curve; covid-19=coronavirus disease 2019.
T383 49608-49642 Sentence denotes See appendix 13 for other metrics.
T384 49643-49661 Sentence denotes * Available data.
T385 49662-49704 Sentence denotes †Novel covid-19 risk stratification score.

LitCovid-PD-HP

Id Subject Object Predicate Lexical cue hp_id
T1 2942-2961 Phenotype denotes respiratory failure http://purl.obolibrary.org/obo/HP_0002878
T2 3852-3861 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T3 3886-3892 Phenotype denotes sepsis http://purl.obolibrary.org/obo/HP_0100806
T4 4003-4010 Phenotype denotes hypoxia http://purl.obolibrary.org/obo/HP_0012418
T5 6902-6908 Phenotype denotes asthma http://purl.obolibrary.org/obo/HP_0002099
T6 6992-7005 Phenotype denotes liver disease http://purl.obolibrary.org/obo/HP_0001392
T7 7007-7015 Phenotype denotes dementia http://purl.obolibrary.org/obo/HP_0000726
T8 7077-7094 Phenotype denotes diabetes mellitus http://purl.obolibrary.org/obo/HP_0000819
T9 7422-7429 Phenotype denotes obesity http://purl.obolibrary.org/obo/HP_0001513
T10 8856-8865 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T11 13774-13783 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T12 15993-16002 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T13 16006-16012 Phenotype denotes sepsis http://purl.obolibrary.org/obo/HP_0100806
T14 18678-18682 Phenotype denotes coma http://purl.obolibrary.org/obo/HP_0001259
T15 21935-21944 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T16 25742-25748 Phenotype denotes sepsis http://purl.obolibrary.org/obo/HP_0100806
T17 25772-25781 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T18 26003-26009 Phenotype denotes sepsis http://purl.obolibrary.org/obo/HP_0100806
T19 26033-26042 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T20 30311-30323 Phenotype denotes hypertension http://purl.obolibrary.org/obo/HP_0000822
T21 30334-30355 Phenotype denotes myocardial infarction http://purl.obolibrary.org/obo/HP_0001658
T22 30361-30367 Phenotype denotes stroke http://purl.obolibrary.org/obo/HP_0001297
T23 32304-32313 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T24 33917-33939 Phenotype denotes Respiratory Infections http://purl.obolibrary.org/obo/HP_0011947
T25 34149-34155 Phenotype denotes Cancer http://purl.obolibrary.org/obo/HP_0002664
T26 35097-35100 Phenotype denotes TMD http://purl.obolibrary.org/obo/HP_0005534
T27 35622-35625 Phenotype denotes TMD http://purl.obolibrary.org/obo/HP_0005534
T28 36420-36442 Phenotype denotes Respiratory Infections http://purl.obolibrary.org/obo/HP_0011947
T29 36591-36597 Phenotype denotes Cancer http://purl.obolibrary.org/obo/HP_0002664
T30 37713-37734 Phenotype denotes Respiratory Infection http://purl.obolibrary.org/obo/HP_0011947
T31 39814-39820 Phenotype denotes Asthma http://purl.obolibrary.org/obo/HP_0002099
T32 43328-43350 Phenotype denotes Chronic kidney disease http://purl.obolibrary.org/obo/HP_0012622
T33 43417-43425 Phenotype denotes neoplasm http://purl.obolibrary.org/obo/HP_0002664
T34 43501-43514 Phenotype denotes liver disease http://purl.obolibrary.org/obo/HP_0001392
T35 43567-43574 Phenotype denotes Obesity http://purl.obolibrary.org/obo/HP_0001513
T36 43682-43688 Phenotype denotes asthma http://purl.obolibrary.org/obo/HP_0002099
T37 44483-44487 Phenotype denotes coma http://purl.obolibrary.org/obo/HP_0001259
T38 45561-45568 Phenotype denotes obesity http://purl.obolibrary.org/obo/HP_0001513
T39 46079-46083 Phenotype denotes coma http://purl.obolibrary.org/obo/HP_0001259
T40 46375-46382 Phenotype denotes obesity http://purl.obolibrary.org/obo/HP_0001513