> top > docs > PMC:7782580 > spans > 1580-64464 > annotations

PMC:7782580 / 1580-64464 JSONTXT

Annnotations TAB JSON ListView MergeView

LitCovid-PubTator

Id Subject Object Predicate Lexical cue tao:has_database_id
44 232-243 Species denotes coronavirus Tax:11118
45 368-373 Species denotes human Tax:9606
46 484-490 Species denotes people Tax:9606
47 884-898 Species denotes COVID-19 virus Tax:2697049
48 2263-2270 Species denotes patient Tax:9606
49 13-37 Disease denotes Coronavirus disease 2019 MESH:C000657245
50 39-47 Disease denotes COVID-19 MESH:C000657245
51 59-77 Disease denotes infectious disease MESH:D003141
52 501-509 Disease denotes infected MESH:D007239
53 513-521 Disease denotes COVID-19 MESH:C000657245
54 549-555 Disease denotes deaths MESH:D003643
55 658-666 Disease denotes COVID-19 MESH:C000657245
56 1030-1038 Disease denotes COVID-19 MESH:C000657245
57 1098-1106 Disease denotes COVID-19 MESH:C000657245
58 1587-1595 Disease denotes COVID-19 MESH:C000657245
59 1735-1743 Disease denotes COVID-19 MESH:C000657245
60 1935-1943 Disease denotes COVID-19 MESH:C000657245
61 2046-2054 Disease denotes COVID-19 MESH:C000657245
62 2301-2309 Disease denotes COVID-19 MESH:C000657245
63 2425-2433 Disease denotes COVID-19 MESH:C000657245
64 2434-2443 Disease denotes infection MESH:D007239
65 2512-2520 Disease denotes COVID-19 MESH:C000657245
79 2582-2590 Disease denotes COVID-19 MESH:C000657245
80 2599-2604 Disease denotes fever MESH:D005334
81 2606-2615 Disease denotes dry cough MESH:D003371
82 2617-2624 Disease denotes dyspnea MESH:D004417
83 2630-2639 Disease denotes pneumonia MESH:D011014
84 2801-2822 Disease denotes respiratory infection MESH:D012141
85 2845-2853 Disease denotes COVID-19 MESH:C000657245
86 2886-2894 Disease denotes COVID-19 MESH:C000657245
87 2977-2991 Disease denotes lung infection MESH:D012141
88 2995-3003 Disease denotes COVID-19 MESH:C000657245
89 3113-3121 Disease denotes COVID-19 MESH:C000657245
90 3942-3950 Disease denotes COVID-19 MESH:C000657245
91 4661-4669 Disease denotes COVID-19 MESH:C000657245
96 5438-5449 Disease denotes lung cancer MESH:D008175
97 5468-5477 Disease denotes pneumonia MESH:D011014
98 5501-5521 Disease denotes diabetic retinopathy MESH:D003920
99 5527-5553 Disease denotes retinal fundus photographs MESH:D012173
101 6635-6643 Disease denotes COVID-19 MESH:C000657245
104 7342-7350 Disease denotes COVID-19 MESH:C000657245
105 7524-7532 Disease denotes COVID-19 MESH:C000657245
107 8102-8110 Species denotes patients Tax:9606
112 9017-9026 Disease denotes Pneumonia MESH:D011014
113 9060-9068 Disease denotes COVID-19 MESH:C000657245
114 9177-9186 Disease denotes Pneumonia MESH:D011014
115 9222-9230 Disease denotes COVID-19 MESH:C000657245
119 9386-9394 Disease denotes COVID-19 MESH:C000657245
120 9410-9419 Disease denotes pneumonia MESH:D011014
121 9461-9469 Disease denotes COVID-19 MESH:C000657245
123 9540-9548 Disease denotes COVID-19 MESH:C000657245
127 9640-9645 Species denotes human Tax:9606
128 9668-9673 Species denotes human Tax:9606
129 10961-10969 Disease denotes COVID-19 MESH:C000657245
131 12182-12190 Disease denotes COVID-19 MESH:C000657245
135 13047-13055 Disease denotes COVID-19 MESH:C000657245
136 13140-13148 Disease denotes COVID-19 MESH:C000657245
137 13425-13435 Disease denotes infections MESH:D007239
141 12321-12329 Disease denotes COVID-19 MESH:C000657245
142 12588-12596 Disease denotes COVID-19 MESH:C000657245
143 12683-12691 Disease denotes COVID-19 MESH:C000657245
146 15452-15460 Disease denotes COVID-19 MESH:C000657245
147 15648-15656 Disease denotes COVID-19 MESH:C000657245
150 15213-15217 Species denotes CATS Tax:9685
151 15129-15133 Disease denotes XPDS OMIM:300911
154 18041-18044 Gene denotes Rad Gene:6236
155 17913-17916 Gene denotes Rad Gene:6236
158 18274-18282 Disease denotes COVID-19 MESH:C000657245
159 18362-18370 Disease denotes COVID-19 MESH:C000657245
163 15782-15790 Disease denotes COVID-19 MESH:C000657245
164 15811-15816 Disease denotes COVID MESH:C000657245
165 15918-15926 Disease denotes COVID-19 MESH:C000657245
167 19907-19915 Disease denotes COVID-19 MESH:C000657245
169 20795-20803 Disease denotes COVID-19 MESH:C000657245
174 19398-19401 Gene denotes Rad Gene:6236
175 18941-18949 Disease denotes COVID-19 MESH:C000657245
176 19058-19066 Disease denotes COVID-19 MESH:C000657245
177 19185-19193 Disease denotes COVID-19 MESH:C000657245
181 21759-21767 Disease denotes COVID-19 MESH:C000657245
182 21880-21888 Disease denotes COVID-19 MESH:C000657245
183 21983-21991 Disease denotes COVID-19 MESH:C000657245
185 23030-23038 Disease denotes COVID-19 MESH:C000657245
188 23096-23104 Disease denotes COVID-19 MESH:C000657245
189 23184-23192 Disease denotes COVID-19 MESH:C000657245
195 24324-24333 Disease denotes pneumonia MESH:D011014
196 24536-24545 Disease denotes pneumonia MESH:D011014
197 24556-24564 Disease denotes COVID-19 MESH:C000657245
198 24940-24948 Disease denotes COVID-19 MESH:C000657245
199 24950-24959 Disease denotes pneumonia MESH:D011014
201 25598-25606 Disease denotes COVID-19 MESH:C000657245
205 25836-25844 Disease denotes COVID-19 MESH:C000657245
206 26194-26202 Disease denotes COVID-19 MESH:C000657245
207 26422-26430 Disease denotes COVID-19 MESH:C000657245
210 27696-27714 Gene denotes C-reactive protein Gene:1401
211 27719-27727 Disease denotes COVID-19 MESH:C000657245
213 28569-28573 Species denotes CATS Tax:9685
215 28894-28902 Disease denotes COVID-19 MESH:C000657245
217 29359-29362 Gene denotes Tag Gene:404663
219 31389-31397 Disease denotes COVID-19 MESH:C000657245
222 33753-33761 Species denotes patients Tax:9606
223 33670-33678 Disease denotes COVID-19 MESH:C000657245
227 35710-35716 Species denotes humans Tax:9606
228 36198-36217 Disease denotes infectious diseases MESH:D003141
229 36258-36266 Disease denotes COVID-19 MESH:C000657245
249 38447-38455 Species denotes patients Tax:9606
250 39126-39131 Species denotes human Tax:9606
251 39779-39787 Species denotes patients Tax:9606
252 39990-39996 Species denotes people Tax:9606
253 40476-40482 Species denotes people Tax:9606
254 37384-37392 Disease denotes COVID-19 MESH:C000657245
255 38461-38469 Disease denotes COVID-19 MESH:C000657245
256 38989-38997 Disease denotes COVID-19 MESH:C000657245
257 39054-39062 Disease denotes COVID-19 MESH:C000657245
258 39364-39378 Disease denotes high infection MESH:D007239
259 39438-39446 Disease denotes COVID-19 MESH:C000657245
260 39447-39456 Disease denotes infection MESH:D007239
261 39551-39556 Disease denotes fever MESH:D005334
262 39656-39664 Disease denotes COVID-19 MESH:C000657245
263 39793-39798 Disease denotes fever MESH:D005334
264 39823-39837 Disease denotes high infection MESH:D007239
265 40002-40016 Disease denotes high infection MESH:D007239
266 40223-40234 Disease denotes lung cancer MESH:D008175
267 40236-40248 Disease denotes liver cancer MESH:D006528
282 40812-40820 Species denotes patients Tax:9606
283 41223-41231 Species denotes patients Tax:9606
284 40751-40759 Disease denotes COVID-19 MESH:C000657245
285 40836-40844 Disease denotes COVID-19 MESH:C000657245
286 40995-41004 Disease denotes pneumonia MESH:D011014
287 41146-41154 Disease denotes COVID-19 MESH:C000657245
288 41199-41208 Disease denotes pneumonia MESH:D011014
289 41214-41222 Disease denotes COVID-19 MESH:C000657245
290 41401-41409 Disease denotes COVID-19 MESH:C000657245
291 41549-41558 Disease denotes pneumonia MESH:D011014
292 41590-41598 Disease denotes COVID-19 MESH:C000657245
293 41670-41674 Disease denotes XPDS OMIM:300911
294 41729-41738 Disease denotes pneumonia MESH:D011014
295 41770-41778 Disease denotes COVID-19 MESH:C000657245
311 42647-42655 Species denotes patients Tax:9606
312 42725-42733 Species denotes patients Tax:9606
313 42762-42770 Species denotes patients Tax:9606
314 42804-42812 Species denotes patients Tax:9606
315 42273-42277 Disease denotes iCAD
316 42458-42466 Disease denotes COVID-19 MESH:C000657245
317 42468-42477 Disease denotes pneumonia MESH:D011014
318 42556-42565 Disease denotes pneumonia MESH:D011014
319 42606-42614 Disease denotes COVID-19 MESH:C000657245
320 42637-42646 Disease denotes pneumonia MESH:D011014
321 42749-42757 Disease denotes COVID-19 MESH:C000657245
322 42828-42837 Disease denotes pneumonia MESH:D011014
323 43206-43221 Disease denotes keystroke-entry MESH:C557826
324 43370-43379 Disease denotes pneumonia MESH:D011014
325 43484-43492 Disease denotes COVID-19 MESH:C000657245
330 43850-43868 Gene denotes C-reactive protein Gene:1401
331 43876-43884 Disease denotes COVID-19 MESH:C000657245
332 44003-44011 Disease denotes COVID-19 MESH:C000657245
333 44180-44188 Disease denotes COVID-19 MESH:C000657245
338 44758-44762 Species denotes CATS Tax:9685
339 44604-44609 Chemical denotes CTHTS
340 44344-44348 Disease denotes XPDS OMIM:300911
341 44685-44689 Disease denotes XPDS OMIM:300911
344 47848-47856 Disease denotes COVID-19 MESH:C000657245
345 48171-48179 Disease denotes COVID-19 MESH:C000657245
350 52835-52838 Chemical denotes ths MESH:D013910
351 52916-52919 Chemical denotes ths MESH:D013910
352 53414-53417 Chemical denotes ths MESH:D013910
353 53482-53485 Chemical denotes ths MESH:D013910
355 53952-53954 Chemical denotes BN
358 48363-48371 Disease denotes COVID-19 MESH:C000657245
359 48526-48534 Disease denotes COVID-19 MESH:C000657245
361 55054-55081 Disease denotes cross-entropy loss function MESH:C537866
363 57403-57406 Gene denotes CGB Gene:93659

LitCovid-PD-HP

Id Subject Object Predicate Lexical cue hp_id
T2 2599-2604 Phenotype denotes fever http://purl.obolibrary.org/obo/HP_0001945
T3 2606-2615 Phenotype denotes dry cough http://purl.obolibrary.org/obo/HP_0031246
T4 2617-2624 Phenotype denotes dyspnea http://purl.obolibrary.org/obo/HP_0002094
T5 2630-2639 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T6 2801-2822 Phenotype denotes respiratory infection http://purl.obolibrary.org/obo/HP_0011947
T7 5438-5449 Phenotype denotes lung cancer http://purl.obolibrary.org/obo/HP_0100526
T8 5468-5477 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T9 5510-5521 Phenotype denotes retinopathy http://purl.obolibrary.org/obo/HP_0000488
T10 9017-9026 Phenotype denotes Pneumonia http://purl.obolibrary.org/obo/HP_0002090
T11 9177-9186 Phenotype denotes Pneumonia http://purl.obolibrary.org/obo/HP_0002090
T12 9410-9419 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T13 24324-24333 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T14 24536-24545 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T15 24950-24959 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T16 29630-29646 Phenotype denotes highly sensitive http://purl.obolibrary.org/obo/HP_0041092
T17 39551-39556 Phenotype denotes fever http://purl.obolibrary.org/obo/HP_0001945
T18 39793-39798 Phenotype denotes fever http://purl.obolibrary.org/obo/HP_0001945
T19 40223-40234 Phenotype denotes lung cancer http://purl.obolibrary.org/obo/HP_0100526
T20 40236-40248 Phenotype denotes liver cancer http://purl.obolibrary.org/obo/HP_0002896
T21 40995-41004 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T22 41199-41208 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T23 41549-41558 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T24 41729-41738 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T25 42468-42477 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T26 42556-42565 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T27 42637-42646 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T28 42828-42837 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090
T29 43370-43379 Phenotype denotes pneumonia http://purl.obolibrary.org/obo/HP_0002090

LitCovid-sentences

Id Subject Object Predicate Lexical cue
T12 0-12 Sentence denotes Introduction
T13 13-338 Sentence denotes Coronavirus disease 2019 (COVID-19), a highly infectious disease with the basic reproductive number (R0) of 5.7 (reported by the US Centers for Disease Control and Prevention), is caused by the most recently discovered coronavirus1 and was declared a global pandemic by the World Health Organization (WHO) on March 11, 20202.
T14 339-448 Sentence denotes It poses a serious threat to human health worldwide, as well as substantial economic losses to all countries.
T15 449-611 Sentence denotes As of 7 September 2020, 27,032,617 people have been infected by COVID-19 after testing, and 881,464 deaths have occurred, according to the statistics of the WHO3.
T16 612-755 Sentence denotes The Wall Street banks have estimated that the COVID-19 pandemic may cause losses of $5.5 trillion to the global economy over the next 2 years4.
T17 756-982 Sentence denotes The WHO recommends using real-time reverse transcriptase-polymerase chain reaction (rRT-PCR) for laboratory confirmation of the COVID-19 virus in respiratory specimens obtained by the preferred method of nasopharyngeal swabs5.
T18 983-1108 Sentence denotes Laboratories performing diagnostic testing for COVID-19 should strictly comply with the WHO biosafety guidance for COVID-196.
T19 1109-1397 Sentence denotes It is also necessary to follow the standard operating procedures (SOPs) for specimen collection, storage, packaging, and transport because the specimens should be regarded as potentially infectious, and the testing process can only be performed in a Biosafety Level 3 (BSL-3) laboratory7.
T20 1398-1495 Sentence denotes Not all cities worldwide have adequate medical facilities to follow the WHO biosafety guidelines.
T21 1496-1755 Sentence denotes According to an early report (Feb 17, 2020), the sensitivity of tests for the detection of COVID-19 using rRT-PCR analysis of nasopharyngeal swab specimens is around 30–60% due to irregularities during the collection and transportation of COVID-19 specimens8.
T22 1756-1857 Sentence denotes Recent studies reported a higher sensitivity range from 71% (Feb 19, 2020) to 91% (Mar 27, 2020)9,10.
T23 1858-2093 Sentence denotes A recent systematic review reported that the sensitivity of the PCR test for COVID-19 might be in the range of 71–98% (Apr 21, 2020), whereas the specificity of tests for the detection of COVID-19 using rRT-PCR analysis is about 95%11.
T24 2094-2325 Sentence denotes Yang et al.8 discovered that although no viral ribonucleic acid (RNA) was detected by rRT-PCR in the first three or all nasopharyngeal swab specimens in mild cases, the patient was eventually diagnosed with COVID-19 (Feb 17, 2020).
T25 2326-2446 Sentence denotes Therefore, the WHO has stated that one or more negative results do not rule out the possibility of COVID-19 infection12.
T26 2447-2543 Sentence denotes Additional auxiliary tests with relatively higher sensitivity to COVID-19 are urgently required.
T27 2544-2693 Sentence denotes The clinical symptoms associated with COVID-19 include fever, dry cough, dyspnea, and pneumonia, as described in the guideline released by the WHO13.
T28 2694-2856 Sentence denotes It has been recommended to use the WHO’s case definition for influenza-like illness (ILI) and severe acute respiratory infection (SARI) for monitoring COVID-1913.
T29 2857-3012 Sentence denotes As reported by the CHINA-WHO COVID-19 joint investigation group (February 28, 2020)14, autopsies showed the presence of lung infection in COVID-19 victims.
T30 3013-3191 Sentence denotes Therefore, medical imaging of the lungs might be a suitable auxiliary diagnostic testing method for COVID-19 since it uses available medical technology and clinical examinations.
T31 3192-3362 Sentence denotes Chest radiography (CXR) and chest computed tomography (CT) are the most common medical imaging examinations for the lungs and are available in most hospitals worldwide15.
T32 3363-3539 Sentence denotes Different tissues of the body absorb X-rays to different degrees16, resulting in grayscale images that allow for the detection of anomalies based on the contrast in the images.
T33 3540-3660 Sentence denotes CT differs from normal CXR in that it has superior tissue contrast with different shades of gray (about 32–64 levels)17.
T34 3661-3749 Sentence denotes The CT images are digitally processed18 to create a three-dimensional image of the body.
T35 3750-3818 Sentence denotes However, CT examinations are more expensive than CXR examinations19.
T36 3819-3956 Sentence denotes Recent studies reported that the use of CXR and CT images resulted in improved diagnostic sensitivity for the detection of COVID-1920,21.
T37 3957-4051 Sentence denotes The interpretation of medical images is time-consuming, labor-intensive, and often subjective.
T38 4052-4151 Sentence denotes The medical images are first annotated by experts to generate a report of the radiography findings.
T39 4152-4265 Sentence denotes Subsequently, the radiography findings are analyzed, and clinical factors are considered to obtain a diagnosis15.
T40 4266-4504 Sentence denotes However, during the current pandemic, the frontline expert physicians are faced with a massive workload and lack of time, which increases the physical and psychological burden on staff and might adversely affect the diagnostic efficiency.
T41 4505-4706 Sentence denotes Since modern hospitals have advanced digital imaging technology, medical image processing methods may have the potential for fast and accurate diagnosis of COVID-19 to reduce the burden on the experts.
T42 4707-5046 Sentence denotes Deep learning (DL) methods, especially convolutional neural networks (CNNs), are effective approaches for representation learning using multilayer neural networks22 and have provided excellent performance solutions to many problems in image classification23,24, object detection25, games and decisions26, and natural language processing27.
T43 5047-5177 Sentence denotes A deep residual network28 is a type of CNN architecture that uses the strategy of skip connections to avoid degradation of models.
T44 5178-5349 Sentence denotes However, the applications of DL for clinical diagnoses remains limited due to the lack of interpretability of the DL model and the multi-modal properties of clinical data.
T45 5350-5556 Sentence denotes Some studies have demonstrated excellent performance of DL methods for the detection of lung cancer with CT images29, pneumonia with CXR images30, and diabetic retinopathy with retinal fundus photographs31.
T46 5557-5714 Sentence denotes To the best of our knowledge, the DL method has been validated only on single modal data, and no correlation analysis with clinical indicators was performed.
T47 5715-5863 Sentence denotes Traditional machine learning methods are more constrained and better suited than DL methods to specific, practical computing tasks using features32.
T48 5864-6098 Sentence denotes As demonstrated by Jin et al., the traditional machine learning algorithm using the scale-invariant feature transform (SIFT)33 and random sample consensus (RANSAC)34 may outperform the state-of-the-art DL methods for image matching35.
T49 6099-6283 Sentence denotes We designed a general end-to-end DL framework for information extraction from CXR images (X-data) and CT images (CT-data) that can be considered a cross-domain transfer learning model.
T50 6284-6531 Sentence denotes In this study, we developed a custom platform for rapid expert annotation and proposed the modular CNN-based multi-stage framework (classification framework and regression framework) consisting of basic component units and special component units.
T51 6532-6644 Sentence denotes The framework represents an auxiliary examination method for high precision and automated detection of COVID-19.
T52 6645-6690 Sentence denotes This study makes the following contributions:
T53 6691-6914 Sentence denotes First, a multi-stage CNN-based classification framework consisting of two basic units (ResBlock-A and ResBlock-B) and a special unit (control gate block) was established for use with multi-modal images (X-data and CT-data).
T54 6915-7020 Sentence denotes The classification results were compared with evaluations by experts with different levels of experience.
T55 7021-7197 Sentence denotes Different optimization goals were established for the different stages in the framework to obtain good performances, which were evaluated using multiple statistical indicators.
T56 7198-7367 Sentence denotes Second, principal component analysis (PCA) was used to determine the characteristics of the X-data and CT-data of different categories (normal, COVID-19, and influenza).
T57 7368-7533 Sentence denotes Gradient-weighted class activation mapping (Grad-CAM) was used to visualize the salient features in the images and extract the lesion areas associated with COVID-19.
T58 7534-7776 Sentence denotes Third, data preprocessing methods, including pseudo-coloring and dimension normalization, were developed to facilitate the interpretability of the medical images and adapt the proposed framework to the multi-modal images (X-data and CT-data).
T59 7777-7955 Sentence denotes Fourth, A knowledge distillation method was adopted as a training strategy to obtain high performance with low computational requirements and improve the usability of the method.
T60 7956-8111 Sentence denotes Last, The CNN-based regression framework was used to describe the relationships between the radiography findings and the clinical symptoms of the patients.
T61 8112-8241 Sentence denotes Multiple evaluation indicators were used to assess the correlations between the radiography findings and the clinical indicators.
T62 8243-8250 Sentence denotes Results
T63 8252-8271 Sentence denotes Data set properties
T64 8272-8335 Sentence denotes Multi-modal data from multiple sources were used in this study.
T65 8336-8483 Sentence denotes X-data, CT-data, and clinical data used in our research were collected from four public data sets and one frontline hospital data (Youan hospital).
T66 8484-8650 Sentence denotes Each data set was divided into two parts: train-val part and test part using a train-test-split function (TTSF) of the scikit-learn library which is shown in Table 1.
T67 8651-8768 Sentence denotes The details of the multi-modal data types are described in the “Methods” section (see “Data sets splitting” section).
T68 8769-8886 Sentence denotes Table 1 Number of cases from four public data sets and the Youan hospital (X-data, CT-data, clinical indicator data).
T69 8887-8921 Sentence denotes Study X-data CT-data Clinical data
T70 8922-8972 Sentence denotes Train + Val Test Train + Val Test Train + Val Test
T71 8973-9016 Sentence denotes *Normal (RSNA + LUNA16) 5000 100 100 20 – –
T72 9017-9059 Sentence denotes Pneumonia (RSNA + ICNP) 3000 100 83 20 – –
T73 9060-9089 Sentence denotes COVID-19 (CCD) 150 62 – – – –
T74 9090-9133 Sentence denotes Influenza (Youan Hospital) 100 45 35 15 – –
T75 9134-9176 Sentence denotes *Normal (Youan Hospital) 478 25 139 20 – –
T76 9177-9221 Sentence denotes Pneumonia (Youan Hospital) 380 55 180 35 – –
T77 9222-9265 Sentence denotes COVID-19 (Youan Hospital) 35 10 75 20 75 20
T78 9266-9294 Sentence denotes Total 9143 397 612 130 75 20
T79 9295-9482 Sentence denotes The term *Normal in this work means the cases where the lungs are not manifest evidence of COVID-19, influenza, or pneumonia on imaging and the RT-PCR testing of the COVID-19 is negative.
T80 9484-9584 Sentence denotes A platform was developed for annotating lesion areas of COVID-19 in medical images (X-data, CT-data)
T81 9585-9748 Sentence denotes Medical imaging uses images of internal tissues of the human body or a part of the human body in a non-invasive manner for clinical diagnoses or treatment plans36.
T82 9749-9935 Sentence denotes Medical images (e.g., X-data and CT-data) are usually acquired using computed radiography and are typically stored in the Digital Imaging and Communications in Medicine (DICOM) format37.
T83 9936-10115 Sentence denotes X-data are two-dimensional grayscale images, and CT-data are three-dimensional data, consisting of slices of the data in the z axis direction of a two-dimensional grayscale image.
T84 10116-10231 Sentence denotes Machine learning methods are playing increasingly important roles in medical image analysis, especially DL methods.
T85 10232-10352 Sentence denotes DL uses multiple non-linear transformations to create a mapping relationship between the input data and output labels38.
T86 10353-10447 Sentence denotes The objective of this study was to annotate lesion areas in medical images with high accuracy.
T87 10448-10636 Sentence denotes Therefore, we developed a pseudo-coloring method, which is a technique that helps enhance medical images for physicians to isolate relevant tissues and groups different tissues together39.
T88 10637-10797 Sentence denotes We converted the original grayscale images to color images using the open-source image processing tools Open Source Computer Vision Library (OpenCV) and Pillow.
T89 10798-10855 Sentence denotes Examples of the pseudo-color images are shown in Fig. 1a.
T90 10856-10995 Sentence denotes We developed a platform that uses a client-server architecture to annotate the potential lesion areas of COVID-19 on the CXR and CT images.
T91 10996-11075 Sentence denotes The platform can be deployed on a private cloud for security and local sharing.
T92 11076-11234 Sentence denotes All the images were annotated by two experienced radiologists (one was a 5th-year radiologist and the other was a 3rd-year radiologist) in the Youan Hospital.
T93 11235-11397 Sentence denotes If there was disagreement about a result, a senior radiologist and a respiratory doctor made the final decision to ensure the precision of the annotation process.
T94 11398-11471 Sentence denotes The details of the annotation pipeline are shown in Supplementary Fig. 1.
T95 11472-11578 Sentence denotes Fig. 1 Demonstrations of data preprocessing methods including pseudo-coloring and dimension normalization.
T96 11579-11644 Sentence denotes a Pseudo-coloring for abnormal examples in the CXR and CT images.
T97 11645-11777 Sentence denotes The original grayscale images were transformed into color images using the pseudo-coloring method and were annotated by the experts.
T98 11778-11921 Sentence denotes The scale bar on the right is the range of pixel values of the image data. b Dimension normalization to reduce the dimensions in the CT images.
T99 11922-12025 Sentence denotes The number of CT images were first resampled to a multiple of three and then divided into three groups.
T100 12026-12104 Sentence denotes Followed by the 1 × 1 convolution layers to reduce the dimensions of the data.
T101 12106-12219 Sentence denotes PCA was used to determine the characteristics of the medical images for the COVID-19, influenza, and normal cases
T102 12220-12381 Sentence denotes PCA was used to visually compare the characteristics of the medical images (X-data, CT-data) for the COVID-19 cases with those of the normal and influenza cases.
T103 12382-12538 Sentence denotes Figure 2a shows the mean image of each category and the five eigenvectors that represent the principal components of PCA in the corresponding feature space.
T104 12539-12730 Sentence denotes Significant differences are observed between the COVID-19, influenza, and normal cases, indicating the possibility of being able to distinguish COVID-19 cases from normal and influenza cases.
T105 12731-12805 Sentence denotes Fig. 2 PCA visualizations and example heatmaps of both X-data and CT-data.
T106 12806-12868 Sentence denotes a Mean image and eigenvectors of five different sub-data sets.
T107 12869-12951 Sentence denotes The first column shows the mean image and the other columns show the eigenvectors.
T108 12952-13159 Sentence denotes The first row shows the mean image and five eigenvectors of the normal CXR images; second row: COVID-19 CXR images, third row: normal CT images, fourth row: influenza CT images, last row: COVID-19 CT images.
T109 13160-13346 Sentence denotes The scale bar on the right is the range of pixel values of the image data. b Heatmaps of both X-data and CT-data were demonstrated for better interpretability of the proposed frameworks.
T110 13347-13436 Sentence denotes The scale bar on the right is the probability of the areas being suspected as infections.
T111 13438-13607 Sentence denotes The CNN-based classification framework exhibited excellent performance based on the validation by experts using multi-modal data from public data sets and Youan hospital
T112 13608-13826 Sentence denotes The structure of the proposed framework, consisting of the stage I sub-framework and the stage II sub-framework is shown in Fig. 3a, where Q, L, M, and N are the hyper-parameters of the framework for general use cases.
T113 13827-13956 Sentence denotes The values of Q, L, M, and N were 1, 1, 2, and 2, respectively, in this study; this framework referred to as the CNNCF framework.
T114 13957-14115 Sentence denotes The stage I and stage II sub-frameworks were designed to extract features corresponding to different optimization goals in the analysis of the medical images.
T115 14116-14361 Sentence denotes The performance of the CNNCF was evaluated using multi-modal data sets (X-data and CT-data) to ensure the generalization and transferability of the model, and five evaluation indicators were used (sensitivity, precision, specificity, F1, kappa).
T116 14362-14486 Sentence denotes The salient features of the images extracted by the CNNCF were visualized in a heatmap (four examples are shown in Fig. 2b).
T117 14487-14735 Sentence denotes In this study, multiple experiments were conducted (including experiments that included data from the same source and from different sources) to validate the generalization ability of the framework while avoiding the possible sample selection bias.
T118 14736-14972 Sentence denotes Five experts evaluated the images, i.e., a 7th-year respiratory resident (Respira.), a 3rd-year emergency resident (Emerg.), a 1st-year respiratory intern (Intern), a 5th-year radiologist (Rad-5th), and a 3rd-year radiologist (Rad-3rd).
T119 14973-15045 Sentence denotes The definition of the expert group can be found in Supplementary Note 1.
T120 15046-15328 Sentence denotes The abbreviations of all the data sets used in the following experiments including XPDS, XPTS, XPVS, XHDS, XHTS, XHVS, CTPDS, CTPTS, CTPVS, CTHDS, CTHTS, CTHVS, CADS, CATS, CAVS, XMTS, XMVS, CTMTS, and CTMVS were defined in the “Methods” section (see “Data sets splitting” section).
T121 15329-15365 Sentence denotes The following results were obtained.
T122 15366-15394 Sentence denotes Fig. 3 CNN-based frameworks.
T123 15395-15657 Sentence denotes a The classification framework for the identification of COVID-19. b The regression framework for the correlation analysis between the lesion areas and the clinical indicators. c is the workflow of the classification framework for the identification of COVID-19.
T124 15659-15671 Sentence denotes Experiment-A
T125 15672-15845 Sentence denotes In this experiment, we used the X-data of the XPVS where the normal cases were from the RSNA data set and the COVID-19 cases were from the COVID CXR data set (CCD) data set.
T126 15846-15983 Sentence denotes The results of the five evaluation indicators for the comparison of the COVID-19 cases and normal cases of the XPVS are shown in Table 2.
T127 15984-16094 Sentence denotes An excellent performance was obtained, with the best score of specificity of 99.33% and a precision of 98.33%.
T128 16095-16284 Sentence denotes The F1 score was 96.72%, which was higher than that of the Respire. (96.12%), the Emerg. (93.94%), the Intern (84.67%), and the Rad-3rd (85.93%) and lower than that of the Rad-5th (98.41%).
T129 16285-16478 Sentence denotes The kappa index was 95.40%, which was higher than that of the Respire. (94.43%), the Emerg. (91.21%), the Intern (77.45%), and the Rad-3rd (79.42%), and lower than that of the Rad-5th (97.74%).
T130 16479-16669 Sentence denotes The sensitivity index was 95.16%, which was higher than that of the Intern (93.55%) and the Rad-3rd (93.55%) and lower than that of the Respire. (100%), the Emerg. (100%) and Rad-5th (100%).
T131 16670-16835 Sentence denotes The receiver operating characteristic (ROC) scores for the CNNCF and the experts are plotted in Fig. 4a; the area under the ROC curve (AUROC) of the CNNCF is 0.9961.
T132 16836-16991 Sentence denotes The precision-recall scores for the CNNCF and the experts are plotted in Fig. 4d; the area under the precision-recall curve (AUPRC) of the CNNCF is 0.9910.
T133 16992-17312 Sentence denotes Table 2 Performance indices of the classification framework (CNNCF) of experiment A and the average performance of the 7th-year respiratory resident (Respira.), the 3rd-year emergency resident (Emerg.), the 1st-year respiratory intern (Intern), the 5th-year radiologist (Rad-5th), and the 3rd-year radiologist (Rad-3rd).
T134 17313-17400 Sentence denotes F1 (95% CI) Kappa (95% CI) Specificity (95% CI) Sensitivity (95% CI) Precision (95% CI)
T135 17401-17409 Sentence denotes CNNCF 0.
T136 17410-17506 Sentence denotes 9672 (0.9307, 0.9890) 0.9540 (0.9030, 0.9924) 0.9933 (0.9792, 1.0000) 0.9516 (0.8889, 1.0000) 0.
T137 17507-17528 Sentence denotes 9833 (0.9444, 1.0000)
T138 17529-17537 Sentence denotes Respire.
T139 17538-17657 Sentence denotes 0.9612 (0.9231, 0.9920) 0.9443 (0.8912, 0.9887) 0.9667 (0.9363, 0.9933) 1.0000 (1.0000, 1.0000) 0.9254 (0.8095, 0.9571)
T140 17658-17664 Sentence denotes Emerg.
T141 17665-17667 Sentence denotes 0.
T142 17668-17785 Sentence denotes 9394 (0.8947, 0.9781) 0.9121 (0.8492, 0.9677) 0.9467 (0.9091, 0.9797) 1.0000 (1.0000, 1.0000) 0.8857 (0.8095, 0.9571)
T143 17786-17793 Sentence denotes Intern.
T144 17794-17912 Sentence denotes 0.8467 (0.7692, 0.9041) 0.7745 (0.6730, 0.8592) 0.8867 (0.8333, 0.9343) 0.9355 (0.8596, 0.984) 0.7733 (0.6708, 0.8649)
T145 17913-18040 Sentence denotes Rad-5th 0.9841 (0.9593, 1.0000) 0.9774 (0.9433, 1.0000) 0.9867 (0.9662, 1.0000) 1.0000 (1.0000, 1.0000) 0.9688 (0.9219, 1.0000)
T146 18041-18168 Sentence denotes Rad-3rd 0.8593 (0.7931, 0.9180) 0.7942 (0.7062, 0.8779) 0.9000 (0.8541, 0.9481) 0.9355 (0.8666, 0.9841) 0.7945 (0.6974, 0.8873)
T147 18169-18232 Sentence denotes Fig. 4 ROC and PRC curves for the CNNCF of the experiments A-C.
T148 18233-18322 Sentence denotes NC indicates that the positive case is a COVID-19 case, and the negative case is *Normal.
T149 18323-18407 Sentence denotes CI indicates that the positive case is COVID-19, and the negative case is influenza.
T150 18408-18494 Sentence denotes The points are the results of experts, corresponding to the results in Tables 2 and 3.
T151 18495-18804 Sentence denotes The background gray dashed curves in the PRC curve correspond to the iso-F1 curves. a ROC curve for the NC using X-data. b ROC curve for the NC using CT-data. c ROC curve for the CI using CT-data. d PRC curve for the NC using X-data. e PRC curve for the NC using CT-data. f PRC curve for the CI using CT-data.
T152 18806-18818 Sentence denotes Experiment-B
T153 18819-18985 Sentence denotes In this experiment, we used the CT-data of the CTPVS and CTHVS where the normal cases were from the LUNA data set and the COVID-19 cases were from the Youan hospital.
T154 18986-19219 Sentence denotes The results of the five evaluation indicators for the comparison of the COVID-19 cases and normal cases of the CTHVS and the CTPVS are shown in Table 3, where the normal cases are from CTPVS and the COVID-19 cases are from the CTHVS.
T155 19220-19406 Sentence denotes The CNNCF exhibits good performance for the five evaluation indices, which are similar to that of the Respire. and the Rad-5th and higher than that of the Intern, the Emerg. and Rad-3rd.
T156 19407-19476 Sentence denotes The ROC scores are plotted in Fig. 4b; the AUROC of the CNNCF is 1.0.
T157 19477-19557 Sentence denotes The precision-recall scores are shown in Fig. 4e; the AUPRC of the CNNCF is 1.0.
T158 19558-19890 Sentence denotes Table 3 Performance indices of the classification framework (CNNCF) of the experiments B and C, and the average performance of the 7th-year respiratory resident (Respira.), the 3rd-year emergency resident (Emerg.), the 1st-year respiratory intern (Intern), the 5th-year radiologist (Rad-5th), and the 3rd-year radiologist (Rad-3rd).
T159 19891-19922 Sentence denotes CT (*Normal and COVID-19 cases)
T160 19923-19937 Sentence denotes CNNCF Respire.
T161 19938-19944 Sentence denotes Emerg.
T162 19945-19952 Sentence denotes Intern.
T163 19953-19968 Sentence denotes Rad-5th Rad-3rd
T164 19969-20124 Sentence denotes F1 (95% CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.8571, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.8667, 1.0000)
T165 20125-20283 Sentence denotes Kappa (95% CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.7422, 1.0000) 1.0000 (1.0000, 1.0000) 0.9000 (0.7487, 1.0000)
T166 20284-20448 Sentence denotes Specificity (95% CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.8333, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.8333, 1.0000)
T167 20449-20613 Sentence denotes Sensitivity (95% CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.8333, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.8421, 1.0000)
T168 20614-20776 Sentence denotes Precision (95% CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.8235, 1.0000) 1.0000 (1.0000, 1.0000) 0.9500 (0.8333, 1.0000)
T169 20777-20810 Sentence denotes CT (Influenza and COVID-19 cases)
T170 20811-20825 Sentence denotes CNNCF Respire.
T171 20826-20832 Sentence denotes Emerg.
T172 20833-20840 Sentence denotes Intern.
T173 20841-20856 Sentence denotes Rad-5th Rad-3rd
T174 20857-21011 Sentence denotes F1 (95%CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.8966 (0.7332, 1.0000) 0.8000 (0.6207, 0.9412) 0.9677 (0.8889, 1.0000) 0.8667 (0.7199, 0.9744)
T175 21012-21100 Sentence denotes Kappa (95%CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.8236 (0.5817, 1.0000) 0.
T176 21101-21170 Sentence denotes 6500 (0.3698, 0.8852) 0.9421 (0.8148, 1.0000) 0.7667 (0.5349, 0.9429)
T177 21171-21334 Sentence denotes Specificity (95%CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.9048 (0.7619, 1.0000) 0.8500 (0.6818, 1.0000) 0.9500 (0.8333, 1.0000) 0.9000 (0.7619, 1.0000)
T178 21335-21498 Sentence denotes Sensitivity (95%CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.9286 (0.7500, 1.0000) 0.8000 (0.5714, 1.0000) 1.0000 (1.0000, 1.0000) 0.8667 (0.6667, 1.0000)
T179 21499-21660 Sentence denotes Precision (95%CI) 1.0000 (1.0000, 1.0000) 1.0000 (1.0000, 1.0000) 0.8667 (0.6874, 1.0000) 0.8000 (0.5881, 1.0000) 0.9375 (0.8000, 1.0000) 0.8667 (0.6667, 1.0000)
T180 21662-21674 Sentence denotes Experiment-C
T181 21675-21807 Sentence denotes In this experiment, we used the CT-data of the CTHVS where the normal cases and the COVID-19 cases were all from the Youan hospital.
T182 21808-22021 Sentence denotes The results of the five evaluation indicators for the comparison of the COVID-19 cases and influenza cases of the CTHVS are shown in Table 3 where the influenza cases and the COVID-19 cases are all from the CTHVS.
T183 22022-22115 Sentence denotes The CNNCF achieved the highest performance and the best score of all five evaluation indices.
T184 22116-22185 Sentence denotes The ROC scores are plotted in Fig. 4c; the AUROC of the CNNCF is 1.0.
T185 22186-22270 Sentence denotes The precision-recall scores are shown in Fig. 4f, and the AUPRC of the CNNCF is 1.0.
T186 22272-22284 Sentence denotes Experiment-D
T187 22285-22546 Sentence denotes The boxplots of the five evaluation indicators, the F1 score (Fig. 5a, d, g), the kappa coefficient (Fig. 5b, e, h), and the specificity (Fig. 5c, f, i) of experiments A–C are shown in Fig. 5, and the precision and sensitivity are shown in Supplementary Fig. 2.
T188 22547-22714 Sentence denotes A bootstrapping method40 was used to calculate the empirical distributions, and McNemar’s test41 was used to analyze the differences between the CNNCF and the experts.
T189 22715-22930 Sentence denotes The p-values of the McNemar’s test (Supplementary Tables 1–3) for the five evaluation indicators were all 1.0, indicating no statistically significant difference between the CNNCF results and the expert evaluations.
T190 22931-23054 Sentence denotes Fig. 5 Boxplots of the F1 score, kappa score, and specificity for the CNNCF and expert results for COVID-19 identification.
T191 23055-23144 Sentence denotes NC indicates that the positive case is a COVID-19 case, and the negative case is *Normal.
T192 23145-23229 Sentence denotes CI indicates that the positive case is COVID-19, and the negative case is influenza.
T193 23230-23684 Sentence denotes Bootstrapping is used to generate n = 1000 resampled independent validation sets for the XVS and the CTVS. a F1 score for the NC using X-data. b Kappa score for the NC using X-data. c Specificity for the NC using X-data. d F1 score for the NC using CT-data. e Kappa score for the NC using CT-data. f Specificity for the NC using CT-data. g F1 score for the CI using CT-data. h Kappa score for the CI using CT-data. i Specificity for the CI using CT-data.
T194 23685-23946 Sentence denotes We also conducted extra experiments with both configurations of the same data source and different data sources: the descriptions and graph charts can be found in the Supplementary Experiments and Tables (Supplementary Tables 4–19 and Supplementary Figs. 3–18).
T195 23947-24037 Sentence denotes The data used in experiments E–G were CTHVS and the data were all from the Youan hospital.
T196 24038-24127 Sentence denotes The data used in experiments H–K were XHVS and the data were all from the Youan hospital.
T197 24128-24181 Sentence denotes The data used in experiments L–N were XPVS and CTPVS.
T198 24182-24398 Sentence denotes The data used in the experiment L was from the same data set RSNA, while the data used in experiment M was from different data sets where the pneumonia cases were from the ICNP, and the normal cases were from LUNA16.
T199 24399-24593 Sentence denotes The data used in the experiments O–R, from the four public data sets and one hospital (Youan hospital) data set (including normal cases, pneumonia cases and COVID-19 cases), were XMVS and CTMVS.
T200 24594-24672 Sentence denotes In all the experiments (experiments A–R), the CNNCF achieved good performance.
T201 24673-24833 Sentence denotes Notably, in order to obtain a more comprehensive evaluation of the CNNCF while further improving the usability in clinical practice, experiment-R was performed.
T202 24834-25006 Sentence denotes In the experiment-R, the CNNCF was used to distinguish three types of cases simultaneously (Including the COVID-19, pneumonia, and normal cases) on both the XMVS and CTMVS.
T203 25007-25204 Sentence denotes Good performances were obtained on the XMVS, with the best score of F1 score of 91.89%, kappa score of 89.74%, specificity of 97.14%, sensitivity of 94.44%, and a precision of 89.47%, respectively.
T204 25205-25327 Sentence denotes Excellent performances were obtained on the CTMVS, with the best score of the five evaluation indicators were all 100.00%.
T205 25328-25441 Sentence denotes The ROC score and PRC score in the experiment-R were also satisfactory which were shown in Supplementary Fig. 18.
T206 25442-25550 Sentence denotes The results of the experiment-R further demonstrated the effectiveness and robustness of the proposed CNNCF.
T207 25552-25606 Sentence denotes Image analysis identifies salient features of COVID-19
T208 25607-25746 Sentence denotes In clinical practice, the diagnostic decision of a clinician relies on the identification of the SAs in the medical images by radiologists.
T209 25747-25879 Sentence denotes The statistical results show that the performance of the CNNCF for the identification of COVID-19 is as good as that of the experts.
T210 25880-25983 Sentence denotes A comparison consisting of two parts was performed to evaluate the discriminatory ability of the CNNCF.
T211 25984-26144 Sentence denotes In the first part, we used Grad-CAM, which is a non-intrusive method to extract the salient features in medical images, to create a heatmap of the CNNCF result.
T212 26145-26235 Sentence denotes Figure 2b shows the heatmaps of four examples of COVID-19 cases in the X-data and CT-data.
T213 26236-26431 Sentence denotes In the second part, we used density-based spatial clustering of applications with noise (DBSCAN) to calculate the center pixel coordinates (CPC) of the salient features corresponding to COVID-19.
T214 26432-26478 Sentence denotes All CPCs were normalized to a range of 0 to 1.
T215 26479-26629 Sentence denotes Subsequently, we used a significance test (ST)42 to analyze the relationship between the CPC of the CNNCF output and the CPC annotated by the experts.
T216 26630-26879 Sentence denotes A good performance was obtained, with a mean square error (MSE) of 0.0108, a mean absolute error (MAE) of 0.0722, a root mean squared error (RMSE) of 0.1040, a correlation coefficient (r) of 0.9761, and a coefficient of determination (R2) of 0.8801.
T217 26881-27002 Sentence denotes A strong correlation was observed between the lesion areas detected by the proposed framework and the clinical indicators
T218 27003-27154 Sentence denotes In clinical practice, multiple clinical indicators are analyzed to determine whether further examinations (i.e., medical image examination) are needed.
T219 27155-27230 Sentence denotes These indicators can be used to assess the predictive ability of the model.
T220 27231-27332 Sentence denotes In addition, various examinations are required to perform an accurate diagnosis in clinical practice.
T221 27333-27423 Sentence denotes However, the correlations between the results of various examinations are often not clear.
T222 27424-27743 Sentence denotes We used the stage II sub-framework and the regressor block of the CNNRF to conduct a correlation analysis between the lesion areas detected by the framework and five clinical indicators (white blood cell count, neutrophil percentage, lymphocyte percentage, procalcitonin, C-reactive protein) of COVID-19 using the CADS.
T223 27744-27937 Sentence denotes The inputs of the CNNRF were the lesion area images of each case, and the output was a 5-dimensional vector describing the correlation between the lesion areas and the five clinical indicators.
T224 27938-28002 Sentence denotes The MAE, MSE, RMSE, r, and R2 were used to evaluate the results.
T225 28003-28150 Sentence denotes The ST and the Pearson correlation coefficient (PCC)43 were used to determine the correlation between the lesion areas and the clinical indicators.
T226 28151-28262 Sentence denotes A strong correlation was obtained, with MSE = 0.0163, MAE = 0.0941, RMSE = 0.1172, r = 0.8274, and R2 = 0.6465.
T227 28263-28356 Sentence denotes At a significance level of 0.001, the value of r was 1.27 times the critical value of 0.6524.
T228 28357-28467 Sentence denotes This result indicates a high and significant correlation between the lesion areas and the clinical indicators.
T229 28468-28539 Sentence denotes The PCC was 0.8274 (range of 0.8–1.0), indicating a strong correlation.
T230 28540-28603 Sentence denotes The CNNRF was trained on the CATS and evaluated using the CAVS.
T231 28604-28721 Sentence denotes The initial learning rate was 0.01, and the optimization function was the stochastic gradient descent (SGD) method44.
T232 28722-28808 Sentence denotes The parameters of the CNNRF were initialized using the Xavier initialization method45.
T233 28810-28820 Sentence denotes Discussion
T234 28821-28941 Sentence denotes We developed a computer-aided diagnosis method for the identification of COVID-19 in medical images using DL techniques.
T235 28942-29067 Sentence denotes Strong correlations were obtained between the lesion areas identified by the proposed CNNRF and the five clinical indicators.
T236 29068-29149 Sentence denotes An excellent agreement was observed between the model results and expert opinion.
T237 29150-29434 Sentence denotes Popular image annotation tools (e.g., Labelme46 and VOTT47) are used to annotate various images and support common formats, such as Joint Photographic Experts Group (JPG), Portable Network Graphics (PNG), and Tag Image File Format (TIFF); these formats are not used in the DICOM data.
T238 29435-29612 Sentence denotes Therefore, we developed an annotation platform that does not require much storage space or transformations and can be deployed on a private cloud for security and local sharing.
T239 29613-29762 Sentence denotes Our eyes are not highly sensitive to grayscale images in regions with high average brightness48, resulting in relatively low identification accuracy.
T240 29763-29902 Sentence denotes The proposed pseudo-color method increased the information content of the medical images and facilitated the identification of the details.
T241 29903-30002 Sentence denotes PCA has been widely used for feature extraction and dimensionality reduction in image processing49.
T242 30003-30067 Sentence denotes We used PCA to determine the feature space of the sub-data sets.
T243 30068-30167 Sentence denotes Each image in a specified sub-data set was represented as a linear combination of the eigenvectors.
T244 30168-30285 Sentence denotes Since the eigenvectors describe the most informative regions in the medical images, they represent each sub-data set.
T245 30286-30373 Sentence denotes We visualized the top-five eigenvectors of each sub-data set using an intuitive method.
T246 30374-30525 Sentence denotes The CNNCF is a modular framework consisting of two stages that were trained with different optimization goals and controlled by the control gate block.
T247 30526-30703 Sentence denotes Each stage consisted of multiple residual blocks (ResBlock-A and ResBlock-B) that retained the features in the different layers, thereby preventing the degradation of the model.
T248 30704-30811 Sentence denotes The design of the control gate block was inspired by the synaptic frontend structure in the nervous system.
T249 30812-30920 Sentence denotes We calculated the score of the optimization target, and a score above a predefined threshold was acceptable.
T250 30921-31064 Sentence denotes If the times of the neurotransmitter were above another predefined threshold, the control gate was opened to let the features information pass.
T251 31065-31116 Sentence denotes The framework was trained in a step-by-step manner.
T252 31117-31326 Sentence denotes Training occurred at each stage for a specified goal, and the second stage used the features extracted by the first stage, thereby reusing the features and increasing the convergence speed of the second stage.
T253 31327-31444 Sentence denotes The CNNCF exhibited excellent performance for identifying the COVID-19 cases automatically in the X-data and CT-data.
T254 31445-31632 Sentence denotes Unlike traditional machine learning methods, the CNNCF was trained in an end-to-end manner, which ensured the flexibility of the framework for different data sets without much adjustment.
T255 31633-31866 Sentence denotes We adopted a knowledge distillation method in the training phrase; a small model (called a student network) was trained to mimic the ensemble of multiple models (called teacher networks) to obtain a small model with high performance.
T256 31867-31998 Sentence denotes In the distillation process, knowledge was transferred from the teacher networks to the student network to minimize knowledge loss.
T257 31999-32088 Sentence denotes The target was the output of the teacher networks; these outputs were called soft labels.
T258 32089-32278 Sentence denotes The student network also learned from the ground-truth labels (also called hard labels), thereby minimizing the knowledge loss from the student networks, whose targets were the hard labels.
T259 32279-32425 Sentence denotes Therefore, the overall loss function of the student network incorporated both knowledge distillation and knowledge loss from the student networks.
T260 32426-32669 Sentence denotes After the student network had been well-trained, the task of the teacher networks was complete, and the student model could be used on a regular computer with a fast speed, which is suitable for hospitals without extensive computing resources.
T261 32670-32801 Sentence denotes As a result of the knowledge distillation method, the CNNCF achieved high performance with a few parameters in the teacher network.
T262 32802-32930 Sentence denotes The CNNRF is a modular framework consisting of one stage II sub-framework and one regressor block to handle the regression task.
T263 32931-33202 Sentence denotes In the regressor block, we used skip connections that consisted of a convolution layer with multiple 1 × 1 convolution kernels for retaining the features extracted by the stage II sub-framework while improving the non-linear representation ability of the regressor block.
T264 33203-33402 Sentence denotes We made use of flexible blocks to achieve good performance for the classification and regression tasks, unlike traditional machine learning methods, which are commonly used for either of these tasks.
T265 33403-33553 Sentence denotes Five statistical indices, including sensitivity, specificity, precision, kappa coefficient, and F1 were used to evaluate the performance of the CNNCF.
T266 33554-33679 Sentence denotes The sensitivity is related to the positive detection rate and is of great significance in the diagnostic testing of COVID-19.
T267 33680-33779 Sentence denotes The specificity refers to the ability of the model to correctly identify patients with the disease.
T268 33780-33862 Sentence denotes The precision indicates the ability of the model to provide a positive prediction.
T269 33863-33926 Sentence denotes The kappa demonstrates the stability of the model’s prediction.
T270 33927-33984 Sentence denotes The F1 is the harmonic mean of precision and sensitivity.
T271 33985-34123 Sentence denotes Good performance was achieved by the CNNCF based on the five statistical indices for the multi-modal image data sets (X-data and CT-data).
T272 34124-34228 Sentence denotes The consistency between the model results and the expert evaluation was determined using McNemar’s test.
T273 34229-34404 Sentence denotes The good performance demonstrated the model’s capacity of learning from the experts using the labels of the image data and mimicking the experts in diagnostic decision-making.
T274 34405-34502 Sentence denotes The ROC and PRC of the CNNCF were used to evaluate the performance of the classification model50.
T275 34503-34661 Sentence denotes The ROC is a probability curve that shows the trade-off between the true positive rate (TPR) and false-positive rate (FPR) using different threshold settings.
T276 34662-34780 Sentence denotes The AUROC provides a measure of separability and demonstrated the discriminative capacity of the classification model.
T277 34781-34913 Sentence denotes The larger the AUROC, the better the performance of the model is for predicting the true positive (TP) and true negative (TN) cases.
T278 34914-35033 Sentence denotes The PRC shows the trade-off between the TPR and the positive predictive value (PPV) using different threshold settings.
T279 35034-35120 Sentence denotes The larger the AUPRC, the higher the capacity of the model is to predict the TP cases.
T280 35121-35235 Sentence denotes In our experiments, the CNNCF achieved high scores for both the AUPRC and AUROC (>99%) for the X-data and CT-data.
T281 35236-35362 Sentence denotes DL has made significant progress in numerous areas in recent years and has provided best-performance solutions for many tasks.
T282 35363-35546 Sentence denotes In areas that require high interpretability, such as autonomous driving and medical diagnosis, DL has disadvantages because it is a black-box approach and lacks good interpretability.
T283 35547-35734 Sentence denotes The strong correlation obtained between the CNNCF output and the experts’ evaluation suggested that the mechanism of the proposed CNNCF is similar to that used by humans analyzing images.
T284 35735-35899 Sentence denotes The combination of the visual interpretation and the correlation analysis enhanced the ability of the framework to interpret the results, making it highly reliable.
T285 35900-36026 Sentence denotes The CNNCF has a promising potential for clinical diagnosis considering its high performance and hybrid interpretation ability.
T286 36027-36459 Sentence denotes We have explored the potential use of the CNNCF for clinical diagnosis with the support of the Beijing Youan hospital (which is an authoritative hospital for the study of infectious diseases and one of the designated hospitals for COVID-19 treatment) using both real data after privacy masking and input from experts under experimental conditions and provided a suitable schedule for assisting experts with the radiography analysis.
T287 36460-36545 Sentence denotes However, medical diagnosis in a real situation is more complex than in an experiment.
T288 36546-36745 Sentence denotes Therefore, further studies will be conducted in different hospitals with different complexities and uncertainties to obtain more experience in multiple clinical use cases with the proposed framework.
T289 36746-36940 Sentence denotes The objective of this study was to use statistical methods to analyze the relationship between salient features in images and expert evaluations and test the discriminative ability of the model.
T290 36941-37136 Sentence denotes The CNNRF can be considered a cross-modal prediction model, which is a challenging research area that requires more attention because it is closely related to associative thinking and creativity.
T291 37137-37299 Sentence denotes In addition, the correlation analysis might be a possible optimization direction to improve the interpretability performance of the classification model using DL.
T292 37300-37524 Sentence denotes In conclusion, we proposed a complete framework for the computer-aided diagnosis of COVID-19, including data annotation, data preprocessing, model design, correlation analysis, and assessment of the model’s interpretability.
T293 37525-37664 Sentence denotes We developed a pseudo-color tool to convert the grayscale medical images to color images to facilitate image interpretation by the experts.
T294 37665-37791 Sentence denotes We developed a platform for the annotation of medical images characterized by high security, local sharing, and expandability.
T295 37792-37927 Sentence denotes We designed a simple data preprocessing method for converting multiple types of images (X-data, CT-data) to three-channel color images.
T296 37928-38095 Sentence denotes We established a modular CNN-based classification framework with high flexibility and wide use cases, consisting of the ResBlock-A, ResBlock-B, and Control Gate Block.
T297 38096-38255 Sentence denotes A knowledge distillation method was used as a training strategy for the proposed classification framework to ensure high performance with fast inference speed.
T298 38256-38503 Sentence denotes A CNN-based regression framework that required minimal changes to the architecture of the classification framework was employed to determine the correlation between the lesion area images of patients with COVID-19 and the five clinical indicators.
T299 38504-38723 Sentence denotes The three evaluation indices (F1, kappa, specificity) of the classification framework were similar to those of the respiratory resident and the emergency resident and slightly higher than that of the respiratory intern.
T300 38724-38853 Sentence denotes We visualized the salient features that contributed most to the CNNCF output in a heatmap for easy interpretability of the CNNCF.
T301 38854-39033 Sentence denotes The proposed CNNCF computer-aided diagnosis method showed relatively high precision and has a potential for the automatic diagnosis of COVID-19 in clinical practice in the future.
T302 39034-39143 Sentence denotes The outbreak of the COVID-19 epidemic poses serious threats to the safety and health of the human population.
T303 39144-39287 Sentence denotes At present, popular methods for the diagnosis and monitoring of viruses include the detection of viral RNAs using PCR or a test for antibodies.
T304 39288-39457 Sentence denotes However, one negative result of the RT-PCR test (especially in the areas of high infection risk) might not be enough to rule out the possibility of a COVID-19 infection.
T305 39458-39578 Sentence denotes On June 14, 2020, the Beijing Municipal Health Commission declared that strict management of fever clinics was required.
T306 39579-39845 Sentence denotes All medical institutions in Beijing were required to conduct tests to detect COVID-19 nucleic acids and antibodies, CT examinations, and the routine blood test (also referred to as “1 + 3 tests”) for patients with fever that live in areas with high infection risk51.
T307 39846-40047 Sentence denotes Therefore, the proposed computer-aided diagnosis using medical imaging could be used as an auxiliary diagnosis tool to help physicians identify people with high infection risk in the clinical workflow.
T308 40048-40123 Sentence denotes There is also a potential for broader applicability of the proposed method.
T309 40124-40277 Sentence denotes Once the method has been improved, it might be used in other diagnostic decision-making scenarios (lung cancer, liver cancer, etc.) using medical images.
T310 40278-40363 Sentence denotes The expertise of a specialist will be required in clinical cases in future scenarios.
T311 40364-40525 Sentence denotes However, we are optimistic about the potential of using DL methods in intelligent medicine and expect that many people will benefit from the advanced technology.
T312 40527-40534 Sentence denotes Methods
T313 40536-40555 Sentence denotes Data sets splitting
T314 40556-40732 Sentence denotes We used the multi-modal data sets from four public data sets and one hospital (Youan hospital) in our research and split the hybrid data set in the following manner.For X-data:
T315 40733-40875 Sentence denotes The CXR images of COVID-19 cases collected from the public CCD52 contained 212 patients diagnosed with COVID-19 and were resized to 512 × 512.
T316 40876-40949 Sentence denotes Each image contained 1–2 suspected areas with inflammatory lesions (SAs).
T317 40950-41049 Sentence denotes We also collected 5100 normal cases and 3100 pneumonia cases from another public data set (RSNA)53.
T318 41050-41271 Sentence denotes In addition, The CXR images collected from the Youan hospital contained 45 cases diagnosed with COVID-19, 503 normal cases, 435 cases diagnosed with pneumonia (not COVID-19 patients), and 145 cases diagnosed as influenza.
T319 41272-41378 Sentence denotes The CXR images collected from the Youan hospital were obtained using the Carestream DRX-Revolution system.
T320 41379-41496 Sentence denotes All the CXR images of COVID-19 cases were analyzed by the two experienced radiologists to determine the lesion areas.
T321 41497-41676 Sentence denotes The X-data of the normal cases (XNPDS), that of the pneumonia cases (XPPDS), and that of the COVID-19 cases (XCPDS) from public data sets constituted the X public data set (XPDS).
T322 41677-41860 Sentence denotes The X-data of the normal cases (XNHDS), that of the pneumonia cases (XPHDS), and that of the COVID-19 cases (XCHDS) from the Youan hospital constituted the X hospital data set (XHDS).
T323 41861-41873 Sentence denotes For CT-data:
T324 41874-42356 Sentence denotes We collected CT-data of 120 normal cases from a public lung CT-data set (LUNA16, a large data set for automatic nodule detection in the lungs54), which was a subset of LIDC-IDRI (The LIDC-IDRI contains a total of 1018 helical thoracic CT scans collected using manufacturers from eight medical imaging companies including AGFA Healthcare, Carestream Health, Inc., Fuji Photo Film Co., GE Healthcare, iCAD, Inc., Philips Healthcare, Riverain Medical, and Siemens Medical Solutions)55.
T325 42357-42522 Sentence denotes It was confirmed by the two experienced radiologists from the Youan Hospital that no lesion areas of COVID-19, pneumonia, or influenza were present in the 120 cases.
T326 42523-42665 Sentence denotes We also collected the CT-data of pneumonia cases from a public data set (images of COVID-19 positive and negative pneumonia patients: ICNP)56.
T327 42666-42838 Sentence denotes The CT-data collected from the Youan hospital contained 95 patients diagnosed with COVID-19, 50 patients diagnosed with influenza and 215 patients diagnosed with pneumonia.
T328 42839-43007 Sentence denotes The images of the CT scans collected from the Youan hospital were obtained using the PHILIPS Brilliance iCT 256 system (Which was also used for the LIDC-IDRI data set).
T329 43008-43121 Sentence denotes The slice thickness of the CT scans was 5 mm, and the CT-data images were grayscale images with 512 × 512 pixels.
T330 43122-43312 Sentence denotes Areas with 2–5 SAs were annotated by the two experienced radiologists using a rapid keystroke-entry format in the images for each case, and these areas ranged from 16 × 16 to 64 × 64 pixels.
T331 43313-43464 Sentence denotes The CT-data of the normal cases (CTNPDS) and that of the pneumonia cases (CTPPDS) from the public data sets constituted the CT public data set (CTPDS).
T332 43465-43709 Sentence denotes The CT-data of the COVID-19 cases from the Youan hospital (CTCHDS), the influenza cases from the Youan hospital (CTIHDS), and the normal cases from the Youan hospital (CTNHDS) constituted the CT hospital (clinically-diagnosed) data set (CTHDS).
T333 43710-43738 Sentence denotes For clinical indicator data:
T334 43739-43965 Sentence denotes Five clinical indicators (white blood cell count, neutrophil percentage, lymphocyte percentage, procalcitonin, C-reactive protein) of 95 COVID-19 cases were obtained from the Youan hospital, as shown in Supplementary Table 20.
T335 43966-44222 Sentence denotes A total of 95 data pairs from the 95 COVID-19 cases (369 images of the lesion area and the 95 × 5 clinical indicators) were collected from the Youan hospital for the correlation analysis of the lesion areas of the COVID-19 and the five clinical indicators.
T336 44223-44330 Sentence denotes The images of the SAs and the clinical indicator data constituted the correlation analysis data set (CADS).
T337 44331-44450 Sentence denotes We split the XPDS, XHDS, CTPDS, CTHDS, and CADS into the training-validation (train-val) and test data sets using TTSF.
T338 44451-44557 Sentence denotes The details of the hybrid data sets for the public data sets and Youan hospital data are shown in Table 1.
T339 44558-44645 Sentence denotes The train-val part of CTHDS is referred to as CTHTS, and the test part is called CTHVS.
T340 44646-44787 Sentence denotes The same naming scheme was adopted for XPDS, XHDS, CTPDS, and CADS, i.e., XPTS, XPVS, XHTS, XHVS, CTPTS, CTPVS, CATS, and CAVS, respectively.
T341 44788-44972 Sentence denotes The training-validation part of the four public data sets and the hospital (Youan Hospital) data set were mixed for X-data and CT-data, which were named as XMTS and CTMTS respectively.
T342 44973-45046 Sentence denotes While the test parts were split in the same way and named XMVS and CTMVS.
T343 45048-45067 Sentence denotes Image preprocessing
T344 45068-45250 Sentence denotes All image data (X-data and CT-data) in the DICOM format were loaded using the Pydicom library (version 1.4.0) and processed as arrays using the Numpy library (version 1.16.0).X-data:
T345 45251-45434 Sentence denotes The two-dimensional array (x axis and y axis) of the image of the X-data (size of 512 × 512) was normalized to pixel values of 0–255 and stored in png format using the OpenCV library.
T346 45435-45503 Sentence denotes Each preprocessed image was resized to 512 × 512 and had 3 channels.
T347 45504-45512 Sentence denotes CT-data:
T348 45513-45674 Sentence denotes The array of the CT-data was three-dimensional (x axis, y axis, and z axis), and the length of the z axis was ~300, which represented the number of image slices.
T349 45675-45751 Sentence denotes Each image slice was two-dimensional (x axis and y axis, size of 512 × 512).
T350 45752-45939 Sentence denotes As shown in Fig. 1b, the array of the image was divided into three groups in the z axis direction, and each group contained 100 image slices (each case was resampled to 300 image slices).
T351 45940-46070 Sentence denotes The image slices in each group were processed using a window center of −600 and a window width of 2000 to extract the lung tissue.
T352 46071-46209 Sentence denotes The images of the CT-data with 300 image slices were normalized to pixel values of 0–255 and stored in npy format using the Numpy library.
T353 46210-46419 Sentence denotes A convolution filter was applied with three 1 × 1 convolution kernels to preprocess the CT-data, which is a trainable layer with the aim of normalizing the input; the image size was 512 × 512, with 3 channels.
T354 46421-46455 Sentence denotes Annotation tool for medical images
T355 46456-46581 Sentence denotes The server program of the annotation tool was deployed in a computer with large network bandwidth and abundant storage space.
T356 46582-46717 Sentence denotes The client program of the annotation tool was deployed in the office computer of the experts, who were given unique user IDs for login.
T357 46718-46878 Sentence denotes The interface of the client program had a built-in image viewer with a window size of 512 × 512 and an export tool for obtaining the annotations in text format.
T358 46879-47102 Sentence denotes Multiple drawing tools were provided to annotate the lesion area in the images, including a rectangle tool for drawing a bounding box around the target, a polygon tool for outlining the target, and a circle tool the target.
T359 47103-47173 Sentence denotes Multiple categories could be defined and assigned to the target areas.
T360 47174-47400 Sentence denotes All annotations were stored in a structured query language (SQL) database, and the export tool was used to export the annotations to two common file formats (comma-separated values (csv) and JavaScript object notation (json)).
T361 47401-47448 Sentence denotes The experts could share the annotation results.
T362 47449-47586 Sentence denotes Since the size of the X-data and the CT slice-data were identical, the annotations for both data were performed with the annotation tool.
T363 47587-47682 Sentence denotes Here we use one image slice of the CT-data as an example to demonstrate the annotation process.
T364 47683-47752 Sentence denotes In this study, two experts were asked to annotate the medical images.
T365 47753-47813 Sentence denotes The normal cases were reviewed and confirmed by the experts.
T366 47814-47908 Sentence denotes The abnormal cases, including the COVID-19 and influenza cases, were annotated by the experts.
T367 47909-47999 Sentence denotes Bounding boxes of the lesion areas in the images were annotated using the annotation tool.
T368 48000-48060 Sentence denotes In general, each case contained 2–5 slices with annotations.
T369 48061-48204 Sentence denotes The cases with the annotated slices were considered positive cases, and each case was assigned to a category (COVID-19 case or influenza case).
T370 48205-48270 Sentence denotes The pipeline of the annotation was shown in Supplementary Fig. 1.
T371 48272-48303 Sentence denotes Model architecture and training
T372 48304-48535 Sentence denotes In this study, we proposed a modular CNNCF to identify the COVID-19 cases in the medical images and a CNNRF to determine the relationships between the lesion areas in the medical images and the five clinical indicators of COVID-19.
T373 48536-48612 Sentence denotes Both proposed frameworks consisted of two units (ResBlock-A and ResBlock-B).
T374 48613-48715 Sentence denotes The CNNCF and CNNRF had unique units, namely the control gate block and regressor block, respectively.
T375 48716-48841 Sentence denotes Both frameworks were implemented using two NVIDIA GTX 1080TI graphics cards and the open-source PyTorch framework.ResBlock-A:
T376 48842-48862 Sentence denotes As discussed in ref.
T377 48863-49004 Sentence denotes 57, the residual block is a CNN-based block that allows the CNN models to reuse features, thus accelerating the training speed of the models.
T378 49005-49165 Sentence denotes In this study, we developed a residual block (ResBlock-A) that utilized a skip-connection for retaining features in different layers in the forward propagation.
T379 49166-49378 Sentence denotes This block (Fig. 6a) consisted of a multiple-input multiple-output structure with two branches (an upper branch and a bottom branch), where input 1 and input 2 have the same size, but the values may be different.
T380 49379-49472 Sentence denotes In contrast, output 1 and output 2 had the same size, but output 1 did not have a ReLu layer.
T381 49473-49600 Sentence denotes The upper branch consisted of a max-pooling layer (Max-Pooling), a convolution layer (Conv 1 × 1), and a batch norm layer (BN).
T382 49601-49835 Sentence denotes The Max-Pooling had a kernel size of 3 × 3 and a stride of 2 to downsample the input 1 for retaining the features and ensuring the same size as the output layer before the element-wise add operation was conducted in the bottom branch.
T383 49836-50014 Sentence denotes The Conv 1 × 1 consisted of multiple 1 × 1 convolution kernels with the same number as that in the second convolution layer in the bottom branch to adjust the number of channels.
T384 50015-50164 Sentence denotes The BN used a regulation function to ensure the input in each layer of the model followed a normal distribution with a mean of 0 and a variance of 1.
T385 50165-50255 Sentence denotes The bottom branch consisted of two convolution layers, two BN layers, and two ReLu layers.
T386 50256-50464 Sentence denotes The first convolution layer in the bottom branch consisted of multiple 3 × 3 convolution kernels with a stride of 2 and a padding of 1 to reduce the size of the feature maps when local features were obtained.
T387 50465-50601 Sentence denotes The second convolution layer in the bottom branch consisted of multiple 3 × 3 convolution kernels with a stride of 1 and a padding of 1.
T388 50602-50721 Sentence denotes The ReLu function was used as the activation function to ensure a non-linear relationship between the different layers.
T389 50722-50856 Sentence denotes The output of the upper branch and the output of the bottom branch after the second BN were fused using an element-wise add operation.
T390 50857-50943 Sentence denotes The fused result was output 1, and the fused result after the ReLu layer was output 2.
T391 50944-50992 Sentence denotes Fig. 6 The four units of the proposed framework.
T392 50993-51673 Sentence denotes a ResBlock-A architecture, containing two convolution layers with 3 × 3 kernels, one convolution layer with a 1 × 1 kernel, three batch normalization layers, two ReLu layers, and one max-pooling layer with a 3 × 3 kernel. b ResBlock-B architecture; the basic unit is the same as the ResBlock-A, except for output 1. c The Control Gate Block has a synaptic-based frontend architecture that controls the direction of the feature map flow and the overall optimization direction of the framework. d The Regressor architecture is a skip-connection architecture containing one convolution layer with 3 × 3 kernels, one batch normalization layer, one ReLu layer, and three linear layers.
T393 51674-51685 Sentence denotes ResBlock-B:
T394 51686-51822 Sentence denotes The ResBlock-B (Fig. 6b) was a multiple-input single-output block that was similar to the ResBlock-A, except that there was no output 1.
T395 51823-51973 Sentence denotes The value of the stride and padding in each layer of the ResBlock-A and ResBlock-B could be adjusted using hyper-parameters based on the requirements.
T396 51974-51993 Sentence denotes Control Gate Block:
T397 51994-52247 Sentence denotes As shown in Fig. 6c, the Control Gate Block was a multiple-input single-output block consisting of a predictor module, a counter module, and a synapses module to control the optimization direction while controlling the information flow in the framework.
T398 52248-52372 Sentence denotes The pipeline of the predictor module is shown in Supplementary Fig. 19a, where the Input S1 is the output of the ResBlock-B.
T399 52373-52474 Sentence denotes The Input S1 was then flattened to a one-dimensional feature vector as the input of the linear layer.
T400 52475-52581 Sentence denotes The output of the linear layer was converted to a probability of each category using the softmax function.
T401 52582-52730 Sentence denotes A sensitivity calculator used the Vpred and Vtrue as inputs to calculate the TP, TN, FP, and false-negative (FN) rates to calculate the sensitivity.
T402 52731-52830 Sentence denotes The sensitivity calculation was followed by a step function to control the output of the predictor.
T403 52831-52977 Sentence denotes The ths was a threshold value; if the calculated sensitivity was greater or equal to ths, the step function output 1; otherwise, the output was 0.
T404 52978-53059 Sentence denotes The counter module was a conditional counter, as shown in Supplementary Fig. 19b.
T405 53060-53125 Sentence denotes If the input n was zero, the counter was cleared and set to zero.
T406 53126-53164 Sentence denotes Otherwise, the counter increased by 1.
T407 53165-53199 Sentence denotes The output of the counter was num.
T408 53200-53349 Sentence denotes The synapses block mimicked the synaptic structure, and the input variable num was similar to a neurotransmitter, as shown in Supplementary Fig. 19c.
T409 53350-53409 Sentence denotes The input num was the input parameter of the step function.
T410 53410-53538 Sentence denotes The ths was a threshold value; if the input num was greater or equal to ths, the step function output 1; otherwise, it output 0.
T411 53539-53642 Sentence denotes An element-wise multiplication was performed between the input S1 and the output of the synapses block.
T412 53643-53698 Sentence denotes The multiplied result was passed on to a discriminator.
T413 53699-53799 Sentence denotes If the sum of each element in the result was not zero, the Input S1 was passed on to the next layer.
T414 53800-53854 Sentence denotes Otherwise, the input S1 information was not passed on.
T415 53855-53871 Sentence denotes Regressor block:
T416 53872-54000 Sentence denotes The regressor block consisted of multiple linear layers, a convolution layer, a BN layer, and a ReLu layer, as shown in Fig. 6d.
T417 54001-54143 Sentence denotes A skip-connection architecture was adopted to retain the features and increase the ability of the block to represent non-linear relationships.
T418 54144-54274 Sentence denotes The convolution block in the skip-connection structure was a convolution layer with multiple numbers of 1 × 1 convolution kernels.
T419 54275-54430 Sentence denotes The number of the convolution kernels was the same as that of the output size of the second linear layer to ensure the consistency of the vector dimension.
T420 54431-54532 Sentence denotes The input size and output size of each linear layer were adjustable to be applicable to actual cases.
T421 54533-54675 Sentence denotes Based on the four blocks, two frameworks were designed for the classification task and regression task, respectively.Classification framework:
T422 54676-54741 Sentence denotes The CNNCF consisted of stage I and stage II, as shown in Fig. 3a.
T423 54742-54813 Sentence denotes Stage I was duplicated Q times in the framework (in this study, Q = 1).
T424 54814-54936 Sentence denotes It consisted of multiple ResBlock-A with a number of M (in this study, M = 2), one ResBlock-B, and one Control Gate Block.
T425 54937-55040 Sentence denotes Stage II consisted of multiple ResBlock-A with a number of N (in this study, N = 2) and one ResBlock-B.
T426 55041-55187 Sentence denotes The weighted cross-entropy loss function was used and was minimized using the SGD optimizer with a learning rate of a1 (in this study, a1 = 0.01).
T427 55188-55438 Sentence denotes A warm-up strategy58 was used in the initialization of the learning rate for a smooth training start, and a reduction factor of b1 (in this study, b1 = 0.1) was used to reduce the learning rate after every c1 (in this study, c1 = 10) training epochs.
T428 55439-55577 Sentence denotes The model was trained for d1 (in this study, d1 = 40) epochs, and the model parameters saved in the last epoch was used in the test phase.
T429 55578-55599 Sentence denotes Regression framework:
T430 55600-55672 Sentence denotes The CNNRF (Fig. 3b) consisted of two parts (stage II and the regressor).
T431 55673-55917 Sentence denotes The inputs to the regression framework were the images of the lesion areas, and the output was the corresponding vector with five dimensions, representing the five clinical indicators (all clinical indicators were normalized to a range of 0–1).
T432 55918-56022 Sentence denotes The stage II structure was the same as that in the classification framework, except for some parameters.
T433 56023-56166 Sentence denotes The loss function was the MSE loss function, which was minimized using the SGD optimizer with a learning rate of a2 (in this study, a2 = 0.01).
T434 56167-56415 Sentence denotes A warm-up strategy was used in the initialization of the learning rate for a smooth training start, and a reduction factor of b2 (in this study, b2 = 0.1) was used to reduce the learning rate after every c2 (in this study, c2 = 50) training epochs.
T435 56416-56560 Sentence denotes The framework was trained for d2 (in this study, d2 = 200) epochs, and the model parameters saved in the last epoch were used in the test phase.
T436 56561-56606 Sentence denotes The workflow of the classification framework.
T437 56607-56680 Sentence denotes The workflow of the classification framework was demonstrated in Fig. 3c.
T438 56681-56809 Sentence denotes The preprocessed images are sent to the first convolution block to expand the channels and processed as the input for the CNNCF.
T439 56810-56952 Sentence denotes Given the input Fi with a size of M × N × 64, the stage I output feature maps F′i with a size of M/8 × N/8 × 256 in the default configuration.
T440 56953-57092 Sentence denotes As we introduced above, the Control Gate Block controls the optimization direction while controlling the information flow in the framework.
T441 57093-57175 Sentence denotes If the Control Gate Block is open, the feature maps F′i are passed on to stage II.
T442 57176-57481 Sentence denotes Given the input F′i, the stage II output the feature maps F″i with a size of M/64 × N/64 × 512 which is defined as follows:1 Fi′=S1(Fi)Fi″=S2(Fi′)⊗CGB(Fi′),where S1 denotes the stage I block, S2 denotes the stage II block, and CGB is the Control Gate Block. ⊗ is the element-wise multiplication operation.
T443 57482-57638 Sentence denotes Stage II is Followed by a global average pooling layer (GAP) and a fully connect layer (FC layer) with a softmax function to generate the final predictions.
T444 57639-57729 Sentence denotes Given F″i as input, the GAP is adopted to generate a vector Vf with a size of 1 × 1 × 512.
T445 57730-58097 Sentence denotes Given Vf as input, the FC layer with the softmax function outputs a vector Vc with a size of 1 × 1 × C.2 Vf=GAPFi′Vc=SMaxFCVf,where GAP is the global average pooling layer, the FC is the fully connect layer, SMax is the softmax function, Vf is the feature vector generated by the GAP, Vc is the prediction vector, and C is the number of case types used in this study.
T446 58099-58176 Sentence denotes Training strategies and evaluation indicators of the classification framework
T447 58177-58270 Sentence denotes The training strategies and hyper-parameters of the classification framework were as follows.
T448 58271-58449 Sentence denotes We adopted a knowledge distillation method (Fig. 7) to train the CNNCF as a student network with one stage I block and one stage II block, each of which contained two ResBlock-A.
T449 58450-58654 Sentence denotes Four teacher networks (the hyper-parameters are provided in Supplementary Table 21) with the proposed blocks were trained on the train-val part of each sub-data set using a 5-fold cross-validation method.
T450 58655-58724 Sentence denotes All networks were initialized using the Xavier initialization method.
T451 58725-58803 Sentence denotes The initial learning rate was 0.01, and the optimization function was the SGD.
T452 58804-58914 Sentence denotes The CNNCF was trained using the image data and the label, as well as the fused output of the teacher networks.
T453 58915-59037 Sentence denotes The comparison of RT-PCR test results using throat specimen and the CNNCF results were provided in Supplementary Table 22.
T454 59038-59115 Sentence denotes Supplementary Fig. 20 shows the details of the knowledge distillation method.
T455 59116-59232 Sentence denotes The definitions and details of the five evaluation indicators used in this study were given in Supplementary Note 2.
T456 59233-59332 Sentence denotes Fig. 7 Knowledge distillation consisting of multiple teacher networks and a target student network.
T457 59333-59433 Sentence denotes The knowledge is transferred from the teacher networks to the student network using a loss function.
T458 59435-59474 Sentence denotes Gradient-weighted class activation maps
T459 59475-59618 Sentence denotes Grad-CAM59 in the Pytorch framework was used to visualize the salient features that contributed the most to the prediction output of the model.
T460 59619-59865 Sentence denotes Given a target category, the Grad-CAM performed back-propagation to obtain the final CNN feature maps and the gradient of the feature maps; only pixels with positive contributions to the specified category were retained through the ReLU function.
T461 59866-60065 Sentence denotes The Grad-CAM method was used for all test data set (X-data and CT-data) in the CNNCF without changing the framework structure to obtain a visual output of the framework’s high discriminatory ability.
T462 60067-60097 Sentence denotes Statistics and reproducibility
T463 60098-60216 Sentence denotes We used multiple statistical indices and empirical distributions to assess the performance of the proposed frameworks.
T464 60217-60376 Sentence denotes The equations of the statistical indices are shown in Supplementary Fig. 21 and all the abbreviations used in this study are defined in Supplementary Table 23.
T465 60377-60505 Sentence denotes All the data used in this study followed the criteria: (1) sign informed consent prior to enrollment. (2) At least 18 years old.
T466 60506-60637 Sentence denotes This study was conducted following the declaration of Helsinki and was approved by the Capital Medical University Ethics Committee.
T467 60638-60839 Sentence denotes The following statistical analyses of the data were conducted for both evaluating the classification framework and the regression framework.Statistical indices to evaluate the classification framework.
T468 60840-61067 Sentence denotes Multiple evaluation indicators (PRC, ROC, AUPRC, AUROC, sensitivity, specificity, precision, kappa index, and F1 with a fixed threshold) were computed for a comprehensive and accurate assessment of the classification framework.
T469 61068-61184 Sentence denotes Multiple threshold values were in the range from 0 to 1 with a step value of 0.005 to obtain the ROC and PRC curves.
T470 61185-61351 Sentence denotes The PRC showed the relationship between the precision and the sensitivity (or recall), and the ROC indicated the relationship between the sensitivity and specificity.
T471 61352-61439 Sentence denotes The two curves reflected the comprehensive performance of the classification framework.
T472 61440-61544 Sentence denotes The kappa index is a statistical method for assessing the degree of agreement between different methods.
T473 61545-61624 Sentence denotes In our use case, the indicator was used to measure the stability of the method.
T474 61625-61717 Sentence denotes The F1 score is a harmonic average of precision and sensitivity and considers the FP and FN.
T475 61718-61810 Sentence denotes The bootstrapping method was used to calculate the empirical distribution of each indicator.
T476 61811-62004 Sentence denotes The detailed calculation process was as follows: we conducted random sampling with replacement to generate 1000 new test data sets with the same number of samples as the original test data set.
T477 62005-62078 Sentence denotes The evaluation indicators were calculated to determine the distributions.
T478 62079-62152 Sentence denotes The results were displayed in boxplots (Fig. 5 and Supplementary Fig. 2).
T479 62153-62210 Sentence denotes Statistical indices to evaluate the regression framework.
T480 62211-62358 Sentence denotes Multiple evaluation indicators (MSE, RMSE, MAE, R2, and PCC) were computed for a comprehensive and accurate assessment of the regression framework.
T481 62359-62441 Sentence denotes The MSE was used to calculate the deviation between the predicted and true values.
T482 62442-62489 Sentence denotes The RMSE was the square root of the MSE result.
T483 62490-62551 Sentence denotes The two indicators show the accuracy of the model prediction.
T484 62552-62626 Sentence denotes The R2 was used to assess the goodness-of-fit of the regression framework.
T485 62627-62718 Sentence denotes The r was used to assess the correlation between two variables in the regression framework.
T486 62719-62813 Sentence denotes The indicators were calculated using the open-source tools scikit-learn and the scipy library.
T487 62815-62840 Sentence denotes Supplementary information
T488 62842-62858 Sentence denotes Peer Review File
T489 62859-62884 Sentence denotes Supplementary Information