PMC:7160614 / 5337-14267 JSONTXT 14 Projects

Annnotations TAB TSV DIC JSON TextAE

Id Subject Object Predicate Lexical cue
T46 0-20 Sentence denotes Patients and methods
T47 22-30 Sentence denotes Patients
T48 31-182 Sentence denotes Ethical approvals by the institutional review boards were obtained for this retrospective analysis, and the need to obtain informed consent was waived.
T49 183-500 Sentence denotes From January 1 to February 8, 2020, seventy consecutive patients with COVID-19 admitted in 5 independent hospitals from 4 cities were enrolled in this study (mean age, 42.9 years; range, 16–69 years), including 41 men (mean age, 41.8 years; range, 16–69 years) and 29 women (mean age, 44.5 years; range, 16–66 years).
T50 501-606 Sentence denotes All patients were confirmed with SARS-CoV-2 infection by real-time RT-PCR and next-generation sequencing.
T51 607-731 Sentence denotes Of these patients, 24 were from Huizhou City, 25 from Shantou City, 15 from Yongzhou City, and the rest 6 from Meizhou City.
T52 732-1018 Sentence denotes At the same period, another 66 pneumonia patients without COVID-19 from Meizhou People’s Hospital were recruited as controls (mean age, 46.7 years; range, 0.3–93 years), including 43 men (mean age, 46.0 years; range, 0.3–93 years) and 23 women (mean age, 48.0 years; range, 1–86 years).
T53 1019-1091 Sentence denotes All the controls were confirmed with consecutive negative RT-PCR assays.
T54 1092-1241 Sentence denotes Figure E1 in the Supplementary Material shows the patient recruitment pathway for the control group, along with the inclusion and exclusion criteria.
T55 1242-1379 Sentence denotes According to previous studies [19–21], whose sample size is comparable with ours, the ratio between primary and validation cohort is 7:3.
T56 1380-1500 Sentence denotes In this study, a total of 136 patients were divided into primary (n = 98) and validation (n = 38) cohorts, close to 7:3.
T57 1501-1782 Sentence denotes A total of 19 COVID-19 patients from two hospitals (6 patients from Meizhou People’s Hospital and 13 patients from the First Affiliated Hospital of Shantou University Medical College) and 19 randomly selected controls from Meizhou City were incorporated into the validation cohort.
T58 1783-1956 Sentence denotes The rest of the patients are incorporated in the primary cohort, including 51 COVID-19 patients from Huizhou, Yongzhou, and Shantou cities and 47 controls from Meizhou City.
T59 1957-2147 Sentence denotes The primary cohort was utilized to select the most valuable features and build the predictive model, and the validation cohort was used to evaluate and validate the performance of the model.
T60 2149-2183 Sentence denotes Image and clinical data collection
T61 2184-2484 Sentence denotes The chest CT imaging data without contrast material enhancement were obtained from multiple hospitals with different CT systems, including GE CT Discovery 750 HD (General Electric Company), SCENARIA 64 CT (Hitachi Medical), Philips Ingenuity CT (PHILIPS), and Siemens SOMATOM Definition AS (Siemens).
T62 2485-2559 Sentence denotes All images were reconstructed into 1-mm slices with a slice gap of 0.8 mm.
T63 2560-2649 Sentence denotes Detailed acquisition parameters were summarized in the Supplementary Material (Table E1).
T64 2650-2744 Sentence denotes The clinical history, nursing records, and laboratory findings were reviewed for all patients.
T65 2745-2937 Sentence denotes Clinical characteristics, including demographic information, daily body temperature, blood pressure, heart rate, clinical symptoms, and history of exposure to epidemic centers, were collected.
T66 2938-3162 Sentence denotes Total white blood cell (WBC) counts, lymphocyte counts, ratio of lymphocyte, neutrophil count, ratio of neutrophil, procalcitonin (PCT), C-reactive protein level (CRP), and erythrocyte sedimentation rate (ESR) were measured.
T67 3163-3278 Sentence denotes All threshold values chosen for laboratory metrics were based on the normal ranges set by each individual hospital.
T68 3280-3294 Sentence denotes Image analysis
T69 3295-3484 Sentence denotes For extraction of radiological semantic features, two senior radiologists (D.L. and X.C., more than 15 years of experience) reached a consensus, blinded to clinical and laboratory findings.
T70 3485-3580 Sentence denotes The radiological semantic features included both qualitative and quantitative imaging features.
T71 3581-3729 Sentence denotes The lesions in the outer third of the lung were defined as peripheral, and lesions in the inner two-thirds of the lung were defined as central [22].
T72 3730-3960 Sentence denotes The progression of COVID-19 lesions within each lung lobe was evaluated by scoring each lobe from 0 to 4 [7], corresponding to normal, 1~25% infection, 26~50% infection, 51~75% infection, and more than 75% infection, respectively.
T73 3961-4051 Sentence denotes The scores were combined for all five lobes to provide a total score ranging from 0 to 20.
T74 4052-4157 Sentence denotes A total of 41 radiological features (26 quantitative and 15 qualitative) were extracted for the analysis.
T75 4158-4261 Sentence denotes The descriptions of radiological semantic features are listed in the Supplementary Material (Table E2).
T76 4262-4318 Sentence denotes Figure 1 is one example of the evaluation of CT imaging.
T77 4319-4400 Sentence denotes Fig. 1 A 23-year-old female with a travel history to Wuhan presenting with fever.
T78 4401-4527 Sentence denotes Axial noncontrast CT image shows a consolidation with ground-glass opacities in the peripheral region by the right upper lobe.
T79 4528-4563 Sentence denotes Air bronchogram is found in lesion.
T80 4564-4605 Sentence denotes The maximum diameter of lesion is 2.8 cm.
T81 4606-4691 Sentence denotes The right upper lobe score is 1 because of the involved lung parenchyma less than 1/4
T82 4693-4736 Sentence denotes Clinical and radiological feature selection
T83 4737-4939 Sentence denotes To obtain the most valuable clinical and radiological semantic features, statistical analysis, univariate analysis, and the least absolute shrinkage and selection operator (LASSO) method were performed.
T84 4940-5138 Sentence denotes In statistical analysis, the chi-square test, the Kruskal-Wallis H test, and t test were utilized to compare the radiological semantic and clinical features between COVID-19 and non-COVID-19 groups.
T85 5139-5197 Sentence denotes The features with p value smaller than 0.05 were selected.
T86 5198-5326 Sentence denotes Then, univariate analysis was performed for clinical and radiological candidate features to determine the COVID-19 risk factors.
T87 5327-5413 Sentence denotes The features with p value smaller than 0.05 in univariate analysis were also selected.
T88 5414-5637 Sentence denotes The least absolute shrinkage and selection operator (LASSO) method [23] was utilized to select the most useful features with penalty parameter tuning that was conducted by 10-fold cross-validation based on minimum criteria.
T89 5638-5741 Sentence denotes Diagnostic models were then constructed by multivariate logistic regression with the selected features.
T90 5742-5860 Sentence denotes The flowchart of the feature selection process for these models was presented in the Supplementary Material (Fig. E2).
T91 5862-5912 Sentence denotes Development and validation of the diagnostic model
T92 5913-6208 Sentence denotes To develop an optimal model, we evaluated 3 models by analyzing (i) the clinical features model (C model), (ii) radiological semantic features model (R model), and (iii) the combination of clinical and radiological semantic features model (CR model) by multivariate logistic regression analysis.
T93 6209-6338 Sentence denotes The classification performances of the models were evaluated by the area under the receiver operating characteristic (ROC) curve.
T94 6339-6431 Sentence denotes The area under the curve (AUC), accuracy, sensitivity, and specificity were also calculated.
T95 6432-6633 Sentence denotes A decision curve analysis was conducted to determine the clinical usefulness of the diagnostic model by quantifying the net benefits at different threshold probabilities in the validation dataset [24].
T96 6634-6713 Sentence denotes The development of decision curve was described in the Supplementary Materials.
T97 6714-6795 Sentence denotes Figure 2 depicts the flowchart of the proposed analysis pipeline described above.
T98 6796-7000 Sentence denotes We also built a nomogram, which was a quantitative tool to predict the individual probability of infection by COVID-19, based on the multivariate logistic analysis of the CR model with the primary cohort.
T99 7001-7157 Sentence denotes Depending on the coefficient of the predictive factors in multivariate logistic regression model, all values of each predictive factor were assigned points.
T100 7158-7237 Sentence denotes A total point was obtained by summing all the points of each predictive factor.
T101 7238-7348 Sentence denotes The scale also showed the relationship between the total point and the prediction probability in the nomogram.
T102 7349-7496 Sentence denotes The corresponding calibration curves of the CR model in the primary cohort and validation cohort are shown in the Supplementary Material (Fig. E3).
T103 7497-7556 Sentence denotes Fig. 2 Workflow of data process and analysis in this study.
T104 7557-7687 Sentence denotes Radiological semantic features, including qualitative and quantitative imaging features, are extracted from axial lung CT section.
T105 7688-7780 Sentence denotes The clinical manifestation and laboratory parameters are provided by electronic case system.
T106 7781-7895 Sentence denotes Statistical analysis is performed for comparing the different features between COVID-19 and non-COVID-19 patients.
T107 7896-8073 Sentence denotes Univariate analysis, least absolute shrinkage, and selection operator (LASSO) are further performed to determine the COVID-19 risk factors with p < 0.05 in statistical analysis.
T108 8074-8170 Sentence denotes Three models based on the selected features are established by multivariate logistic regression.
T109 8171-8313 Sentence denotes These models include radiological mode (R model), clinical model (C model), and the combination of clinical and radiological model (CR model).
T110 8314-8491 Sentence denotes The performance and clinical benefits of the prediction model are assessed by the area under a receiver operating characteristic (ROC) curve and the decision curve, respectively
T111 8493-8513 Sentence denotes Statistical analysis
T112 8514-8574 Sentence denotes Statistical analysis was conducted with R software (Version:
T113 8575-8608 Sentence denotes 3.6.4, http: www.r-project.org/).
T114 8609-8717 Sentence denotes The reported significance levels were all two-sided, and the statistical significance level was set to 0.05.
T115 8718-8803 Sentence denotes The multivariate logistic regression analysis was performed with the “stats” package.
T116 8804-8864 Sentence denotes Nomogram construction was performed using the “rms” package.
T117 8865-8918 Sentence denotes Decision curve analysis was performed using the “dca.
T118 8919-8930 Sentence denotes R” package.