Id |
Subject |
Object |
Predicate |
Lexical cue |
T66 |
0-295 |
Sentence |
denotes |
For the purpose of selecting variables with predictive impact on the incidence of arrhythmia, the variables sex, age, hypertension, cardiovascular disease, hydroxychloroquine, and combined therapy with hydroxychloroquine and azithromycin were initially considered in terms of variable selection. |
T67 |
296-422 |
Sentence |
denotes |
First, regularized logistic regression using the elastic net penalty implemented in the package “glmnet” was computed [12,13]. |
T68 |
423-573 |
Sentence |
denotes |
The hyperparameters α (elastic net mixing parameter) and β (shrinkage parameter) were tuned conducting 5-fold cross-validation (CV) and a grid search. |
T69 |
574-757 |
Sentence |
denotes |
Subsequently, multiple logistic regression modeling was conducted only incorporating the selected variables, to estimate the odds ratios (ORs) and their 95% confidence intervals (CI). |
T70 |
758-920 |
Sentence |
denotes |
The area under the curve (AUC) value was computed applying the receiver operating characteristics (ROC) curve to evaluate the model using the package “pROC” [14]. |
T71 |
921-1031 |
Sentence |
denotes |
To prevent overestimation of the model’s performance measure, the AUC-value was calculated applying 5-fold CV. |
T72 |
1032-1154 |
Sentence |
denotes |
During 5-fold CV, each patient is part of the training set for four times and is assigned exactly once to the testing set. |
T73 |
1155-1334 |
Sentence |
denotes |
Hence, in each step a model is fitted based on 80% of the data whereas a probability of the remaining 20% of the patients is estimated with respect to the incidence of arrhythmia. |