alidation method. All networks were initialized using the Xavier initialization method. The initial learning rate was 0.01, and the optimization function was t