The output of the linear layer was converted to a probability of each category using the softmax function.