is vector containing the raw class scores from the last layer of hte network, is the exponential of the score for class , and the deonminator is the sum of exponentials of all raw class scores, which acts as a normalization constanct.