Kizspy | Question: 24 (Choose 1 answer)
Suppose you are using YOLO on a 19x19 grid, on a detection problem with 20 classes, and with 5 anchor boxes. During training, for each image you will need to construct an output volume y as the target value for the neural network; this corresponds to the last layer of the neural network. (y may include some "?", or "don'tcares"). What is the dimension of this output volume?
A. 5x5x(19x25)
B. 19x19x(5x25)
C. 25x25x(5x19)
D. 5x29x(19x19)