Kizspy | Question: 10
(Choose 1 answer)
How does Expected Sarsa reduce the variance in Q-value updates compared to Q-learning?
A. By using a fixed learning rate.
B. By averaging over all possible actions.
C. By always selecting the action with the highest Q-value.
D. By ignoring the reward signal.