Kizspy | Question: 7
(Choose 1 answer)
Which of the following statements is true regarding the update rule of Expected Sarsa?
A. It only updates based on the greedy action
B. It updates based on a weighted sum of Q-values for all possible actions
C. It requires the entire model of the environment
D. It updates based on the worst possible action