Q10.webp
Sakura_chan

Q10.webp

Kizspy | Question: 10
(Choose 1 answer)
How does Expected Sarsa reduce the variance in Q-value updates compared to Q-learning?
A. By using a fixed learning rate.
B. By averaging over all possible actions.
C. By always selecting the action with the highest Q-value.
D. By ignoring the reward signal.

Thông tin

Category
REL301m
Thêm bởi
Sakura_chan
Ngày thêm
Lượt xem
719
Lượt bình luận
2
Rating
0.00 star(s) 0 đánh giá

Share this media

Back
Bên trên Bottom