Q27.webp
Sakura_chan

Q27.webp

Kizspy | Question: 27
(Choose 1 answer)
What role does the discount factor play in Semi-Gradient TD learning?
A. It determines the rate of eligibility trace decay
B. It controls the influence of future rewards on the updates
C. It sets the exploration-exploitation trade-off
D. It adjusts the learning rate dynamically during training

Thông tin

Category
REL301m
Thêm bởi
Sakura_chan
Ngày thêm
Lượt xem
504
Lượt bình luận
1
Rating
0.00 star(s) 0 đánh giá

Share this media

Back
Bên trên Bottom