Q17.webp
Sakura_chan

Q17.webp

Kizspy | Question: 17
(Choose 1 answer)
Which of the following is an advantage of off-policy learning?
A. It requires less computational resources
B. It guarantees convergence to the optimal policy
C. It allows learning from non-optimal behavior
D. It eliminates the need for exploration entirely

Thông tin

Category
REL301m
Thêm bởi
Sakura_chan
Ngày thêm
Lượt xem
597
Lượt bình luận
3
Rating
0.00 star(s) 0 đánh giá

Share this media

Back
Bên trên Bottom