Q41.webp
Sakura_chan

Q41.webp

Kizspy | Question: 41
(Choose 1 answer)
What does the action value Q(s,a) represent in reinforcement learning?
A. The probability of transitioning to state s from state a
B. The expected return (total future reward) of taking action a in state s
C. The immediate reward received after taking action a
D. The average time it takes to transition from state s to state a

Thông tin

Category
REL301m
Thêm bởi
Sakura_chan
Ngày thêm
Lượt xem
430
Lượt bình luận
1
Rating
0.00 star(s) 0 đánh giá

Share this media

Back
Bên trên Bottom