Kizspy | Question: 41
(Choose 1 answer)
What does the action value Q(s,a) represent in reinforcement learning?
A. The probability of transitioning to state s from state a
B. The expected return (total future reward) of taking action a in state s
C. The immediate reward received after taking action a
D. The average time it takes to transition from state s to state a