Kizspy | Question: 18
(Choose 1 answer)
What is the main objective of the Q-learning algorithm?
A. To minimize the state-action pair values
B. To maximize the total reward over time
C. To minimize the exploration rate
D. To maximize the number of actions taken