Deep Q-Learning
Jump to navigation
Jump to search
Intro
To understand deep Q-learning, it is imperative you first have an understanding of normal, table-based Q-learning.
- Define a table
Qwith a row for each state and a column for each action. We can index intoQwithQ(s,a). The value of each index will be the expected utility of being in statesand taking actiona - Define exploration probability
0 < p <= 1 - Begin in state
s - With probability
p, take actiona = argmax_a[Q(s,a)]. Otherwise, take a random action from the action space.
the sea