|
| QL (const size_t s, const size_t a, const double discount=1.0, const double alpha=0.1, const double lambda=0.1, const double tolerance=0.001, const double epsilon=0.1) |
| Basic constructor. More...
|
|
void | setLambda (double l) |
| This function sets the new lambda parameter. More...
|
|
double | getLambda () const |
| This function returns the currently set lambda parameter. More...
|
|
| OffPolicyControl (size_t s, size_t a, double discount=1.0, double alpha=0.1, double tolerance=0.001, double epsilon=0.1) |
| Basic constructor. More...
|
|
void | stepUpdateQ (const size_t s, const size_t a, const size_t s1, const double rew) |
| This function updates the internal QFunction using the discount set during construction. More...
|
|
void | setEpsilon (double e) |
| This function sets the epsilon parameter. More...
|
|
double | getEpsilon () const |
| This function will return the currently set epsilon parameter. More...
|
|
| OffPolicyBase (size_t s, size_t a, double discount=1.0, double alpha=0.1, double tolerance=0.001) |
| Basic construtor. More...
|
|
void | setLearningRate (double a) |
| This function sets the learning rate parameter. More...
|
|
double | getLearningRate () const |
| This function will return the current set learning rate parameter. More...
|
|
void | setDiscount (double d) |
| This function sets the new discount parameter. More...
|
|
double | getDiscount () const |
| This function returns the currently set discount parameter. More...
|
|
void | setTolerance (double t) |
| This function sets the trace cutoff parameter. More...
|
|
double | getTolerance () const |
| This function returns the currently set trace cutoff parameter. More...
|
|
void | clearTraces () |
| This function clears the already set traces. More...
|
|
const Traces & | getTraces () const |
| This function returns the currently set traces. More...
|
|
void | setTraces (const Traces &t) |
| This function sets the currently set traces. More...
|
|
size_t | getS () const |
| This function returns the number of states on which QLearning is working. More...
|
|
size_t | getA () const |
| This function returns the number of actions on which QLearning is working. More...
|
|
const QFunction & | getQFunction () const |
| This function returns a reference to the internal QFunction. More...
|
|
void | setQFunction (const QFunction &qfun) |
| This function allows to directly set the internal QFunction. More...
|
|
This class implements off-policy control via Q(lambda).
- See also
- QLEvaluation
This method behaves as an inefficient QLearning if you set the lambda parameter to zero (effectively cutting all traces), and the epsilon parameter to zero (forcing a perfectly greedy target policy).