AIToolbox
A library that offers tools for AI problem solving.
|
Classes | |
class | EpsilonPolicy |
class | ESRLPolicy |
This class implements the Exploring Selfish Reinforcement Learning algorithm. More... | |
class | Experience |
This class computes averages and counts for a Bandit problem. More... | |
class | LRPPolicy |
This class implements the Linear Reward Penalty algorithm. More... | |
class | Model |
This class represent a multi-armed bandit. More... | |
class | PolicyInterface |
Simple typedef for most of a normal Bandit's policy needs. More... | |
class | QGreedyPolicy |
This class implements a simple greedy policy. More... | |
class | QGreedyPolicyWrapper |
This class implements some basic greedy policy primitives. More... | |
class | QSoftmaxPolicy |
This class implements a softmax policy through a QFunction. More... | |
class | QSoftmaxPolicyWrapper |
This class implements some basic softmax policy primitives. More... | |
class | RandomPolicy |
This class represents a random policy. More... | |
class | SuccessiveRejectsPolicy |
This class implements the successive rejects algorithm. More... | |
class | T3CPolicy |
This class implements the T3C sampling policy. More... | |
class | ThompsonSamplingPolicy |
This class implements a Thompson sampling policy. More... | |
class | TopTwoThompsonSamplingPolicy |
This class implements the top-two Thompson sampling policy. More... | |
Functions | |
template<typename V , typename Gen > | |
QGreedyPolicyWrapper (const V &&, std::vector< size_t > &, Gen &) -> QGreedyPolicyWrapper< V, Gen > | |
template<typename V , typename Gen > | |
QGreedyPolicyWrapper (const V &, std::vector< size_t > &, Gen &) -> QGreedyPolicyWrapper< const V &, Gen > | |
template<typename V , typename Gen > | |
QSoftmaxPolicyWrapper (double, const V &&, Vector &, std::vector< size_t > &, Gen &) -> QSoftmaxPolicyWrapper< V, Gen > | |
template<typename V , typename Gen > | |
QSoftmaxPolicyWrapper (double, const V &, Vector &, std::vector< size_t > &, Gen &) -> QSoftmaxPolicyWrapper< const V &, Gen > | |
using AIToolbox::Bandit::QFunction = typedef Vector |
AIToolbox::Bandit::QGreedyPolicyWrapper | ( | const V && | , |
std::vector< size_t > & | , | ||
Gen & | |||
) | -> QGreedyPolicyWrapper< V, Gen > |
AIToolbox::Bandit::QGreedyPolicyWrapper | ( | const V & | , |
std::vector< size_t > & | , | ||
Gen & | |||
) | -> QGreedyPolicyWrapper< const V &, Gen > |
AIToolbox::Bandit::QSoftmaxPolicyWrapper | ( | double | , |
const V && | , | ||
Vector & | , | ||
std::vector< size_t > & | , | ||
Gen & | |||
) | -> QSoftmaxPolicyWrapper< V, Gen > |
AIToolbox::Bandit::QSoftmaxPolicyWrapper | ( | double | , |
const V & | , | ||
Vector & | , | ||
std::vector< size_t > & | , | ||
Gen & | |||
) | -> QSoftmaxPolicyWrapper< const V &, Gen > |