A | AIToolbox::PolicyInterface< void, void, size_t > | protected |
Base typedef | AIToolbox::Bandit::PolicyInterface | |
getA() const | AIToolbox::PolicyInterface< void, void, size_t > | |
getActionProbability(const size_t &a) const override | AIToolbox::Bandit::TopTwoThompsonSamplingPolicy | virtual |
AIToolbox::Bandit::PolicyInterface::getActionProbability(const void &s, const size_t &a) const=0 | AIToolbox::PolicyInterface< void, void, size_t > | pure virtual |
getExperience() const | AIToolbox::Bandit::TopTwoThompsonSamplingPolicy | |
getPolicy() const override | AIToolbox::Bandit::TopTwoThompsonSamplingPolicy | virtual |
getS() const | AIToolbox::PolicyInterface< void, void, size_t > | |
PolicyInterface(void s, size_t a) | AIToolbox::PolicyInterface< void, void, size_t > | |
rand_ | AIToolbox::PolicyInterface< void, void, size_t > | mutableprotected |
recommendAction() const | AIToolbox::Bandit::TopTwoThompsonSamplingPolicy | |
S | AIToolbox::PolicyInterface< void, void, size_t > | protected |
sampleAction() const override | AIToolbox::Bandit::TopTwoThompsonSamplingPolicy | virtual |
AIToolbox::Bandit::PolicyInterface::sampleAction(const void &s) const=0 | AIToolbox::PolicyInterface< void, void, size_t > | pure virtual |
TopTwoThompsonSamplingPolicy(const Experience &exp, double beta) | AIToolbox::Bandit::TopTwoThompsonSamplingPolicy | |
~PolicyInterface() | AIToolbox::PolicyInterface< void, void, size_t > | virtual |