This class implements a simple greedy policy. More...

#include <AIToolbox/Bandit/Policies/QGreedyPolicy.hpp>

Inheritance diagram for AIToolbox::Bandit::QGreedyPolicy:

Public Member Functions
	QGreedyPolicy (const QFunction &q)
	Basic constructor. More...

virtual size_t	sampleAction () const override
	This function chooses the greediest action. More...

virtual double	getActionProbability (const size_t &a) const override
	This function returns the probability of taking the specified action. More...

virtual Vector	getPolicy () const override
	This function returns a vector containing all probabilities of the policy. More...

Public Member Functions inherited from AIToolbox::PolicyInterface< void, void, size_t >
	PolicyInterface (void s, size_t a)
	Basic constructor. More...

virtual	~PolicyInterface ()
	Basic virtual destructor. More...

virtual size_t	sampleAction (const void &s) const=0
	This function chooses a random action for state s, following the policy distribution. More...

virtual double	getActionProbability (const void &s, const size_t &a) const=0
	This function returns the probability of taking the specified action in the specified state. More...

const void &	getS () const
	This function returns the number of states of the world. More...

const size_t &	getA () const
	This function returns the number of available actions to the agent. More...

Additional Inherited Members
Public Types inherited from AIToolbox::Bandit::PolicyInterface
using	Base = AIToolbox::PolicyInterface< void, void, size_t >

Protected Attributes inherited from AIToolbox::PolicyInterface< void, void, size_t >
void	S

size_t	A

RandomEngine	rand_

Detailed Description

This class implements a simple greedy policy.

This class always selects the greediest action with respect to the already obtained experience.

Constructor & Destructor Documentation

◆ QGreedyPolicy()

AIToolbox::Bandit::QGreedyPolicy::QGreedyPolicy ( const QFunction & q )

Basic constructor.

Parameters

q	The QFunction to act upon.

Member Function Documentation

◆ getActionProbability()

virtual double AIToolbox::Bandit::QGreedyPolicy::getActionProbability ( const size_t & a ) const

overridevirtual

This function returns the probability of taking the specified action.

If multiple greedy actions exist, this function returns the correct probability of picking each one, since we return a random one with sampleAction().

Parameters

a	The selected action.

Returns: This function returns 0 if the action is not greedy, and 1/the number of greedy actions otherwise.

◆ getPolicy()

virtual Vector AIToolbox::Bandit::QGreedyPolicy::getPolicy ( ) const

overridevirtual

This function returns a vector containing all probabilities of the policy.

Ideally this function can be called only when there is a repeated need to access the same policy values in an efficient manner.

Implements AIToolbox::Bandit::PolicyInterface.

◆ sampleAction()

virtual size_t AIToolbox::Bandit::QGreedyPolicy::sampleAction ( ) const

overridevirtual

This function chooses the greediest action.

If multiple actions would be equally as greedy, a random one is returned.

Returns: The chosen action.

The documentation for this class was generated from the following file:

include/AIToolbox/Bandit/Policies/QGreedyPolicy.hpp

Public Member Functions

Additional Inherited Members

Detailed Description

Constructor & Destructor Documentation

◆ QGreedyPolicy()

Member Function Documentation

◆ getActionProbability()

◆ getPolicy()

◆ sampleAction()