This class represents the Multi Objective Variable Elimination process. More...

#include <AIToolbox/Factored/Bandit/Algorithms/Utils/MultiObjectiveVariableElimination.hpp>

Classes
struct	Entry

Public Types
using	Factor = std::vector< Entry >

using	GVE = GenericVariableElimination< Factor >

using	Results = Factor

Public Member Functions
template<typename Iterable >
Results	operator() (const Action &A, const Iterable &inputRules)
	This function finds the best Action-value pair for the provided MOQFunctionRules. More...

Results	operator() (const Action &A, GVE::Graph &graph)
	This function performs the actual agent elimination process. More...

Detailed Description

This class represents the Multi Objective Variable Elimination process.

This class performs variable elimination on a factor graph. It first builds the graph starting from a list of MOQFunctionRules. These rules are sorted by the agents they affect, and each group is added to a single factor connected to those agents.

Each agent is then eliminated from the graph, and all rules connected to it are processed in order to find out which actions the agent being eliminated should take.

When doing multi-objective elimination, there is no real best action in general, since we do not know in advance the weights for the objectives' rewards. Thus, we keep all action-rewards pairs we found during the elimination and return them.

This process is exponential in the maximum number of agents found attached to the same factor (which could be higher than in the original graph, as the elimination process can create bigger factors than the initial ones). However, given that each factor is usually linked to few agents, and that this process allows avoiding considering the full factored Action at any one time, it is usually much faster than a brute-force approach.

WARNING: This process only considers rules that have been explicitly passed to it. This may create problems if some of your values have negative values in it, since the elimination process will not consider unmentioned actions as giving 0 reward, and choose them instead of the negative values. In order to avoid this problem either all 0 rules have to be explicitly mentioned for each agent subgroup containing negative rules, or the rules have to be converted to an equivalent graph with positive values.

Member Typedef Documentation

◆ Factor

using AIToolbox::Factored::Bandit::MultiObjectiveVariableElimination::Factor = std::vector<Entry>

◆ GVE

using AIToolbox::Factored::Bandit::MultiObjectiveVariableElimination::GVE = GenericVariableElimination<Factor>

◆ Results

using AIToolbox::Factored::Bandit::MultiObjectiveVariableElimination::Results = Factor

Member Function Documentation

◆ operator()() [1/2]

template<typename Iterable >

Results AIToolbox::Factored::Bandit::MultiObjectiveVariableElimination::operator()	(	const Action &	A,
		const Iterable &	inputRules
	)

inline

This function finds the best Action-value pair for the provided MOQFunctionRules.

Parameters

A	The multi-agent action space.
inputRules	An iterable object over MOQFunctionRules.

Returns: All pairs of PartialAction, Rewards found during the elimination process.

◆ operator()() [2/2]

Results AIToolbox::Factored::Bandit::MultiObjectiveVariableElimination::operator()	(	const Action &	A,
		GVE::Graph &	graph
	)

This function performs the actual agent elimination process.

For each agent, its adjacent factors, and the agents adjacent to those are found. Then all possible action combinations between those other agents are tried in order to find the best action response be for the agent to be eliminated.

All the responses found (possibly pruned) are added as Rules to a (possibly new) factor adjacent to the adjacent agents.

The process is repeated until all agents are eliminated.

What remains is then returned.

Parameters

A	The action space of the agents.
graph	The graph to perform VE on.

Returns: All pairs of PartialAction, Rewards found during the elimination process.

The documentation for this class was generated from the following file:

include/AIToolbox/Factored/Bandit/Algorithms/Utils/MultiObjectiveVariableElimination.hpp

Classes

Public Types

Public Member Functions