125 double exploThreshold = temp < 0.1 ? 0.1 : temp;
130 if (explo > exploThreshold) {
Headers of the epsilon-greedy decision maker class.
A class to store the optimal actions.
Size size() const
Gives the size.
void initialize(const FMDP< double > *fmdp)
Initializes the learner.
E_GreedyDecider()
Constructor.
ActionSet stateOptimalPolicy(const Instantiation &curState)
~E_GreedyDecider()
Destructor.
void checkState(const Instantiation &newState, Idx actionId)
SequenceIteratorSafe< const DiscreteVariable * > endVariables() const
Returns an iterator reference to the end of the list of variables.
SequenceIteratorSafe< const DiscreteVariable * > beginVariables() const
Returns an iterator reference to he beginning of the list of variables.
virtual void initialize(const FMDP< double > *fmdp)
Initializes the learner.
virtual ActionSet stateOptimalPolicy(const Instantiation &curState)
Class for assigning/browsing values to tuples of discrete variables.
Size Idx
Type for indexes.
double randomProba()
Returns a random double between 0 and 1 included (i.e.
gum is the global namespace for all aGrUM entities
Contains useful methods for random stuff.