|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.ObjectPolicyEval
This is the template of a class that evaluates a given policy, i.e., computes the utility of each state when actions are chosen according to it. The utility is returned in the public utility field. You need to fill in the constructor. You may wish to add other fields with other useful information that you want this class to return (for instance, number of iterations before convergence).
Field Summary | |
static double |
precision
the precision used to determine when to stop iterating (called epsilon in lecture) |
double[] |
utility
the computed utility of each state under the given policy |
Constructor Summary | |
PolicyEval(Mdp mdp,
double discount,
int[] pi)
The constructor for this class. |
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
public double[] utility
public static double precision
Constructor Detail |
public PolicyEval(Mdp mdp, double discount, int[] pi)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |