Index

Package

Class

Tree

Deprecated

Index

Help

PREV NEXT

FRAMES NO FRAMES

A C M N P R S T U V

A

actionName - Variable in class Mdp: an array containing the names of all of the actions
animate(int[], int) - Method in class CatMouseAnimator: This method will animate the mdp provided to the constructor for numSteps steps, following policy pi.

CatMouse - class CatMouse.: This class provides a simple main for running on cat and mouse data.
CatMouse() - Constructor for class CatMouse
CatMouseAnimator - class CatMouseAnimator.: This class can be used to "animate" the interaction between a cat and mouse.
CatMouseAnimator(Mdp) - Constructor for class CatMouseAnimator: The constructor for this class for printing results to standard output.
CatMouseAnimator(Mdp, PrintStream) - Constructor for class CatMouseAnimator: The constructor for this class for printing results to a given print stream.

Mdp - class Mdp.: This class represents an MDP, and includes a constructor that can read the MDP from a data file.
Mdp(String) - Constructor for class Mdp: This constructor reads in data from filename and sets up all of the public fields.
main(String[]) - Static method in class CatMouse: This is a simple main.

nextState - Variable in class Mdp: a list of the possible states that can be reached from each state under each action
numActions - Variable in class Mdp: total number of actions
numStates - Variable in class Mdp: total number of states

PolicyEval - class PolicyEval.: This is the template of a class that evaluates a given policy, i.e., computes the utility of each state when actions are chosen according to it.
PolicyEval(Mdp, double, int[]) - Constructor for class PolicyEval: The constructor for this class.
PolicyIter - class PolicyIter.: This is the template of a class that should run policy iteration on a given MDP to compute the optimal policy which is returned in the public policy field.
PolicyIter(Mdp, double) - Constructor for class PolicyIter: The constructor for this class.
policy - Variable in class PolicyIter: the computed optimal policy for the given MDP
policy - Variable in class ValueIter: the computed optimal policy for the given MDP
precision - Static variable in class PolicyEval: the precision used to determine when to stop iterating (called epsilon in lecture)
precision - Static variable in class ValueIter: the precision used to determine when to stop iterating (called epsilon in lecture)

reward - Variable in class Mdp: an array containing the (immediate) reward associated with every state

startState - Variable in class Mdp: the start state
stateName - Variable in class Mdp: an array containing the names of all of the states

transProb - Variable in class Mdp: a table of transition probabilities: transProb[s][a][i] is the probability of transitioning from state s under action a to state nextState[s][a][i].

utility - Variable in class PolicyEval: the computed utility of each state under the given policy
utility - Variable in class ValueIter: the computed optimal utility for the given MDP

ValueIter - class ValueIter.: This is the template of a class that should run value iteration on a given MDP to compute the optimal policy which is returned in the public policy field.
ValueIter(Mdp, double) - Constructor for class ValueIter: The constructor for this class.

A C M N P R S T U V

Package

Class

Tree

Deprecated

Index

Help

PREV NEXT

FRAMES NO FRAMES