ast_toolbox.mcts.MDP module¶

class ast_toolbox.mcts.MDP.TransitionModel(getInitialState, getNextState, isEndState, maxSteps, goToState)[source]¶

Bases: object

The wrapper for the transitin model used in the tree search.

Parameters:

getInitialState (function) – getInitialState() returns the initial AST state.
getNextState (function) – getNextState(s, a) returns the next state and the reward.
isEndState (function) – isEndState(s) returns whether s is a terminal state.
maxSteps (int) – The maximum path length.
goToState (function) – goToState(s) sets the simulator to the target state s.

ast_toolbox.mcts.MDP.simulate(model, p, policy, verbose=False, sleeptime=0.0)[source]¶

Simulate the environment model using the policy and the parameter p.

Parameters:

Returns: