ast_toolbox.rewards.ast_reward.
ASTReward
Bases: object
object
Function to calculate the rewards for timesteps when optimizing AST solver policies.
give_reward
Returns the reward for a given time step.
reward (float) – Reward based on the previous action.