Python Policy.getitem Examples

Programming Language: Python

Namespace/Package Name: policy

Class/Type: Policy

Method/Function: __getitem__

Examples at hotexamples.com: 1

Python Policy.__getitem__ - 1 examples found. These are the top rated real world Python examples of policy.Policy.__getitem__ extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

Policy(30)

action_prob(20)

__init__(13)

act(12)

checkWin(6)

build_deterministic(5)

action(4)

MakeMove(3)

build(3)

CheckLegal(3)

CRAWLER_NUMBER(2)

query(2)

qFunc(2)

choose_action(2)

fromString(2)

INVALID(2)

epsilonGreedy(2)

check_policy(1)

user(1)

classifier(1)

group(1)

script(1)

set_probability(1)

APPLY_TIME_INTERVAL(1)

actions_probas_from(1)

check(1)

calculate_probs(1)

apply_accumulated_gradients(1)

add_models(1)

B(1)

action_masks(1)

_placeholders(1)

_func(1)

__getitem__(1)

W(1)

TIME_INTERVAL_ST(1)

TIME_INTERVAL_ED(1)

CRAWLER_TYPE(1)

weights(1)

Example #1

Show file

    def compute_single_policy_backup(self, policy: Policy, gamma: float) -> Tuple[ValueFunction, float]:
        '''
        Performs a policy backup on the current value function 
        and using the specified policy.  
        This method does not modify the current value function; 
        instead it returns a new value function, 
        together with the error associated with the backup operation.
        '''
        # DONE
        new_value_function = ValueFunction(self._domain)
        error = 0
        for state in self._domain.get_observation_space().get_elements():
            if self._domain.is_terminal(state):
                new_value_function._values[state] = 0
            else:
                action = policy.__getitem__(state)
                # distribution = self._domain.get_next_state_distribution(state,action).get_values()
                new_value_function._values[state] = self.q_value(state,action,gamma)
                if error < abs(self.q_value(state,action,gamma) - self.__getitem__(state)):
                    error = abs(self.q_value(state,action,gamma) - self.__getitem__(state))


        return new_value_function, error

Python Policy.__getitem__ Examples

Python Policy.getitem Examples