Python Action.playの例

プログラミング言語: Python

名前空間/パッケージ名: models

クラス/型: Action

メソッド/関数: play

hotexamples.comのコード掲載数: 1

Python Action.play - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのmodels.Action.playの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Action(30)

create(2)

keywords(2)

get_or_create(1)

redirect_link(1)

points(1)

play(1)

phrase(1)

name(1)

latest_action(1)

get_action_count_for_user(1)

_output_action_names(1)

description(1)

create_without_saving(1)

createEdge(1)

connectTo(1)

by_id(1)

activists(1)

_output_canonical_action_names(1)

timestamp(1)

コード例 #1

ファイルを表示

ファイル: test_bandit.py プロジェクト: Fandekasp/reinforcement_learning_exercices

class TestAction(TestCase):
    """Tests related to the action"""

    def setUp(self):
        self.bandit = Bandit()
        self.action = Action(self.bandit)

    def test_action_value_is_gaussian(self):
        """
        Verify that our action will get a correct value from N(mu, sigma)
        - mu: the mean, here 0
        - sigma: the variance, here 1
        """
        results = [Action(self.bandit).value for x in range(5000)]
        law_1 = get_percent([x for x in results if x > 0], results)
        self.assertAlmostEqual(law_1, 50, delta=2, msg="%s%% of results are "
            "above mu instead of around 51%%" % law_1)
        law_2 = get_percent([x for x in results if x > -1 and x < 1], results)
        self.assertAlmostEqual(law_2, 68, delta=2, msg="%s%% of results are "
            "above mu instead of around 68%%" % law_2)

    def test_gauss_with_numpy(self):
        """We use numpy, let's add some verifications"""
        mu, sigma = 0, 1
        value = np.random.normal(mu, sigma, 1000)
        diff_mean = abs(mu - np.mean(value))
        self.assertLess(diff_mean, 0.12, "diff mean = %s > 0.1" % diff_mean)
        diff_vari = abs(sigma - np.std(value, ddof=1))
        self.assertLess(diff_vari, 0.12, "diff vari = %s > 0.1" % diff_vari)

    def test_action_can_play(self):
        """Let's play and check that action got a reward"""
        action = self.action  # shortcut
        [action.play() for x in range(1000)]
        lim = [action.mu - 3 * action.sigma, action.mu + 3 * action.sigma]
        rewards_in_range = [r for r in action.rewards if lim[0] <= r <= lim[1]]
        percent = get_percent(rewards_in_range, action.rewards)
        self.assertAlmostEqual(percent, 85.0, delta=15, msg="%s%% instead of "
            "99,7%% for a Normal Distribution" % percent)
    # TODO verify why it comes to 86% sometimes !

    def test_action_mean_reward(self):
        """Test that an action keep an history of rewards and that mean_reward
        become closer to value after multiples plays"""
        [self.action.play() for i in range(1, 1000)]
        vari = abs(self.action.sigma - np.std(self.action.rewards, ddof=1))
        self.assertLess(vari, 0.12, "diff variance = %s > 0.1" % vari)