Python MRSparkScriptWordcount.sandbox 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: mrjob.examples.mr_spark_wordcount_script

메소드/함수: sandbox

hotexamples.com에서의 예제들: 6

Python MRSparkScriptWordcount.sandbox - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 mrjob.examples.mr_spark_wordcount_script.MRSparkScriptWordcount.sandbox에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

MRSparkScriptWordcount(4)

sandbox(4)

make_runner(3)

예제 #1

파일 보기

    def test_empty(self):
        # this doesn't work on the inline runner because
        # Spark doesn't have a working dir to upload stop_words.txt
        # to. See below for what does and doesn't work in inline
        # runner
        job = MRSparkScriptWordcount(['-r', 'local'])
        job.sandbox()

        with job.make_runner() as runner:
            runner.run()

            self.assertEqual(sorted(to_lines(runner.cat_output())), [])

예제 #2

파일 보기

    def test_spark_script_mrjob(self):
        text = b'one fish\ntwo fish\nred fish\nblue fish\n'

        job = MRSparkScriptWordcount(['-r', 'spark'])
        job.sandbox(stdin=BytesIO(text))

        counts = {}

        with job.make_runner() as runner:
            runner.run()

            for line in to_lines(runner.cat_output()):
                k, v = safeeval(line)
                counts[k] = v

        self.assertEqual(counts, dict(blue=1, fish=4, one=1, red=1, two=1))

예제 #3

파일 보기

파일: test_local.py 프로젝트: Affirm/mrjob

    def test_spark_script_mrjob(self):
        text = b'one fish\ntwo fish\nred fish\nblue fish\n'

        job = MRSparkScriptWordcount(['-r', 'local'])
        job.sandbox(stdin=BytesIO(text))

        counts = {}

        with job.make_runner() as runner:
            runner.run()

            for line in to_lines(runner.cat_output()):
                k, v = safeeval(line)
                counts[k] = v

        self.assertEqual(counts, dict(
            blue=1, fish=4, one=1, red=1, two=1))

예제 #4

파일 보기

    def test_count_words(self):
        job = MRSparkScriptWordcount(['-r', 'local'])
        job.sandbox(
            stdin=BytesIO(b'Mary had a little lamb\nlittle lamb\nlittle lamb'))

        with job.make_runner() as runner:
            runner.run()

            output = sorted(
                safeeval(line) for line in to_lines(runner.cat_output()))

            self.assertEqual(output, [
                ('a', 1),
                ('had', 1),
                ('lamb', 3),
                ('little', 3),
                ('mary', 1),
            ])

예제 #5

파일 보기

파일: test_inline.py 프로젝트: qui/mrjob

    def test_no_spark_script_steps(self):
        # just a sanity check; _STEP_TYPES is tested in a lot of ways
        job = MRSparkScriptWordcount(['-r', 'inline'])
        job.sandbox()

        self.assertRaises(NotImplementedError, job.make_runner)

예제 #6

파일 보기

파일: test_inline.py 프로젝트: Yelp/mrjob

    def test_no_spark_script_steps(self):
        # just a sanity check; _STEP_TYPES is tested in a lot of ways
        job = MRSparkScriptWordcount(['-r', 'inline'])
        job.sandbox()

        self.assertRaises(NotImplementedError, job.make_runner)