Python MRSparkWordcount.make_runner 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: mrjob.examples.mr_spark_wordcount

클래스/타입: MRSparkWordcount

메소드/함수: make_runner

hotexamples.com에서의 예제들: 4

Python MRSparkWordcount.make_runner - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 mrjob.examples.mr_spark_wordcount.MRSparkWordcount.make_runner에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

MRSparkWordcount(3)

make_runner(3)

sandbox(3)

자주 사용되는 메소드들

MRSparkWordcount (3)

make_runner (3)

sandbox (3)

예제 #1

파일 보기

파일: test_mr_spark_wordcount.py 프로젝트: yzhanggithub/mrjob

    def test_empty(self):
        job = MRSparkWordcount([])
        job.sandbox()

        with job.make_runner() as runner:
            runner.run()

            self.assertEqual(sorted(to_lines(runner.cat_output())), [])

예제 #2

파일 보기

파일: test_inline.py 프로젝트: qui/mrjob

    def test_spark_mrjob(self):
        text = b'one fish\ntwo fish\nred fish\nblue fish\n'

        job = MRSparkWordcount(['-r', 'inline'])
        job.sandbox(stdin=BytesIO(text))

        counts = {}

        with job.make_runner() as runner:
            runner.run()

            for line in to_lines(runner.cat_output()):
                k, v = safeeval(line)
                counts[k] = v

        self.assertEqual(counts, dict(blue=1, fish=4, one=1, red=1, two=1))

예제 #3

파일 보기

파일: test_inline.py 프로젝트: Yelp/mrjob

    def test_spark_mrjob(self):
        text = b'one fish\ntwo fish\nred fish\nblue fish\n'

        job = MRSparkWordcount(['-r', 'inline'])
        job.sandbox(stdin=BytesIO(text))

        counts = {}

        with job.make_runner() as runner:
            runner.run()

            for line in to_lines(runner.cat_output()):
                k, v = safeeval(line)
                counts[k] = v

        self.assertEqual(counts, dict(
            blue=1, fish=4, one=1, red=1, two=1))

예제 #4

파일 보기

파일: test_mr_spark_wordcount.py 프로젝트: yzhanggithub/mrjob

    def test_count_words(self):
        job = MRSparkWordcount([])
        job.sandbox(
            stdin=BytesIO(b'Mary had a little lamb\nlittle lamb\nlittle lamb'))

        with job.make_runner() as runner:
            runner.run()

            output = sorted(
                safeeval(line) for line in to_lines(runner.cat_output()))

            self.assertEqual(output, [
                ('a', 1),
                ('had', 1),
                ('lamb', 3),
                ('little', 3),
                ('mary', 1),
            ])