Python Reshuffle 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: apache_beam.transforms

클래스/타입: Reshuffle

hotexamples.com에서의 예제들: 3

Python Reshuffle - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 apache_beam.transforms.Reshuffle에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Reshuffle(3)

자주 사용되는 메소드들

Reshuffle (3)

예제 #1

파일 보기

 def expand(self, pcoll):
     return (pcoll
             | beam.ParDo(_GenerateObjectIdFn())
             | Reshuffle()
             | beam.ParDo(
                 _WriteMongoFn(self._uri, self._db, self._coll,
                               self._batch_size, self._spec)))

예제 #2

파일 보기

파일: sql.py 프로젝트: madverma/pysql-beam-1

 def expand(self, pcoll):
     return (pcoll.pipeline
             | 'UserQuery' >> beam.Create([1])
             | 'SplitQuery' >> beam.ParDo(PaginateQueryDoFn(*self.args, **self.kwargs))
             | "reshuffle" >> Reshuffle()
             | 'Read' >> beam.ParDo(SQLSourceDoFn(*self.args, **self.kwargs))
             )

예제 #3

파일 보기

파일: datastoreio.py 프로젝트: SofyanS/CH_redact

    def expand(self, pcoll):
        # This is a composite transform involves the following:
        #   1. Create a singleton of the user provided `query` and apply a ``ParDo``
        #   that splits the query into `num_splits` queries if possible.
        #
        #   If the value of `num_splits` is 0, the number of splits will be
        #   computed dynamically based on the size of the data for the `query`.
        #
        #   2. The resulting ``PCollection`` is sharded across workers using a
        #   ``Reshuffle`` operation.
        #
        #   3. In the third step, a ``ParDo`` reads entities for each query and
        #   outputs a ``PCollection[Entity]``.

        return (pcoll.pipeline
                | 'UserQuery' >> Create([self._query])
                | 'SplitQuery' >> ParDo(
                    ReadFromDatastore._SplitQueryFn(self._num_splits))
                | Reshuffle()
                | 'Read' >> ParDo(ReadFromDatastore._QueryFn()))