Python get_data_parallel_group 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: fairscale.nn.model_parallel

메소드/함수: get_data_parallel_group

hotexamples.com에서의 예제들: 5

Python get_data_parallel_group - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 fairscale.nn.model_parallel.get_data_parallel_group에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

 def configure_ddp(
         self,
         model: LightningModule, device_ids: List[int]) -> DistributedDataParallel:
     model = RPCPlugin(process_group=mpu.get_data_parallel_group()).configure_ddp(model, device_ids)
     # Plugin handle backwards across processes. Currently not supported for DDP + pipe parallel
     model.require_backward_grad_sync = False
     return model

예제 #2

파일 보기

 def configure_ddp(self, model: LightningModule,
                   device_ids: List[int]) -> DistributedDataParallel:
     ddp_plugin = RPCPlugin(
         process_group=mpu.get_data_parallel_group()).configure_ddp(
             model, device_ids)
     # Plugin handle backwards across processes. Currently not supported for DDP + pipe parallel
     ddp_plugin.PREPARE_FOR_BACKWARDS = False
     return ddp_plugin

예제 #3

파일 보기

 def _sync_balance_to_all_parallel_groups(self, main_rank=0):
     """
     Ensures that we sync the balance to all main processes, so that the balance is the same per replica.
     Args:
         main_rank: The rank with the balance we'd like to replicate.
     """
     self.balance = torch.tensor(self.balance, dtype=torch.int, device='cuda')
     # Ensure we sync to all processes within the main data parallel group
     # We use the data parallel group as all main processes are found within the same group
     torch_distrib.broadcast(self.balance, src=main_rank, group=mpu.get_data_parallel_group())
     self.balance = self.balance.cpu()

예제 #4

파일 보기

    def configure_ddp(self):
        if self.main_rpc_process:
            self.pre_configure_ddp()

            self._model = DistributedDataParallel(
                LightningDistributedModule(self.model),
                device_ids=self.determine_ddp_device_ids(),
                process_group=mpu.get_data_parallel_group(),
                **self._ddp_kwargs,
            )
            # Plugin handle backwards across processes. Currently not supported for DDP + pipe parallel
            self._model.require_backward_grad_sync = False

예제 #5

파일 보기

 def data_parallel_group(self):
     return mpu.get_data_parallel_group()