Introduction

This repository holds NVIDIA-maintained utilities to streamline mixed precision and distributed training in Pytorch. Some of the code here will be included in upstream Pytorch eventually. The intention of Apex is to make up-to-date utilities available to users as quickly as possible.

Full API Documentation: https://nvidia.github.io/apex

apex.parallel.SyncBatchNorm extends torch.nn.modules.batchnorm._BatchNorm to support synchronized BN. It reduces stats across processes during multiprocess distributed data parallel training. Synchronous Batch Normalization has been used in cases where only very small number of mini-batch could be fit on each GPU. All-reduced stats boost the effective batch size for sync BN layer to be the total number of mini-batches across all processes. It has improved the converged accuracy in some of our research models.

Requirements

Python 3

CUDA 9 or 10

PyTorch 0.4 or newer. We recommend to use the latest stable release, obtainable from https://pytorch.org/. We also test against the latest master branch, obtainable from https://github.com/pytorch/pytorch.
If you have any problems building, please file an issue.

The cpp and cuda extensions require pytorch 1.0 or newer.

Quick Start

Linux

To build the extension run

python setup.py install

in the root directory of the cloned repository.

To use the extension

import apex

CUDA/C++ extension

Apex contains optional CUDA/C++ extensions, installable via

python setup.py install [--cuda_ext] [--cpp_ext]

Currently, --cuda_ext enables

Fused kernels that improve the performance and numerical stability of apex.parallel.SyncBatchNorm.
Fused kernels required to use apex.optimizers.FusedAdam.
Fused kernels required to use 'apex.normalization.FusedLayerNorm'.

--cpp_ext enables

C++-side flattening and unflattening utilities that reduce the CPU overhead of apex.parallel.DistributedDataParallel.

Windows support

Windows support is experimental, and Linux is recommended. However, since Apex could be Python-only, there's a good chance the Python-only features "just works" the same way as Linux. If you installed Pytorch in a Conda environment, make sure to install Apex in that same environment.

Name		Name	Last commit message	Last commit date
Latest commit History 276 Commits
apex		apex
csrc		csrc
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
.nojekyll		.nojekyll
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

License

meteozay/apex

Folders and files

Latest commit

History

Repository files navigation

Introduction

Full API Documentation: https://nvidia.github.io/apex

Contents

1. Mixed Precision

amp: Automatic Mixed Precision

FP16_Optimizer

2. Distributed Training

Synchronized Batch Normalization

Requirements

Quick Start

Linux

CUDA/C++ extension

Windows support

About

Resources

License

Stars

Watchers

Forks

Languages