Python _cast_if_autocast_enabledの例

プログラミング言語: Python

名前空間/パッケージ名: apex._autocast_utils

メソッド/関数: _cast_if_autocast_enabled

hotexamples.comのコード掲載数: 10

Python _cast_if_autocast_enabled - 10件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのapex._autocast_utils._cast_if_autocast_enabledの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

def linear_with_grad_accumulation_and_async_allreduce_in16bit(
    input, weight, bias, gradient_accumulation_fusion, async_grad_allreduce,
):
    args = _cast_if_autocast_enabled(
        input, weight, bias, gradient_accumulation_fusion, async_grad_allreduce
    )
    with torch.cuda.amp.autocast(enabled=False):
        return LinearWithGradAccumulationAndAsyncAllreduceIn16Bit.apply(*args)

コード例 #2

ファイルを表示

def scaled_upper_triang_masked_softmax(inputs, _, scale):
    b, np, sq, sk = inputs.size()
    assert sq == sk, "causal mask is only for self attention"
    # Reshaping input to 3D tensor (attn_batches, sq, sk)
    inputs = inputs.view(-1, sq, sk)
    args = _cast_if_autocast_enabled(inputs, scale)
    with torch.cuda.amp.autocast(enabled=False):
        probs = ScaledUpperTriangMaskedSoftmax.apply(*args)
    return probs.view(b, np, sq, sk)

コード例 #3

ファイルを表示

ファイル: fused_layer_norm.py プロジェクト: leezu/apex

def mixed_dtype_fused_layer_norm_affine(input,
                                        weight,
                                        bias,
                                        normalized_shape,
                                        eps=1e-6):
    args = _cast_if_autocast_enabled(input, weight, bias, normalized_shape,
                                     eps)
    with torch.cuda.amp.autocast(enabled=False):
        return FusedLayerNormAffineMixedDtypesFunction.apply(*args)

コード例 #4

ファイルを表示

def scaled_masked_softmax(inputs, mask, scale):
    # input is 4D tensor (b, np, sq, sk)
    args = _cast_if_autocast_enabled(inputs, mask, scale)
    with torch.cuda.amp.autocast(enabled=False):
        return ScaledMaskedSoftmax.apply(*args)

コード例 #5

ファイルを表示

def fused_bias_gelu(input, bias):
    args = _cast_if_autocast_enabled(input, bias)
    with torch.cuda.amp.autocast(enabled=False):
        return GeLUFunction.apply(*args)

コード例 #6

ファイルを表示

def _fast_layer_norm(x, weight, bias, epsilon):
    args = _cast_if_autocast_enabled(x, weight, bias, epsilon)
    with torch.cuda.amp.autocast(enabled=False):
        return FastLayerNormFN.apply(*args)

コード例 #7

ファイルを表示

def bias_dropout_add_fused_inference(x, bias, residual, prob):
    args = _cast_if_autocast_enabled(x, bias, residual, prob)
    with torch.cuda.amp.autocast(enabled=False):
        return bias_dropout_add_fused_inference_(*args)

コード例 #8

ファイルを表示

def bias_dropout_add_fused_train(x, bias, residual, prob):
    # re-enable torch grad to enable fused optimization.
    with torch.enable_grad():
        args = _cast_if_autocast_enabled(x, bias, residual, prob)
        with torch.cuda.amp.autocast(enabled=False):
            return bias_dropout_add_fused_train_(*args)

コード例 #9

ファイルを表示

def column_parallel_linear(input, weight, bias):
    args = _cast_if_autocast_enabled(input, weight, bias)
    with torch.cuda.amp.autocast(enabled=False):
        return ColumnParallelLinearWithAsyncAllreduce.apply(*args)

コード例 #10

ファイルを表示

def fused_rms_norm(input, normalized_shape, eps=1e-6):
    args = _cast_if_autocast_enabled(input, normalized_shape, eps)
    with torch.cuda.amp.autocast(enabled=False):
        return FusedRMSNormFunction.apply(*args)