Skip to content

HabanaAI/Model-References

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Intel® Gaudi® AI Accelerator Examples for Training and Inference

Model List and Performance Data

Please visit this page for performance information.

This repository is a collection of models that have been ported to run on Intel®️ Gaudi®️ AI accelerator. They are intended as examples, and will be reasonably optimized for performance while still being easy to read.

Computer Vision

Models Framework Validated on Gaudi Validated on Gaudi2
ResNet50, ResNeXt101 PyTorch Training Training, Inference
ResNet152 PyTorch Training -
MobileNetV2 PyTorch Training -
UNet 2D, Unet3D PyTorch Lightning Training, Inference Training, Inference
SSD PyTorch Training Training
GoogLeNet PyTorch Training -
Vision Transformer PyTorch Training -
DINO PyTorch Training -
YOLOX PyTorch Training -

Natural Language Processing

Models Framework Validated on Gaudi Validated on Gaudi2
BERT Pretraining and Finetuning PyTorch Training, Inference Training, Inference
DeepSpeed BERT-1.5B, BERT-5B PyTorch Training -
BART PyTorch Training -
HuggingFace BLOOM PyTorch Inference Inference

Audio

Models Framework Validated on Gaudi Validated on Gaudi2
Wav2Vec2ForCTC PyTorch Inference Inference

Generative Models

Models Framework Validated on Gaudi Validated on Gaudi2
Stable Diffusion PyTorch Lightning Training Training
Stable Diffusion FineTuning PyTorch Training Training
Stable Diffusion v2.1 PyTorch Inference Inference

MLPerf™ Training 3.1

Models Framework Validated on Gaudi Validated on Gaudi2
GPT3 PyTorch - Training
ResNet50 PyTorch - Training
BERT PyTorch - Training

MLPerf™ Inference 3.1

Models Framework Validated on Gaudi Validated on Gaudi2
GPT-J PyTorch - Inference

MLPerf™ is a trademark and service mark of MLCommons Association in the United States and other countries. All rights reserved. Unauthorized use is strictly prohibited.

Reporting Bugs/Feature Requests

We welcome you to use the GitHub issue tracker to report bugs or suggest features.

When filing an issue, please check existing open, or recently closed, issues to make sure somebody else hasn't already reported the issue. Please try to include as much information as you can. Details like these are incredibly useful:

  • A reproducible test case or series of steps
  • The version of our code being used
  • Any modifications you've made relevant to the bug
  • Anything unusual about your environment or deployment

Community

Hugging Face

Megatron-DeepSpeed

DeepSpeed-Chat

Fairseq