Apache MXNet (incubating) for Deep Learning

Note: this version is forked from tag 0.11.0.rc3

Emperimental: supporting partial pushing/pulling with timeout

git clone --recursive https://github.com/xcgoner/dist-mxnet.git
Build with GPU and Distributed KVStore:
make -j5 USE_OPENCV=0 USE_BLAS=openblas USE_CUDA=1 USE_CUDA_PATH=/usr/local/cuda USE_CUDNN=1 USE_DIST_KVSTORE=1
make -j5 USE_OPENCV=0 USE_BLAS=openblas USE_CUDA=0 USE_DIST_KVSTORE=1

pkill -u cx2 python

python train_mnist.py

python ../../tools/launch.py -n 2 --launcher ssh -H ../../tests/distributed/hosts python train_mnist.py --kv-store dist_sync

python ../../tools/launch.py -n 3 -s 3 --launcher ssh -H local_hosts python lr.py

python ../../tools/launch.py -n 2 --launcher ssh -H hosts python train_mnist.py --kv-store dist_sync

python ../../tools/launch.py -n 7 -s 2 --launcher ssh -H hosts 'export MXNET_MERGE_THRESHOLD=3 && export MXNET_MERGE_TAU_MILLISECOND=0 && export DMLC_PS_PULL_THRESHOLD=0.7 && export DMLC_PS_PARTIAL_PULL_ACTIVE=0 && export MXNET_KVSTORE_PARTIAL_PULL_HISTORY=0.2 && export DMLC_PS_PULL_DELAY=200 && python train_mnist.py --kv-store dist_sync'

Apache MXNet (incubating) is a deep learning framework designed for both efficiency and flexibility. It allows you to mix symbolic and imperative programming to maximize efficiency and productivity. At its core, MXNet contains a dynamic dependency scheduler that automatically parallelizes both symbolic and imperative operations on the fly. A graph optimization layer on top of that makes symbolic execution fast and memory efficient. MXNet is portable and lightweight, scaling effectively to multiple GPUs and multiple machines.

MXNet is also more than a deep learning project. It is also a collection of blue prints and guidelines for building deep learning systems, and interesting insights of DL systems for hackers.

What's New

Version 0.11.0.rc3 Release - MXNet 0.11.0.rc3 Release.
Apache Incubator - We are now an Apache Incubator project.
Version 0.10.0 Release - MXNet 0.10.0 Release.
Version 0.9.3 Release - First 0.9 official release.
Version 0.9.1 Release (NNVM refactor) - NNVM branch is merged into master now. An official release will be made soon.
Version 0.8.0 Release
Updated Image Classification with new Pre-trained Models
Python Notebooks for How to Use MXNet
MKLDNN for Faster CPU Performance
MXNet Memory Monger, Training Deeper Nets with Sublinear Memory Cost
Tutorial for NVidia GTC 2016
Embedding Torch layers and functions in MXNet
MXNet.js: Javascript Package for Deep Learning in Browser (without server)
Design Note: Design Efficient Deep Learning Data Loading Module
MXNet on Mobile Device
Distributed Training
Guide to Creating New Operators (Layers)
Go binding for inference
Amalgamation and Go Binding for Predictors - Outdated
Training Deep Net on 14 Million Images on A Single Machine

Features

Design notes providing useful insights that can re-used by other DL projects
Flexible configuration for arbitrary computation graph
Mix and match imperative and symbolic programming to maximize flexibility and efficiency
Lightweight, memory efficient and portable to smart devices
Scales up to multi GPUs and distributed setting with auto parallelism
Support for Python, R, Scala, C++ and Julia
Cloud-friendly and directly compatible with S3, HDFS, and Azure

Ask Questions

Please use mxnet/issues for how to use mxnet and reporting bugs

License

Reference Paper

Tianqi Chen, Mu Li, Yutian Li, Min Lin, Naiyan Wang, Minjie Wang, Tianjun Xiao, Bing Xu, Chiyuan Zhang, and Zheng Zhang. MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems. In Neural Information Processing Systems, Workshop on Machine Learning Systems, 2015

History

MXNet emerged from a collaboration by the authors of cxxnet, minerva, and purine2. The project reflects what we have learned from the past projects. MXNet combines aspects of each of these projects to achieve flexibility, speed, and memory efficiency.

Name		Name	Last commit message	Last commit date
Latest commit History 5,901 Commits
.github		.github
R-package		R-package
amalgamation		amalgamation
cmake		cmake
cpp-package		cpp-package
cub @ 05eb57f		cub @ 05eb57f
dlpack @ a6e09b5		dlpack @ a6e09b5
dmlc-core @ 71bfbd3		dmlc-core @ 71bfbd3
docker		docker
docs		docs
example		example
include/mxnet		include/mxnet
make		make
matlab		matlab
mshadow @ 497eb91		mshadow @ 497eb91
nnvm @ bcfbf90		nnvm @ bcfbf90
perl-package		perl-package
plugin		plugin
ps-lite @ b0ab45f		ps-lite @ b0ab45f
python		python
scala-package		scala-package
setup-utils		setup-utils
src		src
tests		tests
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
CONTRIBUTORS.md		CONTRIBUTORS.md
DISCLAIMER		DISCLAIMER
Jenkinsfile		Jenkinsfile
KEYS		KEYS
LICENSE		LICENSE
MKL_README.md		MKL_README.md
Makefile		Makefile
NEWS.md		NEWS.md
NOTICE		NOTICE
README.md		README.md
appveyor.yml		appveyor.yml
prepare_mkl.sh		prepare_mkl.sh
readthedocs.yml		readthedocs.yml
snap.python		snap.python
snapcraft.yaml		snapcraft.yaml

License

xcgoner/dist-mxnet-udp

Folders and files

Latest commit

History

Repository files navigation

Apache MXNet (incubating) for Deep Learning

Note: this version is forked from tag 0.11.0.rc3

Emperimental: supporting partial pushing/pulling with timeout

What's New

Contents

Features

Ask Questions

License

Reference Paper

History

About

Resources

License

Stars

Watchers

Forks

Languages