Minerva: a fast and flexible system for deep learning

Goal

Make deep learning a home dish.

Features

Matrix programming interface
Easy interaction with NumPy
Multi-GPU, multi-CPU support
Good performance: ImageNet AlexNet training achieves 213 and 403 images/s with one and two Titan GPU, respectivly. Four GPU cards number will be coming soon.

Installation

Requirements and Dependencies

Minerva depends on several packages. If they are not in the system search path, please set variable EXTERN_INCLUDE_PATH and EXTERN_LIB_PATH in configure.in accordingly.

CUDA 6.5
cuDNN
LMDB
NumPy

In addition, Minerva also depends on the following packages. The script resolve_deps automatically resolves them. Running this script will download and install them in the deps directory relative to the root of the repository. If you do so, please consult configure.in.example for a sample configuration of the paths.

Boost
Boost.NumPy
glog
gflags
Google Test

How to build

Set CXX_COMPILER, and C_COMPILER in configure.in. g++-4.8 is strongly recommended.
Set EXTERN_INCLUDE_PATH, and EXTERN_LIB_PATH in configure.in to reflect any dependency not in the system search path. If there are more than one, use a comma-separated list.
Run ./configure.
Change directory into release (or debug for more verbose logging) and run make.

After these steps, Minerva should be successfully built and linked on your system. But to run the Python interpreter with Owl enabled, you have to source ${MINERVA_ROOT}/owl_environ.sh. This will set PYTHONPATH environment variable for importing modules in Python. You could also add this script to ~/.profile or ~/.bashrc to automate this process.

Minerva is not unusual in this approach; ssh-agent, pip, opam and many others also work similarly. This allows a local version of the Minerva system and its dependencies, while still giving the ability to run from anywhere.

Running apps

There are two ways to use Minerva: writing C++ code or Python code. Python binding is preferred since we provide easy interaction with NumPy.

We have implemented several applications in Minerva including ImageNet training and MNIST training. After you have built Minerva, you can run both C++ and Python codes.

The Python applications are located in minerva/owl/apps. After you have built Minerva, you can run the applications with python {app}.py.

The source code for C++ applicaionts are located in minerva/apps and the compiled executables are located in minerva/release/apps. You can run the executables directly.

The MNIST training data can be downloaded in: http://pan.baidu.com/s/1ntsQs0x

Writing you own app

Minerva allows you to write you own code for machine learning, using a matrix interface just like Matlab or NumPy. You can use C++ or Python, whichever you prefer. The C++ and Python interface are quite similar. With Python, you can load data with NumPy and use it in Minerva, or you can convert Minerva NArrays into NumPy array and plot/print it with the tools provided in NumPy.

The NArray interface provided by Minerva is very intuitive. If you are familiar with either one of the matrix programming tools such as Matlab or NumPy, it should be very easy to get started with Minerva. More detailed documents will be available soon.

Minerva allows you to use multiple GPUs at the same time. By using the set_device function, you can specify which device you want the operation to run on. Once set, all the operations/statements that follow will be performed on this device. This simple primitive will give you flexibility to parallelize on multiple devices (either CPU or GPU).

Minerva uses lazy evaluation, meaning that the operations are carried out only when necessary. For example, when you write c = a + b, the matrix addition will not be performed immediately. Instead, a dependency graph is constructed internally to track the dependency relationship. Once you try to evaluate the matrix c, either by printing some of its elements, or calling c.WaitForEval(), Minerva will lookup the dependency graph and carry out all computations specified by the graph. In this way, you can "push" multiple operations to different devices, and then trigger the evaluation on both devices at the same time. This is how multi-GPU programming is done in Minerva. Please refer to the code to get more details.

License and support

Minerva is provided in the Apache V2 open source license.

You can use the "issues" tab in github to report bugs. For non-bug issues, please send up an email at minerva-support@googlegroups.com.

History

Minerva evolves from an early prototype that follows the same principle. The system is described in the following paper, published in NIPS 2014 workshop (http://nips.cc/Conferences/2014/Program/event.php?ID=4272). The paper is at http://stanford.edu/~rezab/nips2014workshop/submits/minerva.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 679 Commits
apps		apps
matlab		matlab
minerva		minerva
owl		owl
tests		tests
.gitignore		.gitignore
.ycm_extra_conf.py		.ycm_extra_conf.py
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
configure		configure
configure.in.example		configure.in.example
owl_environ.sh		owl_environ.sh
read_configure_in_from_caffe_makefile_config.makefile		read_configure_in_from_caffe_makefile_config.makefile
repair_permisson.sh		repair_permisson.sh
resolve_deps.sh		resolve_deps.sh

License

yuewong/minerva

Folders and files

Latest commit

History

Repository files navigation

Minerva: a fast and flexible system for deep learning

Goal

Features

Installation

Requirements and Dependencies

How to build

Running apps

Writing you own app

License and support

History

About

Resources

License

Stars

Watchers

Forks

Languages