OpenDeep: a fully modular & extensible deep learning framework in Python

OpenDeep is a deep learning framework for Python built from the ground up in Theano with a focus on flexibility and ease of use for both industry data scientists and cutting-edge researchers. OpenDeep is a modular and easily extensible framework for constructing any neural network architecture to solve your problem.

Use OpenDeep to:

Quickly prototype complex networks through a focus on complete modularity and containers similar to Torch.
Configure and train existing state-of-the-art models.
Write your own models from scratch in Theano and plug into OpenDeep for easy training and dataset integration.
Use visualization and debugging tools to see exactly what is happening with your neural net architecture.
Plug into your existing Numpy/Scipy/Pandas/Scikit-learn pipeline.
Run on the CPU or GPU.

This library is currently undergoing rapid development and is in its alpha stages.

Quick example usage

Train and evaluate a Multilayer Perceptron (MLP - your generic feedforward neural network for classification) on the MNIST handwritten digit dataset:

from opendeep.models import Prototype, Dense, SoftmaxLayer
from opendeep.optimization import AdaDelta
from opendeep.data import MNIST

print "Creating model..."
mlp = Prototype()
mlp.add(Dense(input_size=28*28, output_size=512, activation='rectifier', noise='dropout'))
mlp.add(Dense(output_size=512, activation='rectifier', noise='dropout'))
mlp.add(SoftmaxLayer(output_size=10))

print "Training..."
data = MNIST()
optimizer = AdaDelta(dataset=data, epochs=10)
mlp.train(optimizer)

print "Predicting..."
predictions = mlp.run(data.test_inputs)

print "Accuracy: ", float(sum(predictions==data.test_targets)) / len(data.test_targets)

Installation

Because OpenDeep is still in alpha, you have to install via setup.py. Also, please make sure you have these dependencies installed first.

Dependencies

Theano: Theano and its dependencies are required to use OpenDeep. You need to install the bleeding-edge version directly from their GitHub, which has installation instructions here.
- For GPU integration with Theano, you also need the latest CUDA drivers. Here are instructions for setting up Theano for the GPU. If you prefer to use a server on Amazon Web Services, here are instructions for setting up an EC2 gpu server with Theano.
- CuDNN (optional but recommended for CNN's): for a fast convolution support from Nvidia. You will want to move the files to Theano's directory like the instructions say here: Theano cuDNN integration.
Pillow (PIL): image manipulation functionality.
PyYAML (optional): used for YAML parsing of config files.
Bokeh (optional): if you want live charting/plotting of values during training or testing.
NLTK (optional): if you want nlp functions like word tokenization.

All of these Python dependencies (not the system-specific ones like CUDA or HDF5), can be installed with pip install -r requirements.txt inside the root OpenDeep folder.

Install from source

Navigate to your desired installation directory and download the github repository:
```
git clone https://github.com/vitruvianscience/opendeep.git
```
Navigate to the top-level folder (should be named OpenDeep and contain the file setup.py) and run setup.py with develop mode:
```
cd opendeep
python setup.py develop
```

Using python setup.py develop instead of the normal python setup.py install allows you to update the repository files by pulling from git and have the whole package update! No need to reinstall when you get the latest files.

That's it! Now you should be able to import opendeep into python modules.

Quick Start

To get up to speed on deep learning, check out a blog post here: Deep Learning 101. You can also go through tutorials on OpenDeep's documentation site: http://www.opendeep.org/

Let's say you want to train a Denoising Autoencoder on the MNIST handwritten digit dataset. You can get started in just a few lines of code:

from opendeep.log import config_root_logger
from opendeep.data import MNIST
from opendeep.models import DenoisingAutoencoder
from opendeep.optimization import AdaDelta

# set up the logging to display to std.out and files so we can see what is happening.
config_root_logger()

# create the MNIST dataset
mnist = MNIST()

# define some model configuration parameters (this could have come from json!)
config = {
    "input_size": 28*28, # dimensions of the MNIST images
    "hidden_size": 1500  # number of hidden units - generally bigger than input size for DAE
}
# create the denoising autoencoder
dae = DenoisingAutoencoder(**config)

# create the optimizer to train the denoising autoencoder
# AdaDelta is normally a good generic optimizer
optimizer = AdaDelta(dataset=mnist, model=dae)
optimizer.train()
# note: the syntactic sugar of dae.train() calls optimizer.train() internally

# test the trained model and save some reconstruction images
n_examples = 100
# grab 100 test examples
test_xs = mnist.test_inputs[:n_examples]
# test and save the images
dae.create_reconstruction_image(test_xs)

Congrats, you just:

set up a dataset (MNIST)
instantiated a denoising autoencoder model with some configurations
trained it with an AdaDelta optimizer
and predicted some outputs given inputs (and saved them as an image)!

More Information

Source code: https://github.com/vitruvianscience/opendeep

Documentation and tutorials: http://www.opendeep.org/

User group: opendeep-users

Developer group: opendeep-dev

Twitter: @opendeep

We would love all help to make this the best library possible! Feel free to fork the repository and join the Google groups!

Why OpenDeep?

Modularity. A lot of recent deep learning progress has come from combining multiple models. Existing libraries are either too confusing or not easily extensible enough to perform novel research and also quickly set up existing algorithms at scale. This need for transparency and modularity is the main motivating factor for creating the OpenDeep library, where we hope novel research and industry use can both be easily implemented.
Ease of use. Many libraries require a lot of familiarity with deep learning or their specific package structures. OpenDeep's goal is to be the best-documented deep learning library and have smart enough default code that someone without a background can start training models, while experienced practitioners can easily create and customize their own algorithms.
State of the art. A side effect of modularity and ease of use, OpenDeep aims to maintain state-of-the-art performance as new algorithms and papers get published. As a research library, citing and accrediting those authors and code used is very important to the library.

Name		Name	Last commit message	Last commit date
Latest commit History 168 Commits
docs		docs
opendeep		opendeep
readme_images		readme_images
tutorials		tutorials
.gitignore		.gitignore
LICENSE		LICENSE
README.rst		README.rst
TODO.md		TODO.md
req-docs.txt		req-docs.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

opendeep

opendeep

readme_images

readme_images

tutorials

tutorials

.gitignore

.gitignore

LICENSE

LICENSE

README.rst

README.rst

TODO.md

TODO.md

req-docs.txt

req-docs.txt

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

OpenDeep: a fully modular & extensible deep learning framework in Python

Quick example usage

Installation

Dependencies

Install from source

Quick Start

More Information

Why OpenDeep?

About

Releases

Packages

Languages

License

nagyistoce/OpenDeep

Folders and files

Latest commit

History

Repository files navigation

OpenDeep: a fully modular & extensible deep learning framework in Python

Quick example usage

Installation

Dependencies

Install from source

Quick Start

More Information

Why OpenDeep?

About

Resources

License

Stars

Watchers

Forks

Languages