Weighted Ensemble Python (wepy)

Modular implementation and framework for running weighted ensemble simulations in pure python, where the aim is to have simple things simple and complicated things possible. The latter being the priority.

The goal of the architecture is that it should be highly modular to allow extension, but provide a “killer app” for most uses that just works, no questions asked.

Comes equipped with support for OpenMM molecular dynamics, parallelization using multiprocessing, the WExplore and REVO (Resampling Ensembles by Variance Optimization) resampling algorithms, and an HDF5 file format and library for storing and querying your WE datasets that can be used from the command line.

The deeper architecture of wepy is intended to be loosely coupled, so that unforeseen use cases can be accomodated, but tightly integrated for the most common of use cases, i.e. molecular dynamics.

This allows freedom for fast development of new methods.

Community

Discussion takes place on riot.im (#wepy:matrix.org) which is a slack-like app that works on the Matrix protocol: https://riot.im/app/#/room/#wepy:matrix.org

You can also contact me directly:

samuel.lotz@salotz.info

Installation

Wepy is still in beta but you can install by cloning this repository, switching to the last release and installing with pip:

git clone https://github.com/ADicksonLab/wepy
cd wepy
pip install -e .

PyPI and Anaconda repos are planned.

The only absolutely necessary dependencies are numpy and h5py which are used in the core classes.

Outside of the core classes there are a couple of dependencies which are needed for this distribution but not in general if you use wepy as a library:

OpenMM (http://openmm.org/) (7.2 suggested)
pandas (https://pandas.pydata.org/)
mdtraj (http://mdtraj.org)
networkx >=2 (https://networkx.github.io/)
geomm (https://github.com/ADicksonLab/geomm)

To install these see their pages and the instructions below for geomm.

The default Runner is for OpenMM and should also be installed automatically when following these instructions (defined in setup.py) although it is not necessary to use openmm when using wepy as a library.

Currently, some things are still coupled to mdtraj and thus this is also a dependency for some functionality, although this will be relaxed in the future and replaced with a dependency on our nascent project geomm.

To install geomm:

git clone https://github.com/ADicksonLab/geomm.git
cd geomm
# compile the cython modules
python setup.py build_ext --inplace
# install it
pip install -e .

There are other uses for mdtraj such as export of trajectories to mdtraj trajectories, which will not be removed.

Pandas is used for outputting some data records, for which there is always a non-pandas option.

NetworkX is used in the WExplore resampler for the region tree and also for the tree module for manipulating the walker cloning/merging trees.

Roadmap

[X] Weighted Ensemble Layer
- [X] simulation manager
[X] Resampling sub-module
- [X] Random clone-merge resampler
- [X] WExplore
- [X] REVO
[X] OpenMM support
[X] HDF5 output and API
[ ] Command Line Interface
[ ] PyPI and Anaconda repositories

Getting Started

There are a few examples here (https://github.com/ADicksonLab/wepy/tree/master/examples).

There is an example with a pair of Lennard-Jones particles that runs on the reference implementation. This is the “Hello World” example and should be your starting point.

A more advanced (and interesting) example is a non-equilibrium unbinding WExplore simulation of the soluble epoxide hydrolase (sEH) protein with the inhibitor TPPU, which was the subject of this paper:

Lotz and Dickson. 2018. JACS 140 (2) pp. 618-628 (DOI: 10.1021/jacs.7b08572)

Be sure to install the extra dependencies for the examples as above in the installation instructions.

Architecture

The overall architecture of the project is broken into separate modules:

Simulation Management

a framework for running simulations, needs:

Runner

module that implements whatever dynamics you want to run

e.g.
- OpenMM

Resampler

the key functionality of the Weighted Ensemble resampling procedure is implemented here

e.g.
- WExplore

WorkMapper

a function that implements the map function that allows for arbitrary methods of parallelization

Reporter

Responsible for the collection and saving of data from wepy runs

e.g. HDF5 or plaintext

BoundaryConditions

describes and performs boundary condition transformations as the simulation progresses

simulation manager

coordinates all of these components to run simulations

helper sub-modules will make the construction of new simulation management modules easier and standardized

Application Layer: This is a convenience layer for building the CLI and perhaps high level functions for users to write their own scripts

Name		Name	Last commit message	Last commit date
Latest commit History 575 Commits
examples		examples
wepy		wepy
.gitignore		.gitignore
CHANGELOG.org		CHANGELOG.org
LICENSE		LICENSE
README.org		README.org
doc.org		doc.org
setup.py		setup.py
todo.org		todo.org
todo.org_archive		todo.org_archive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

wepy

wepy

.gitignore

.gitignore

CHANGELOG.org

CHANGELOG.org

LICENSE

LICENSE

README.org

README.org

doc.org

doc.org

setup.py

setup.py

todo.org

todo.org

todo.org_archive

todo.org_archive

Repository files navigation

Weighted Ensemble Python (wepy)

Community

Installation

Roadmap

Getting Started

Architecture

About

Releases

Packages

Languages

License

fabian-paul/wepy

Folders and files

Latest commit

History

Repository files navigation

Weighted Ensemble Python (wepy)

Community

Installation

Roadmap

Getting Started

Architecture

About

Resources

License

Stars

Watchers

Forks

Languages