Skip to content

tgwizard/pythonrv

Repository files navigation

pythonrv

pythonrv is a runtime verification framework for python. It is the implementation of a master thesis currently being written. Please see tgwizard.github.com/thesis for more information.

Build Status

Installing

pythonrv is available as a pip package, with setup.py and everything. Use pip to install it like this:

pip install pythonrv

pythonrv has been tested with Python 2.7. It does not yet work with Python 3.

Coding

Clone the repository:

git clone git@github.com:tgwizard/pythonrv.git
cd pythonrv

Initialize the environment (preferrably through virtualenv, and possibly virtualenvwrapper):

pip install -r requirements.txt

Run the tests:

./runtests.sh

Runtime Verification

Runtime verification is the idea of verifying the correctness of a program during its execution. Verification is done by checking if the program conforms to its specifications.

There are several approaches to runtime verification, where pythonrv is one. It is based on the idea that the specifications should be written in the target programming language, in a similar manner to unit tests, and therefore make them both more expressive and easier to learn.

The specifications written in pythonrv get "baked" into the target's source code, so that whenever a function is called the specifications are executed as well.

Examples

Here are some simple examples showing the pythonrv API. For more realistic examples, see the examples folder. The unit tests might also be useful to see what works.

First, lets say we have a function which correctness we want to verify:

# factorial.py
def factorial(n):
    res = 1
    for i in range(2, n)
        res *= i
    return res

We can now write specifications for it, preferably in another file:

# rvspecs.py
from pythonrv import rv
import factorial

@rv.monitor(fact=factorial.factorial)
def input_only_spec(event):
    assert event.fn.fact.inputs[0] >= 0

@rv.monitor(fact=factorial.factorial)
@rv.spec(when=rv.POST)
def simple_specification(event):
    assert event.fn.fact.result >= event.fn.fact.inputs[0]

@rv.monitor(fact=factorial.factorial)
@rv.spec(when=rv.POST, history_size=rv.INFINITE_HISTORY_SIZE)
def simple_specification(event):
    in_out = (event.fn.fact.inputs[0], event.fn.fact.result)
    old_in_out = [(x.inputs[0], x.result) for x in event.fn.fact.history]

    for a in old_in_out:
        if in_out[0] > old_in_out[0]:
            assert in_out[1] >= old_in_out[1]

The first specification checks that all inputs are greater-than-or-equal-to zero. The second specification checks that all outputs are at least as big as the inputs. The third specification verifies the input/output against the historical data for the function; given a larger input, the output must be larger-or-equal than before. Note that the last two specs are executed after the factorial has been called. This gives them access to return values and output arguments.

A specification is sent an event with information of the called function, its history, and the history of all functions monitored by the specification.

from pythonrv import rv
import mymodule

@rv.monitor(foo=mymodule.foo, bar=mymodule.bar)
@rv.spec(when=rv.POST, history_size=20)
def more_specifications(event):
    # here are all functions monitored
    event.fn
    # the currently called function can be accessed like this
    event.called_function
    # which, if mymodule.foo was called, is the same as
    event.fn.foo

    # we can also check if a function was called
    assert event.fn.foo.called

    # the inputs, outputs and result can be accessed like this
    event.fn.foo.inputs        # a copy of the input argument tuple
    event.fn.foo.input_kwargs  # a copy of the input key-word argument dict
    event.fn.foo.outputs
    event.fn.foo.output_kwargs
    event.fn.foo.result

    # we can gain access to the previous event, and the previous function call
    event.prev
    event.fn.foo.prev

    # if two calls to mymodule.foo occurr consecutively
    assert event.prev.fn.foo == event.fn.foo.prev
    # but they needn't be if more than one function is monitored by a spec

    # we can gain access to the "entire" history
    for old_event in event.history:
        pass
    # and
    for old_foo_call in event.fn.foo.history:
        pass
    # this is obviously a big drain on the memory, so by default only two events
    # are stored in the history (this and the previous). this can be changed,
    # like we do here with @rv.spec(history_size=20)

    # we can also say that the next time some monitored function is called,
    # something should happen
    event.next(call_next_time)

    # or for just a specific monitored function
    event.fn.foo.next(call_next_time)

    # we can also specify which monitored function should be the next to be
    # called:
    event.next_called_should_be(event.fn.bar)

    # sometimes a specification can "finish" - it need not be verified again
    event.success("optional message telling that everything was ok")
    # or
    event.failure("we've failed, and there's no point continuing this verification")

def call_next_time(event):
    # here we gain access to all the same data as in the spec
    pass

When to Execute the Specification

The specification is executed before the call is passed on to the active monitoree. This can be customized:

@rv.monitor(f=func)
@rv.spec(when=rv.POST)
def spec_after(event):
    pass

# this is the default, it is not needed to explicitly specify PRE
@rv.monitor(f=func)
@rv.spec(when=rv.PRE)
def spec_before(event):
    pass

Dealing with Errors

Specifications signal verifications errors by raising the AssertionError exception (which the assert statement does). When this happens, pythonrv, by default, lets this error propagate, and, if uncaught, the program stops.

If this is not the desired behaviour, it can be changed. For a logging error handler, do

from pythonrv import rv
rv.configure(error_handler=rv.LoggingErrorHandler())

and then configure logging the normal way, through the python logging module.

Specifications can be marked with an error level:

@rv.monitor(f=func)
@rv.spec(level=rv.DEBUG)
def spec(event):
    pass

The available error levels are DEBUG, INFO, WARNING, ERROR and CRITICAL (just as in the logging module).

For more on error handling, see the source code, and the unit tests.

Technical Issues

Importing

It is recommended that you import your rv specifications among the first things you do in your program. The reasons will be detailed below.

When writing specs, this is the wrong way:

# this doesn't work
from mymodule import myfunc
@rv.monitor(f=myfunc)
def spec(event):
    pass

This is the correct way:

# this works
import mymodule
@rv.monitor(f=mymodule.myfunc)
def spec(event):
    pass

The first example creates a reference to myfunc and inserts it into the current module (the module defining the specification). Monitoring a function means adding a wrapper around it, and in this case we only add a wrapper for the myfunc reference in the current module. We do not modify mymodule, which all other code will use. The second example fixes this.

The above reason also explains why the specifications should be imported/defined at the very beginning of the execution: Other modules might use the from x import y style, and if they do so before the rv specifications have had a chance to monitor/instrument the functions, they will get unmonitored/uninstrumented references to them.

Copying arguments

When intercepting function calls, pythonrv copies the arguments to make sure that the history it stores doesn't get altered afterwards and/or from the outside. This also makes it so that the input arguments really are input arguments, and not modified by the function itself.

This might sometimes be deemed unnecessary, or needlessly expensive. It might sometimes not even work, for instance when cStringIO is involved. This is the case for Django requests in v1.4.x (but not on master).

Copying can be turned off for all specifications:

from pythonrv import rv
rv.configure(enable_copy_args=False)

Or for a specific specification:

from pythonrv import rv
@rv.monitor(func=somemodule.somefunc)
@rv.spec(enable_copy_args=False)
def spec(event):
    pass

Note: Disabling argument copying for one specification actually disables argument copying for all monitored functions for that specification. Other specifications that monitor the same functions won't get argument copying either. This is "a feature".

License

pythonrv is released under the MIT license.

About

A runtime verification framework for python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published