An intel 64 symbolic emulator.
Ekoparty slides: https://github.com/feliam/pysymemu/blob/master/doc/pysymemu.pdf?raw=true
- z3, an smt solver. http://z3.codeplex.com/ (1)
- pyelftool, an ELF parsing library. https://github.com/eliben/pyelftools
- distorm3, a disassembler. https://code.google.com/p/distorm/
doc/ Slides and papers
examples/ Asorted set of small C examples to emulate
tutorial/ Very simple test cases
test/ Unittests
setup.py Setuputils/pipy related (not used yet)
linux.py The Linux operating system micro model
memory.py The symbolic memory model
smtlibv2.py Smtlib v2 solver API
system.py A quick command line tool
You may use the discover command.
$ python -m unittest discover
Note that cpu.py testcases are generated semi-automatically using tools at test/auto
You may generate a fair amount of API doc using epydoc. epydoc.sourceforge.net/ The following command will generate an html/ folder with the api documentation:
$ epydoc cpu.py memory.py linux.py smtlibv2.py system.py
THIS IS APLHA SOFT. But you may play directly on binary ELF files until you hit an unimplemented instruction or systemcall(2). The commandline gives you a somehow confusing help. :)
$ python system.py --help
usage: system.py [-h] [--worspace WORSPACE] [--sym SYM] [--stdin STDIN]
[--stdout STDOUT] [--stderr STDERR] [--env ENV]
PROGRAM ...
Symbolically analize a program
positional arguments:
PROGRAM Program to analize
... Program arguments. Need a -- separator. Ex: -- -y 2453
optional arguments:
-h, --help show this help message and exit
--worspace WORSPACE A folder name fpor temporaries and results. (default pse_?????)
--sym SYM Consider a filename as symbolic
--stdin STDIN A filename to pass as standar stdin (default: stdin)
--stdout STDOUT A filename to pass as standar stdout (default: stdout)
--stderr STDERR A filename to pass as standar stderr (default: stderr)
--env ENV A environment variable to pass to the program (ex. VAR=VALUE)
Basicaly you pass a binary file for pysymemu to emulate. Let's try the toy examples:
$ cd examples
$ cat toy002-libc.c
int main()
{
unsigned int cmd;
if (read(0, &cmd, sizeof(cmd)) != sizeof(cmd))
{
printf("Error reading stdin!");
exit(-1);
}
if (cmd > 0x41)
{
printf("Message: It is greater than 0x41\n");
}
else
{
printf("Message: It is smaller or equal than 0x41\n");
}
return 0;
}
$ make
gcc -fno-builtin -static -nostdlib -m32 -fomit-frame-pointer toy001-nostdlib.c -o toy001-nostdlib
gcc toy002-libc.c -static -o toy002-libc
$ cd -
Now run it under the emulator like this. First create 3 dummy files to replace the virtual/emulated stdin, stdout and stderr
$ touch > stderr
$ touch > stdout
$ echo ++++++++++ > stdin
We'll be considering that the stdin is filled by symbolic data ( marked with '+' (yes, I know)). Also we need to tell pysymemu which part of the environment should be considered symbolic and which concret. We mark the 'stdin' file as symbolic (its '+' will be free 8bit variables) with --sym 'stdin', like this:
$ python system.py --sym stdin examples/toy002-libc
The quick and dirty command line tool will generate somthing like this..
$ python system.py --sym stdin examples/toy002-libc
[+] Running examples/toy002-libc
with arguments: []
with environment: []
[+] Detected arch: amd64
starting
Symbolic PC found, possible detinations are: ['4005ab', '40059d']
Saving state dump_00000000004005ab_8452.pkl PC: 0x4005ab
Program Finnished correctly
stdin: '\xc1\x00\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\x80\n'
Program Finnished correctly
stdin: ' \x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\n'
Results dumped in ./pse_xYhZwA
10392 3038.59649123
And an insanelly verbose system.log file. Also a folder with all intermediate states and results ...
$ ls ./pse_xYhZwA
dump_000000000040059d_8452.pkl dump_00000000004005ab_8452.pkl dump_init.pkl test_2.txt test_4.txt
$ tail -n 12341 ./pse_xYhZwA/test*
==> ./pse_xYhZwA/test_2.txt <==
stdin: '\xc1\x00\x80\x80\x80\x80\x80\x80'
==> ./pse_xYhZwA/test_4.txt <==
stdin: '\x20\x00\x00\x00\x00\x00\x00\x00'
- With a few mods it may accept any smtlibv2 solver that can handle (get-value) command.
- In such case you should go tu cpu.py or linux.py and add the necesarry code!