Chainer implementation of Formulanet

This is an experimental Chainer implementation of FormulaNet [1], a graph embedding based premise selection method for theorem proving.

Disclaimer: PFN provides no warranty or support for this implementation. Use it at your own risk.

Requirement

Python 3.6
Chainer >= 7.0.0
funcparserlib

Usage

Dataset preparation

$ wget http://cl-informatik.uibk.ac.at/cek/holstep/holstep.tgz
$ tar zxf holstep.tgz
$ python build_db.py -o db

Training

$ python formulanet_train.py --dataset db --devices @cupy:0

formulanet_train.py has several options for configuring models.

--conditional: Use contional model. Default is unconditional model.
--preserve-order: Use order-preserving model (i.e. FormulaNet in the original paper). Default is without order information (i.e. FormulaNet-basic in the original paper).
--steps STEPS: Number of update steps

Testing

Pretrained model formulanet-basic-unconditional-3steps.npz is included in this repository.

$ python formulanet_test.py --model formulanet-basic-unconditional-3steps.npz --dataset db/test.h5 --device @cupy:0
...
accuracy: 0.8891751170158386
precision: 0.9018562609300268
recall: 0.8733969290414733
F beta score: 0.887398477223135
support: [98015 98015]

Results

Observed accurarcy in our experiment is similar but somewhat lower compared to the original paper.

Classification accuracy on the test set of our approach versus baseline methods on HolStep

[1, Table 1] + our results:

	CNN	CNN-LSTM	Formulanet-basic (orig)	FormulaNet (orig)	Formulanet-basic (ours)	Formulanet (ours)
Unconditional	83	83	89.0	90.0	89.9	89.9
Conditional	82	83	89.1	90.3	89.4	89.8

Classification accuracy with different numbers of update steps on conditional premise selection.

Results reported in the paper ([1, Table 3]):

Number of steps	0	1	2	3	4
FormulaNet-basic	81.5	89.3	89.8	89.9	90.0
FormulaNet	81.5	90.4	91.0	91.1	90.8

Results of our experiment:

Number of steps	0	1	2	3	4
FormulaNet-basic	74.2	87.7	89.1	89.2	89.4
FormulaNet	74.2	89.0	89.8	89.6	89.8

Difference from the original paper

There are several differences from original paper. These difference might be the reason for lower accuracy compared to the original paper.

In the original paper batch normalization is applied within a single graph whereas our implementaion apply batch normalization across multiple graphs.
Number of constants: we used 2753 unique tokens + three special tokens "VAR", "VARFUNC", "UNKNOWN", whereas the original paper uses only 1906 + 3 tokens. We used only limited normalization of tokens, but the original paper might used more normilization.

References

[1] M. Wang, Y. Tang, J. Wang, and J. Deng, "Premise selection for theorem proving by deep graph embedding," In Advances in Neural Information Processing Systems 30 (NIPS 2017). Available: https://papers.nips.cc/paper/6871-premise-selection-for-theorem-proving-by-deep-graph-embedding

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
.gitignore		.gitignore
HolstepAttoparsec.hs		HolstepAttoparsec.hs
HolstepMegaparsec.hs		HolstepMegaparsec.hs
LICENSE		LICENSE
README.md		README.md
build_db.py		build_db.py
collect_symbols.hs		collect_symbols.hs
collect_tokens.hs		collect_tokens.hs
dmuxspec-build_db.yml		dmuxspec-build_db.yml
dmuxspec-haskell.yml		dmuxspec-haskell.yml
dmuxspec-train-chainermn.yml		dmuxspec-train-chainermn.yml
dmuxspec-train.yml		dmuxspec-train.yml
dmuxspec.yml		dmuxspec.yml
expr.py		expr.py
formulanet-basic-unconditional-3steps.npz		formulanet-basic-unconditional-3steps.npz
formulanet.py		formulanet.py
formulanet_test.py		formulanet_test.py
formulanet_train.py		formulanet_train.py
holstep.py		holstep.py
parser_funcparselib.py		parser_funcparselib.py
parser_parsy.py		parser_parsy.py
parser_pyparsing.py		parser_pyparsing.py
requirements.txt		requirements.txt
stack.yaml		stack.yaml
symbols.py		symbols.py
symbols.txt		symbols.txt
symbols_train.txt		symbols_train.txt
test_holstep_attoparsec.hs		test_holstep_attoparsec.hs
test_holstep_megaparsec.hs		test_holstep_megaparsec.hs
test_parser_funcparselib.py		test_parser_funcparselib.py
test_parser_parsy.py		test_parser_parsy.py
test_parser_pyparsing.py		test_parser_pyparsing.py
tokens.txt		tokens.txt
tokens_train.txt		tokens_train.txt
tree.py		tree.py

License

pfnet-research/chainer-formulanet

Folders and files

Latest commit

History

Repository files navigation

Chainer implementation of Formulanet

Requirement

Usage

Dataset preparation

Training

Testing

Results

Classification accuracy on the test set of our approach versus baseline methods on HolStep

Classification accuracy with different numbers of update steps on conditional premise selection.

Difference from the original paper

References

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages