GitHub - DerekYu177/ExpenseManager: Allowing you to build your expenses straight from your receipt photos

This is be roughly split into three parts: Input, Process, Output

Operation

Input

Getting the photo. We wait until the photos have been imported onto the processing computer.

We query the user to determine the relevant folder (ui still under construction)
We run over all of the photos to determine which ones are receipts (can we use ML for this?)

Investigation onto Photos naming scheme still needed.

Process

We apply OCR onto each photo to extract relevant tags and attributes.

The extracted data currently is the date-time, and the total amount. (currently working on the address)
This extracted data is then compared to the database, where duplicates are ignored. (We have the time attribute so this should be okay). Unique entries are inserted into the relevant line.
For questionable attributes such as the address, the computer will prompt with a image of the text in question. (Can ML be used to improve this accuracy?)

Output

We log all data into a .csv file.

Installation Process

required packages

brew install tesseract
pip install Pillow
pip install pytesseract

To get homebrew w/ python2.7 and PyQt4:

xcode-select --install
ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)"
brew install cartr/qt4/pyqt
brew install python

Adding to your PYTHONPATH

 /usr/local/Cellar/python/2.7.13/Frameworks/Python.Framework/Version/2.7/bin/python2.7
/usr/local/Cellar/pyqt/

credits

robonobodojo for the excellent guide

to view markdown in atom, use ctrl-shift-m

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
Data		Data
modules		modules
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
main.py		main.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data

Data

modules

modules

test

test

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

init.py

init.py

main.py

main.py

setup.py

setup.py

Repository files navigation

Operation

Input

Process

Output

Installation Process

required packages

To get homebrew w/ python2.7 and PyQt4:

Adding to your PYTHONPATH

credits

About

Releases

Packages

Languages

License

DerekYu177/ExpenseManager

Folders and files

Latest commit

History

Repository files navigation

Operation

Input

Process

Output

Installation Process

required packages

To get homebrew w/ python2.7 and PyQt4:

Adding to your PYTHONPATH

credits

About

Resources

License

Stars

Watchers

Forks

Languages