Skip to content
/ pas Public

PAS (parsing and storing) reads large json files and stores them into a postgres database for later use (like machine learning)

License

Notifications You must be signed in to change notification settings

jgoralcz/pas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pas

PAS (parsing and storing) reads large json files and stores them into a postgres database for later use (like machine learning)

I would suggest either using the Dockerfile or creating a virtual environment for ease of use.

You can find the files I used at: https://files.pushshift.io/reddit/comments/

you can then use ML or store it into a database for a full text search.

Virtual Environment

  1. virtualenv -p python3.6 venv_pas
  2. source venv_pas/bin/activate
  3. pip install -r requirements.txt
  4. python3 read_data.py

About

PAS (parsing and storing) reads large json files and stores them into a postgres database for later use (like machine learning)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published