Skip to content

afcarl/essay-scoring-spring

 
 

Repository files navigation

Installation

This code uses Python. Linux environment is needed for the code to run smoothly.

Setting up the environment

Create Python environment

virtualenv --no-site-packages env

Activate the environment

source env/bin/activate

Install requirements

pip install -r requirements.txt

Download wikipedia n-grams

./features/wikipedia/download.sh

After downloading you must create a database wiki in PostgreSQL.

And then configure file features/wiki_ngram_coverage.py

conn = psycopg2.connect("dbname='wiki' user='postgres' host='localhost' port=5432 password='XXXX'")

Change the user and password to be able to use wikipedia coverage features

***Download word2vec precalculated vectors```

Run script

./features/word2vec_download.sh

It will download a file GoogleNews-vectors-negative300.bin.gz with computed word2vec vectors.

Importing data

  1. Put xml files in data/xml/training/ and data/xml/validation
  2. Run script
python 1_import_raw_data.py

It will combine and transform the XML files into CSV format.

Creating models

Run script

python 2_model_generic.py

Ensembling models

Run script

python 3_create_ensemble.py

Exporting the results

Run script

python 4_export_results.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 86.2%
  • C++ 10.3%
  • R 2.4%
  • C 0.9%
  • Shell 0.1%
  • Makefile 0.1%