Skip to content

TongTongX/CMPUT651Project

Repository files navigation

CMPUT651Project

CMPUT 651 Project at University of Alberta in Fall 2019

Setup Procudure

  1. Ensure torch and sklearn are installed.
  2. Set up the project.
$ pip install -e .
$ pip install torchtext
$ pip install spacy
$ python3 -m spacy download en
  1. Download pre-trained GloVe word vectors. Place the Common Crawl (840B tokens, 2.2M vocab, cased, 300d vectors, 2.03 GB download) word vectors in /data/glove/ directory.
  2. Download the trial dataset and training dataset, unzip the .zip files and move the image directories in /data.
  3. Download the InferSent model trained with GloVe in /data.
$ mkdir encoder
$ curl -Lo encoder/infersent1.pkl https://dl.fbaipublicfiles.com/infersent/infersent1.pkl
  1. Clone the Facebook AI Research Sequence-to-Sequence Toolkit (for PyTorch implementation of RoBERTa) in the project directory.
$ git clone git@github.com:pytorch/fairseq.git
  1. Download the pretrained RoBERTa model roberta.large here. Decompress the file, and place the folder in the project directory.

About

CMPUT 651 Project at University of Alberta in Fall 2019

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages