CRCN

This project hosts the code for our NIPS 2015 paper.

Cesc Chunseong Park and Gunhee Kim. Expressing an Image Stream with a Sequence of Natural Sentences. In NIPS 2015 [pdf]

The CRCN stands for Coherent Recurrent Convolutional Networks. It integrates (i) convolutional networks for image description, (ii) bidirectional recurrent networks for the language model, and (iii) local coherence model for a smooth flow of multiple sentences.

The main objective of our model is, given a photo stream, to generate (retrieve) a coherent sequence of natural sentences. For example, if you visit New York City and takes lots of pictures, it can write a travelogue for your photo album. While almost all previous studies have dealt with the relation between a single image and a single natural sentence, our work extends both input and output dimension to a sequence of images and a sequence of sentences.

##Reference

If you use this code as part of any published research, please acknowledge the following paper.

@inproceedings{Cesc:2015:NIPS,
author    = {Cesc Chunseong Park and Gunhee Kim},
title     = "{Expressing an Image Stream with a Sequence of Natural Sentences}",
booktitle = {NIPS},
year      = 2015
}

#Running Code

git clone https://github.com/cesc-park/CRCN.git crcn

##Prerequisites

Install Stanford NLP

Download stanford-parser.jar, stanford-parser-3.5.2-models.jar and englishPCFG.caseless.ser.gz.

wget http://nlp.stanford.edu/software/stanford-parser-full-2015-04-20.zip
wget http://nlp.stanford.edu/software/stanford-corenlp-full-2015-04-20.zip
unzip stanford-parser-full-2015-04-20.zip
unzip stanford-corenlp-full-2015-04-20.zip
mv stanford-parser-full-2015-04-20 stanford-parser
mv stanford-corenlp-full-2015-04-20 stanford-core
cd stanford-parser
jar xvf stanford-parser-3.5.2-models.jar

Install Brown courpus

We need the browncourpus and wordnet packages to extract entity features.

sudo apt-get install wordnet-dev
wget https://bitbucket.org/melsner/browncoherence/get/d46d5cd3fc57.zip -O browncoherence.zip
unzip browncoherence.zip
mv melsner-browncoherence-d46d5cd3fc57 browncoherence
cd browncoherence
mkdir lib64
mkdir bin64

We have to change some lines in Makefile.

vim Makefile

Change the followings from top to bottom.

WORDNET = 1
WORDNET = 0

CFLAGS = $(WARNINGS) -Iinclude $(WNINCLUDE) $(TAO_PETSC_INCLUDE) $(GSLINCLUDE)
CFLAGS = $(WARNINGS) -Iinclude $(WNINCLUDE) $(TAO_PETSC_INCLUDE) $(GSLINCLUDE) -fpermissive

WNLIBS = -L$(WNDIR)/lib -lWN
WNLIBS = -L$(WNDIR)/lib -lwordnet

Then build TestGrid.

make TestGrid
cd ..

Install python modules of all dependencies.

for req in $(cat python_requirements.txt); do pip install $req; done

##Applying to New Dataset

Prepare dataset. Check out the data format.
```
less json_data_format.txt
```

Create parsed trees. We use the StanfordCoreNLP tool written in java to extract parsed trees.

cd tree
python spliter_for_parser.py
javac -d . -cp .:./json-simple-1.1.1.jar:../stanford-core/stanford-corenlp-3.5.2.jar:../stanford-core/xom.jar:../stanford-core/stanford-corenlp-3.5.2-models.jar:../stanford-core/joda-time.jar:../stanford-core/jollyday.jar: StanfordCoreNlpTreeAdder.java
java -cp .:./json-simple-1.1.1.jar:../stanford-core/stanford-corenlp-3.5.2.jar:../stanford-core/xom.jar:../stanford-core/stanford-corenlp-3.5.2-models.jar:../stanford-core/joda-time.jar:../stanford-core/jollyday.jar: parser.StanfordCoreNlpTreeAdder
python merger_for_parser.py

##Training

Make directory for training

mkdir model

Doc2Vec. Train the doc2vec model.
```
python doc2vec_training.py
```
RCN. Train the RCN model. If you want to use GPU (in this example device is 0), execute the below code.
```
CUDA_VISIBLE_DEVICES=0 THEANO_FLAGS=mode=FAST_RUN,device=gpu,floatX=float32 python rcn_training.py
```
If you want to use CPU, run the below instead of the above.
```
python rcn_training.py
```
CRCN. Train the CRCN model. If you want to use GPU (In this example device is 0), execute the below code.
```
CUDA_VISIBLE_DEVICES=0 THEANO_FLAGS=mode=FAST_RUN,device=gpu,floatX=float32 python crcn_training.py
```
If you want to use CPU, run the below instead of the above.
```
python crcn_training.py
```

##Output Generation

Generating output is easy. The following script loads training and test datasets, then automatically produces outputs.

python generate_output.py

##Acknowledgement

We implement our model using keras package. Thanks for keras developers. :)

Authors

Cesc Chunseong Park and Gunhee Kim,
Vision and Learning Lab, Seoul National University

License

MIT license

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
entity		entity
keras		keras
tree		tree
.gitignore		.gitignore
README.md		README.md
crcn_training.py		crcn_training.py
doc2vec_training.py		doc2vec_training.py
generate_output.py		generate_output.py
json_data_format.txt		json_data_format.txt
python_requirements.txt		python_requirements.txt
rank_sequence_utils.py		rank_sequence_utils.py
rcn_training.py		rcn_training.py
stream2text_nips.jpg		stream2text_nips.jpg
topk_utils.py		topk_utils.py

CVML/CRCN

Folders and files

Latest commit

History

Repository files navigation

CRCN

Authors

License

About

Resources

Stars

Watchers

Forks

Languages