Natural Language Object Retrieval in Tensorflow

Implement Natural Language Object Retrieval in Tensorflow. The original code is release here, written in Caffe.
note that some of the codes are from natural-language-object-retrieval

What is natural language object retrieval?

In real AI, you may refer something with its location, color, or other characteristic. The AI robot should know where the corresponding object is. Here's the illustration:

The blue box represents the ground truth, while the yellow stands for positive recall, red for negative recall.

Requirement

Training

Here, all the experiments are done on ReferIt dataset. You can also train on other dataset in the same way.

Install Caffe

Download caffe: ./external/download_caffe.sh
modify Makefile.config
make -j
make pycaffe

Preprocess

Download the pretrained models: ./models/download_trained_models.sh
Download the ReferIt dataset: ./datasets/ReferIt/ReferitData/download_referit_dataset.sh and ./datasets/ReferIt/ImageCLEF/download_data.sh
Download pre-extracted EdgeBox proposals: ./data/download_edgebox_proposals.sh
Preprocess the ReferIt dataset to generate metadata needed for training and evaluation: python ./exp-referit/preprocess_dataset.py
Cache the scene-level contextual features to disk: python ./exp-referit/cache_referit_context_features.py
Cache the bbox features to disk: python ./exp-referit/cache_referit_local_features.py
Build training batches: python ./exp-referit/cache_referit_training_batches.py

Start training

Once you prepare the data, you can run the training code python train.py
If you want to use pretrained model, you should convert the pretrained model into .npy and load the pretrained model when training.

python initialize.py

Testing

Before testing, I cache the proposal feature to disk due to the RAM contraints.

cache the proposal feature: python ./exp-referit/cache_edgebox_feature.py
choose which model you want to test by modifying this line.
python test.py

Experiments result

I use R@10 to evaluate the performance and all the test is on ReferIt dataset.
the performance through different epoch [without pretrained]:

the performance through different epoch [with pretrained]:

And here is the loss thourgh every epoch:

Reference

Natural language object retrieval, Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saenko, Trevor Darrell, cvpr2016 oral
natural-language-object-retrieval

Feel free to contact me whenever you encounter unexpected errors in this repo.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
datasets/ReferIt		datasets/ReferIt
exp-referit		exp-referit
external		external
img		img
models		models
prototxt		prototxt
util		util
Demo_retrieval.ipynb		Demo_retrieval.ipynb
README.md		README.md
initialize.py		initialize.py
retriever.py		retriever.py
test.py		test.py
train.py		train.py

andrewliao11/Natural-Language-Object-Retrieval-tensorflow

Folders and files

Latest commit

History

Repository files navigation

Natural Language Object Retrieval in Tensorflow

What is natural language object retrieval?

Requirement

Training

Install Caffe

Preprocess

Start training

Testing

Experiments result

Reference

About

Resources

Stars

Watchers

Forks

Languages