Introduction

In this project we apply concepts of Natural Language Processing(NLP) and Deep Learning(DL) for the generation of the image description.
The project present a generative model based on a deep recurrent architecture that combines recent advances in computer vision and machine translation and that can be used to generate natural sentences describing an image.Here we use a neural and probabilistic framework to generate descriptions from images.
The model is to involve attention mechanism which would increase the efficiency of the system.
The model is trained and evaluated on Flickr8K and the result is captured and analysis is done.

Dataset

Dataset can be requested and downloaded from this link.
However the dataset has been already downloaded and saved in 'Flickr8k' folder After downloading, move the files to 'Flickr8k' folder and run the following commands
unzip Flickr8k_Dataset.zip
and unzip Flickr8k_Text.zip

Installation

The application can be run in one of the two ways, either using Python Interpreter or using Jupyter Notebook.

Python Environment
This project require Python 3.6 interpreter.

To use the Python interpreter to run the project, first install the python packages being used in this project.
pip3 install -r requirements.txt
or
pip install -r requirements.txt

To run the application
$python P3

Conda Environment
The project requires Anaconda 3
To install Anaconda3 download the shell script from Anaconda website. Run the following command
bash Anaconda-latest-Linux-x86_64.sh

Run the following to create environment and install packages
conda env create -f project_env.yml

Run the following to run the Jupyter Notebook
jupyter notebook

Select the '.ipynb' file.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Image-Captioning		Image-Captioning
P3.py		P3.py
Project.pptx		Project.pptx
Proposal.pptx		Proposal.pptx
Readme.md		Readme.md
Readme.txt		Readme.txt
model.py		model.py
project_env.yml		project_env.yml
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image-Captioning

Image-Captioning

P3.py

P3.py

Project.pptx

Project.pptx

Proposal.pptx

Proposal.pptx

Readme.md

Readme.md

Readme.txt

Readme.txt

model.py

model.py

project_env.yml

project_env.yml

requirements.txt

requirements.txt

train.py

train.py

Repository files navigation

Introduction

Dataset

Installation

About

Releases

Packages

Languages

jayantkashyap/Image-Captioning

Folders and files

Latest commit

History

Repository files navigation

Introduction

Dataset

Installation

About

Resources

Stars

Watchers

Forks

Languages