GitHub - xiongshufeng/siamese_sentiment: learning distance metric with siamese CNN to classify sentiment and attempting to show the siamese CNN is robust to blind spots whereas the CNN is not

xiongshufeng / siamese_sentiment Public

forked from jcavalieri8619/siamese_sentiment

Notifications You must be signed in to change notification settings
Fork 0
Star 0

learning distance metric with siamese CNN to classify sentiment and attempting to show the siamese CNN is robust to blind spots whereas the CNN is not

MIT license

0 stars 2 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
model_data		model_data
testing_data		testing_data
training_data		training_data
CNN_model.py		CNN_model.py
LICENSE		LICENSE
README		README
convert_review.py		convert_review.py
data_perturbing.py		data_perturbing.py
loss_functions.py		loss_functions.py
modelParameters.py		modelParameters.py
network_utils.py		network_utils.py
preprocess.py		preprocess.py
siamese_activations.py		siamese_activations.py
siamese_model.py		siamese_model.py

Repository files navigation

The task is sentiment classification but the goal of the project is to show that
the CNN that comprises both branches of the siamese network is susceptible to adversial
examples and blind spots like those described in https://arxiv.org/abs/1312.6199 while the
siamese network built from the same CNN is not.

To train model, create the <train> directory inside <training_data>.
The <train> directory should contain sub-directories <pos> and <neg>
containing files with positive reviews and negative reviews from
Stanford sentiment dataset.

If your computer runs out of RAM during training its because of the
number of pairs that must be generated is massive. Set the
TRAIN_LOW_RAM_CUTOFF and DEV_LOW_RAM_CUTOFF paramters inside convert_review module
to prevent this from happening. I have 16GBs of RAM and the current
settings work for me but if you have less you will need to set these
parameters

training is set up with check points so after each epoch the weights will
be saved in the model_data/saved_weights directory and model_specs will
also be saved after training is complete.

EXAMPLE:

from siamese_model import build_siamese_model, train_siamese_model

#this call returns a dictionary containing the siamese model
#and the CNN model comprising the left and right branch of the siamese

models = build_siamese_model()

#this call will train the model. All training params can be set
#inside the siamese_network module (num epochs, CNN params, ect..).
#It returns the training data set, dev set, and training history.

trainingData,devData,hist = train_siamese_model(models)

#once training completes you will have a trained model in
#models['siamese'] and you can use models['siamese'].predict(...)
#with dev set data to test it out. predict input is a list
#of the form [Lreview,Rreview] and these reviews can be found
#in devData. Both devData and trainingData can be unpacked
#like Xdev_left, ydev_left, Xdev_right, ydev_right, dev_similarity = devData

About

learning distance metric with siamese CNN to classify sentiment and attempting to show the siamese CNN is robust to blind spots whereas the CNN is not

Readme

MIT license

Activity

0 stars

2 watching

0 forks

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model_data

model_data

testing_data

testing_data

training_data

training_data

CNN_model.py

CNN_model.py

LICENSE

LICENSE

README

README

convert_review.py

convert_review.py

data_perturbing.py

data_perturbing.py

loss_functions.py

loss_functions.py

modelParameters.py

modelParameters.py

network_utils.py

network_utils.py

preprocess.py

preprocess.py

siamese_activations.py

siamese_activations.py

siamese_model.py

siamese_model.py

Repository files navigation

About

Releases

Packages

Languages

License

xiongshufeng/siamese_sentiment

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Languages