MultimodalDNN

This repository provides code from the University of Edinburgh Team G25 for the ACL 2018 Workshop on Human Multimodal Language.

EMOTION - First Place in Emotion Recognition Challenge (all metrics) using MOSEI data

Paper: Recognizing Emotions in Video Using Multimodal DNN Feature Fusion

Code: emotion_recognition.py

Run: emotion_recognition.py [mode]

Where [mode] specifies the multimodal inputs (A=Audio, V=Video, T=Text): all, AV, AT, VT, V, T, or A

This script will run a sweep of all parameters described in our paper, including number of BLSTM layers and dropout rates. It is designed to run the sweep in parallel and thus requires a significant compute resource.

To cite (BibTeX):

@inproceedings{williams2018a,
  title     = "Recognizing Emotions in Video Using Multimodal DNN Feature Fusion",
  author    = "Jennifer Williams and Steven Kleinegesse and Ramona Comanescu and Oana Radu",
  year      = "2018",
  pages     = "11--19",
  booktitle = "Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML)",
  publisher = "Association for Computational Linguistics",
}

SENTIMENT - Multimodal Sentiment Analysis using MOSI data

Paper: DNN Multimodal Fusion Techniques for Predicting Video Sentiment

Code: MOSI_*.py

Run: MOSI_*.py [mode] [task]

Where [mode] specifies the multimodal inputs (A=Audio, V=Video, T=Text): all, AV, AT, VT, V, T, or A and [task] specifies if the task is binary, 5-class, or regression.

This script will run a sweep of all parameters described in our paper.

To cite (BibTeX):

@inproceedings{williams2018b,
  title     = "DNN Multimodal Fusion Techniques for Predicting Video Sentiment",
  author    = "Jennifer Williams and Ramona Comanescu and Oana Radu and Leimin Tian",
  year      = "2018",
  pages     = "64--72",
  booktitle = "Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML)",
  publisher = "Association for Computational Linguistics",
}

Notes:

Our code is designed to interface with the CMU MultiModalDataSDK. Do cite their dataset along with our paper.
To work with the MOSEI dataset in particular, the dataset is currently very large, and you require a large amount of RAM
If you have questions about this code, please open an issue on this repository.
If you have questions related to the data itself, please contact the CMU team.
This code is provided as-is, and is the code used for our University of Edinburgh Team G25 submission.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
MOSI_early_fusion_blstm.py		MOSI_early_fusion_blstm.py
MOSI_early_fusion_cnn.py		MOSI_early_fusion_cnn.py
MOSI_early_fusion_lstm.py		MOSI_early_fusion_lstm.py
MOSI_intermediate_fusion.py		MOSI_intermediate_fusion.py
MOSI_late_fusion.py		MOSI_late_fusion.py
README.md		README.md
emotion_recognition.py		emotion_recognition.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MOSI_early_fusion_blstm.py

MOSI_early_fusion_blstm.py

MOSI_early_fusion_cnn.py

MOSI_early_fusion_cnn.py

MOSI_early_fusion_lstm.py

MOSI_early_fusion_lstm.py

MOSI_intermediate_fusion.py

MOSI_intermediate_fusion.py

MOSI_late_fusion.py

MOSI_late_fusion.py

README.md

README.md

emotion_recognition.py

emotion_recognition.py

Repository files navigation

MultimodalDNN

EMOTION - First Place in Emotion Recognition Challenge (all metrics) using MOSEI data

SENTIMENT - Multimodal Sentiment Analysis using MOSI data

Notes:

About

Releases

Packages

Languages

rhoposit/MultimodalDNN

Folders and files

Latest commit

History

Repository files navigation

MultimodalDNN

EMOTION - First Place in Emotion Recognition Challenge (all metrics) using MOSEI data

SENTIMENT - Multimodal Sentiment Analysis using MOSI data

Notes:

About

Resources

Stars

Watchers

Forks

Languages