multimodal-fusion

This repository contains codes of our some recent works aiming at multimodal fusion, including Divide, Conquer and Combine: Hierarchical Feature Fusion Network with Local and Global Perspectives for Multimodal Affective Computing, Locally Confined Modality Fusion Network With a Global Perspective for Multimodal Human Affective Computing, etc.

Belows are the detailed introductions of our fusion methods:

HFFN (Hierarchical Feature Fusion Network)

Some of the codes are borrowed from https://github.com/soujanyaporia/multimodal-sentiment-analysis. We thank very much for their sharing.

Some modifications have been made to obtain better performance (80.45 on MOSI) such that some details are different from the paper:https://www.aclweb.org/anthology/P19-1046/

The data are originally released in https://github.com/A2Zadeh/CMU-MultimodalSDK and are finally provided in https://github.com/soujanyaporia/multimodal-sentiment-analysis. If you need to use these data, please cite their corresponding papers. For raw datasets, please download them from: https://github.com/soujanyaporia/multimodal-sentiment-analysis/tree/master/dataset (you need to create a 'dataset' folder and place the downloaded data in it.)

To run the code: python mosi_acl.py

We test the code with python2, and the framework is Keras. You can also change the hyperparameters.

If you need to use the codes, please cite our paper:

Mai, Sijie, Haifeng Hu, and Songlong Xing. "Divide, Conquer and Combine: Hierarchical Feature Fusion Network with Local and Global Perspectives for Multimodal Affective Computing." Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019.

LMFN

If you need to use the codes, please cite our paper:

Mai, Sijie, Songlong Xing, and Haifeng Hu. "Locally Confined Modality Fusion Network With a Global Perspective for Multimodal Human Affective Computing." IEEE Transactions on Multimedia 22.1 (2019): 122-137.

ARGF

The codes for ARGF is released in: https://github.com/TmacMai/ARGF_multimodal_fusion

If you need to use the codes, please cite our paper:

Mai, Sijie, Haifeng Hu, and Songlong Xing. "Modality To Modality Translation: An Adversarial Representation Learning and Graph Fusion Network for Multimodal Fusion." AAAI-20 (2020).

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
HFFN_ACL		HFFN_ACL
README.md		README.md
create_data.py		create_data.py
mosi_acl.py		mosi_acl.py
unimodal_iemocap_6way.pickle		unimodal_iemocap_6way.pickle
unimodal_mosi_2way.pickle		unimodal_mosi_2way.pickle
util_function.py		util_function.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HFFN_ACL

HFFN_ACL

README.md

README.md

create_data.py

create_data.py

mosi_acl.py

mosi_acl.py

unimodal_iemocap_6way.pickle

unimodal_iemocap_6way.pickle

unimodal_mosi_2way.pickle

unimodal_mosi_2way.pickle

util_function.py

util_function.py

Repository files navigation

multimodal-fusion

About

Releases

Packages

Languages

TmacMai/multimodal-fusion

Folders and files

Latest commit

History

Repository files navigation

multimodal-fusion

About

Resources

Stars

Watchers

Forks

Languages