This code is for cleaning and preparing data to analyse the MULTISIMO cultimodal corpus (Read about the data : https://dl.acm.org/citation.cfm?id=3151018)
A snap of the final dataset can be viewed at dataset snap.PNG added in this repository - https://github.com/royn5618/MULTISIMO-Multimodal-Corpus-Cleaning/blob/master/dataset%20snap.PNG
Resources Used:
Stanford POS Tagger : https://nlp.stanford.edu/software/ TextBlob: http://textblob.readthedocs.io/en/