Skip to content

gtziafas/3VGC

Repository files navigation

3VGC

A Tri-Modal Video Genre Classification Dataset 0. Regroup for data loading- Friday

  1. Audio - LSTM(Extract features manually) and 2d CNN(CNN Extraction for features)
  2. Video - 3dCNN(Exists) , Tune hyperparameters etc.
  3. Maybe text (optional)- Train CNN,Transformer,LSTM.
  4. Speech to text

About

A Tri-Modal Video Genre Classification Dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •