Skip to content

dr-costas/clotho-dataloader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Clotho data loader


NOTICE: This code is transferred to a new repository

Link to new repository: https://github.com/audio-captioning/clotho-dataloader

The current repository will not be updated any more. Any issues will be taken into account, but the updated code will be pushed at the new repository.

Sorry for the inconvenience!


Welcome to Clotho data loader repository. This repository has the necessary code for using the DataLoader class from PyTorch package (torch.utils.data.dataloader.DataLoader) with the Clotho dataset.

You can use the present data loader of Clotho directly with the examples created by the Clotho baseline dataset repository.

If you are looking at this README file, then I suppose that you already know what is a DataLoader from PyTorch. Nevertheless, the Clotho dataset has sequences as inputs and outputs, and each sequence is of arbitrary length (15 to 30 seconds for the input and 8 to 20 words for the output). For that reason, this data loader already provides a collate function.

Collate function

To be able to use the sequences of Clotho in a batch, you most likely will need some kind of padding policy. This repository already offers a collate function to be used with the Clotho data.

With the provided collate function, you can choose to either:

  • pad the data with zeros (for input audio data) and end-of-sequence symbol (for the output/words), to the length of the longest input (for the inputs) and output (for the outputs) sequence in tha batch
  • truncate the input and the output to the minimum length of the input and output in the batch, and
  • use a constant length for input and output, and either truncate or pad.

Enjoy and if you have any issues, please let me know in the issue section.

About

Data loader code for the data of the Clotho dataset.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages