data preprocess, mutli-channel change, sample rate, framing, label
features: feature extract include MFCC, Log-mel, CQT, Gammatone, and calculate scalar config: configuration files and parameters
main: three parts, train, validation, evaluation models: