Temporal Pyramid Network for Action Recognition

configs: 各模型配置文件模板
metrics: Youtube-8，Kinetics数据集评估脚本，以及模型自定义评估方法
train.py: 一键式训练脚本，可通过指定模型名，配置文件等一键式启动训练
eval.py: 一键式评估脚本，可通过指定模型名，配置文件，模型权重等一键式启动评估
predict.py: 一键式推断脚本，可通过指定模型名，配置文件，模型权重，待推断文件列表等一键式启动推断

Get Started

Please refer to GETTING_STARTED for detailed usage.

Quick Demo

We provide test_video.py to inference a single video.

python ./test_video.py

video dir at './data/dataset/inferlist.txt'

the log will show the infer result,the infer.json also will be save at './data/predict_results/infer.json'

For example, we can predict for the demo video (download here and put it under demo/.) by running:

train

python train.py  

python multi_gpus_train.py

At Aistudio script task:

deliver main.py

Detail:

python train.py --model_name=TPN \
                    --config=./configs/tpn.yaml \
                    --log_interval=10 \
                    --valid_interval=1 \
                    --use_gpu=True \
                    --save_dir=./data/checkpoints \
                    --fix_random_seed=False \
                    --pretrain=$PATH_TO_PRETRAIN_MODEL

eval

python eval.py

eval file is './data/dataset/vallist.txt'

the log will show the eval result, the eval.json also will be save at './data/evaluate_results/eval.json'

测试时数据预处理的方式跟训练时不一样，crop区域的大小为256x256，不同于训练时的224x224，所以需要将训练中预测输出时使用的全连接操作改为1x1x1的卷积。每个视频抽取图像帧数据的时候，会选取10个不同的位置作为时间起始点，做crop的时候会选取三个不同的空间起始点。在每个视频上会进行10x3次采样，将这30个样本的预测结果进行求和，选取概率最大的类别作为最终的预测结果。

原文中是每个视频采样长度是32x2 单卡上做了一次验证采用的是8x8的方式，所以精度比原文低了很多

由于单卡资源受限，cpu解码限制了速度（多卡脚本环境问题导致eval程序跑不起来），尝试采用32x2程序跑一半内存溢出，给我退出了... 后来调试好后在脚本任务上测试得出了与原文相同的结果结果都保存在'./data/evaluate_results/'下

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
configs		configs
data		data
demo		demo
docs		docs
metrics		metrics
model_tpn		model_tpn
reader		reader
utils		utils
精度对齐		精度对齐
README.md		README.md
eval.py		eval.py
inference_model.py		inference_model.py
main.py		main.py
multi_gpus_train.py		multi_gpus_train.py
predict.py		predict.py
requirement.txt		requirement.txt
train.py		train.py
train_dist.py		train_dist.py
train_single.py		train_single.py

ruyijidan/TPN

Folders and files

Latest commit

History

Repository files navigation

Temporal Pyramid Network for Action Recognition

Get Started

Quick Demo

train

eval

About

Resources

Stars

Watchers

Forks

Languages