README

About

This project uses CNN to classify dangerous goods like pistol、scissors and knife.

Progress and method

train.py

Run train.py to start training. The environment depends on the torch general environment.

The data set is placed under ./data, and the directory structure is as follows

./data
├── eval
│   ├── 0
│   │   ├── 0.1007.jpg
│   │   ├── 0.1008.jpg
│   │   ├── 0.1016.jpg
│   │   ├── 0.1017.jpg
│   │   ├── 0.1024.jpg
│   │   ├── 0.1025.jpg
│   ├── 1
│   │   ├── 1.1001.jpg
│   │   ├── 1.1002.jpg
│   │   ├── 1.1011.jpg
│   │   ├── 1.1012.jpg
│   ├── 2
│   │   ├── 2.1001.jpg
│   │   ├── 2.1002.jpg
│   │   ├── 2.1011.jpg
│   │   ├── 2.1012.jpg
...
├── train
│   ├── 0
│   │   ├── 0.1007.jpg
│   │   ├── 0.1008.jpg
│   │   ├── 0.1016.jpg
│   │   ├── 0.1017.jpg
│   │   ├── 0.1024.jpg
│   │   ├── 0.1025.jpg
│   ├── 1
│   │   ├── 1.1001.jpg
│   │   ├── 1.1002.jpg
│   │   ├── 1.1011.jpg
│   │   ├── 1.1012.jpg
│   ├── 2
│   │   ├── 2.1001.jpg
│   │   ├── 2.1002.jpg
│   │   ├── 2.1011.jpg
│   │   ├── 2.1012.jpg
...

eval.py

At present, a validator has been implemented, and the image path and model parameters (chkpoint.bin) can be passed into the program to obtain the prediction of the classification of the image by the trained model.

usage: eval.py [-h] [-i IMG_PATH] [-m MODEL] [-v VERBOSE]

Pass in an image, it will show you its class

optional arguments:
  -h, --help            show this help message and exit
  -i IMG_PATH, --path IMG_PATH
                        Image file path
  -m MODEL, --model MODEL
                        Model file path (*.bin)
  -v VERBOSE, --verbose VERBOSE
                        Show model structure and full output

Example：

python3 eval.py -i '/home/neoncloud/project/data/eval/3/3.3.jpg' -m '/home/neoncloud/project/chkpoint_res.bin'
output: tensor(3, device='cuda:0')

The model successfully predicted the image label. The input is the No. 3 image, and the prediction is 3.

ft_train.py

This script attempts to fine-tune the pre-training script. Take ResNet34 as an example, its network structure is as follows：

ResNet(
  (conv1): Conv2d(3, 64, kernel_size=(7, 7), stride=(2, 2), padding=(3, 3), bias=False)
  (bn1): BatchNorm2d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
  (relu): ReLU(inplace=True)
  (maxpool): MaxPool2d(kernel_size=3, stride=2, padding=1, dilation=1, ceil_mode=False)
  (layer1): Sequential(
    (0): BasicBlock(
      (conv1): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (1): BasicBlock(
      (conv1): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (2): BasicBlock(
      (conv1): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
  )
  (layer2): Sequential(
    (0): BasicBlock(
      (conv1): Conv2d(64, 128, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (downsample): Sequential(
        (0): Conv2d(64, 128, kernel_size=(1, 1), stride=(2, 2), bias=False)
        (1): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      )
    )
    (1): BasicBlock(
      (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (2): BasicBlock(
      (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (3): BasicBlock(
      (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
  )
  (layer3): Sequential(
    (0): BasicBlock(
      (conv1): Conv2d(128, 256, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (downsample): Sequential(
        (0): Conv2d(128, 256, kernel_size=(1, 1), stride=(2, 2), bias=False)
        (1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      )
    )
    (1): BasicBlock(
      (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (2): BasicBlock(
      (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (3): BasicBlock(
      (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (4): BasicBlock(
      (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (5): BasicBlock(
      (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
  )
  (layer4): Sequential(
    (0): BasicBlock(
      (conv1): Conv2d(256, 512, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (downsample): Sequential(
        (0): Conv2d(256, 512, kernel_size=(1, 1), stride=(2, 2), bias=False)
        (1): BatchNorm2d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      )
    )
    (1): BasicBlock(
      (conv1): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
    (2): BasicBlock(
      (conv1): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (relu): ReLU(inplace=True)
      (conv2): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
      (bn2): BatchNorm2d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    )
  )
  (avgpool): AdaptiveAvgPool2d(output_size=(1, 1))
  (fc): Linear(in_features=512, out_features=5, bias=True)
)

The network consists of the following five layers

layer1
layer2
layer3
layer4
avgpool
fc

The project has a 5-layer network，We only train the layer4, avgpool and fc layers, and fix the previous network parameters.

for param in model.parameters():
    param.requires_grad = False

from itertools import chain 
for param in chain(model.layer4.parameters(), model.avgpool.parameters(), model.fc.parameters()):
    param.requires_grad = True

Fine tuning the network can greatly save training time: the program has a correct rate of 91.9% after only 45 epochs.

Epoch done, evaluating: 45
Epoch 45: 100%|████████████████████████████████████████████████████████████████| 53/53 [00:08<00:00,  6.25batch/s, accuracy=91.9, loss=0.0289]

For comparison, training from 0 requires at least 80 epochs to reach the same level.

TODO

Basic training scripts available, including data set objects (inherited from the ImageFolder class), data set loader objects (inherited from dataloader), simple data set enhancements (Transform, including padding as square, random rotation, etc.), using SGD and cross Entropy, and a progress bar pretending to be compelling.
Improve the training script, design command line parameters, separate config.py (hyper parameter settings), and network.py (network definition part).
Perform fine tuning to test its effect.
Improve data enhancement to increase the accuracy of the model.
Design a network by yourself and train it, and hope to have a good accuracy rate.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
media/16118317390713		media/16118317390713
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
chkpoint.bin		chkpoint.bin
chkpoint_res.bin		chkpoint_res.bin
chkpoint_res_2.bin		chkpoint_res_2.bin
eval.py		eval.py
ft_train.py		ft_train.py
model.py		model.py
preprocessing.py		preprocessing.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

media/16118317390713

media/16118317390713

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

chkpoint.bin

chkpoint.bin

chkpoint_res.bin

chkpoint_res.bin

chkpoint_res_2.bin

chkpoint_res_2.bin

eval.py

eval.py

ft_train.py

ft_train.py

model.py

model.py

preprocessing.py

preprocessing.py

train.py

train.py

Repository files navigation

README

About

Progress and method

train.py

eval.py

ft_train.py

TODO

About

Releases

Packages

Languages

putiaopi/cnn_object_detection_project

Folders and files

Latest commit

History

Repository files navigation

README

About

Progress and method

train.py

eval.py

ft_train.py

TODO

About

Resources

Stars

Watchers

Forks

Languages